WorldWideScience

Sample records for cdna sequence analysis

  1. Cloning and sequence analysis of H. contortus HC58cDNA gene ...

    African Journals Online (AJOL)

    Phylogenetic analysis revealed close evolutionary proximity of the protein sequence to counterpart sequences in the cathepsin B like proteases, suggesting that HC58cDNA was a member of the papain family. Keywords:Haemonchus contortus, HC58cDNA, cathepsin B like protease, papain family. Kenya Veterinarian Vol.

  2. Isolation and sequence analysis of a cDNA clone encoding the fifth complement component

    DEFF Research Database (Denmark)

    Lundwall, Åke B; Wetsel, Rick A; Kristensen, Torsten

    1985-01-01

    clone of 1.85 kilobase pairs was isolated. Hybridization of the mixed-sequence probe to the complementary strand of the plasmid insert and sequence analysis by the dideoxy method predicted the expected protein sequence of C5a (positions 1-12), amino-terminal to the anticipated priming site. The sequence......We have used available protein sequence data for the anaphylatoxin (C5a) portion of the fifth component of human complement (residues 19-25) to synthesize a mixed-sequence oligonucleotide probe. The labeled oligonucleotide was then used to screen a human liver cDNA library, and a single candidate cDNA...... obtained further predicted an arginine-rich sequence (RPRR) immediately upstream of the N-terminal threonine of C5a, indicating that the promolecule form of C5 is synthesized with a beta alpha-chain orientation as previously shown for pro-C3 and pro-C4. The C5 cDNA clone was sheared randomly by sonication...

  3. [Cloning and sequence analysis of Eg95 cDNA from different stages of Echinococcus granulosus in Xinjiang].

    Science.gov (United States)

    Lin, Ren-yong; Ding, Jian-bing; Wen, Hao; Zhang, Wen-bao; Li, Jun; Lu, Xiao-mei

    2003-01-01

    To study expression and sequence differences of Echinococcus granulosus 95(Eg95) antigen cDNA from different stages of protoscolex, oncosphere and adult worm of E. granulosus from Xinjiang Uighur Aut. Reg. In accordance with the sequence of Eg95 antigen cDNA, the primers of Eg95 were designed. Eg95 antigen cDNAs were amplified by PCR from protoscolex, oncosphere and adult worm cDNA libraries of E. granulosus, respectively and were cloned into pUCm-T plasmid, and sequenced. The sequences were analyzed by DNAman and GenBank/BLAST biosoftware. PCR results showed that Eg95 antigen cDNA was amplified from three stages of E. granulosus cDNA libraries. Sequencing analysis indicated that the Eg95 cDNA length was 402 bp, same as the reported data in GenBank. The Eg95 antigen cDNA was expressed in the different life-cycle stages of E. granulosus in Xinjiang and there was no nucleic acid sequence difference of Eg95 antigen among the protoscolex, oncosphere and adult worm of E. granulosus.

  4. Cloning, sequencing and expression of cDNA encoding growth ...

    Indian Academy of Sciences (India)

    Unknown

    317. 2.4 cDNA sequencing and analysis. The nucleotide sequence of the cloned H. fossilis GH. cDNA was determined by Sanger's dideoxy chain termi- nation method, using Perkin Elmer bigdye terminator kit in an ABI Prism 377 automated DNA sequencer. All other computational analysis of the GH cDNA was done using.

  5. Primary analysis of the expressed sequence tags in a pentastomid nymph cDNA library.

    Directory of Open Access Journals (Sweden)

    Jing Zhang

    Full Text Available BACKGROUND: Pentastomiasis is a rare zoonotic disease caused by pentastomids. Despite their worm-like appearance, they are commonly placed into a separate sub-class of the subphylum Crustacea, phylum Arthropoda. However, until now, the systematic classification of the pentastomids and the diagnosis of pentastomiasis are immature, and genetic information about pentastomid nylum is almost nonexistent. The objective of this study was to obtain information on pentastomid nymph genes and identify the gene homologues related to host-parasite interactions or stage-specific antigens. METHODOLOGY/PRINCIPAL FINDINGS: Total pentastomid nymph RNA was used to construct a cDNA library and 500 colonies were sequenced. Analysis shows one hundred and ninety-seven unigenes were identified. In which, 147 genes were annotated, and 75 unigenes (53.19% were mapped to 82 KEGG pathways, including 29 metabolism pathways, 29 genetic information processing pathways, 4 environmental information processing pathways, 7 cell motility pathways and 5 organismal systems pathways. Additionally, two host-parasite interaction-related gene homologues, a putative Kunitz inhibitor and a putative cysteine protease. CONCLUSION/SIGNIFICANCE: We first successfully constructed a cDNA library and gained a number of expressed sequence tags (EST from pentastomid nymphs, which will lay the foundation for the further study on pentastomids and pentastomiasis.

  6. Molecular cloning and sequence analysis of growth hormone cDNA of Neotropical freshwater fish Pacu (Piaractus mesopotamicus

    Directory of Open Access Journals (Sweden)

    Janeth Silva Pinheiro

    2008-01-01

    Full Text Available RT-PCR was used for amplifying Piaractus mesopotamicus growth hormone (GH cDNA obtained from mRNA extracted from pituitary cells. The amplified fragment was cloned and the complete cDNA sequence was determined. The cloned cDNA encompassed a sequence of 543 nucleotides that encoded a polypeptide of 178 amino acids corresponding to mature P. mesopotamicus GH. Comparison with other GH sequences showed a gap of 10 amino acids localized in the N terminus of the putative polypeptide of P. mesopotamicus. This same gap was also observed in other members of the family. Neighbor-joining tree analysis with GH sequences from fishes belonging to different taxonomic groups placed the P. mesopotamicus GH within the Otophysi group. To our knowledge, this is the first GH sequence of a Neotropical characiform fish deposited in GenBank.

  7. cDNA, genomic cloning and sequence analysis of ribosomal protein ...

    African Journals Online (AJOL)

    Ribosomal protein S4X (RPS4X) is one of the 40S ribosomal proteins encoded by the RPS4X gene. The cDNA and the genomic sequence of RPS4X were cloned successfully from giant panda (Ailuropoda melanoleuca) using reverse transcriptase-polymerase chain reaction (RT-PCR) and touchdown-PCR technology ...

  8. cDNA, genomic sequence cloning and analysis of the ribosomal ...

    African Journals Online (AJOL)

    Ribosomal protein L37A (RPL37A) is a component of 60S large ribosomal subunit encoded by the RPL37A gene, which belongs to the family of ribosomal L37AE proteins, located in the cytoplasm. The complementary deoxyribonucleic acid (cDNA) and the genomic sequence of RPL37A were cloned successfully from giant ...

  9. cDNA, genomic cloning and sequence analysis of ribosomal protein ...

    African Journals Online (AJOL)

    enoh

    2012-03-13

    Mar 13, 2012 ... Ribosomal protein S4X (RPS4X) is one of the 40S ribosomal proteins encoded by the RPS4X gene. The. cDNA and the genomic sequence of RPS4X were cloned successfully from giant panda (Ailuropoda melanoleuca) using reverse transcriptase-polymerase chain reaction (RT-PCR) and touchdown- ...

  10. Generation and Analysis of Full-length cDNA Sequences from Elephant Shark (Callorhinchus milii)

    KAUST Repository

    Kodzius, Rimantas

    2009-03-17

    Cartilaginous fishes are the oldest living group of jawed vertebrates and therefore is an important group for understanding the evolution of vertebrate genomes including the human genome. Our laboratory has proposed elephant shark (C. milii) as a model cartilaginous fish genome because of its relatively small genome size (910 Mb). The whole genome of C. milii is being sequenced (first cartilaginous fish genome to be sequenced completely). To characterize the transcriptome of C. milii and to assist in annotating exon-intron boundaries, transcriptional start sites and alternatively spliced transcripts, we are generating full-length cDNA sequences from C. milii.

  11. [Construction and sequence analysis of a normalized full-length cDNA library of Dendrobium officinale].

    Science.gov (United States)

    Jiang, Min; Wang, Jiang; Wen, Guo-Song; Xu, Shao-Zhong; Zha, Ying-Hong; Rong, Tian-Ju; Qian, Xiong

    2013-02-01

    In order to obtain functional genes, a normalized stems cDNA library was constructed from medicinal plant Dendrobium officinale. SMART (switching mechanism at 5' end of RNA transcript) cDNA synthesis combined with DSN (duplex-specific nuclease) normalization was applied to construct the normalized full-length cDNA library of D. officinale. The titer of cDNA library was about 1.3 x 10(6) cfu x mL(-1) and the average insertion size was about 1.5 kb with high recombination rate (93.9%). Random selected 163 positive clones were sequenced at single side. Bio-information analysis indicated that 147 from 150 high-quality unique sequences matched corresponding homologous proteins, and they participated in various biological processes based on GO (gene ontology). There were 8 clones with complete coding sequence, which presumed to be full-length genes. These results showed preliminarily that we successfully constructed a normalized full-length cDNA library of D. officinale which could be used to screen the functional genes related to metabolic pathways of medicinal ingredients.

  12. Construction of cDNA library and preliminary analysis of expressed sequence tags from Siberian tiger.

    Science.gov (United States)

    Liu, Chang-Qing; Lu, Tao-Feng; Feng, Bao-Gang; Liu, Dan; Guan, Wei-Jun; Ma, Yue-Hui

    2010-10-01

    In this study we successfully constructed a full-length cDNA library from Siberian tiger, Panthera tigris altaica, the most well-known wild Animal. Total RNA was extracted from cultured Siberian tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.30×10(6) pfu/ml and 1.62×10(9) pfu/ml respectively. The proportion of recombinants from unamplified library was 90.5% and average length of exogenous inserts was 1.13 kb. A total of 282 individual ESTs with sizes ranging from 328 to 1,142 bps were then analyzed the BLASTX score revealed that 53.9% of the sequences were classified as strong match, 38.6% as nominal and 7.4% as weak match. 28.0% of them were found to be related to enzyme/catalytic protein, 20.9% ESTs to metabolism, 13.1% ESTs to transport, 12.1% ESTs to signal transducer/cell communication, 9.9% ESTs to structure protein, 3.9% ESTs to immunity protein/defense metabolism, 3.2% ESTs to cell cycle, and 8.9 ESTs classified as novel genes. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genomic research of Siberian tigers.

  13. Construction of cDNA library and preliminary analysis of expressed sequence tags from Siberian tiger

    Science.gov (United States)

    Liu, Chang-Qing; Lu, Tao-Feng; Feng, Bao-Gang; Liu, Dan; Guan, Wei-Jun; Ma, Yue-Hui

    2010-01-01

    In this study we successfully constructed a full-length cDNA library from Siberian tiger, Panthera tigris altaica, the most well-known wild Animal. Total RNA was extracted from cultured Siberian tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.30×106 pfu/ml and 1.62×109 pfu/ml respectively. The proportion of recombinants from unamplified library was 90.5% and average length of exogenous inserts was 1.13 kb. A total of 282 individual ESTs with sizes ranging from 328 to 1,142bps were then analyzed the BLASTX score revealed that 53.9% of the sequences were classified as strong match, 38.6% as nominal and 7.4% as weak match. 28.0% of them were found to be related to enzyme/catalytic protein, 20.9% ESTs to metabolism, 13.1% ESTs to transport, 12.1% ESTs to signal transducer/cell communication, 9.9% ESTs to structure protein, 3.9% ESTs to immunity protein/defense metabolism, 3.2% ESTs to cell cycle, and 8.9 ESTs classified as novel genes. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genomic research of Siberian tigers. PMID:20941376

  14. Construction of a cDNA library from the ephemeral plant Olimarabidopsis pumila and preliminary analysis of expressed sequence tags.

    Science.gov (United States)

    Zhao, Yun-Xia; Wei, Yan-Ling; Zhao, Ping; Xiang, Cheng-Bin; Xu, Fang; Li, Chao; Huang, Xian-Zhong

    2013-01-01

    Olimarabidopsis pumila is a close relative of the model plant Arabidopsis thaliana but, unlike A. thaliana, it is a salt-tolerant ephemeral plant that is widely distributed in semi-arid and semi-salinized regions of the Xinjiang region of China, thus providing an ideal candidate plant system for salt tolerance gene mining. A good-quality cDNA library was constructed using cap antibody to enrich full-length cDNA with the gateway technology allowing library construction without traditional methods of cloning by use of restriction enzymes. A preliminary analysis of expressed sequence tags (ESTs) was carried out. The titers of the primary and the normalized cDNA library were 1.6 x 10(6) cfu/mL and 6.7 x 10(6) cfu/mL, respectively. A total of 1093 clones were randomly selected from the normalized library for EST sequencing. By sequence analysis, 894 high-quality ESTs were generated and assembled into 736 unique sequences consisting of 72 contigs and 664 singletons. The resulting unigenes were categorized according to the gene ontology (GO) hierarchy. The potential roles of gene products associated with stress-related ESTs are discussed. The 736 unigenes were similar to A. thaliana, A. lyrata, or Thellungiella salsuginea. This research provides an overview of the mRNA expression profile and first-hand information of gene sequence expressed in young leaves of O. pumila.

  15. An analysis of expressed sequence tags of developing castor endosperm using a full-length cDNA library

    Directory of Open Access Journals (Sweden)

    Wallis James G

    2007-07-01

    Full Text Available Abstract Background Castor seeds are a major source for ricinoleate, an important industrial raw material. Genomics studies of castor plant will provide critical information for understanding seed metabolism, for effectively engineering ricinoleate production in transgenic oilseeds, or for genetically improving castor plants by eliminating toxic and allergic proteins in seeds. Results Full-length cDNAs are useful resources in annotating genes and in providing functional analysis of genes and their products. We constructed a full-length cDNA library from developing castor endosperm, and obtained 4,720 ESTs from 5'-ends of the cDNA clones representing 1,908 unique sequences. The most abundant transcripts are genes encoding storage proteins, ricin, agglutinin and oleosins. Several other sequences are also very numerous, including two acidic triacylglycerol lipases, and the oleate hydroxylase (FAH12 gene that is responsible for ricinoleate biosynthesis. The role(s of the lipases in developing castor seeds are not clear, and co-expressing of a lipase and the FAH12 did not result in significant changes in hydroxy fatty acid accumulation in transgenic Arabidopsis seeds. Only one oleate desaturase (FAD2 gene was identified in our cDNA sequences. Sequence and functional analyses of the castor FAD2 were carried out since it had not been characterized previously. Overexpression of castor FAD2 in a FAH12-expressing Arabidopsis line resulted in decreased accumulation of hydroxy fatty acids in transgenic seeds. Conclusion Our results suggest that transcriptional regulation of FAD2 and FAH12 genes maybe one of the mechanisms that contribute to a high level of ricinoleate accumulation in castor endosperm. The full-length cDNA library will be used to search for additional genes that affect ricinoleate accumulation in seed oils. Our EST sequences will also be useful to annotate the castor genome, which whole sequence is being generated by shotgun sequencing at

  16. Construction of cDNA library and preliminary analysis of expressed sequence tags from green microalga Ankistrodesmus convolutus Corda.

    Science.gov (United States)

    Thanh, Tran; Chi, Vu Thi Quynh; Abdullah, Mohd Puad; Omar, Hishamuddin; Noroozi, Mostafa; Ky, Huynh; Napis, Suhaimi

    2011-01-01

    Green microalga Ankistrodesmus convolutus Corda is a fast growing alga which produces appreciable amount of carotenoids and polyunsaturated fatty acids. To our knowledge, this is the first report on the construction of cDNA library and preliminary analysis of ESTs for this species. The titers of the primary and amplified cDNA libraries were 1.1×10(6) and 6.0×10(9) pfu/ml respectively. The percentage of recombinants was 97% in the primary library and a total of 337 out of 415 original cDNA clones selected randomly contained inserts ranging from 600 to 1,500 bps. A total of 201 individual ESTs with sizes ranging from 390 to 1,038 bps were then analyzed and the BLASTX score revealed that 35.8% of the sequences were classified as strong match, 38.3% as nominal and 25.9% as weak match. Among the ESTs with known putative function, 21.4% of them were found to be related to gene expression, 14.4% ESTs to photosynthesis, 10.9% ESTs to metabolism, 5.5% ESTs to miscellaneous, 2.0% to stress response, and the remaining 45.8% were classified as novel genes. Analysis of ESTs described in this paper can be an effective approach to isolate and characterize new genes from A. convolutus and thus the sequences obtained represented a significant contribution to the extensive database of sequences from green microalgae.

  17. Isolation and sequence analysis of a chalcone synthase cDNA of Matthiola incana R. Br. (Brassicaceae).

    Science.gov (United States)

    Epping, B; Kittel, M; Ruhnau, B; Hemleben, V

    1990-06-01

    A cDNA clone (pcM12) of the chalcone synthase (CHS) of Matthiola incana R. Br. (Brassicaceae) was isolated from a cDNA library, sequenced and analysed. It comprises the complete coding sequence for the CHS and 5' and 3' untranslated regions. The deduced amino acid sequence shows that the Matthiola incana CHS consists of 394 amino acid residues. Comparison with CHS amino acid sequences of other plants indicates more than 82% homology.

  18. Analysis of expressed sequence tags generated from full-length enriched cDNA libraries of melon

    Directory of Open Access Journals (Sweden)

    Bendahmane Abdelhafid

    2011-05-01

    Full Text Available Abstract Background Melon (Cucumis melo, an economically important vegetable crop, belongs to the Cucurbitaceae family which includes several other important crops such as watermelon, cucumber, and pumpkin. It has served as a model system for sex determination and vascular biology studies. However, genomic resources currently available for melon are limited. Result We constructed eleven full-length enriched and four standard cDNA libraries from fruits, flowers, leaves, roots, cotyledons, and calluses of four different melon genotypes, and generated 71,577 and 22,179 ESTs from full-length enriched and standard cDNA libraries, respectively. These ESTs, together with ~35,000 ESTs available in public domains, were assembled into 24,444 unigenes, which were extensively annotated by comparing their sequences to different protein and functional domain databases, assigning them Gene Ontology (GO terms, and mapping them onto metabolic pathways. Comparative analysis of melon unigenes and other plant genomes revealed that 75% to 85% of melon unigenes had homologs in other dicot plants, while approximately 70% had homologs in monocot plants. The analysis also identified 6,972 gene families that were conserved across dicot and monocot plants, and 181, 1,192, and 220 gene families specific to fleshy fruit-bearing plants, the Cucurbitaceae family, and melon, respectively. Digital expression analysis identified a total of 175 tissue-specific genes, which provides a valuable gene sequence resource for future genomics and functional studies. Furthermore, we identified 4,068 simple sequence repeats (SSRs and 3,073 single nucleotide polymorphisms (SNPs in the melon EST collection. Finally, we obtained a total of 1,382 melon full-length transcripts through the analysis of full-length enriched cDNA clones that were sequenced from both ends. Analysis of these full-length transcripts indicated that sizes of melon 5' and 3' UTRs were similar to those of tomato, but

  19. cDNA sequence and tissue expression analysis of glucokinase from ...

    African Journals Online (AJOL)

    Yomi

    2012-01-10

    Jan 10, 2012 ... (Rattus norvegicus) were 98.1, 96.8, 80.3 and 79.8%, respectively. Phylogenetic analysis based on GK amino acid sequences. Phylogenetic analysis among eight fish species, eleven endothermic species and one amphibian species based on glucokinase amino acid sequences is shown in Figure. 3.

  20. Nucleotide sequence of a cDNA for branched chain acyltransferase with analysis of the deduced protein structure

    International Nuclear Information System (INIS)

    Hummel, K.B.; Litwer, S.; Bradford, A.P.; Aitken, A.; Danner, D.J.; Yeaman, S.J.

    1988-01-01

    Nucleotide sequence was determined for a 1.6-kilobase human cDNA putative for the branched chain acyltransferase protein of the branched chain α-ketoacid dehydrogenase complex. Translation of the sequence reveals an open reading frame encoding a 315-amino acid protein of molecular weight 35,759 followed by 560 bases of 3'-untranslated sequence. Three repeats of the polyadenylation signal hexamer ATTAAA are present prior to the polyadenylate tail. Within the open reading frame is a 10-amino acid fragment which matches exactly the amino acid sequence around the lipoate-lysine residue in bovine kidney branched chain acyltransferase, thus confirming the identity of the cDNA. Analysis of the deduced protein structure for the human branched chain acyltransferase revealed an organization into domains similar to that reported for the acyltransferase proteins of the pyruvate and α-ketoglutarate dehydrogenase complexes. This similarity in organization suggests that a more detailed analysis of the proteins will be required to explain the individual substrate and multienzyme complex specificity shown by these acyltransferases

  1. Bioinformatic analysis of barcoded cDNA libraries for small RNA profiling by next-generation sequencing.

    Science.gov (United States)

    Farazi, Thalia A; Brown, Miguel; Morozov, Pavel; Ten Hoeve, Jelle J; Ben-Dov, Iddo Z; Hovestadt, Volker; Hafner, Markus; Renwick, Neil; Mihailović, Aleksandra; Wessels, Lodewyk F A; Tuschl, Thomas

    2012-10-01

    The characterization of post-transcriptional gene regulation by small regulatory RNAs of 20-30 nt length, particularly miRNAs and piRNAs, has become a major focus of research in recent years. A prerequisite for the characterization of small RNAs is their identification and quantification across different developmental stages, normal and diseased tissues, as well as model cell lines. Here we present a step-by-step protocol for the bioinformatic analysis of barcoded cDNA libraries for small RNA profiling generated by Illumina sequencing, thereby facilitating miRNA and other small RNA profiling of large sample collections. Copyright © 2012 Elsevier Inc. All rights reserved.

  2. CONSTRUCTION OF SILKWORM MIDGUT cDNA LIBRARY FOR SCREEN AND SEQUENCE ANALYSIS OF PERITROPHIC MEMBRANE PROTEIN GENES.

    Science.gov (United States)

    Zhou, Yi-Jun; Xue, Bin; Li, Yang-Yang; Li, Fan-Chi; Ni, Min; Shen, Wei-De; Gu, Zhi-Ya; Li, Bing; Shen, Wei-De; Gu, Zhi-Ya; Li, Bing

    2016-01-01

    Silkworm is an important economic insect and the model species for Lepidoptera. The midgut of silkworm is an important physiological barrier, as its peritrophic membrane (PM) can resist pathogen invasion. In this study, a silkworm midgut cDNA library was constructed in order to identify silkworm PM genes. The capacity of the initial library was 6.92 × 10(6) pfu/ml, along with a recombination rate of 92.14% and a postamplification titer of 4.10 × 10(9) pfu/ml. Three silkworm PM protein genes were obtained by immunoscreening, two of which were chitin-binding protein (CBP) genes and one of which was a chitin deacetylase (CDA) gene as revealed by sequence analysis. Three genes were named BmCBP02, BmCBP13, and BmCDA17, and their ORF sizes are 678, 1,029, and 645 bp, respectively; all of them contain sequences of chitin-binding domains. Phylogenetic analysis indicated that BmCBP02 has the highest consensus with Mamestra configurata CBP at 61.0%; BmCBP13 has the highest consensus with Loxostege sticticalis PM CBP at 53.35%; BmCDA17 has the highest consensus with Helicoverpa armigera CDA5a at 70.83%. Tissue transcriptional analysis revealed that all three genes were specifically expressed in the midgut, and during the developmental process of fifth-instar silkworms, the transcription of all the genes showed an upward trend. This study laid a foundation for further studies on the functions of silkworm PM genes. © 2015 Wiley Periodicals, Inc.

  3. cDNA sequence quality data - Budding yeast cDNA sequencing project | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Budding yeast cDNA sequencing project cDNA sequence quality data Data detail Data name cDNA sequence quality... data DOI 10.18908/lsdba.nbdc00838-003 Description of data contents Phred's quality score. P...tion Download License Update History of This Database Site Policy | Contact Us cDNA sequence quality

  4. cDNA sequence analysis of a 29-kDa cysteine-rich surface antigen of pathogenic Entamoeba histolytica

    International Nuclear Information System (INIS)

    Torian, B.E.; Stroeher, V.L.; Stamm, W.E.; Flores, B.M.; Hagen, F.S.

    1990-01-01

    A λgt11 cDNA library was constructed from poly(U)-Spharose-selected Entamoeba histolytica trophozoite RNA in order to clone and identify surface antigens. The library was screened with rabbit polyclonal anti-E. histolytica serum. A 700-base-pair cDNA insert was isolated and the nucleotide sequence was determined. The deduced amino acid sequence of the cDNA revealed a cysteine-rich protein. DNA hybridizations showed that the gene was specific to E. histolytica since the cDNA probe reacted with DNA from four axenic strains of E. histolytica but did not react with DNA from Entamoeba invadens, Acanthamoeba castellanii, or Trichomonas vaginalis. The insert was subcloned into the expression vector pGEX-1 and the protein was expressed as a fusion with the C terminus of glutathione S-transferase. Purified fusion protein was used to generate 22 monoclonal antibodies (mAbs) and a mouse polyclonal antiserum specific for the E. histolytica portion of the fusion protein. A 29-kDa protein was identified as a surface antigen when mAbs were used to immunoprecipitate the antigen from metabolically 35 S-labeled live trophozoites. The surface location of the antigen was corroborated by mAb immunoprecipitation of a 29-kDa protein from surface- 125 I-labeled whole trophozoites as well as by the reaction of mAbs with live trophozoites in an indirect immunofluorescence assay performed at 4 degree C. Immunoblotting with mAbs demonstrated that the antigen was present on four axenic isolates tested. mAbs recognized epitopes on the 29-kDa native antigen on some but not all clinical isolates tested

  5. cDNA sequence analysis of a 29-kDa cysteine-rich surface antigen of pathogenic Entamoeba histolytica

    Energy Technology Data Exchange (ETDEWEB)

    Torian, B.E.; Stroeher, V.L.; Stamm, W.E. (Univ. of Washington, Seattle (USA)); Flores, B.M. (Louisiana State Univ. Medical Center, New Orleans (USA)); Hagen, F.S. (Zymogenetics Incorporated, Seattle, WA (USA))

    1990-08-01

    A {lambda}gt11 cDNA library was constructed from poly(U)-Spharose-selected Entamoeba histolytica trophozoite RNA in order to clone and identify surface antigens. The library was screened with rabbit polyclonal anti-E. histolytica serum. A 700-base-pair cDNA insert was isolated and the nucleotide sequence was determined. The deduced amino acid sequence of the cDNA revealed a cysteine-rich protein. DNA hybridizations showed that the gene was specific to E. histolytica since the cDNA probe reacted with DNA from four axenic strains of E. histolytica but did not react with DNA from Entamoeba invadens, Acanthamoeba castellanii, or Trichomonas vaginalis. The insert was subcloned into the expression vector pGEX-1 and the protein was expressed as a fusion with the C terminus of glutathione S-transferase. Purified fusion protein was used to generate 22 monoclonal antibodies (mAbs) and a mouse polyclonal antiserum specific for the E. histolytica portion of the fusion protein. A 29-kDa protein was identified as a surface antigen when mAbs were used to immunoprecipitate the antigen from metabolically {sup 35}S-labeled live trophozoites. The surface location of the antigen was corroborated by mAb immunoprecipitation of a 29-kDa protein from surface-{sup 125}I-labeled whole trophozoites as well as by the reaction of mAbs with live trophozoites in an indirect immunofluorescence assay performed at 4{degree}C. Immunoblotting with mAbs demonstrated that the antigen was present on four axenic isolates tested. mAbs recognized epitopes on the 29-kDa native antigen on some but not all clinical isolates tested.

  6. Cloning, sequencing and expression of cDNA encoding growth ...

    Indian Academy of Sciences (India)

    Using polymerase chain reaction (PCR) primers representing the conserved regions of fish GH sequences the 3′ region of catfish GH cDNA (540 bp) was cloned by random amplification of cDNA ends and the clone was used as a probe to isolate recombinant phages carrying the full-length cDNA sequence. The full-length ...

  7. Cloning, sequence analysis, and expression of cDNA coding for the major house dust mite allergen, Der f 1, in Escherichia coli

    Directory of Open Access Journals (Sweden)

    Y. Cui

    2008-05-01

    Full Text Available Our objective was to clone, express and characterize adult Dermatophagoides farinae group 1 (Der f 1 allergens to further produce recombinant allergens for future clinical applications in order to eliminate side reactions from crude extracts of mites. Based on GenBank data, we designed primers and amplified the cDNA fragment coding for Der f 1 by nested-PCR. After purification and recovery, the cDNA fragment was cloned into the pMD19-T vector. The fragment was then sequenced, subcloned into the plasmid pET28a(+, expressed in Escherichia coli BL21 and identified by Western blotting. The cDNA coding for Der f 1 was cloned, sequenced and expressed successfully. Sequence analysis showed the presence of an open reading frame containing 966 bp that encodes a protein of 321 amino acids. Interestingly, homology analysis showed that the Der p 1 shared more than 87% identity in amino acid sequence with Eur m 1 but only 80% with Der f 1. Furthermore, phylogenetic analyses suggested that D. pteronyssinus was evolutionarily closer to Euroglyphus maynei than to D. farinae, even though D. pteronyssinus and D. farinae belong to the same Dermatophagoides genus. A total of three cysteine peptidase active sites were found in the predicted amino acid sequence, including 127-138 (QGGCGSCWAFSG, 267-277 (NYHAVNIVGYG and 284-303 (YWIVRNSWDTTWGDSGYGYF. Moreover, secondary structure analysis revealed that Der f 1 contained an a helix (33.96%, an extended strand (17.13%, a ß turn (5.61%, and a random coil (43.30%. A simple three-dimensional model of this protein was constructed using a Swiss-model server. The cDNA coding for Der f 1 was cloned, sequenced and expressed successfully. Alignment and phylogenetic analysis suggests that D. pteronyssinus is evolutionarily more similar to E. maynei than to D. farinae.

  8. Construction of cDNA library and preliminary analysis of expressed sequence tags from tea plant [Camellia sinensis (L) O. Kuntze].

    Science.gov (United States)

    Phukon, Munmi; Namdev, Richa; Deka, Diganta; Modi, Mahendra K; Sen, Priyabrata

    2012-09-10

    Tea is the most popular non-alcoholic and healthy beverage across the world. The understanding of the genetic organization and molecular biology of tea plant, which is very poorly understood at present, is required for quantum increase in productivity and efficient use of germplasm for either cultivation or breeding program. Single-pass sequencing of randomly selected cDNA clones is the most widely accepted technique for gene identification and cloning. In the present study, a good quality cDNA library was constructed and preliminary analysis of ESTs was carried out. The titers of unamplified and amplified libraries were 1.4 × 10(6)pfu/ml and 5.27 × 10(8)pfu/ml respectively. A total of 210 cDNA clones from the constructed cDNA library were sequenced and analyzed. A total of 84 high quality Expressed Sequence Tags (ESTs) were generated, among which 71 ESTs had significant homology with sequences in NCBI non-redundant protein database by BLAST X analysis. About 80% ESTs had poly (A) tail at 3' end indicating that the cDNAs were full length. The database-matched ESTs were classified into putative cellular roles, viz. energy-related category (corresponding to 20% of total BLAST X matched ESTs), Transcription (14.2%), protein synthesis (14.2%) cell growth and division (8.6%), cell structure (5.7%), signal transduction (5.7%), transporters (2.9%), disease and defenses (2.9%), secondary metabolism (2.9%) and gene regulation (2.9%). This study provides an overview of the mRNA expression profile and first hand information of gene sequence expressed in tender leaves and apical buds of tea plant. Copyright © 2012 Elsevier B.V. All rights reserved.

  9. The venom gland transcriptome of Latrodectus tredecimguttatus revealed by deep sequencing and cDNA library analysis.

    Directory of Open Access Journals (Sweden)

    Quanze He

    Full Text Available Latrodectus tredecimguttatus, commonly known as black widow spider, is well known for its dangerous bite. Although its venom has been characterized extensively, some fundamental questions about its molecular composition remain unanswered. The limited transcriptome and genome data available prevent further understanding of spider venom at the molecular level. In the present study, we combined next-generation sequencing and conventional DNA sequencing to construct a venom gland transcriptome of the spider L. tredecimguttatus, which resulted in the identification of 9,666 and 480 high-confidence proteins among 34,334 de novo sequences and 1,024 cDNA sequences, respectively, by assembly, translation, filtering, quantification and annotation. Extensive functional analyses of these proteins indicated that mRNAs involved in RNA transport and spliceosome, protein translation, processing and transport were highly enriched in the venom gland, which is consistent with the specific function of venom glands, namely the production of toxins. Furthermore, we identified 146 toxin-like proteins forming 12 families, including 6 new families in this spider in which α-LTX-Lt1a family2 is firstly identified as a subfamily of α-LTX-Lt1a family. The toxins were classified according to their bioactivities into five categories that functioned in a coordinate way. Few ion channels were expressed in venom gland cells, suggesting a possible mechanism of protection from the attack of their own toxins. The present study provides a gland transcriptome profile and extends our understanding of the toxinome of spiders and coordination mechanism for toxin production in protein expression quantity.

  10. Ferritin from the Pacific abalone Haliotis discus hannai: Analysis of cDNA sequence, expression, and activity.

    Science.gov (United States)

    Qiu, Reng; Kan, Yunchao; Li, Dandan

    2016-02-01

    Ferritin plays an important role in iron homeostasis due to its ability to bind and sequester large amounts of iron. In this study, the gene encoding a ferritin (HdhFer2) was cloned from Pacific abalone (Haliotis discus hannai). The full-length cDNA of HdhFer2 contains a 5'-UTR of 121 bp, an ORF of 516 bp, and a 3'-UTR of 252 bp with a polyadenylation signal sequence of AATAAA and a poly(A) tail. It also contains a 31 bp iron-responsive element (IRE) in the 5'-UTR position, which is conserved in many ferritins. HdhFer2 consists of 171 amino acid residues with a predicted molecular weight (MW) ∼19.8 kDa and a theoretical isoelectric point (PI) of 4.84. The deduced amino acid sequence of HdhFer2 contains two ferritin iron-binding region signatures (IBRSs). HdhFer2 mRNA was detected in a wide range of tissues and was dominantly expressed in the gill. Infection with the bacterial pathogen Vibrio anguillarum significantly upregulated HdhFer2 expression in a time-dependent manner. Recombinant HdhFer2 (rHdhFer2) purified from Escherichia coli was able to bind ferrous iron in a concentration-dependent manner. In summary, these results suggest that HdhFer2 is a crucial protein in the iron-withholding defense system, and plays an important role in the innate immune response of abalone. Copyright © 2016 Elsevier Ltd. All rights reserved.

  11. Characterization of the venom from the Australian scorpion Urodacus yaschenkoi: Molecular mass analysis of components, cDNA sequences and peptides with antimicrobial activity.

    Science.gov (United States)

    Luna-Ramírez, Karen; Quintero-Hernández, Veronica; Vargas-Jaimes, Leonel; Batista, Cesar V F; Winkel, Kenneth D; Possani, Lourival D

    2013-03-01

    The Urodacidae scorpions are the most widely distributed of the four families in Australia and represent half of the species in the continent, yet their venoms remain largely unstudied. This communication reports the first results of a proteome analysis of the venom of the scorpion Urodacus yaschenkoi performed by mass fingerprinting, after high performance liquid chromatography (HPLC) separation. A total of 74 fractions were obtained by HPLC separation allowing the identification of approximately 274 different molecular masses with molecular weights varying from 287 to 43,437 Da. The most abundant peptides were those from 1 K Da and 4-5 K Da representing antimicrobial peptides and putative potassium channel toxins, respectively. Three such peptides were chemically synthesized and tested against Gram-positive and Gram-negative bacteria showing minimum inhibitory concentration in the low micromolar range, but with moderate hemolytic activity. It also reports a transcriptome analysis of the venom glands of the same scorpion species, undertaken by constructing a cDNA library and conducting random sequencing screening of the transcripts. From the resultant cDNA library 172 expressed sequence tags (ESTs) were analyzed. These transcripts were further clustered into 120 unique sequences (23 contigs and 97 singlets). The identified putative proteins can be assorted in several groups, such as those implicated in common cellular processes, putative neurotoxins and antimicrobial peptides. The scorpion U. yaschenkoi is not known to be dangerous to humans and its venom contains peptides similar to those of Opisthacanthus cayaporum (antibacterial), Scorpio maurus palmatus (maurocalcin), Opistophthalmus carinatus (opistoporines) and Hadrurus gerstchi (scorpine-like molecules), amongst others. Copyright © 2012 Elsevier Ltd. All rights reserved.

  12. Mouse tetranectin: cDNA sequence, tissue-specific expression, and chromosomal mapping

    DEFF Research Database (Denmark)

    Ibaraki, K; Kozak, C A; Wewer, U M

    1995-01-01

    regulation, mouse tetranectin cDNA was cloned from a 16-day-old mouse embryo library. Sequence analysis revealed a 992-bp cDNA with an open reading frame of 606 bp, which is identical in length to the human tetranectin cDNA. The deduced amino acid sequence showed high homology to the human cDNA with 76......(s) of tetranectin. The sequence analysis revealed a difference in both sequence and size of the noncoding regions between mouse and human cDNAs. Northern analysis of the various tissues from mouse, rat, and cow showed the major transcript(s) to be approximately 1 kb, which is similar in size to that observed...

  13. Sequencing and comparative genomics analysis inSenecio scandensBuch.-Ham. Ex D. Don, based on full-length cDNA library.

    Science.gov (United States)

    Qian, Gang; Ping, Junjiao; Zhang, Zhen; Xu, Delin

    2014-09-03

    Senecio scandens Buch.-Ham. ex D. Don, an important antibacterial source of Chinese traditional medicine, has a widespread distribution in a few ecological habitats of China. We generated a full-length complementary DNA (cDNA) library from a sample of elite individuals with superior antibacterial properties, with satisfactory parameters such as library storage (4.30 × 10 6 CFU), efficiency of titre (1.30 × 10 6 CFU/mL), transformation efficiency (96.35%), full-length ratio (64.00%) and redundancy ratio (3.28%). The BLASTN search revealed the facile formation of counterparts between the experimental sample and Arabidopsis thaliana in view of high-homology cDNA sequence (90.79%) with e -values cDNA clones consist of the major of functional genes identified by a large set of microarray data from the present experimental material. For other Compositae species, a large set of full-length cDNA clones reported in the present article will serve as a useful resource to facilitate further research on the transferability of expressed sequence tag-derived simple sequence repeats (EST-SSR) development, comparative genomics and novel transcript profiles.

  14. Analysis of expression sequence tags from a full-length-enriched cDNA library of developing sesame seeds (Sesamum indicum).

    Science.gov (United States)

    Ke, Tao; Dong, Caihua; Mao, Han; Zhao, Yingzhong; Chen, Hong; Liu, Hongyan; Dong, Xuyan; Tong, Chaobo; Liu, Shengyi

    2011-12-24

    Sesame (Sesamum indicum) is one of the most important oilseed crops with high oil contents and rich nutrient value. However, genetic improvement efforts in sesame could not get benefit from molecular biology technology due to poor DNA and RNA sequence resources. In this study, we carried out a large scale of expressed sequence tags (ESTs) sequencing from developing sesame seeds and further conducted analysis on seed storage products-related genes. A normalized and full-length enriched cDNA library from 5 ~ 30 days old immature seeds was constructed and randomly sequenced, leading to generation of 41,248 expressed sequence tags (ESTs) which then formed 4,713 contigs and 27,708 singletons with 44.9% uniESTs being putative full-length open reading frames. Approximately 26,091 of all these uniESTs have significant matches to the counterparts in Nr database of GenBank, and 21,628 of them were assigned to one or more Gene ontology (GO) terms. Homologous genes involved in oil biosynthesis were identified including some conservative transcription factors regulating oil biosynthesis such as LEAFY COTYLEDON1 (LEC1), PICKLE (PKL), WRINKLED1 (WRI1) and majority of them were found for the first time in sesame seeds. One hundred and 17 ESTs were identified possibly involved in biosynthesis of sesame lignans, sesamin and sesamolin. In total, 9,347 putative functional genes from developing seeds were identified, which accounts for one third of total genes in the sesame genome. Further analysis of the uniESTs identified 1,949 non-redundant simple sequence repeats (SSRs). This study has provided an overview of genes expressed during sesame seed development. This collection of sesame full-length cDNAs covered a wide variety of genes in seeds, in particular, candidate genes involved in biosynthesis of sesame oils and lignans. These EST sequences enriched with full length will contribute to comparative genomic studies on sesame and other oilseed plants and serve as an abundant

  15. Analysis of expression sequence tags from a full-length-enriched cDNA library of developing sesame seeds (Sesamum indicum

    Directory of Open Access Journals (Sweden)

    Ke Tao

    2011-12-01

    Full Text Available Abstract Background Sesame (Sesamum indicum is one of the most important oilseed crops with high oil contents and rich nutrient value. However, genetic improvement efforts in sesame could not get benefit from molecular biology technology due to poor DNA and RNA sequence resources. In this study, we carried out a large scale of expressed sequence tags (ESTs sequencing from developing sesame seeds and further conducted analysis on seed storage products-related genes. Results A normalized and full-length enriched cDNA library from 5 ~ 30 days old immature seeds was constructed and randomly sequenced, leading to generation of 41,248 expressed sequence tags (ESTs which then formed 4,713 contigs and 27,708 singletons with 44.9% uniESTs being putative full-length open reading frames. Approximately 26,091 of all these uniESTs have significant matches to the counterparts in Nr database of GenBank, and 21,628 of them were assigned to one or more Gene ontology (GO terms. Homologous genes involved in oil biosynthesis were identified including some conservative transcription factors regulating oil biosynthesis such as LEAFY COTYLEDON1 (LEC1, PICKLE (PKL, WRINKLED1 (WRI1 and majority of them were found for the first time in sesame seeds. One hundred and 17 ESTs were identified possibly involved in biosynthesis of sesame lignans, sesamin and sesamolin. In total, 9,347 putative functional genes from developing seeds were identified, which accounts for one third of total genes in the sesame genome. Further analysis of the uniESTs identified 1,949 non-redundant simple sequence repeats (SSRs. Conclusions This study has provided an overview of genes expressed during sesame seed development. This collection of sesame full-length cDNAs covered a wide variety of genes in seeds, in particular, candidate genes involved in biosynthesis of sesame oils and lignans. These EST sequences enriched with full length will contribute to comparative genomic studies on sesame and

  16. cDNA, genomic sequence cloning and overexpression of ribosomal ...

    African Journals Online (AJOL)

    RPS16 of eukaryote is a component of the 40S small ribosomal subunit encoded by RPS16 gene and is also a homolog of prokaryotic RPS9. The cDNA and genomic sequence of RPS16 was cloned successfully for the first time from the Giant Panda (Ailuropoda melanoleuca) using reverse transcription-polymerase chain ...

  17. Mouse tetranectin: cDNA sequence, tissue-specific expression, and chromosomal mapping

    DEFF Research Database (Denmark)

    Ibaraki, K; Kozak, C A; Wewer, U M

    1995-01-01

    regulation, mouse tetranectin cDNA was cloned from a 16-day-old mouse embryo library. Sequence analysis revealed a 992-bp cDNA with an open reading frame of 606 bp, which is identical in length to the human tetranectin cDNA. The deduced amino acid sequence showed high homology to the human cDNA with 76...... in human. Although additional minor bands of 1.5 and 3.3 kb were found in Northern blots, RT-PCR (reverse transcription polymerase chain reaction) analysis failed to provide evidence that these minor bands are products of the tetranectin gene. Finally, the genetic map location for this gene, Tna...

  18. cDNA sequencing improves the detection of P53 missense mutations in colorectal cancer

    International Nuclear Information System (INIS)

    Szybka, Malgorzata; Kordek, Radzislaw; Zakrzewska, Magdalena; Rieske, Piotr; Pasz-Walczak, Grazyna; Kulczycka-Wojdala, Dominika; Zawlik, Izabela; Stawski, Robert; Jesionek-Kupnicka, Dorota; Liberski, Pawel P

    2009-01-01

    Recently published data showed discrepancies beteween P53 cDNA and DNA sequencing in glioblastomas. We hypothesised that similar discrepancies may be observed in other human cancers. To this end, we analyzed 23 colorectal cancers for P53 mutations and gene expression using both DNA and cDNA sequencing, real-time PCR and immunohistochemistry. We found P53 gene mutations in 16 cases (15 missense and 1 nonsense). Two of the 15 cases with missense mutations showed alterations based only on cDNA, and not DNA sequencing. Moreover, in 6 of the 15 cases with a cDNA mutation those mutations were difficult to detect in the DNA sequencing, so the results of DNA analysis alone could be misinterpreted if the cDNA sequencing results had not also been available. In all those 15 cases, we observed a higher ratio of the mutated to the wild type template by cDNA analysis, but not by the DNA analysis. Interestingly, a similar overexpression of P53 mRNA was present in samples with and without P53 mutations. In terms of colorectal cancer, those discrepancies might be explained under three conditions: 1, overexpression of mutated P53 mRNA in cancer cells as compared with normal cells; 2, a higher content of cells without P53 mutation (normal cells and cells showing K-RAS and/or APC but not P53 mutation) in samples presenting P53 mutation; 3, heterozygous or hemizygous mutations of P53 gene. Additionally, for heterozygous mutations unknown mechanism(s) causing selective overproduction of mutated allele should also be considered. Our data offer new clues for studying discrepancy in P53 cDNA and DNA sequencing analysis

  19. Whitefly (Bemisia tabaci genome project: analysis of sequenced clones from egg, instar, and adult (viruliferous and non-viruliferous cDNA libraries

    Directory of Open Access Journals (Sweden)

    Czosnek Henryk

    2006-04-01

    Full Text Available Abstract Background The past three decades have witnessed a dramatic increase in interest in the whitefly Bemisia tabaci, owing to its nature as a taxonomically cryptic species, the damage it causes to a large number of herbaceous plants because of its specialized feeding in the phloem, and to its ability to serve as a vector of plant viruses. Among the most important plant viruses to be transmitted by B. tabaci are those in the genus Begomovirus (family, Geminiviridae. Surprisingly, little is known about the genome of this whitefly. The haploid genome size for male B. tabaci has been estimated to be approximately one billion bp by flow cytometry analysis, about five times the size of the fruitfly Drosophila melanogaster. The genes involved in whitefly development, in host range plasticity, and in begomovirus vector specificity and competency, are unknown. Results To address this general shortage of genomic sequence information, we have constructed three cDNA libraries from non-viruliferous whiteflies (eggs, immature instars, and adults and two from adult insects that fed on tomato plants infected by two geminiviruses: Tomato yellow leaf curl virus (TYLCV and Tomato mottle virus (ToMoV. In total, the sequence of 18,976 clones was determined. After quality control, and removal of 5,542 clones of mitochondrial origin 9,110 sequences remained which included 3,843 singletons and 1,017 contigs. Comparisons with public databases indicated that the libraries contained genes involved in cellular and developmental processes. In addition, approximately 1,000 bases aligned with the genome of the B. tabaci endosymbiotic bacterium Candidatus Portiera aleyrodidarum, originating primarily from the egg and instar libraries. Apart from the mitochondrial sequences, the longest and most abundant sequence encodes vitellogenin, which originated from whitefly adult libraries, indicating that much of the gene expression in this insect is directed toward the production

  20. Analysis of expressed sequence tags (ESTs) from a normalized cDNA library and isolation of EST simple sequence repeats from the invasive cotton mealybug Phenacoccus solenopsis.

    Science.gov (United States)

    Li, Hui; Lang, Kun-Ling; Fu, Hai-Bin; Shen, Chang-Peng; Wan, Fang-Hao; Chu, Dong

    2015-12-01

    The cotton mealybug, Phenacoccus solenopsis Tinsley, is a serious and invasive pest. At present, genetic resources for studying P. solenopsis are limited, and this negatively affects genetic research on the organism and, consequently, translational work to improve management of this pest. In the present study, expressed sequence tags (ESTs) were analyzed from a normalized complementary DNA library of P. solenopsis. In addition, EST-derived microsatellite loci (also known as simple sequence repeats or SSRs) were isolated and characterized. A total of 1107 high-quality ESTs were acquired from the library. Clustering and assembly analysis resulted in 785 unigenes, which were classified functionally into 23 categories according to the Gene Ontology database. Seven EST-based SSR markers were developed in this study and are expected to be useful in characterizing how this invasive species was introduced, as well as providing insights into its genetic microevolution. © 2014 Institute of Zoology, Chinese Academy of Sciences.

  1. Construction of a full-length cDNA library and preliminary analysis of expressed sequence tags from lymphocytes of half-pipe snowboarding athletes.

    Science.gov (United States)

    Zhao, Y H; Zhang, Z B; Zhao, C Q; Zhang, Y; Wang, Y F; Guan, W J; Zhu, Z Q

    2015-10-21

    The genes of top athletes are a valuable genetic resource for the human race, and could be exploited to identify novel genes related to sports ability, as well as other functions. We analyzed the expressed sequence tags from top half-pipe snowboarding athletes using the SMART complementary DNA (cDNA) library construction method to elucidate the characteristics of the athlete genome and the differential expression of the genes it contains. Overall, we established a full-length cDNA library from the lymphocytes of half-pipe snowboarding athletes and analyzed the inserted gene fragments. We also classified those genes according to molecular function, biological characteristics, cellular composition, protein types, and signal paths. A total of 201 functional genes were noted, which were distributed in 27 pathways. TXN, MDH1, ARL1, ARPC3, ACTG1, and other genes measured in sequence may be associated with physical ability. This suggests that the SMART cDNA library constructed from the genetic material from top athletes is an effective tool for preserving genetic sports resources and providing genetic markers of physical ability for athlete selection.

  2. Construction of a muscle cDNA library of Chinese shrimp Fenneropenaeus chinensis and sequence analysis of the troponin I gene

    Science.gov (United States)

    Li, Jitao; Chen, Ping; Li, Jian; Liu, Ping; He, Yuying; Wang, Qingyin

    2010-03-01

    A muscle cDNA library of Chinese shrimp ( Fenneropenaeus chinensis) was constructed with the SMART™ cDNA Library Construction Kit. The titer of optimal primary library was 7.7×105 pfu mL-1 and that of the amplified library was 3.0×109 pfu mL-1. The percentages of the recombinant clones of primary and amplified libraries were over 98%. The insert sizes were longer than 400 bp with an average of 1000 bp. A positive clone containing a 794 bp insert was sequenced and identified encoding fast skeletal troponin I gene. This library provided a useful resource for the functional genomic research of F. chinensis.

  3. cDNA cloning, structural analysis, SNP detection and tissue ...

    Indian Academy of Sciences (India)

    Home; Journals; Journal of Genetics; Volume 96; Issue 2. cDNA cloning, structural analysis, SNP detection and tissue ... Abstract. Insulin-like growth factor 1 (IGF1) plays an important role in growth, reproduction, foetal development and cell proliferation. The present study was conducted to clone and sequence the ...

  4. CDNA encoding a polypeptide including a hevein sequence

    Science.gov (United States)

    Raikhel, Natasha V.; Broekaert, Willem F.; Chua, Nam-Hai; Kush, Anil

    1995-03-21

    A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74-79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli.

  5. cDNA encoding a polypeptide including a hevein sequence

    Energy Technology Data Exchange (ETDEWEB)

    Raikhel, N.V.; Broekaert, W.F.; Chua, N.H.; Kush, A.

    2000-07-04

    A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74--79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli.

  6. Construction of a Full-Length Enriched cDNA Library and Preliminary Analysis of Expressed Sequence Tags from Bengal Tiger Panthera tigris tigris

    Directory of Open Access Journals (Sweden)

    Changqing Liu

    2013-05-01

    Full Text Available In this study, a full-length enriched cDNA library was successfully constructed from Bengal tiger, Panthera tigris tigris, the most well-known wild Animal. Total RNA was extracted from cultured Bengal tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.28 × 106 pfu/mL and 1.56 × 109 pfu/mL respectively. The percentage of recombinants from unamplified library was 90.2% and average length of exogenous inserts was 0.98 kb. A total of 212 individual ESTs with sizes ranging from 356 to 1108 bps were then analyzed. The BLASTX score revealed that 48.1% of the sequences were classified as a strong match, 45.3% as nominal and 6.6% as a weak match. Among the ESTs with known putative function, 26.4% ESTs were found to be related to all kinds of metabolisms, 19.3% ESTs to information storage and processing, 11.3% ESTs to posttranslational modification, protein turnover, chaperones, 11.3% ESTs to transport, 9.9% ESTs to signal transducer/cell communication, 9.0% ESTs to structure protein, 3.8% ESTs to cell cycle, and only 6.6% ESTs classified as novel genes. By EST sequencing, a full-length gene coding ferritin was identified and characterized. The recombinant plasmid pET32a-TAT-Ferritin was constructed, coded for the TAT-Ferritin fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-Ferritin recombinant protein was 2.32 ± 0.12 mg/mL. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genome and transcriptome research of Bengal tigers.

  7. Construction of a full-length enriched cDNA library and preliminary analysis of expressed sequence tags from Bengal Tiger Panthera tigris tigris.

    Science.gov (United States)

    Liu, Changqing; Liu, Dan; Guo, Yu; Lu, Taofeng; Li, Xiangchen; Zhang, Minghai; Ma, Jianzhang; Ma, Yuehui; Guan, Weijun

    2013-05-24

    In this study, a full-length enriched cDNA library was successfully constructed from Bengal tiger, Panthera tigris tigris, the most well-known wild Animal. Total RNA was extracted from cultured Bengal tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.28 × 106 pfu/mL and 1.56 × 109 pfu/mL respectively. The percentage of recombinants from unamplified library was 90.2% and average length of exogenous inserts was 0.98 kb. A total of 212 individual ESTs with sizes ranging from 356 to 1108 bps were then analyzed. The BLASTX score revealed that 48.1% of the sequences were classified as a strong match, 45.3% as nominal and 6.6% as a weak match. Among the ESTs with known putative function, 26.4% ESTs were found to be related to all kinds of metabolisms, 19.3% ESTs to information storage and processing, 11.3% ESTs to posttranslational modification, protein turnover, chaperones, 11.3% ESTs to transport, 9.9% ESTs to signal transducer/cell communication, 9.0% ESTs to structure protein, 3.8% ESTs to cell cycle, and only 6.6% ESTs classified as novel genes. By EST sequencing, a full-length gene coding ferritin was identified and characterized. The recombinant plasmid pET32a-TAT-Ferritin was constructed, coded for the TAT-Ferritin fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-Ferritin recombinant protein was 2.32 ± 0.12 mg/mL. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genome and transcriptome research of Bengal tigers.

  8. Construction of a Full-Length Enriched cDNA Library and Preliminary Analysis of Expressed Sequence Tags from Bengal Tiger Panthera tigris tigris

    Science.gov (United States)

    Liu, Changqing; Liu, Dan; Guo, Yu; Lu, Taofeng; Li, Xiangchen; Zhang, Minghai; Ma, Jianzhang; Ma, Yuehui; Guan, Weijun

    2013-01-01

    In this study, a full-length enriched cDNA library was successfully constructed from Bengal tiger, Panthera tigris tigris, the most well-known wild Animal. Total RNA was extracted from cultured Bengal tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.28 × 106 pfu/mL and 1.56 × 109 pfu/mL respectively. The percentage of recombinants from unamplified library was 90.2% and average length of exogenous inserts was 0.98 kb. A total of 212 individual ESTs with sizes ranging from 356 to 1108 bps were then analyzed. The BLASTX score revealed that 48.1% of the sequences were classified as a strong match, 45.3% as nominal and 6.6% as a weak match. Among the ESTs with known putative function, 26.4% ESTs were found to be related to all kinds of metabolisms, 19.3% ESTs to information storage and processing, 11.3% ESTs to posttranslational modification, protein turnover, chaperones, 11.3% ESTs to transport, 9.9% ESTs to signal transducer/cell communication, 9.0% ESTs to structure protein, 3.8% ESTs to cell cycle, and only 6.6% ESTs classified as novel genes. By EST sequencing, a full-length gene coding ferritin was identified and characterized. The recombinant plasmid pET32a-TAT-Ferritin was constructed, coded for the TAT-Ferritin fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-Ferritin recombinant protein was 2.32 ± 0.12 mg/mL. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genome and transcriptome research of Bengal tigers. PMID:23708105

  9. cDNA sequence of human transforming gene hst and identification of the coding sequence required for transforming activity

    International Nuclear Information System (INIS)

    Taira, M.; Yoshida, T.; Miyagawa, K.; Sakamoto, H.; Terada, M.; Sugimura, T.

    1987-01-01

    The hst gene was originally identified as a transforming gene in DNAs from human stomach cancers and from a noncancerous portion of stomach mucosa by DNA-mediated transfection assay using NIH3T3 cells. cDNA clones of hst were isolated from the cDNA library constructed from poly(A) + RNA of a secondary transformant induced by the DNA from a stomach cancer. The sequence analysis of the hst cDNA revealed the presence of two open reading frames. When this cDNA was inserted into an expression vector containing the simian virus 40 promoter, it efficiently induced the transformation of NIH3T3 cells upon transfection. It was found that one of the reading frames, which coded for 206 amino acids, was responsible for the transforming activity

  10. Expressed sequence tags: normalization and subtraction of cDNA libraries expressed sequence tags\\ normalization and subtraction of cDNA libraries.

    Science.gov (United States)

    Soares, Marcelo Bento; de Fatima Bonaldo, Maria; Hackett, Jeremiah D; Bhattacharya, Debashish

    2009-01-01

    Expressed Sequence Tags (ESTs) provide a rapid and efficient approach for gene discovery and analysis of gene expression in eukaryotes. ESTs have also become particularly important with recent expanded efforts in complete genome sequencing of understudied, nonmodel eukaryotes such as protists and algae. For these projects, ESTs provide an invaluable source of data for gene identification and prediction of exon-intron boundaries. The generation of EST data, although straightforward in concept, requires nonetheless great care to ensure the highest efficiency and return for the investment in time and funds. To this end, key steps in the process include generation of a normalized cDNA library to facilitate a high gene discovery rate followed by serial subtraction of normalized libraries to maintain the discovery rate. Here we describe in detail, protocols for normalization and subtraction of cDNA libraries followed by an example using the toxic dinoflagellate Alexandrium tamarense.

  11. Expression analysis of a ''Cucurbita'' cDNA encoding endonuclease

    International Nuclear Information System (INIS)

    Szopa, J.

    1995-01-01

    The nuclear matrices of plant cell nuclei display intrinsic nuclease activity which consists in nicking supercoiled DNA. A cDNA encoding a 32 kDa endonuclease has been cloned and sequenced. The nucleotide and deduced amino-acid sequences show high homology to known 14-3-3-protein sequences from other sources. The amino-acid sequence shows agreement with consensus sequences for potential phosphorylation by protein kinase A and C and for calcium, lipid and membrane-binding sites. The nucleotide-binding site is also present within the conserved part of the sequence. By Northern blot analysis, the differential expression of the corresponding mRNA was detected; it was the strongest in sink tissues. The endonuclease activity found on DNA-polyacrylamide gel electrophoresis coincided with mRNA content and was the highest in tuber. (author). 22 refs, 6 figs

  12. cDNA, genomic sequence cloning and overexpression of ribosomal ...

    African Journals Online (AJOL)

    Alignment analysis indicated that the nucleotide sequence of the coding sequence shows a high homology to those of Homo sapiens, Pongo abelii, Macaca fascicularis, Mus musculus, Bos taurus and Rattus norvegicus are 93.1, 92.5, 92.2, 91.1, 90.6 and 90.0% respectively. The amino acid sequence encoded by RPS20 ...

  13. Sequencing of first-strand cDNA library reveals full-length transcriptomes.

    Science.gov (United States)

    Agarwal, Saurabh; Macfarlan, Todd S; Sartor, Maureen A; Iwase, Shigeki

    2015-01-21

    Massively parallel strand-specific sequencing of RNA (ssRNA-seq) has emerged as a powerful tool for profiling complex transcriptomes. However, many current methods for ssRNA-seq suffer from the underrepresentation of both the 5' and 3' ends of RNAs, which can be attributed to second-strand cDNA synthesis. The 5' and 3' ends of RNA harbour crucial information for gene regulation; namely, transcription start sites (TSSs) and polyadenylation sites. Here we report a novel ssRNA-seq method that does not involve second-strand cDNA synthesis, as we Directly Ligate sequencing Adaptors to the First-strand cDNA (DLAF). This novel method with fewer enzymatic reactions results in a higher quality of the libraries than the conventional method. Sequencing of DLAF libraries followed by a novel analysis pipeline enables the profiling of both 5' ends and polyadenylation sites at near-base resolution. Therefore, DLAF offers the first genomics tool to obtain the 'full-length' transcriptome with a single library.

  14. Porcine transcriptome analysis based on 97 non-normalized cDNA libraries and assembly of 1,021,891 expressed sequence tags

    DEFF Research Database (Denmark)

    Gorodkin, Jan; Cirera, Susanna; Hedegaard, Jacob

    2007-01-01

    with the greatest number of different expressed genes, whereas tissues with more specialized function, such as developing liver, have fewer expressed genes. There are at least 65 high confidence housekeeping gene candidates and 876 cDNA library-specific gene candidates. We identified differential expression...

  15. Porcine transcriptome analysis based on 97 non-normalized cDNA libraries and assembly of 1,021,891 expressed sequence tags

    DEFF Research Database (Denmark)

    Gorodkin, Jan; Cirera, Susanna; Hedegaard, Jacob

    2007-01-01

    public databases. The Sino-Danish ESTs were generated from one normalized and 97 non-normalized cDNA libraries representing 35 different tissues and three developmental stages. RESULTS: Using the Distiller package, the ESTs were assembled to roughly 48,000 contigs and 73,000 singletons, of which...

  16. Generation of cDNA expression libraries enriched for in-frame sequences

    OpenAIRE

    Davis, Claytus A.; Benzer, Seymour

    1997-01-01

    Bacterial cDNA expression libraries are made to reproduce protein sequences present in the mRNA source tissue. However, there is no control over which frame of the cDNA is translated, because translation of the cDNA must be initiated on vector sequence. In a library of nondirectionally cloned cDNAs, only some 8% of the protein sequences produced are expected to be correct. Directional cloning can increase this by a factor of two, but it does not solve the frame problem. We have therefore deve...

  17. Generation of longer 3' cDNA fragments from massively parallel signature sequencing tags.

    Science.gov (United States)

    Silva, Ana Paula M; Chen, Jianjun; Carraro, Dirce M; Wang, San Ming; Camargo, Anamaria A

    2004-07-06

    Massively Parallel Signature Sequencing (MPSS) is a powerful technique for genome-wide gene expression analysis, which, similar to SAGE, relies on the production of short tags proximal to the 3'end of transcripts. A single MPSS experiment can generate over 10(7) tags, providing a 10-fold coverage of the transcripts expressed in a human cell. A significant fraction of MPSS tags cannot be assigned to known transcripts (orphan tags) and are likely to be derived from transcripts expressed at very low levels (approximately 1 copy per cell). In order to explore the potential of MPSS for the characterization of the human transcriptome, we have adapted the GLGI protocol (Generation of Longer cDNA fragments from SAGE tags for Gene Identification) to convert MPSS tags into their corresponding 3' cDNA fragments. GLGI-MPSS was applied to 83 orphan tags and 41 cDNA fragments were obtained. The analysis of these 41 fragments allowed the identification of novel transcripts, alternative tags generated from polymorphic and alternatively spliced transcripts, as well as the detection of artefactual MPSS tags. A systematic large-scale analysis of the genome by MPSS, in combination with the use of GLGI-MPSS protocol, will certainly provide a complementary approach to generate the complete catalog of human transcripts.

  18. Generation of longer 3′ cDNA fragments from massively parallel signature sequencing tags

    Science.gov (United States)

    Silva, Ana Paula M.; Chen, Jianjun; Carraro, Dirce M.; Wang, San Ming; Camargo, Anamaria A.

    2004-01-01

    Massively Parallel Signature Sequencing (MPSS) is a powerful technique for genome-wide gene expression analysis, which, similar to SAGE, relies on the production of short tags proximal to the 3′end of transcripts. A single MPSS experiment can generate over 107 tags, providing a 10-fold coverage of the transcripts expressed in a human cell. A significant fraction of MPSS tags cannot be assigned to known transcripts (orphan tags) and are likely to be derived from transcripts expressed at very low levels (∼1 copy per cell). In order to explore the potential of MPSS for the characterization of the human transcriptome, we have adapted the GLGI protocol (Generation of Longer cDNA fragments from SAGE tags for Gene Identification) to convert MPSS tags into their corresponding 3′ cDNA fragments. GLGI-MPSS was applied to 83 orphan tags and 41 cDNA fragments were obtained. The analysis of these 41 fragments allowed the identification of novel transcripts, alternative tags generated from polymorphic and alternatively spliced transcripts, as well as the detection of artefactual MPSS tags. A systematic large-scale analysis of the genome by MPSS, in combination with the use of GLGI-MPSS protocol, will certainly provide a complementary approach to generate the complete catalog of human transcripts. PMID:15247327

  19. Generation and analysis of a large-scale expressed sequence Tag database from a full-length enriched cDNA library of developing leaves of Gossypium hirsutum L.

    Directory of Open Access Journals (Sweden)

    Min Lin

    Full Text Available BACKGROUND: Cotton (Gossypium hirsutum L. is one of the world's most economically-important crops. However, its entire genome has not been sequenced, and limited resources are available in GenBank for understanding the molecular mechanisms underlying leaf development and senescence. METHODOLOGY/PRINCIPAL FINDINGS: In this study, 9,874 high-quality ESTs were generated from a normalized, full-length cDNA library derived from pooled RNA isolated from throughout leaf development during the plant blooming stage. After clustering and assembly of these ESTs, 5,191 unique sequences, representative 1,652 contigs and 3,539 singletons, were obtained. The average unique sequence length was 682 bp. Annotation of these unique sequences revealed that 84.4% showed significant homology to sequences in the NCBI non-redundant protein database, and 57.3% had significant hits to known proteins in the Swiss-Prot database. Comparative analysis indicated that our library added 2,400 ESTs and 991 unique sequences to those known for cotton. The unigenes were functionally characterized by gene ontology annotation. We identified 1,339 and 200 unigenes as potential leaf senescence-related genes and transcription factors, respectively. Moreover, nine genes related to leaf senescence and eleven MYB transcription factors were randomly selected for quantitative real-time PCR (qRT-PCR, which revealed that these genes were regulated differentially during senescence. The qRT-PCR for three GhYLSs revealed that these genes express express preferentially in senescent leaves. CONCLUSIONS/SIGNIFICANCE: These EST resources will provide valuable sequence information for gene expression profiling analyses and functional genomics studies to elucidate their roles, as well as for studying the mechanisms of leaf development and senescence in cotton and discovering candidate genes related to important agronomic traits of cotton. These data will also facilitate future whole-genome sequence

  20. Cloning, sequencing and expression of cDNA encoding growth ...

    Indian Academy of Sciences (India)

    Unknown

    cell embryo and the expression was monitored continuously. The expression shown here is in developing embryo and freshly hatched fish. The intensity of green colour indicate the strong expression of EGFP in all the tissues of the embryo/fry. The expression of EGPF indicates the co-expression of catfish GH cDNA and the ...

  1. 5'-end sequences of budding yeast full-length cDNA clones and quality scores - Budding yeast cDNA sequencing project | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Budding yeast cDNA sequencing project 5'-end sequences of budding yeast full-length cDNA clones and quality... scores Data detail Data name 5'-end sequences of budding yeast full-length cDNA clones and quality...or-capping method, the sequence quality score generated by the Phred software, and links to SGD, dbEST and U...es. FASTA format. Quality Phred's quality score About This Database Database Desc...g yeast full-length cDNA clones and quality scores - Budding yeast cDNA sequencing project | LSDB Archive ...

  2. Cloning and sequencing of complete τ-crystallin cDNA from ...

    Indian Academy of Sciences (India)

    Unknown

    length τ-crystallin cDNA from crocodilian lens and α-enolase from other tissues. ... human (Acc. No. NM_001428). The sequences were used to construct a phylogenetic tree depicting gene lineage, using the clustering program DNAML.

  3. Analysis of 4,664 high-quality sequence-finished poplar full-length cDNA clones and their utility for the discovery of genes responding to insect feeding

    Directory of Open Access Journals (Sweden)

    Douglas Carl J

    2008-01-01

    Full Text Available Abstract Background The genus Populus includes poplars, aspens and cottonwoods, which will be collectively referred to as poplars hereafter unless otherwise specified. Poplars are the dominant tree species in many forest ecosystems in the Northern Hemisphere and are of substantial economic value in plantation forestry. Poplar has been established as a model system for genomics studies of growth, development, and adaptation of woody perennial plants including secondary xylem formation, dormancy, adaptation to local environments, and biotic interactions. Results As part of the poplar genome sequencing project and the development of genomic resources for poplar, we have generated a full-length (FL-cDNA collection using the biotinylated CAP trapper method. We constructed four FLcDNA libraries using RNA from xylem, phloem and cambium, and green shoot tips and leaves from the P. trichocarpa Nisqually-1 genotype, as well as insect-attacked leaves of the P. trichocarpa × P. deltoides hybrid. Following careful selection of candidate cDNA clones, we used a combined strategy of paired end reads and primer walking to generate a set of 4,664 high-accuracy, sequence-verified FLcDNAs, which clustered into 3,990 putative unique genes. Mapping FLcDNAs to the poplar genome sequence combined with BLAST comparisons to previously predicted protein coding sequences in the poplar genome identified 39 FLcDNAs that likely localize to gaps in the current genome sequence assembly. Another 173 FLcDNAs mapped to the genome sequence but were not included among the previously predicted genes in the poplar genome. Comparative sequence analysis against Arabidopsis thaliana and other species in the non-redundant database of GenBank revealed that 11.5% of the poplar FLcDNAs display no significant sequence similarity to other plant proteins. By mapping the poplar FLcDNAs against transcriptome data previously obtained with a 15.5 K cDNA microarray, we identified 153 FLcDNA clones

  4. Cloning and sequencing of dolphinfish (Coryphaena hippurus, Coryphaenidae) growth hormone-encoding cDNA.

    Science.gov (United States)

    Peduel, A D; Elizur, A; Knibb, W

    1994-01-01

    The cDNA encoding the preprotein growth hormone from the dolphinfish (Coryphaena hippurus) has been cloned and sequenced. The cDNA was derived by reverse transcription of RNA from the pituitary of a young fish using the method known as Rapid Amplification of cDNA Ends (RACE). An oligonucleotide primer corresponding to the 5' region of Pagrus major and the universal RACE primer enabled amplification using the Polymerase Chain Reaction (PCR). The dolphinfish and yellow-tail, Seriola quineqeradiata, are both members of the sub-order Percoidei (Perciforme) and their GH sequences show a high level of homology.

  5. De-novo transcriptome sequencing of a normalized cDNA pool from influenza infected ferrets.

    Directory of Open Access Journals (Sweden)

    Jeremy V Camp

    Full Text Available The ferret is commonly used as a model for studies of infectious diseases. The genomic sequence of this animal model is not yet characterized, and only a limited number of fully annotated cDNAs are currently available in GenBank. The majority of genes involved in innate or adaptive immune response are still lacking, restricting molecular genetic analysis of host response in the ferret model. To enable de novo identification of transcriptionally active ferret genes in response to infection, we performed de-novo transcriptome sequencing of animals infected with H1N1 A/California/07/2009. We also included splenocytes induced with bacterial lipopolysaccharide to allow for identification of transcripts specifically induced by gram-negative bacteria. We pooled and normalized the cDNA library in order to delimit the risk of sequencing only highly expressed genes. While normalization of the cDNA library removes the possibility of assessing expression changes between individual animals, it has been shown to increase identification of low abundant transcripts. In this study, we identified more than 19,000 partial ferret transcripts, including more than 1000 gene orthologs known to be involved in the innate and the adaptive immune response.

  6. Generation and analysis of a large-scale expressed sequence tags from a full-length enriched cDNA library of Siberian tiger (Panthera tigris altaica).

    Science.gov (United States)

    Guo, Yu; Liu, Changqing; Lu, Taofeng; Liu, Dan; Bai, Chunyu; Li, Xiangchen; Ma, Yuehui; Guan, Weijun

    2014-05-15

    In this study, a full-length enriched cDNA library was successfully constructed from Siberian tiger, the world's most endangered species. The titers of primary and amplified libraries were 1.28×10(6)pfu/mL and 1.59×10(10)pfu/mL respectively. The proportion of recombinants from unamplified library was 91.3% and the average length of exogenous inserts was 1.06kb. A total of 279 individual ESTs with sizes ranging from 316 to 1258bps were then analyzed. Furthermore, 204 unigenes were successfully annotated and involved in 49 functions of the GO classification, cell (175, 85.5%), cellular process (165, 80.9%), and binding (152, 74.5%) are the dominant terms. 198 unigenes were assigned to 156 KEGG pathways, and the pathways with the most representation are metabolic pathways (18, 9.1%). The proportion pattern of each COG subcategory was similar among Panthera tigris altaica, P. tigris tigris and Homo sapiens, and general function prediction only cluster (44, 15.8%) represents the largest group, followed by translation, ribosomal structure and biogenesis (33, 11.8%), replication, recombination and repair (24, 8.6%), and only 7.2% ESTs classified as novel genes. Moreover, the recombinant plasmid pET32a-TAT-COL6A2 was constructed, coded for the Trx-TAT-COL6A2 fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-COL6A2 recombinant protein was 2.64±0.18mg/mL. This library will provide a useful platform for the functional genome and transcriptome research of for the P. tigris and other felid animals in the future. Copyright © 2014 Elsevier B.V. All rights reserved.

  7. Cloning of human purine-nucleoside phosphorylase cDNA sequences by complementation in Escherichia coli.

    OpenAIRE

    Goddard, J M; Caput, D; Williams, S R; Martin, D W

    1983-01-01

    We have obtained cDNA clones that contain the entire coding region of the human purine-nucleoside phosphorylase (PNP; EC 2.4.2.1) mRNA. The cDNA sequences were generated by reverse transcription of PNP-enriched mRNA obtained by immunoadsorption of HeLa cell polyribosomes with monospecific antibody to human PNP. cDNA molecules that were close in length to PNP mRNA were separated by agarose gel electrophoresis and inserted into the Pst I site of the plasmid pBR322. Plasmid DNA from the pooled c...

  8. Generation of cDNA expression libraries enriched for in-frame sequences.

    Science.gov (United States)

    Davis, C A; Benzer, S

    1997-03-18

    Bacterial cDNA expression libraries are made to reproduce protein sequences present in the mRNA source tissue. However, there is no control over which frame of the cDNA is translated, because translation of the cDNA must be initiated on vector sequence. In a library of nondirectionally cloned cDNAs, only some 8% of the protein sequences produced are expected to be correct. Directional cloning can increase this by a factor of two, but it does not solve the frame problem. We have therefore developed and tested a library construction methodology using a novel vector, pKE-1, with which translation in the correct reading frame confers kanamycin resistance on the host. Following kanamycin selection, the cDNA libraries contained 60-80% open, in-frame clones. These, compared with unselected libraries, showed a 10-fold increase in the number of matches between the cDNA-encoded proteins made by the bacteria and database protein sequences. cDNA sequencing programs will benefit from the enrichment for correct coding sequences, and screening methods requiring protein expression will benefit from the enrichment for authentic translation products.

  9. Sequence of a cDNA encoding turtle high mobility group 1 protein.

    Science.gov (United States)

    Zheng, Jifang; Hu, Bi; Wu, Duansheng

    2005-07-01

    In order to understand sequence information about turtle HMG1 gene, a cDNA encoding HMG1 protein of the Chinese soft-shell turtle (Pelodiscus sinensis) was amplified by RT-PCR from kidney total RNA, and was cloned, sequenced and analyzed. The results revealed that the open reading frame (ORF) of turtle HMG1 cDNA is 606 bp long. The ORF codifies 202 amino acid residues, from which two DNA-binding domains and one polyacidic region are derived. The DNA-binding domains share higher amino acid identity with homologues sequences of chicken (96.5%) and mammalian (74%) than homologues sequence of rainbow trout (67%). The polyacidic region shows 84.6% amino acid homology with the equivalent region of chicken HMG1 cDNA. Turtle HMG1 protein contains 3 Cys residues located at completely conserved positions. Conservation in sequence and structure suggests that the functions of turtle HMG1 cDNA may be highly conserved during evolution. To our knowledge, this is the first report of HMG1 cDNA sequence in any reptilian.

  10. Construction and analysis of full-length and normalized cDNA libraries from citrus.

    Science.gov (United States)

    Marques, M Carmen; Perez-Amador, Miguel A

    2012-01-01

    We have developed an integrated method to generate a normalized cDNA collection enriched in full-length and rare transcripts from citrus, using different species and multiple tissues and developmental stages. Interpretation of ever-increasing raw sequence information generated by modern genome sequencing technologies faces multiple challenges, such as gene function analysis and genome annotation. In this regard, the availability of full-length cDNA clones facilitates functional analysis of the corresponding genes enabling manipulation of their expression and the generation of a variety of tagged versions of the native protein. The development of full-length cDNA sequences has the power to improve the quality of genome annotation, as well as provide tools for functional characterization of genes.

  11. Cloning, sequencing, and expression of cDNA for human β-glucuronidase

    International Nuclear Information System (INIS)

    Oshima, A.; Kyle, J.W.; Miller, R.D.

    1987-01-01

    The authors report here the cDNA sequence for human placental β-glucuronidase (β-D-glucuronoside glucuronosohydrolase, EC 3.2.1.31) and demonstrate expression of the human enzyme in transfected COS cells. They also sequenced a partial cDNA clone from human fibroblasts that contained a 153-base-pair deletion within the coding sequence and found a second type of cDNA clone from placenta that contained the same deletion. Nuclease S1 mapping studies demonstrated two types of mRNAs in human placenta that corresponded to the two types of cDNA clones isolated. The NH 2 -terminal amino acid sequence determined for human spleen β-glucuronidase agreed with that inferred from the DNA sequence of the two placental clones, beginning at amino acid 23, suggesting a cleaved signal sequence of 22 amino acids. When transfected into COS cells, plasmids containing either placental clone expressed an immunoprecipitable protein that contained N-linked oligosaccharides as evidenced by sensitivity to endoglycosidase F. However, only transfection with the clone containing the 153-base-pair segment led to expression of human β-glucuronidase activity. These studies provide the sequence for the full-length cDNA for human β-glucuronidase, demonstrate the existence of two populations of mRNA for β-glucuronidase in human placenta, only one of which specifies a catalytically active enzyme, and illustrate the importance of expression studies in verifying that a cDNA is functionally full-length

  12. In-depth cDNA library sequencing provides quantitative gene expression profiling in cancer biomarker discovery.

    Science.gov (United States)

    Yang, Wanling; Ying, Dingge; Lau, Yu-Lung

    2009-06-01

    Quantitative gene expression analysis plays an important role in identifying differentially expressed genes in various pathological states, gene expression regulation and co-regulation, shedding light on gene functions. Although microarray is widely used as a powerful tool in this regard, it is suboptimal quantitatively and unable to detect unknown gene variants. Here we demonstrated effective detection of differential expression and co-regulation of certain genes by expressed sequence tag analysis using a selected subset of cDNA libraries. We discussed the issues of sequencing depth and library preparation, and propose that increased sequencing depth and improved preparation procedures may allow detection of many expression features for less abundant gene variants. With the reduction of sequencing cost and the emerging of new generation sequencing technology, in-depth sequencing of cDNA pools or libraries may represent a better and powerful tool in gene expression profiling and cancer biomarker detection. We also propose using sequence-specific subtraction to remove hundreds of the most abundant housekeeping genes to increase sequencing depth without affecting relative expression ratio of other genes, as transcripts from as few as 300 most abundantly expressed genes constitute about 20% of the total transcriptome. In-depth sequencing also represents a unique advantage of detecting unknown forms of transcripts, such as alternative splicing variants, fusion genes, and regulatory RNAs, as well as detecting mutations and polymorphisms that may play important roles in disease pathogenesis.

  13. New Approaches to Attenuated Hepatitis a Vaccine Development: Cloning and Sequencing of Cell-Culture Adapted Viral cDNA.

    Science.gov (United States)

    1987-10-13

    Insert frag- ments from p16 cDNA clones were subcloned into the phage vector Ml3mp8 or Ml3mpl9 qnd subjected to rapid sequencing using the...2), and selected cDNA insert fragments were subcloned into M13 vectors for sequencing. The sequence of the complete genome was determined, with over

  14. cDNA, genomic sequence cloning and overexpression of ribosomal ...

    African Journals Online (AJOL)

    PRECIOUS

    2009-11-02

    Nov 2, 2009 ... Rattus norvegicus are 93.1, 92.5, 92.2, 91.1, 90.6 and 90.0% respectively. The amino acid sequence encoded by RPS20 gene of the Giant Panda shared a high homology (100%) with those of H. sapiens,. Mac. fascicularis, Mus musculus, B. taurus and R. norvegicus, except for P. abelii (99.88%). Primary.

  15. Nucleotide sequence and infectious cDNA clone of the L1 isolate of Pea seed-borne mosaic potyvirus.

    Science.gov (United States)

    Olsen, B S; Johansen, I E

    2001-01-01

    The complete nucleotide sequence of Pea seed-borne mosaic potyvirus isolate L1 has been determined from cloned virus cDNA. The PSbMV L1 genome is 9895 nucleotides in length excluding the poly(A) tail. Computer analysis of the sequence revealed a single long open reading frame (ORF) of 9594 nucleotides. The ORF potentially encodes a polyprotein of 3198 amino acids with a deduced Mr of 363537. Nine putative proteolytic cleavage sites were identified by analogy to consensus sequences and genome arrangement in other potyviruses. Two full-length cDNA clones, p35S-L1-4 and p35S-L1-5, were assembled under control of an enhanced 35S promoter and nopaline synthase terminator. Clone p35S-L1-4 was constructed with four introns and p35S-L1-5 with five introns inserted in the cDNA. Clone p35S-L1-4 was unstable in Escherichia coli often resulting in amplification of plasmids with deletions. Clone p35S-L1-5 was stable and apparently less toxic to Escherichia coli resulting in larger bacterial colonies and higher plasmid yield. Both clones were infectious upon mechanical inoculation of plasmid DNA on susceptible pea cultivars Fjord, Scout, and Brutus. Eight pea genotypes resistant to L1 virus were also resistant to the cDNA derived L1 virus. Both native PSbMV L1 and the cDNA derived virus infected Chenopodium quinoa systemically giving rise to characteristic necrotic lesions on uninoculated leaves.

  16. Cloning and sequencing of complete τ-crystallin cDNA from ...

    Indian Academy of Sciences (India)

    Unknown

    brain, heart and gonad, suggesting both to be the product of the same gene. The study thus provides the first report on cDNA sequence of τ-crystallin from a reptilian species and also re-confirms it to be an example of the phenomenon of gene sharing as was demonstrated earlier in the case of peking duck. Moreover, the ...

  17. Complete amino acid sequence of human intestinal aminopeptidase N as deduced from cloned cDNA

    DEFF Research Database (Denmark)

    Cowell, G M; Kønigshøfer, E; Danielsen, E M

    1988-01-01

    The complete primary structure (967 amino acids) of an intestinal human aminopeptidase N (EC 3.4.11.2) was deduced from the sequence of a cDNA clone. Aminopeptidase N is anchored to the microvillar membrane via an uncleaved signal for membrane insertion. A domain constituting amino acid 250...

  18. cDNA sequence of the long mRNA for human glutamine synthase

    NARCIS (Netherlands)

    van den Hoff, M. J.; Geerts, W. J.; Das, A. T.; Moorman, A. F.; Lamers, W. H.

    1991-01-01

    Screening a human liver cDNA library in lambda ZAP revealed several clones for the mRNA of glutamine synthase. The longest clone was completely sequenced and consists of a 109 bp 5' untranslated region, a 1119 bp protein coding region, a 1498 bp 3' untranslated region and a poly(A) tract of 12 bp

  19. Detection of reverse transcriptase termination sites using cDNA ligation and massive parallel sequencing

    DEFF Research Database (Denmark)

    Kielpinski, Lukasz J; Boyd, Mette; Sandelin, Albin

    2013-01-01

    of these methods can be increased by applying massive parallel sequencing technologies.Here, we describe a versatile method for detection of reverse transcriptase termination sites based on ligation of an adapter to the 3' end of cDNA with bacteriophage TS2126 RNA ligase (CircLigase™). In the following PCR...

  20. Primary structure of a lipoxygenase from barley grain as deduced from its cDNA sequence

    NARCIS (Netherlands)

    Mechelen, J.R. van; Smits, M.; Douma, A.C.; Rouster, J.; Cameron-Mills, V.; Heidekamp, F.; Valk, B.E.

    1995-01-01

    A full length cDNA sequence for a barley grain lipoxygenase was obtained. It includes a 5' untranslated region of 69 nucleotides, an open reading frame of 2586 nucleotides encoding a protein of 862 amino acid residues and a 3' untranslated region of 142 nucleotides. The molecular mass of the encoded

  1. cDNA cloning, structural analysis, SNP detection and tissue ...

    Indian Academy of Sciences (India)

    THOMAS NAICY

    [Naicy T., Venkatachalapathy T., Aravindakshan T., Raghavan K. C., Mini M. and Shyama K. 2017 cDNA cloning, structural analysis, SNP detection and tissue expression profile of the IGF1 gene in Malabari and Attappady Black goats of India. J. Genet. 96, xx–xx]. Introduction. Insulin-like growth factor 1 (IGF1), an important ...

  2. Digital analysis of cDNA abundance; expression profiling by means of restriction fragment fingerprinting

    Directory of Open Access Journals (Sweden)

    Regenbogen Johannes

    2002-03-01

    Full Text Available Abstract Background Gene expression profiling among different tissues is of paramount interest in various areas of biomedical research. We have developed a novel method (DADA, Digital Analysis of cDNA Abundance, that calculates the relative abundance of genes in cDNA libraries. Results DADA is based upon multiple restriction fragment length analysis of pools of clones from cDNA libraries and the identification of gene-specific restriction fingerprints in the resulting complex fragment mixtures. A specific cDNA cloning vector had to be constructed that governed missing or incomplete cDNA inserts which would generate misleading fingerprints in standard cloning vectors. Double stranded cDNA was synthesized using an anchored oligo dT primer, uni-directionally inserted into the DADA vector and cDNA libraries were constructed in E. coli. The cDNA fingerprints were generated in a PCR-free procedure that allows for parallel plasmid preparation, labeling, restriction digest and fragment separation of pools of 96 colonies each. This multiplexing significantly enhanced the throughput in comparison to sequence-based methods (e.g. EST approach. The data of the fragment mixtures were integrated into a relational database system and queried with fingerprints experimentally produced by analyzing single colonies. Due to limited predictability of the position of DNA fragments on the polyacrylamid gels of a given size, fingerprints derived solely from cDNA sequences were not accurate enough to be used for the analysis. We applied DADA to the analysis of gene expression profiles in a model for impaired wound healing (treatment of mice with dexamethasone. Conclusions The method proved to be capable of identifying pharmacologically relevant target genes that had not been identified by other standard methods routinely used to find differentially expressed genes. Due to the above mentioned limited predictability of the fingerprints, the method was yet tested only with

  3. cDNA Library Enrichment of Full Length Transcripts for SMRT Long Read Sequencing.

    Science.gov (United States)

    Cartolano, Maria; Huettel, Bruno; Hartwig, Benjamin; Reinhardt, Richard; Schneeberger, Korbinian

    2016-01-01

    The utility of genome assemblies does not only rely on the quality of the assembled genome sequence, but also on the quality of the gene annotations. The Pacific Biosciences Iso-Seq technology is a powerful support for accurate eukaryotic gene model annotation as it allows for direct readout of full-length cDNA sequences without the need for noisy short read-based transcript assembly. We propose the implementation of the TeloPrime Full Length cDNA Amplification kit to the Pacific Biosciences Iso-Seq technology in order to enrich for genuine full-length transcripts in the cDNA libraries. We provide evidence that TeloPrime outperforms the commonly used SMARTer PCR cDNA Synthesis Kit in identifying transcription start and end sites in Arabidopsis thaliana. Furthermore, we show that TeloPrime-based Pacific Biosciences Iso-Seq can be successfully applied to the polyploid genome of bread wheat (Triticum aestivum) not only to efficiently annotate gene models, but also to identify novel transcription sites, gene homeologs, splicing isoforms and previously unidentified gene loci.

  4. Cloning and sequencing of Indian Water buffalo (Bubalus bubalis) interleukin-3 cDNA

    KAUST Repository

    Sugumar, Thennarasu

    2011-12-12

    Full-length cDNA (435 bp) of the interleukin-3(IL-3) gene of the Indian water buffalo was amplified by reverse transcriptase-polymerase chain reaction and sequenced. This sequence had 96% nucleotide identity and 92% amino acid identity with bovine IL-3. There are 10 amino acid substitutions in buffalo compared with that of bovine. The amino acid sequence of buffalo IL-3 also showed very high identity with that of other ruminants, indicating functional cross-reactivity. Structural homology modelling of buffalo IL-3 protein with human IL-3 showed the presence of five helical structures.

  5. Characterization of full-length sequenced cDNA inserts (FLIcs from Atlantic salmon (Salmo salar

    Directory of Open Access Journals (Sweden)

    Lunner Sigbjørn

    2009-10-01

    Full Text Available Abstract Background Sequencing of the Atlantic salmon genome is now being planned by an international research consortium. Full-length sequenced inserts from cDNAs (FLIcs are an important tool for correct annotation and clustering of the genomic sequence in any species. The large amount of highly similar duplicate sequences caused by the relatively recent genome duplication in the salmonid ancestor represents a particular challenge for the genome project. FLIcs will therefore be an extremely useful resource for the Atlantic salmon sequencing project. In addition to be helpful in order to distinguish between duplicate genome regions and in determining correct gene structures, FLIcs are an important resource for functional genomic studies and for investigation of regulatory elements controlling gene expression. In contrast to the large number of ESTs available, including the ESTs from 23 developmental and tissue specific cDNA libraries contributed by the Salmon Genome Project (SGP, the number of sequences where the full-length of the cDNA insert has been determined has been small. Results High quality full-length insert sequences from 560 pre-smolt white muscle tissue specific cDNAs were generated, accession numbers [GenBank: BT043497 - BT044056]. Five hundred and ten (91% of the transcripts were annotated using Gene Ontology (GO terms and 440 of the FLIcs are likely to contain a complete coding sequence (cCDS. The sequence information was used to identify putative paralogs, characterize salmon Kozak motifs, polyadenylation signal variation and to identify motifs likely to be involved in the regulation of particular genes. Finally, conserved 7-mers in the 3'UTRs were identified, of which some were identical to miRNA target sequences. Conclusion This paper describes the first Atlantic salmon FLIcs from a tissue and developmental stage specific cDNA library. We have demonstrated that many FLIcs contained a complete coding sequence (cCDS. This

  6. Construction and EST sequencing of full-length, drought stress cDNA libraries for common beans (Phaseolus vulgaris L.).

    Science.gov (United States)

    Blair, Matthew W; Fernandez, Andrea C; Ishitani, Manabu; Moreta, Danilo; Seki, Motoaki; Ayling, Sarah; Shinozaki, Kazuo

    2011-11-25

    Common bean is an important legume crop with only a moderate number of short expressed sequence tags (ESTs) made with traditional methods. The goal of this research was to use full-length cDNA technology to develop ESTs that would overlap with the beginning of open reading frames and therefore be useful for gene annotation of genomic sequences. The library was also constructed to represent genes expressed under drought, low soil phosphorus and high soil aluminum toxicity. We also undertook comparisons of the full-length cDNA library to two previous non-full clone EST sets for common bean. Two full-length cDNA libraries were constructed: one for the drought tolerant Mesoamerican genotype BAT477 and the other one for the acid-soil tolerant Andean genotype G19833 which has been selected for genome sequencing. Plants were grown in three soil types using deep rooting cylinders subjected to drought and non-drought stress and tissues were collected from both roots and above ground parts. A total of 20,000 clones were selected robotically, half from each library. Then, nearly 10,000 clones from the G19833 library were sequenced with an average read length of 850 nucleotides. A total of 4,219 unigenes were identified consisting of 2,981 contigs and 1,238 singletons. These were functionally annotated with gene ontology terms and placed into KEGG pathways. Compared to other EST sequencing efforts in common bean, about half of the sequences were novel or represented the 5' ends of known genes. The present full-length cDNA libraries add to the technological toolbox available for common bean and our sequencing of these clones substantially increases the number of unique EST sequences available for the common bean genome. All of this should be useful for both functional gene annotation, analysis of splice site variants and intron/exon boundary determination by comparison to soybean genes or with common bean whole-genome sequences. In addition the library has a large number of

  7. Construction and EST sequencing of full-length, drought stress cDNA libraries for common beans (Phaseolus vulgaris L.

    Directory of Open Access Journals (Sweden)

    Blair Matthew W

    2011-11-01

    Full Text Available Abstract Background Common bean is an important legume crop with only a moderate number of short expressed sequence tags (ESTs made with traditional methods. The goal of this research was to use full-length cDNA technology to develop ESTs that would overlap with the beginning of open reading frames and therefore be useful for gene annotation of genomic sequences. The library was also constructed to represent genes expressed under drought, low soil phosphorus and high soil aluminum toxicity. We also undertook comparisons of the full-length cDNA library to two previous non-full clone EST sets for common bean. Results Two full-length cDNA libraries were constructed: one for the drought tolerant Mesoamerican genotype BAT477 and the other one for the acid-soil tolerant Andean genotype G19833 which has been selected for genome sequencing. Plants were grown in three soil types using deep rooting cylinders subjected to drought and non-drought stress and tissues were collected from both roots and above ground parts. A total of 20,000 clones were selected robotically, half from each library. Then, nearly 10,000 clones from the G19833 library were sequenced with an average read length of 850 nucleotides. A total of 4,219 unigenes were identified consisting of 2,981 contigs and 1,238 singletons. These were functionally annotated with gene ontology terms and placed into KEGG pathways. Compared to other EST sequencing efforts in common bean, about half of the sequences were novel or represented the 5' ends of known genes. Conclusions The present full-length cDNA libraries add to the technological toolbox available for common bean and our sequencing of these clones substantially increases the number of unique EST sequences available for the common bean genome. All of this should be useful for both functional gene annotation, analysis of splice site variants and intron/exon boundary determination by comparison to soybean genes or with common bean whole

  8. cDNA sequences reveal considerable gene prediction inaccuracy in the Plasmodium falciparum genome

    Directory of Open Access Journals (Sweden)

    Valenzuela Jesus G

    2007-07-01

    Full Text Available Abstract Background The completion of the Plasmodium falciparum genome represents a milestone in malaria research. The genome sequence allows for the development of genome-wide approaches such as microarray and proteomics that will greatly facilitate our understanding of the parasite biology and accelerate new drug and vaccine development. Designing and application of these genome-wide assays, however, requires accurate information on gene prediction and genome annotation. Unfortunately, the genes in the parasite genome databases were mostly identified using computer software that could make some erroneous predictions. Results We aimed to obtain cDNA sequences to examine the accuracy of gene prediction in silico. We constructed cDNA libraries from mixed blood stages of P. falciparum parasite using the SMART cDNA library construction technique and generated 17332 high-quality expressed sequence tags (EST, including 2198 from primer-walking experiments. Assembly of our sequence tags produced 2548 contigs and 2671 singletons versus 5220 contigs and 5910 singletons when our EST were assembled with EST in public databases. Comparison of all the assembled EST/contigs with predicted CDS and genomic sequences in the PlasmoDB database identified 356 genes with predicted coding sequences fully covered by EST, including 85 genes (23.6% with introns incorrectly predicted. Careful automatic software and manual alignments found an additional 308 genes that have introns different from those predicted, with 152 new introns discovered and 182 introns with sizes or locations different from those predicted. Alternative spliced and antisense transcripts were also detected. Matching cDNA to predicted genes also revealed silent chromosomal regions, mostly at subtelomere regions. Conclusion Our data indicated that approximately 24% of the genes in the current databases were predicted incorrectly, although some of these inaccuracies could represent alternatively

  9. Generation of a large scale repertoire of Expressed Sequence Tags (ESTs from normalised rainbow trout cDNA libraries

    Directory of Open Access Journals (Sweden)

    Guiguen Yann

    2006-08-01

    Full Text Available Abstract Background Within the framework of a genomics project on livestock species (AGENAE, we initiated a high-throughput DNA sequencing program of Expressed Sequence Tags (ESTs in rainbow trout, Oncorhynchus mykiss. Results We constructed three cDNA libraries including one highly complex pooled-tissue library. These libraries were normalized and subtracted to reduce clone redundancy. ESTs sequences were produced, and 96 472 ESTs corresponding to high quality sequence reads were released on the international database, currently representing 42.5% of the overall sequence knowledge in this species. All these EST sequences and other publicly available ESTs in rainbow trout have been included on a publicly available Website (SIGENAE and have been clustered into a total of 52 930 clusters of putative transcripts groups, including 24 616 singletons. 57.1% of these 52 930 clusters are represented by at least one Agenae EST and 14 343 clusters (27.1% are only composed by Agenae ESTs. Sequence analysis also reveals that normalization and especially subtraction were effective in decreasing redundancy, and that the pooled-tissue library was representative of the initial tissue complexity. Conclusion Due to present work on the construction of rainbow trout normalized cDNA libraries and their extensive sequencing, along with other large scale sequencing programs, rainbow trout is now one of the major fish models in term of EST sequences available in a public database, just after Zebrafish, Danio rerio. This information is now used for the selection of a non redundant set of clones for producing DNA micro-arrays in order to examine global gene expression.

  10. cDNA sequences of two inducible T-cell genes

    Energy Technology Data Exchange (ETDEWEB)

    Kwon, B.S. (Indiana Univ. School of Medicine, Indianapolis (USA) Guthrie Research Institute, Sayre, PA (USA)); Weissman, S.M. (Yale Univ., New Haven, CT (USA))

    1989-03-01

    The authors have previously described a set of human T-lymphocyte-specific cDNA clones isolated by a modified differential screening procedure. Apparent full-length cDNAs containing the sequences of 14 of the 16 initial isolates were sequenced and were found to represent five different species of mRNA; three of the five species were identical to previously reported cDNA sequences of preproenkephalin, T-cell-replacing factor, and a serine esterase, respectively. The other two species, 4-1BB and L2G25B, were inducible sequences found in mRNA from both a cytolytic T-lymphocyte and a helper T-lymphocyte clone and were not previously described in T-cell mRNA; these mRNA sequences encode peptides of 256 and 92 amino acids, respectively. Both peptides contain putative leader sequences. The protein encoded by 4-1BB also has a potential membrane anchor segment and other features also seen in known receptor proteins.

  11. Construction of small RNA cDNA libraries for deep sequencing.

    Science.gov (United States)

    Lu, Cheng; Meyers, Blake C; Green, Pamela J

    2007-10-01

    Small RNAs (21-24 nucleotides) including microRNAs (miRNAs) and small interfering RNAs (siRNAs) are potent regulators of gene expression in both plants and animals. Several hundred genes encoding miRNAs and thousands of siRNAs have been experimentally identified by cloning approaches. New sequencing technologies facilitate the identification of these molecules and provide global quantitative expression data in a given biological sample. Here, we describe the methods used in our laboratory to construct small RNA cDNA libraries for high-throughput sequencing using technologies such as MPSS, 454 or SBS.

  12. Construction of small RNA cDNA libraries for high-throughput sequencing.

    Science.gov (United States)

    Lu, Cheng; Shedge, Vikas

    2011-01-01

    Small RNAs (smRNAs) play an essential role in virtually every aspect of growth and development, by regulating gene expression at the post-transcriptional and/or transcriptional level. New high-throughput sequencing technology allows for a comprehensive coverage of smRNAs in any given biological sample, and has been widely used for profiling smRNA populations in various developmental stages, tissue and cell types, or normal and disease states. In this article, we describe the method used in our laboratory to construct smRNA cDNA libraries for high-throughput sequencing.

  13. cDNA encoding a polypeptide including a hev ein sequence

    Energy Technology Data Exchange (ETDEWEB)

    Raikhel, Natasha V. (Okemos, MI); Broekaert, Willem F. (Dilbeek, BE); Chua, Nam-Hai (Scarsdale, NY); Kush, Anil (New York, NY)

    2000-07-04

    A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74-79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli.

  14. Primary structure of bovine pituitary secretory protein I (chromogranin A) deduced from the cDNA sequence

    Energy Technology Data Exchange (ETDEWEB)

    Ahn, T.G.; Cohn, D.V.; Gorr, S.U.; Ornstein, D.L.; Kashdan, M.A.; Levine, M.A.

    1987-07-01

    Secretory protein I (SP-I), also referred to as chromogranin A, is an acidic glycoprotein that has been found in every tissue of endocrine and neuroendocrine origin examined but never in exocrine or epithelial cells. Its co-storage and co-secretion with peptide hormones and neurotransmitters suggest that it has an important endocrine or secretory function. The authors have isolated cDNA clones from a bovine pituitary lambdagt11 expression library using an antiserum to parathyroid SP-I. The largest clone (SP4B) hybridized to a transcript of 2.1 kilobases in RNA from parathyroid, pituitary, and adrenal medulla. Immunoblots of bacterial lysates derived from SP4B lysognes demonstrated specific antibody binding to an SP4B/..beta..-galactosidase fusion protein (160 kDa) with a cDNA-derived component of 46 kDa. Radioimmunoassay of the bacterial lystates with SP-I antiserum yielded parallel displacement curves of /sup 125/I-labeled SP-I by the SP4B lysate and authentic SP-I. SP4B contains a cDNA of 1614 nucleotides that encodes a 449-amino acid protein (calculated mass, 50 kDa). The nucleotide sequences of the pituitary SP-I cDNA and adrenal medullary SP-I cDNAs are nearly identical. Analysis of genomic DNA suggests that pituitary, adrenal, and parathyroid SP-I are products of the same gene.

  15. Primary structure of bovine pituitary secretory protein I (chromogranin A) deduced from the cDNA sequence

    International Nuclear Information System (INIS)

    Ahn, T.G.; Cohn, D.V.; Gorr, S.U.; Ornstein, D.L.; Kashdan, M.A.; Levine, M.A.

    1987-01-01

    Secretory protein I (SP-I), also referred to as chromogranin A, is an acidic glycoprotein that has been found in every tissue of endocrine and neuroendocrine origin examined but never in exocrine or epithelial cells. Its co-storage and co-secretion with peptide hormones and neurotransmitters suggest that it has an important endocrine or secretory function. The authors have isolated cDNA clones from a bovine pituitary λgt11 expression library using an antiserum to parathyroid SP-I. The largest clone (SP4B) hybridized to a transcript of 2.1 kilobases in RNA from parathyroid, pituitary, and adrenal medulla. Immunoblots of bacterial lysates derived from SP4B lysognes demonstrated specific antibody binding to an SP4B/β-galactosidase fusion protein (160 kDa) with a cDNA-derived component of 46 kDa. Radioimmunoassay of the bacterial lystates with SP-I antiserum yielded parallel displacement curves of 125 I-labeled SP-I by the SP4B lysate and authentic SP-I. SP4B contains a cDNA of 1614 nucleotides that encodes a 449-amino acid protein (calculated mass, 50 kDa). The nucleotide sequences of the pituitary SP-I cDNA and adrenal medullary SP-I cDNAs are nearly identical. Analysis of genomic DNA suggests that pituitary, adrenal, and parathyroid SP-I are products of the same gene

  16. Identification and complete sequencing of novel human transcripts through the use of mouse orthologs and testis cDNA sequences

    DEFF Research Database (Denmark)

    Ferreira, Elisa N; Pires, Lilian C; Parmigiani, Raphael B

    2004-01-01

    The correct identification of all human genes, and their derived transcripts, has not yet been achieved, and it remains one of the major aims of the worldwide genomics community. Computational programs suggest the existence of 30,000 to 40,000 human genes. However, definitive gene identification...... can only be achieved by experimental approaches. We used two distinct methodologies, one based on the alignment of mouse orthologous sequences to the human genome, and another based on the construction of a high-quality human testis cDNA library, in an attempt to identify new human transcripts within...... the human genome sequence. We generated 47 complete human transcript sequences, comprising 27 unannotated and 20 annotated sequences. Eight of these transcripts are variants of previously known genes. These transcripts were characterized according to size, number of exons, and chromosomal localization...

  17. Molecular cloning and nucleotide sequence of full-length cDNA for sweet potato catalase mRNA.

    Science.gov (United States)

    Sakajo, S; Nakamura, K; Asahi, T

    1987-06-01

    A nearly full-length cDNA clone for catalase (pCAS01) was obtained through immunological screening of cDNA expression library constructed from size-fractionated poly(A)-rich RNA of wounded sweet potato tuberous roots by Escherichia coli expression vector-primed cDNA synthesis. Two additional catalase cDNA clones (pCAS10 and pCAS13), which contained cDNA inserts slightly longer than that of pCAS01 at their 5'-termini, were identified by colony hybridization of another cDNA library. Those three catalase cDNAs contained primary structures not identical, but closely related, to one another based on their restriction enzyme and RNase cleavage mapping analyses, suggesting that microheterogeneity exists in catalase mRNAs. The cDNA insert of pCAS13 carried the entire catalase coding capacity, since the RNA transcribed in vitro from the cDNA under the SP6 phage promoter directed the synthesis of a catalase polypeptide in the wheat germ in vitro translation assay. The nucleotide sequencing of these catalase cDNAs indicated that 1900-base catalase mRNA contained a coding region of 1476 bases. The amino acid sequence of sweet potato catalase deduced from the nucleotide sequence was 35 amino acids shorter than rat liver catalase [Furuta, S., Hayashi, H., Hijikata, M., Miyazawa, S., Osumi, T. & Hashimoto, T. (1986) Proc. Natl Acad. Sci. USA 83, 313-317]. Although these two sequences showed only 38% homology, the sequences around the amino acid residues implicated in catalytic function, heme ligand or heme contact had been well conserved during evolution.

  18. [Cloning, sequencing and subcloning of cDNA coding for group I allergen of Dermatophagoides farinae].

    Science.gov (United States)

    Yang, Qing-gui; Li, Chao-pin

    2004-06-01

    To clone, sequence and subclone the cDNA coding for group 1 allergen of Dermatophagoides farinae (Der f 1). The cDNA of Der f 1 was amplified by RT-PCR and PCR. After purified, the gene fragment was cloned into a vector pMD-18T. The recombinant plasmid pMD-18T-Der f 1 was transformed into E. coli JM109. Positive clones were screened and identified by PCR and digestion with restriction enzyme. The sequence of inserted Der f 1 gene fragment was also detected. Der f 1 was then subcloned into the vector of pET-32a(+). The Der f 1 gene fragment of Dermatophagoides farinae was specifically amplified from RNA by RT-PCR and PCR. The recombinant plasmid pMD-18T-Der f 1 and pET-32a(+)-Der f 1 was constructed and digested by Bam H I and Sac I, the size of gene fragment was 646 bp and in accordance with the expected one. The pET-32a(+)-Der f 1 subcloning has been constructed successfully.

  19. Nucleotide sequence of cloned cDNA for human sphingolipid activator protein 1 precursor

    International Nuclear Information System (INIS)

    Dewji, N.N.; Wenger, D.A.; O'Brien, J.S.

    1987-01-01

    Two cDNA clones encoding prepro-sphingolipid activator protein 1 (SAP-1) were isolated from a λ gt11 human hepatoma expression library using polyclonal antibodies. These had inserts of ≅ 2 kilobases (λ-S-1.2 and λ-S-1.3) and both were both homologous with a previously isolated clone (λ-S-1.1) for mature SAP-1. The authors report here the nucleotide sequence of the longer two EcoRI fragments of S-1.2 and S-1.3 that were not the same and the derived amino acid sequences of mature SAP-1 and its prepro form. The open reading frame encodes 19 amino acids, which are colinear with the amino-terminal sequence of mature SAP-1, and extends far beyond the predicted carboxyl terminus of mature SAP-1, indicating extensive carboxyl-terminal processing. The nucleotide sequence of cDNA encoding prepro-SAP-1 includes 1449 bases from the assigned initiation codon ATG at base-pair 472 to the stop codon TGA at base-pair 1921. The first 23 amino acids coded after the initiation ATG are characteristic of a signal peptide. The calculated molecular mass for a polypeptide encoded by 1449 bases is ≅ 53 kDa, in keeping with the reported value for pro-SAP-1. The data indicate that after removal of the signal peptide mature SAP-1 is generated by removing an additional 7 amino acids from the amino terminus and ≅ 373 amino acids from the carboxyl terminus. One potential glycosylation site was previously found in mature SAP-1. Three additional potential glycosylation sites are present in the processed carboxyl-terminal polypeptide, which they designate as P-2

  20. Construction and analysis of a cDNA library from yellow-fruit ginseng

    African Journals Online (AJOL)

    The total RNA was isolated from yellow-fruit ginseng (Panax ginseng C.A. Meyer) leaf tissue. A cDNA library of panax ginseng leaves was constructed by using pDNR-LIB vector according to the SMART cDNA library construction kit protocol. We obtained 378 high quality sequences (GenBank accession number: ...

  1. An integrated PCR colony hybridization approach to screen cDNA libraries for full-length coding sequences.

    Science.gov (United States)

    Pollier, Jacob; González-Guzmán, Miguel; Ardiles-Diaz, Wilson; Geelen, Danny; Goossens, Alain

    2011-01-01

    cDNA-Amplified Fragment Length Polymorphism (cDNA-AFLP) is a commonly used technique for genome-wide expression analysis that does not require prior sequence knowledge. Typically, quantitative expression data and sequence information are obtained for a large number of differentially expressed gene tags. However, most of the gene tags do not correspond to full-length (FL) coding sequences, which is a prerequisite for subsequent functional analysis. A medium-throughput screening strategy, based on integration of polymerase chain reaction (PCR) and colony hybridization, was developed that allows in parallel screening of a cDNA library for FL clones corresponding to incomplete cDNAs. The method was applied to screen for the FL open reading frames of a selection of 163 cDNA-AFLP tags from three different medicinal plants, leading to the identification of 109 (67%) FL clones. Furthermore, the protocol allows for the use of multiple probes in a single hybridization event, thus significantly increasing the throughput when screening for rare transcripts. The presented strategy offers an efficient method for the conversion of incomplete expressed sequence tags (ESTs), such as cDNA-AFLP tags, to FL-coding sequences.

  2. Characterization of cDNA clones encoding rabbit and human serum paraoxonase: The mature protein retains its signal sequence

    Energy Technology Data Exchange (ETDEWEB)

    Hassett, C.; Richter, R.J.; Humbert, R.; Omiecinski, C.J.; Furlong, C.E. (Univ. of Washington, Seattle (United States)); Chapline, C.; Crabb, J.W. (W.Alton Jones Cell Science Center, Lake Placid, NY (United States))

    1991-10-22

    Serum paraoxonase hydrolyzes the toxic metabolites of a variety of organophosphorus insecticides. High serum paraoxonase levels appear to protect against the neurotoxic effects of organophosphorus substrates of this enzyme. The amino acid sequence accounting for 42% of rabbit paraoxonase was determined. From these data, two oligonucleotide probes were synthesized and used to screen a rabbit liver cDNA library. Human paraoxonase clones were isolated from a liver cDNA library by using the rabbit cDNA as a hybridization probe. Inserts from three of the longest clones were sequenced, and one full-length clone contained an open reading frame encoding 355 amino acids, four less than the rabbit paraoxonase protein. Amino-terminal sequences derived from purified rabbit and human paraoxonase proteins suggested that the signal sequence is retained, with the exception of the initiator methionine residue. Characterization of the rabbit and human paraoxonase cDNA clones confirms that the signal sequences are not processed, except for the N-terminal methionine residue. The rabbit and human cDNA clones demonstrate striking nucleotide and deduced amino acid similarities (greater than 85%), suggesting an important metabolic role and constraints on the evolution of this protein.

  3. Determination of cDNA and genomic DNA sequences of hevamine, a chitinase from the rubber tree Hevea brasiliensis

    NARCIS (Netherlands)

    Bokma, E; Spiering, M; Chow, KS; Mulder, PPMFA; Subroto, T; Beintema, JJ

    Hevamine is a chitinase from the rubber tree Hevea brasiliensis and belongs to the family 18 glycosyl hydrolases. This paper describes the cloning of hevamine DNA and cDNA sequences. Hevamine contains a signal peptide at the N-terminus and a putative vacuolar targeting sequence at the C-terminus

  4. Differential representation of sunflower ESTs in enriched organ-specific cDNA libraries in a small scale sequencing project

    Directory of Open Access Journals (Sweden)

    Heinz Ruth A

    2003-09-01

    Full Text Available Abstract Background Subtractive hybridization methods are valuable tools for identifying differentially regulated genes in a given tissue avoiding redundant sequencing of clones representing the same expressed genes, maximizing detection of low abundant transcripts and thus, affecting the efficiency and cost effectiveness of small scale cDNA sequencing projects aimed to the specific identification of useful genes for breeding purposes. The objective of this work is to evaluate alternative strategies to high-throughput sequencing projects for the identification of novel genes differentially expressed in sunflower as a source of organ-specific genetic markers that can be functionally associated to important traits. Results Differential organ-specific ESTs were generated from leaf, stem, root and flower bud at two developmental stages (R1 and R4. The use of different sources of RNA as tester and driver cDNA for the construction of differential libraries was evaluated as a tool for detection of rare or low abundant transcripts. Organ-specificity ranged from 75 to 100% of non-redundant sequences in the different cDNA libraries. Sequence redundancy varied according to the target and driver cDNA used in each case. The R4 flower cDNA library was the less redundant library with 62% of unique sequences. Out of a total of 919 sequences that were edited and annotated, 318 were non-redundant sequences. Comparison against sequences in public databases showed that 60% of non-redundant sequences showed significant similarity to known sequences. The number of predicted novel genes varied among the different cDNA libraries, ranging from 56% in the R4 flower to 16 % in the R1 flower bud library. Comparison with sunflower ESTs on public databases showed that 197 of non-redundant sequences (60% did not exhibit significant similarity to previously reported sunflower ESTs. This approach helped to successfully isolate a significant number of new reported sequences

  5. Isolation and characterization of sequences homologous to the tobacco clone axi 1 (auxin independent) from a Vicia sativa nodule cDNA library

    NARCIS (Netherlands)

    Yalçin-Mendi, Y.; Çetiner, S.; Bisseling, T.

    2001-01-01

    In this research, partial nucleotide sequences of the axi 1 gene, which is related to auxin perception and transduction, isolated from Vicia sativa using cDNA library screening were investigated. Four V. sativa cDNA clones representing homologous of the tobacco axi 1 (auxin independent) cDNA clone

  6. Nucleotide sequence of Phaseolus vulgaris L. alcohol dehydrogenase encoding cDNA and three-dimensional structure prediction of the deduced protein.

    Science.gov (United States)

    Amelia, Kassim; Khor, Chin Yin; Shah, Farida Habib; Bhore, Subhash J

    2015-01-01

    Common beans (Phaseolus vulgaris L.) are widely consumed as a source of proteins and natural products. However, its yield needs to be increased. In line with the agenda of Phaseomics (an international consortium), work of expressed sequence tags (ESTs) generation from bean pods was initiated. Altogether, 5972 ESTs have been isolated. Alcohol dehydrogenase (AD) encoding gene cDNA was a noticeable transcript among the generated ESTs. This AD is an important enzyme; therefore, to understand more about it this study was undertaken. The objective of this study was to elucidate P. vulgaris L. AD (PvAD) gene cDNA sequence and to predict the three-dimensional (3D) structure of deduced protein. positive and negative strands of the PvAD cDNA clone were sequenced using M13 forward and M13 reverse primers to elucidate the nucleotide sequence. Deduced PvAD cDNA and protein sequence was analyzed for their basic features using online bioinformatics tools. Sequence comparison was carried out using bl2seq program, and tree-view program was used to construct a phylogenetic tree. The secondary structures and 3D structure of PvAD protein were predicted by using the PHYRE automatic fold recognition server. The sequencing results analysis showed that PvAD cDNA is 1294 bp in length. It's open reading frame encodes for a protein that contains 371 amino acids. Deduced protein sequence analysis showed the presence of putative substrate binding, catalytic Zn binding, and NAD binding sites. Results indicate that the predicted 3D structure of PvAD protein is analogous to the experimentally determined crystal structure of s-nitrosoglutathione reductase from an Arabidopsis species. The 1294 bp long PvAD cDNA encodes for 371 amino acid long protein that contains conserved domains required for biological functions of AD. The predicted deduced PvAD protein's 3D structure reflects the analogy with the crystal structure of Arabidopsis thaliana s-nitrosoglutathione reductase. Further study is required

  7. [Software development in data analysis and mining for cDNA microarray].

    Science.gov (United States)

    Wu, Bin; Wang, Jianguo; Wang, Miqu

    2007-12-01

    Data analysis and mining is a key issue to microarray technology and is usually implemented through software development. This paper summarizes the state-of-art software development in cDNA microarray data analysis and mining. The updated software developments are discussed in three stages: data inquisition from cDNA microarray tests, statistical treatment of cDNA data and data mining from gene network.

  8. The cDNA sequence of three hemocyanin subunits from the garden snail Helix lucorum.

    Science.gov (United States)

    De Smet, Lina; Dimitrov, Ivan; Debyser, Griet; Dolashka-Angelova, Pavlina; Dolashki, Aleksandar; Van Beeumen, Jozef; Devreese, Bart

    2011-11-10

    Hemocyanins are blue copper containing respiratory proteins residing in the hemolymph of many molluscs and arthropods. They can have different molecular masses and quaternary structures. Moreover, several molluscan hemocyanins are isolated with one, two or three isoforms occurring as decameric, didecameric, multidecameric or tubule aggregates. We could recently isolate three different hemocyanin isopolypeptides from the hemolymph of the garden snail Helix lucorum (HlH). These three structural subunits were named α(D)-HlH, α(N)-HlH and β-HlH. We have cloned and sequenced their cDNA which is the first result ever reported for three isoforms of a molluscan hemocyanin. Whereas the complete gene sequence of α(D)-HlH and β-HlH was obtained, including the 5' and 3' UTR, 180bp of the 5' end and around 900bp at the 3' end are missing for the third subunit. The subunits α(D)-HlH and β-HlH comprise a signal sequence of 19 amino acids plus a polypeptide of 3409 and 3414 amino acids, respectively. We could determine 3031 residues of the α(N)-HLH subunit. Sequence comparison with other molluscan hemocyanins shows that α(D)-HlH is more related to Aplysia californicum hemocyanin than to each of its own isopolypeptides. The structural subunits comprise 8 different functional units (FUs: a, b, c, d, e, f, g, h) and each functional unit possesses a highly conserved copper-A and copper-B site for reversible oxygen binding. Potential N-glycosylation sites are present in all three structural subunits. We confirmed that all three different isoforms are effectively produced and secreted in the hemolymph of H. lucorum by analyzing a tryptic digest of the purified native hemocyanin by MALDI-TOF and LC-FTICR mass spectrometry. Copyright © 2011 Elsevier B.V. All rights reserved.

  9. Pattern analysis approach reveals restriction enzyme cutting abnormalities and other cDNA library construction artifacts using raw EST data.

    Science.gov (United States)

    Zhou, Sun; Ji, Guoli; Liu, Xiaolin; Li, Pei; Moler, James; Karro, John E; Liang, Chun

    2012-05-03

    Expressed Sequence Tag (EST) sequences are widely used in applications such as genome annotation, gene discovery and gene expression studies. However, some of GenBank dbEST sequences have proven to be "unclean". Identification of cDNA termini/ends and their structures in raw ESTs not only facilitates data quality control and accurate delineation of transcription ends, but also furthers our understanding of the potential sources of data abnormalities/errors present in the wet-lab procedures for cDNA library construction. After analyzing a total of 309,976 raw Pinus taeda ESTs, we uncovered many distinct variations of cDNA termini, some of which prove to be good indicators of wet-lab artifacts, and characterized each raw EST by its cDNA terminus structure patterns. In contrast to the expected patterns, many ESTs displayed complex and/or abnormal patterns that represent potential wet-lab errors such as: a failure of one or both of the restriction enzymes to cut the plasmid vector; a failure of the restriction enzymes to cut the vector at the correct positions; the insertion of two cDNA inserts into a single vector; the insertion of multiple and/or concatenated adapters/linkers; the presence of 3'-end terminal structures in designated 5'-end sequences or vice versa; and so on. With a close examination of these artifacts, many problematic ESTs that have been deposited into public databases by conventional bioinformatics pipelines or tools could be cleaned or filtered by our methodology. We developed a software tool for Abnormality Filtering and Sequence Trimming for ESTs (AFST, http://code.google.com/p/afst/) using a pattern analysis approach. To compare AFST with other pipelines that submitted ESTs into dbEST, we reprocessed 230,783 Pinus taeda and 38,709 Arachis hypogaea GenBank ESTs. We found 7.4% of Pinus taeda and 29.2% of Arachis hypogaea GenBank ESTs are "unclean" or abnormal, all of which could be cleaned or filtered by AFST. cDNA terminal pattern analysis, as

  10. Pattern analysis approach reveals restriction enzyme cutting abnormalities and other cDNA library construction artifacts using raw EST data

    Directory of Open Access Journals (Sweden)

    Zhou Sun

    2012-05-01

    Full Text Available Abstract Background Expressed Sequence Tag (EST sequences are widely used in applications such as genome annotation, gene discovery and gene expression studies. However, some of GenBank dbEST sequences have proven to be “unclean”. Identification of cDNA termini/ends and their structures in raw ESTs not only facilitates data quality control and accurate delineation of transcription ends, but also furthers our understanding of the potential sources of data abnormalities/errors present in the wet-lab procedures for cDNA library construction. Results After analyzing a total of 309,976 raw Pinus taeda ESTs, we uncovered many distinct variations of cDNA termini, some of which prove to be good indicators of wet-lab artifacts, and characterized each raw EST by its cDNA terminus structure patterns. In contrast to the expected patterns, many ESTs displayed complex and/or abnormal patterns that represent potential wet-lab errors such as: a failure of one or both of the restriction enzymes to cut the plasmid vector; a failure of the restriction enzymes to cut the vector at the correct positions; the insertion of two cDNA inserts into a single vector; the insertion of multiple and/or concatenated adapters/linkers; the presence of 3′-end terminal structures in designated 5′-end sequences or vice versa; and so on. With a close examination of these artifacts, many problematic ESTs that have been deposited into public databases by conventional bioinformatics pipelines or tools could be cleaned or filtered by our methodology. We developed a software tool for Abnormality Filtering and Sequence Trimming for ESTs (AFST, http://code.google.com/p/afst/ using a pattern analysis approach. To compare AFST with other pipelines that submitted ESTs into dbEST, we reprocessed 230,783 Pinus taeda and 38,709 Arachis hypogaea GenBank ESTs. We found 7.4% of Pinus taeda and 29.2% of Arachis hypogaea GenBank ESTs are “unclean” or abnormal, all of which could be cleaned

  11. Cloning and Sequencing of Protein Kinase cDNA from Harbor Seal (Phoca vitulina Lymphocytes

    Directory of Open Access Journals (Sweden)

    Jennifer C. C. Neale

    2004-01-01

    Full Text Available Protein kinases (PKs play critical roles in signal transduction and activation of lymphocytes. The identification of PK genes provides a tool for understanding mechanisms of immunotoxic xenobiotics. As part of a larger study investigating persistent organic pollutants in the harbor seal and their possible immunomodulatory actions, we sequenced harbor seal cDNA fragments encoding PKs. The procedure, using degenerate primers based on conserved motifs of human protein tyrosine kinases (PTKs, successfully amplified nine phocid PK gene fragments with high homology to human and rodent orthologs. We identified eight PTKs and one dual (serine/threonine and tyrosine kinase. Among these were several PKs important in early signaling events through the B- and T-cell receptors (FYN, LYN, ITK and SYK and a MAP kinase involved in downstream signal transduction. V-FGR, RET and DDR2 were also expressed. Sequential activation of protein kinases ultimately induces gene transcription leading to the proliferation and differentiation of lymphocytes critical to adaptive immunity. PKs are potential targets of bioactive xenobiotics, including persistent organic pollutants of the marine environment; characterization of these molecules in the harbor seal provides a foundation for further research illuminating mechanisms of action of contaminants speculated to contribute to large-scale die-offs of marine mammals via immunosuppression.

  12. Cloning, sequencing and expression of a novel xylanase cDNA from ...

    African Journals Online (AJOL)

    A strain SH 2016, capable of producing xylanase, was isolated and identified as Aspergillus awamori, based on its physiological and biochemical characteristics as well as its ITS rDNA gene sequence analysis. A xylanase gene of 591 bp was cloned from this newly isolated A. awamori and the ORF sequence predicted a ...

  13. Cloning and sequencing of cDNA encoding human DNA topoisomerase II and localization of the gene to chromosome region 17q21-22

    International Nuclear Information System (INIS)

    Tsai-Pflugfelder, M.; Liu, L.F.; Liu, A.A.; Tewey, K.M.; Whang-Peng, J.; Knutsen, T.; Huebner, K.; Croce, C.M.; Wang, J.C.

    1988-01-01

    Two overlapping cDNA clones encoding human DNA topoisomerase II were identified by two independent methods. In one, a human cDNA library in phage λ was screened by hybridization with a mixed oligonucleotide probe encoding a stretch of seven amino acids found in yeast and Drosophila DNA topoisomerase II; in the other, a different human cDNA library in a λgt11 expression vector was screened for the expression of antigenic determinants that are recognized by rabbit antibodies specific to human DNA topoisomerase II. The entire coding sequences of the human DNA topoisomerase II gene were determined from these and several additional clones, identified through the use of the cloned human TOP2 gene sequences as probes. Hybridization between the cloned sequences and mRNA and genomic DNA indicates that the human enzyme is encoded by a single-copy gene. The location of the gene was mapped to chromosome 17q21-22 by in situ hybridization of a cloned fragment to metaphase chromosomes and by hybridization analysis with a panel of mouse-human hybrid cell lines, each retaining a subset of human chromosomes

  14. (+)-(10R)-Germacrene A synthase from goldenrod, Solidago canadensis; cDNA isolation, bacterial expression and functional analysis.

    Science.gov (United States)

    Prosser, Ian; Phillips, Andy L; Gittings, Simon; Lewis, Mervyn J; Hooper, Antony M; Pickett, John A; Beale, Michael H

    2002-08-01

    Profiling of sesquiterpene hydrocarbons in extracts of goldenrod, Solidago canadensis, by GC-MS revealed the presence of both enantiomers of germacrene D and lesser amounts of germacrene A, alpha-humulene, and beta-caryophyllene. A similarity-based cloning strategy using degenerate oligonucleotide primers, based on conserved amino acid sequences in known plant sesquiterpene synthases and RT-PCR, resulted in the isolation of a full length sesquiterpene synthase cDNA. Functional expression of the cDNA in E. coli, as an N-terminal thioredoxin fusion protein using the pET32b vector yielded an enzyme that was readily purified by nickel-chelate affinity chromatography. Chiral GC-MS analysis of products from of (3)H- and (2)H-labelled farnesyl diphosphate identified the enzyme as (+)-(10R)-germacrene A synthase. Sequence analysis and molecular modelling was used to compare this enzyme with the mechanistically related epi-aristolochene synthase from tobacco.

  15. Construction of full-length cDNA library and development of EST-derived simple sequence repeat (EST-SSR) markers in Senecio scandens.

    Science.gov (United States)

    Qian, Gang; Ping, Junjiao; Lu, Jian; Zhang, Zhen; Wang, Lei; Xu, Delin

    2014-12-01

    Senecio scandens Buch.-Ham. ex D. Don (Compositae) is a crucial source of Chinese traditional medicine with antibacterial properties. We constructed a cDNA library and obtained expressed sequence tags (ESTs) to show the distribution of gene ontology annotations for mRNAs, using an individual plant with superior antibacterial characteristics. Analysis of comparative genomics indicates that the putative uncharacterized proteins (21.07%) might be derived from "molecular function unknown" clones or rare transcripts. Furthermore, the Compositae had high cross-species transferability of EST-derived simple sequence repeats (EST-SSR), based on valid amplifications of 206 primer pairs developed from the newly assembled expressed sequence tag sequences in Artemisia annua L. Among those EST-SSR markers, 52 primers showed polymorphic amplifications between individuals with contrasting diverse antibacterial traits. Our sequence data and molecular markers will be cost-effective tools for further studies such as genome annotation, molecular breeding, and novel transcript profiles within Compositae species.

  16. cDNA cloning, structural analysis, SNP detection and tissue ...

    Indian Academy of Sciences (India)

    THOMAS NAICY

    E-mail: naicy@kvasu.ac.in. and conception rate ... transformed into DH5α strain of Escherichia coli and the clones harbouring ... Primer pairs for caprine IGF1 and GAPDH were designed using Primer3 software (table 1). RTq-PCR was conducted in a 25 μL reaction volume containing 50 ng of cDNA and 2× Max- ima SYBR ...

  17. Nucleotide sequence of a cDNA coding for the barley seed protein CMa: an inhibitor of insect α-amylase

    DEFF Research Database (Denmark)

    Rasmussen, Søren Kjærsgård; Johansson, A.

    1992-01-01

    The primary structure of the insect alpha-amylase inhibitor CMa of barley seeds was deduced from a full-length cDNA clone pc43F6. Analysis of RNA from barley endosperm shows high levels 15 and 20 days after flowering. The cDNA predicts an amino acid sequence of 119 residues preceded by a signal...... peptide of 25 amino acids. Ala and Leu account for 55% of the signal peptide. CMa is 60-85% identical with alpha-amylase inhibitors of wheat, but shows less than 50% identity to trypsin inhibitors of barley and wheat. The 10 Cys residues are located in identical positions compared to the cereal inhibitor...

  18. Construction and analysis of full-length cDNA library of Cryptosporidium parvum.

    Science.gov (United States)

    Yamagishi, Junya; Wakaguri, Hiroyuki; Sugano, Sumio; Kawano, Suguru; Fujisaki, Kozo; Sugimoto, Chihiro; Watanabe, Junichi; Suzuki, Yutaka; Kimata, Isao; Xuan, Xuenan

    2011-06-01

    A full-length cDNA library was constructed from the sporozoite of Cryptosporidium parvum. Normalized clones were subjected to Solexa shotgun sequencing, and then complete sequences for 1066 clones were reconfigured. Detailed analyses of the sequences revealed that 13.5% of the transcripts were spliced; the average and median 5' UTR lengths were 213.5 and 122 nucleotides, respectively. There were 148 inconsistencies out of 562 examined genes between the experimentally described cDNA sequence and the predicted sequence from its genome. In addition, we identified 118 sequences that had little homology against annotated genes of C. parvum as prospective candidates for addable genes. These observations should improve the reliability of C. parvum transcriptome and provide a versatile resource for further studies. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.

  19. Isolation and sequence of a cDNA clone for human tyrosinase that maps at the mouse c-albino locus.

    Science.gov (United States)

    Kwon, B S; Haq, A K; Pomerantz, S H; Halaban, R

    1987-01-01

    Screening of a lambda gt11 human melanocyte cDNA library with antibodies against hamster tyrosinase (monophenol, L-dopa:oxygen oxidoreductase, EC 1.14.18.1) resulted in the isolation of 16 clones. The cDNA inserts from 13 of the 16 clones cross-hybridized with each other, indicating that they were from related mRNA species. One of the cDNA clones, Pmel34, detected one mRNA species with an approximate length of 2.4 kilobases that was expressed preferentially in normal and malignant melanocytes but not in other cell types. The amino acid sequence deduced from the nucleotide sequence showed that the putative human tyrosinase is composed of 548 amino acids with a molecular weight of 62,610. The deduced protein contains glycosylation sites and histidine-rich sites that could be used for copper binding. Southern blot analysis of DNA derived from newborn mice carrying lethal albino deletion mutations revealed that Pmel34 maps near or at the c-albino locus, the position of the structural gene for tyrosinase. Images PMID:2823263

  20. Cloning and sequencing of the cDNA encoding a core protein of the paired helical filament of Alzheimer's disease: Identification as the microtubule-associated protein tau

    International Nuclear Information System (INIS)

    Goedert, M.; Wischik, C.M.; Crowther, R.A.; Walker, J.E.; Klug, A.

    1988-01-01

    Screening of cDNA libraries prepared from the frontal cortex of an Alzheimer's disease patient and from fetal human brain has led to isolation of the cDNA for a core protein of the paired helical filament of Alzheimer's disease. The partial amino acid sequence of this core protein was used to design synthetic oligonucleotide probes. The cDNA encodes a protein of 352 amino acids that contains a characteristic amino acid repeat in its carboxyl-terminal half. This protein is highly homologous to the sequence of the mouse microtubule-associated protein tau and thus constitutes the human equivalent of mouse tau. RNA blot analysis indicates the presence of two major transcripts, 6 and 2 kilobases long, with a wide distribution in normal human brain. Tau protein mRNAs were found in normal amounts in the frontal cortex from patients with Alzheimer's disease. The proof that at least part of tau protein forms a component of the paired helical filament core opens the way to understanding the mode of formation of paired helical filaments and thus, ultimately, the pathogenesis of Alzheimer's disease

  1. cDNA, amino acid carbohydrate sequence of barley seed-specific peroxidase BP 1

    DEFF Research Database (Denmark)

    Johansson, A.; Rasmussen, Søren Kjærsgård; Harthill, J.E.

    1992-01-01

    of 69% in the translated region, a 90% G or C preference in the wobble position of the codons and a typical signal peptide sequence. N-terminal amino acid sequencing and sequence analysis of tryptic peptides verified 98% of the sequence of the mature BP 1 which contains 309 amino acid residues. BP 1...... biological role of this enzyme. The barley peroxidase is processed at the C-terminus and might be targeted to the vacuole. The single site of glycosylation is located near the C-terminus in the N-glycosylation sequon -Asn-Cys-Ser- in which Cys forms part of a disulphide bridge. The major glycan is a typical...

  2. Identification and characterization of a novel legume-like lectin cDNA sequence from the red marine algae Gracilaria fisheri.

    Science.gov (United States)

    Suttisrisung, Sukanya; Senapin, Saengchan; Withyachumnarnkul, Boonsirm; Wongprasert, Kanokpan

    2011-12-01

    A legume-type lectin (L-Lectin) gene of the red algae Gracilaria fisheri (GFL) was cloned by rapid amplification of cDNA ends (RACE). The full-length cDNA of GFL was 1714 bp and contained a 1542 bp open reading frame encoding 513 amino acids with a predicted molecular mass of 56.5 kDa. Analysis of the putative amino acid sequence with NCBI-BLAST revealed a high homology (30-68%) with legume-type lectins (L-lectin) from Griffithsia japonica, Clavispora lusitaniae, Acyrthosiphon pisum, Tetraodon nigroviridis and Xenopus tropicalis. Phylogenetic relationship analysis showed the highest sequence identity to a glycoprotein of the red algae Griffithsia japonica (68%) (GenBank number AAM93989). Conserved Domain Database analysis detected an N-terminal carbohydrate recognition domain (CRD), the characteristic of L-lectins, which contained two sugar binding sites and a metal binding site. The secondary structure prediction of GFL showed a beta-sheet structure, connected with turn and coil. The most abundant structural element of GFL was the random coil, while the alpha-helixes were distributed at the N- and C-termini, and 21 beta-sheets were distributed in the CRD. Computer analysis of three-dimensional structure showed a common feature of L-lectins of GFL, which included an overall globular shape that was composed of a beta-sandwich of two anti-parallel beta-sheets, monosaccharide binding sites, were on the top of the structure and in proximity with a metal binding site. Northern blot analysis using a DIG-labelled probe derived from a partial GFL sequence revealed a hybridization signal of (approx.) 1.7 kb consistent with the length of the full-length GFL cDNA identified by RACE. No detectable band was observed from control total RNA extracted from filamentous green algae.

  3. Assessing the utility of the Oxford Nanopore MinION for snake venom gland cDNA sequencing

    Directory of Open Access Journals (Sweden)

    Adam D. Hargreaves

    2015-11-01

    Full Text Available Portable DNA sequencers such as the Oxford Nanopore MinION device have the potential to be truly disruptive technologies, facilitating new approaches and analyses and, in some cases, taking sequencing out of the lab and into the field. However, the capabilities of these technologies are still being revealed. Here we show that single-molecule cDNA sequencing using the MinION accurately characterises venom toxin-encoding genes in the painted saw-scaled viper, Echis coloratus. We find the raw sequencing error rate to be around 12%, improved to 0–2% with hybrid error correction and 3% with de novo error correction. Our corrected data provides full coding sequences and 5′ and 3′ UTRs for 29 of 33 candidate venom toxins detected, far superior to Illumina data (13/40 complete and Sanger-based ESTs (15/29. We suggest that, should the current pace of improvement continue, the MinION will become the default approach for cDNA sequencing in a variety of species.

  4. 5'-end sequences of budding yeast full-length cDNA clones - Budding yeast cDNA sequencing project | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available switchLanguage; BLAST Search Image Search Home About Archive Update History Data ...cription Download License Update History of This Database Site Policy | Contact Us 5'-end sequences of buddi

  5. Characterization and sequence analysis of cysteine and glycine-rich ...

    African Journals Online (AJOL)

    Primers specific for CSRP3 were designed using known cDNA sequences of Bos taurus published in database with different accession numbers. Polymerase chain reaction (PCR) was performed and products were purified and sequenced. Sequence analysis and alignment were carried out using CLUSTAL W (1.83).

  6. Human uroporphyrinogen III synthase: Molecular cloning, nucleotide sequence, and expression of a full-length cDNA

    International Nuclear Information System (INIS)

    Tsai, Shihfeng; Bishop, D.F.; Desnick, R.J.

    1988-01-01

    Uroporphyrinogen III synthase, the fourth enzyme in the heme biosynthetic pathway, is responsible for conversion of the linear tetrapyrrole, hydroxymethylbilane, to the cyclic tetrapyrrole, uroporphyrinogen III. The deficient activity of URO-synthase is the enzymatic defect in the autosomal recessive disorder congenital erythropoietic porphyria. To facilitate the isolation of a full-length cDNA for human URO-synthase, the human erythrocyte enzyme was purified to homogeneity and 81 nonoverlapping amino acids were determined by microsequencing the N terminus and four tryptic peptides. Two synthetic oligonucleotide mixtures were used to screen 1.2 x 10 6 recombinants from a human adult liver cDNA library. Eight clones were positive with both oligonucleotide mixtures. Of these, dideoxy sequencing of the 1.3 kilobase insert from clone pUROS-2 revealed 5' and 3' untranslated sequences of 196 and 284 base pairs, respectively, and an open reading frame of 798 base pairs encoding a protein of 265 amino acids with a predicted molecular mass of 28,607 Da. The isolation and expression of this full-length cDNA for human URO-synthase should facilitate studies of the structure, organization, and chromosomal localization of this heme biosynthetic gene as well as the characterization of the molecular lesions causing congenital erythropoietic porphyria

  7. RNA-ligase-dependent biases in miRNA representation in deep-sequenced small RNA cDNA libraries

    Science.gov (United States)

    Hafner, Markus; Renwick, Neil; Brown, Miguel; Mihailović, Aleksandra; Holoch, Daniel; Lin, Carolina; Pena, John T.G.; Nusbaum, Jeffrey D.; Morozov, Pavel; Ludwig, Janos; Ojo, Tolulope; Luo, Shujun; Schroth, Gary; Tuschl, Thomas

    2011-01-01

    Sequencing of small RNA cDNA libraries is an important tool for the discovery of new RNAs and the analysis of their mutational status as well as expression changes across samples. It requires multiple enzyme-catalyzed steps, including sequential oligonucleotide adapter ligations to the 3′ and 5′ ends of the small RNAs, reverse transcription (RT), and PCR. We assessed biases in representation of miRNAs relative to their input concentration, using a pool of 770 synthetic miRNAs and 45 calibrator oligoribonucleotides, and tested the influence of Rnl1 and two variants of Rnl2, Rnl2(1–249) and Rnl2(1–249)K227Q, for 3′-adapter ligation. The use of the Rnl2 variants for adapter ligations yielded substantially fewer side products compared with Rnl1; however, the benefits of using Rnl2 remained largely obscured by additional biases in the 5′-adapter ligation step; RT and PCR steps did not have a significant impact on read frequencies. Intramolecular secondary structures of miRNA and/or miRNA/3′-adapter products contributed to these biases, which were highly reproducible under defined experimental conditions. We used the synthetic miRNA cocktail to derive correction factors for approximation of the absolute levels of individual miRNAs in biological samples. Finally, we evaluated the influence of 5′-terminal 5-nt barcode extensions for a set of 20 barcoded 3′ adapters and observed similar biases in miRNA read distribution, thereby enabling cost-saving multiplex analysis for large-scale miRNA profiling. PMID:21775473

  8. cDNA cloning, structural analysis, SNP detection and tissue ...

    Indian Academy of Sciences (India)

    The present study was conducted to clone and sequence the full-length coding sequence of the caprine IGF1 gene from Attappady Black and Malabari breeds, two indigenous goat breeds of south India, to analyse its structure, and to ascertainthe relative abundance of IGF1 mRNA in different tissues. The caprine IGF1 ...

  9. cDNA cloning, structural analysis, SNP detection and tissue ...

    Indian Academy of Sciences (India)

    THOMAS NAICY

    Abstract. Insulin-like growth factor 1 (IGF1) plays an important role in growth, reproduction, foetal development and cell proliferation. The present study was conducted to clone and sequence the full-length coding sequence of the caprine IGF1 gene from. Attappady Black and Malabari breeds, two indigenous goat breeds of ...

  10. Generation of longer 3′ cDNA fragments from massively parallel signature sequencing tags

    OpenAIRE

    Silva, Ana Paula M.; Chen, Jianjun; Carraro, Dirce M.; Wang, San Ming; Camargo, Anamaria A.

    2004-01-01

    Massively Parallel Signature Sequencing (MPSS) is a powerful technique for genome-wide gene expression analysis, which, similar to SAGE, relies on the production of short tags proximal to the 3′end of transcripts. A single MPSS experiment can generate over 107 tags, providing a 10-fold coverage of the transcripts expressed in a human cell. A significant fraction of MPSS tags cannot be assigned to known transcripts (orphan tags) and are likely to be derived from transcripts expressed at very l...

  11. Acetylcholinesterase of the sand fly, Phlebotomus papatasi (Scopoli): cDNA sequence, baculovirus expression, and biochemical properties.

    Science.gov (United States)

    Temeyer, Kevin B; Brake, Danett K; Tuckow, Alexander P; Li, Andrew Y; Pérez de León, Adalberto A

    2013-02-04

    Millions of people and domestic animals around the world are affected by leishmaniasis, a disease caused by various species of flagellated protozoans in the genus Leishmania that are transmitted by several sand fly species. Insecticides are widely used for sand fly population control to try to reduce or interrupt Leishmania transmission. Zoonotic cutaneous leishmaniasis caused by L. major is vectored mainly by Phlebotomus papatasi (Scopoli) in Asia and Africa. Organophosphates comprise a class of insecticides used for sand fly control, which act through the inhibition of acetylcholinesterase (AChE) in the central nervous system. Point mutations producing an altered, insensitive AChE are a major mechanism of organophosphate resistance in insects and preliminary evidence for organophosphate-insensitive AChE has been reported in sand flies. This report describes the identification of complementary DNA for an AChE in P. papatasi and the biochemical characterization of recombinant P. papatasi AChE. A P. papatasi Israeli strain laboratory colony was utilized to prepare total RNA utilized as template for RT-PCR amplification and sequencing of cDNA encoding acetylcholinesterase 1 using gene specific primers and 3'-5'-RACE. The cDNA was cloned into pBlueBac4.5/V5-His TOPO, and expressed by baculovirus in Sf21 insect cells in serum-free medium. Recombinant P. papatasi acetylcholinesterase was biochemically characterized using a modified Ellman's assay in microplates. A 2309 nucleotide sequence of PpAChE1 cDNA [GenBank: JQ922267] of P. papatasi from a laboratory colony susceptible to insecticides is reported with 73-83% nucleotide identity to acetylcholinesterase mRNA sequences of Culex tritaeniorhynchus and Lutzomyia longipalpis, respectively. The P. papatasi cDNA ORF encoded a 710-amino acid protein [GenBank: AFP20868] exhibiting 85% amino acid identity with acetylcholinesterases of Cx. pipiens, Aedes aegypti, and 92% amino acid identity for L. longipalpis. Recombinant P

  12. Selective and flexible depletion of problematic sequences from RNA-seq libraries at the cDNA stage.

    Science.gov (United States)

    Archer, Stuart K; Shirokikh, Nikolay E; Preiss, Thomas

    2014-05-26

    A major hurdle to transcriptome profiling by deep-sequencing technologies is that abundant transcripts, such as rRNAs, can overwhelm the libraries, severely reducing transcriptome-wide coverage. Methods for depletion of such unwanted sequences typically require treatment of RNA samples prior to library preparation, are costly and not suited to unusual species and applications. Here we describe Probe-Directed Degradation (PDD), an approach that employs hybridisation to DNA oligonucleotides at the single-stranded cDNA library stage and digestion with Duplex-Specific Nuclease (DSN). Targeting Saccharomyces cerevisiae rRNA sequences in Illumina HiSeq libraries generated by the split adapter method we show that PDD results in efficient removal of rRNA. The probes generate extended zones of depletion as a function of library insert size and the requirements for DSN cleavage. Using intact total RNA as starting material, probes can be spaced at the minimum anticipated library size minus 20 nucleotides to achieve continuous depletion. No off-target bias is detectable when comparing PDD-treated with untreated libraries. We further provide a bioinformatics tool to design suitable PDD probe sets. We find that PDD is a rapid procedure that results in effective and specific depletion of unwanted sequences from deep-sequencing libraries. Because PDD acts at the cDNA stage, handling of fragile RNA samples can be minimised and it should further be feasible to remediate existing libraries. Importantly, PDD preserves the original RNA fragment boundaries as is required for nucleotide-resolution footprinting or base-cleavage studies. Finally, as PDD utilises unmodified DNA oligonucleotides it can provide a low-cost option for large-scale projects, or be flexibly customised to suit different depletion targets, sample types and organisms.

  13. In vitro and in silico cloning of Xenopus laevis SOD2 cDNA and its phylogenetic analysis.

    Science.gov (United States)

    Purrello, Michele; Di Pietro, Cinzia; Ragusa, Marco; Pulvirenti, Alfredo; Giugno, Rosalba; Di Pietro, Valentina; Emmanuele, Giovanni; Travali, Salvo; Scalia, Marina; Shasha, Dennis; Ferro, Alfredo

    2005-02-01

    By using the methodology of both wet and dry biology (i.e., RT-PCR and cycle sequencing, and biocomputational technology, respectively) and the data obtained through the Genome Projects, we have cloned Xenopus laevis SOD2 (MnSOD) cDNA and determined its nucleotide sequence. These data and the deduced protein primary structure were compared with all the other SOD2 nucleotide and amino acid sequences from eukaryotes and prokaryotes, published in public databases. The analysis was performed by using both Clustal W, a well known and widely used program for sequence analysis, and AntiClustAl, a new algorithm recently created and implemented by our group. Our results demonstrate a very high conservation of the enzyme amino acid sequence during evolution, which proves a close structure-function relationship. This is to be expected for very ancient molecules endowed with critical biological functions, performed through a specific structural organization. The nucleotide sequence conservation is less pronounced: this too was foreseeable, due to neutral mutations and to the species-specific codon usage. The data obtained by using AntiClustAl are comparable with those produced with Clustal W, which validates this algorithm as an important new tool for biocomputational analysis. Finally, it is noteworthy that evolutionary trees, drawn by using all the available data on SOD2 nucleotide sequences and amino acid and either Clustal W or AntiClustAl, are comparable to those obtained through phylogenetic analysis based on fossil records.

  14. Intervening sequences in a plant gene-comparison of the partial sequence of cDNA and genomic DNA of French bean phaseolin

    Science.gov (United States)

    Sun, S. M.; Slightom, J. L.; Hall, T. C.

    1981-01-01

    A plant gene coding for the major storage protein (phaseolin, G1-globulin) of the French bean was isolated from a genomic library constructed in the phage vector Charon 24A. Comparison of the nucleotide sequence of part of the gene with that of the cloned messenger RNA (cDNA) revealed the presence of three intervening sequences, all beginning with GTand ending with AG. The 5' and 3' boundaries of intervening sequences TVS-A (88 base pairs) and IVS-B (124 base pairs) are similar to those described for animal and viral genes, but the 3' boundary of IVS-C (129 base pairs) shows some differences. A sequence of 185 amino acids deduced from the cloned DMAs represents about 40% of a phaseolin polypeptide.

  15. Cost-effective sequencing of full-length cDNA clones powered by a de novo-reference hybrid assembly.

    Directory of Open Access Journals (Sweden)

    Reginaldo M Kuroshu

    Full Text Available BACKGROUND: Sequencing full-length cDNA clones is important to determine gene structures including alternative splice forms, and provides valuable resources for experimental analyses to reveal the biological functions of coded proteins. However, previous approaches for sequencing cDNA clones were expensive or time-consuming, and therefore, a fast and efficient sequencing approach was demanded. METHODOLOGY: We developed a program, MuSICA 2, that assembles millions of short (36-nucleotide reads collected from a single flow cell lane of Illumina Genome Analyzer to shotgun-sequence approximately 800 human full-length cDNA clones. MuSICA 2 performs a hybrid assembly in which an external de novo assembler is run first and the result is then improved by reference alignment of shotgun reads. We compared the MuSICA 2 assembly with 200 pooled full-length cDNA clones finished independently by the conventional primer-walking using Sanger sequencers. The exon-intron structure of the coding sequence was correct for more than 95% of the clones with coding sequence annotation when we excluded cDNA clones insufficiently represented in the shotgun library due to PCR failure (42 out of 200 clones excluded, and the nucleotide-level accuracy of coding sequences of those correct clones was over 99.99%. We also applied MuSICA 2 to full-length cDNA clones from Toxoplasma gondii, to confirm that its ability was competent even for non-human species. CONCLUSIONS: The entire sequencing and shotgun assembly takes less than 1 week and the consumables cost only approximately US$3 per clone, demonstrating a significant advantage over previous approaches.

  16. Cost-effective sequencing of full-length cDNA clones powered by a de novo-reference hybrid assembly.

    Science.gov (United States)

    Kuroshu, Reginaldo M; Watanabe, Junichi; Sugano, Sumio; Morishita, Shinichi; Suzuki, Yutaka; Kasahara, Masahiro

    2010-05-07

    Sequencing full-length cDNA clones is important to determine gene structures including alternative splice forms, and provides valuable resources for experimental analyses to reveal the biological functions of coded proteins. However, previous approaches for sequencing cDNA clones were expensive or time-consuming, and therefore, a fast and efficient sequencing approach was demanded. We developed a program, MuSICA 2, that assembles millions of short (36-nucleotide) reads collected from a single flow cell lane of Illumina Genome Analyzer to shotgun-sequence approximately 800 human full-length cDNA clones. MuSICA 2 performs a hybrid assembly in which an external de novo assembler is run first and the result is then improved by reference alignment of shotgun reads. We compared the MuSICA 2 assembly with 200 pooled full-length cDNA clones finished independently by the conventional primer-walking using Sanger sequencers. The exon-intron structure of the coding sequence was correct for more than 95% of the clones with coding sequence annotation when we excluded cDNA clones insufficiently represented in the shotgun library due to PCR failure (42 out of 200 clones excluded), and the nucleotide-level accuracy of coding sequences of those correct clones was over 99.99%. We also applied MuSICA 2 to full-length cDNA clones from Toxoplasma gondii, to confirm that its ability was competent even for non-human species. The entire sequencing and shotgun assembly takes less than 1 week and the consumables cost only approximately US$3 per clone, demonstrating a significant advantage over previous approaches.

  17. Construction and analysis of cotton (Gossypium arboreum L. drought-related cDNA library

    Directory of Open Access Journals (Sweden)

    Zhang Chao-Jun

    2009-07-01

    Full Text Available Abstract Background Drought is one of the most important environmental factors causing water stress for cotton, and it greatly limits cotton growth and crop productivity. So far only a few drought-tolerance genes have been functionally characterized in details, and most efforts on this topic have been made in model organisms. Therefore, to identify more drought-related genes in cotton plays a crucial role in elucidating the underlying mechanisms of drought tolerance as well as utilizing bioengineering techniques to improve the tolerance in this organism. Findings Here we constructed a subtractive drought-tolerance cDNA library using suppressive subtractive hybridization (SSH. Through differential screening and bioinformatics analysis, we identified 392 positive clones with differential expression, corresponding 265 unique genes. By BLAST search against Genbank, we found that more than half of these EST sequences were homologous to those previously known drought-related genes and that there were 57 sequences with unknown functions, suggesting that many more genes are involved in this complex trait. Moreover, using RT-PCR, we examined the expression of nine representative candidate genes and confirmed that their expression levels were increased at different levels under drought stress. Conclusion Our results show that drought tolerance is a complex trait in cotton, which involves the coordination of many genes and multiple metabolism pathways. The candidate EST sequences we identified here would facilitate further functional studies of drought-related genes and provide important insights into the molecular mechanisms of drought-stress tolerance and genetic breeding in cotton.

  18. cDNA cloning and expression analysis of two distinct Sox8 genes in ...

    Indian Academy of Sciences (India)

    2010-08-06

    Aug 6, 2010 ... cDNA cloning and expression analysis of two distinct Sox8 genes in. Paramisgurnus dabryanus (Cypriniformes). XIAOHUA XIA, JIE ZHAO, QIYAN DU and ZHONGJIE CHANG. ∗. Molecular and Genetic Laboratory, College of Life Sciences, Henan Normal University, 46 East of Construction Road,. Xinxiang ...

  19. cDNA cloning and primary structure analysis of invariant chain in ...

    African Journals Online (AJOL)

    cDNA cloning and primary structure analysis of invariant chain in Chinese Pengze crucian carp. X Liu, W Yu, J Li, F Chen, S Liu, C Wu, J Xu. Abstract. Invariant chain (Ii) plays an important role in MHC class II molecules assembly and exogenous peptide presentation in vertebrates. Although mammalian Ii has been ...

  20. Isolation and analysis of water stress induced genes in maize seedlings by subtractive PCR and cDNA macroarray.

    Science.gov (United States)

    Zheng, Jun; Zhao, Jinfeng; Tao, Yazhong; Wang, Jianhua; Liu, Yunjun; Fu, Junjie; Jin, Ying; Gao, Peng; Zhang, Jinpeng; Bai, Yunfeng; Wang, Guoying

    2004-08-01

    In order to identify genes induced during the water stress response in maize (Zea mays) seedlings, suppression subtractive hybridization (SSH) was performed using mixed cDNAs prepared from maize seedlings treated with 20% PEG as testers and cDNAs from unstressed maize seedlings as drivers. A forward subtractive cDNA library was constructed, from which 960 recombinant colonies were picked and amplified. Through differential screening of the subtractive cDNA library, 533 clones were identified as water stress induced. After sequencing, 190 unique expressed sequence tags (ESTs) were obtained by clustering and blast analysis, which included transcripts that had previously been reported as responsive to stress as well as some functionally unknown transcripts. The ESTs with significant protein homology were sorted into 13 functional categories. A cDNA marcoarray containing the 190 unique ESTs was used to analyze their expression profiles in maize seedling during both PEG treatment and natural drought. The results indicated that 67 ESTs in leaves and 113 ESTs in roots were significantly up-regulated by PEG-stress. 123 ESTs were found to be up-regulated for at least one time-course point in either maize leaves or roots. Correspondingly, 163 ESTs were significantly up-regulated by drought stress. Results from the hierarchical cluster analysis suggest that the leaves and roots of maize seedlings had different expression profiles after PEG treatment and that there was a lot of overlap between PEG- and drought-stress induced up-regulated transcripts. A set of transcripts has been identified, which have significantly increased expression and probably involved in water stress signaling pathway based on data analysis.

  1. Growth hormone and prolactin in Andrias davidianus: cDNA cloning, tissue distribution and phylogenetic analysis.

    Science.gov (United States)

    Yang, Liping; Meng, Zining; Liu, Yun; Zhang, Yong; Liu, Xiaochun; Lu, Danqi; Huang, Junhai; Lin, Haoran

    2010-01-15

    The Chinese giant salamander (Andrias davidianus) is one of the largest and 'living fossil' species of amphibian. To obtain genetic information for this species, the cDNAs encoding growth hormone (adGH) and prolactin (adPRL) were cloned from a pituitary cDNA library. The isolated adGH cDNA consisted of 864 bp and encoded a propeptide of 215 amino acids, while the cDNA of adPRL was 1106 bp in length and encoded a putative peptide of 229 amino acids. Expression of the GH and PRL mRNA was only detected in the pituitary. Phylogenetic analyses were performed based on the isolated pituitary hormone sequences using maximum parsimony and neighbor-joining algorithms. The clustering results are similar to that based on the morphological characteristics or the rRNA genes, which indicate that the two orders (Anura and Caudata) of amphibian were monophyletic, and that A. davidianus was diverged early in the Caudate clade. These results indicated that both the GH and PRL sequence might be useful to study the phylogenies of relatively moderate evolved groups.

  2. [Construction and analysis of suppression subtractive cDNA libraries of continuous monoculture Rehmannia glutinosa].

    Science.gov (United States)

    Zhang, Zhongyi; Fan, Huamin; Yang, Yanhui; Li, Mingjie; Li, Juan; Xu, Haixia; Chen, Junying; Chen, Xinjian

    2011-02-01

    To explore the molecular mechanism of continuous monoculture problem by constructing the cDNA libraries of continuous monoculture Rehmannia glutinosa. To use the suppression subtractive hybridization (SSH) technique to construct the forward and reverse subtractive cDNA libraries of continuous monoculture R. glutinosa to adopt blue-white colony screening and PCR to detect the positive clones which would be sequenced and analyzed by bioinformatics. The subtracted cDNA libraries of continuous monoculture R. glutinosa. were successfully constructed, and the result showed that the forward and reverse subtracted libraries obtained 300 positive clones, respectively. The forward and reverse libraries got different ESTs, and produced 232 (forward library) and 214 (reverse library) unique ESTs by sequencing. Based on homology search of BLASTX and BLASTN in NCBI, 200 and 195 of unique ESTs were homologous to known genes in the forward and reverse libraries, respectively. Categories of orthologous group (COG) showed that the forward and reverse libraries got 60 and 61 ESTs with the corresponding gene annotation, involving 21 metabolic pathways. The information of differential expression genes in continuous monoculture R. glutinosa, and their functional annotation of differentially expressed genes indicate that continuous monoculture has a profound effect on expression of the genes in R. glutinosa. Furthermore, the research analyzed several key genes in response to replant problem, which provided a foundation for revealing the molecular mechanism of continuous monoculture R. glutinosa.

  3. Biological sequence analysis

    DEFF Research Database (Denmark)

    Durbin, Richard; Eddy, Sean; Krogh, Anders Stærmose

    This book provides an up-to-date and tutorial-level overview of sequence analysis methods, with particular emphasis on probabilistic modelling. Discussed methods include pairwise alignment, hidden Markov models, multiple alignment, profile searches, RNA secondary structure analysis, and phylogene......This book provides an up-to-date and tutorial-level overview of sequence analysis methods, with particular emphasis on probabilistic modelling. Discussed methods include pairwise alignment, hidden Markov models, multiple alignment, profile searches, RNA secondary structure analysis...

  4. Attenuation and cell culture adaptation of hepatitis A virus (HAV): a genetic analysis with HAV cDNA.

    OpenAIRE

    Cohen, J I; Rosenblum, B; Feinstone, S M; Ticehurst, J; Purcell, R H

    1989-01-01

    RNA transcripts of hepatitis A virus (HAV) HM-175 cDNA from attenuated, cell culture-adapted HAV were infectious in cell culture. A full-length HAV cDNA from wild-type HAV (propagated in marmosets in vivo) was constructed. Chimeric cDNAs that contained portions of both wild-type and attenuated genomes were produced. Oligonucleotide-directed mutagenesis was used to engineer a point mutation into the VP1 gene of attenuated HAV cDNA, so that the sequence of this capsid protein would be identical...

  5. Gene discovery from Jatropha curcas by sequencing of ESTs from normalized and full-length enriched cDNA library from developing seeds

    Directory of Open Access Journals (Sweden)

    Sugantham Priyanka Annabel

    2010-10-01

    Full Text Available Abstract Background Jatropha curcas L. is promoted as an important non-edible biodiesel crop worldwide. Jatropha oil, which is a triacylglycerol, can be directly blended with petro-diesel or transesterified with methanol and used as biodiesel. Genetic improvement in jatropha is needed to increase the seed yield, oil content, drought and pest resistance, and to modify oil composition so that it becomes a technically and economically preferred source for biodiesel production. However, genetic improvement efforts in jatropha could not take advantage of genetic engineering methods due to lack of cloned genes from this species. To overcome this hurdle, the current gene discovery project was initiated with an objective of isolating as many functional genes as possible from J. curcas by large scale sequencing of expressed sequence tags (ESTs. Results A normalized and full-length enriched cDNA library was constructed from developing seeds of J. curcas. The cDNA library contained about 1 × 106 clones and average insert size of the clones was 2.1 kb. Totally 12,084 ESTs were sequenced to average high quality read length of 576 bp. Contig analysis revealed 2258 contigs and 4751 singletons. Contig size ranged from 2-23 and there were 7333 ESTs in the contigs. This resulted in 7009 unigenes which were annotated by BLASTX. It showed 3982 unigenes with significant similarity to known genes and 2836 unigenes with significant similarity to genes of unknown, hypothetical and putative proteins. The remaining 191 unigenes which did not show similarity with any genes in the public database may encode for unique genes. Functional classification revealed unigenes related to broad range of cellular, molecular and biological functions. Among the 7009 unigenes, 6233 unigenes were identified to be potential full-length genes. Conclusions The high quality normalized cDNA library was constructed from developing seeds of J. curcas for the first time and 7009 unigenes coding

  6. A novel method of differential gene expression analysis using multiple cDNA libraries applied to the identification of tumour endothelial genes.

    Science.gov (United States)

    Herbert, John M J; Stekel, Dov; Sanderson, Sharon; Heath, Victoria L; Bicknell, Roy

    2008-04-07

    In this study, differential gene expression analysis using complementary DNA (cDNA) libraries has been improved. Firstly by the introduction of an accurate method of assigning Expressed Sequence Tags (ESTs) to genes and secondly, by using a novel likelihood ratio statistical scoring of differential gene expression between two pools of cDNA libraries. These methods were applied to the latest available cell line and bulk tissue cDNA libraries in a two-step screen to predict novel tumour endothelial markers. Initially, endothelial cell lines were in silico subtracted from non-endothelial cell lines to identify endothelial genes. Subsequently, a second bulk tumour versus normal tissue subtraction was employed to predict tumour endothelial markers. From an endothelial cDNA library analysis, 431 genes were significantly up regulated in endothelial cells with a False Discovery Rate adjusted q-value of 0.01 or less and 104 of these were expressed only in endothelial cells. Combining the cDNA library data with the latest Serial Analysis of Gene Expression (SAGE) library data derived a complete list of 459 genes preferentially expressed in endothelium. 27 genes were predicted tumour endothelial markers in multiple tissues based on the second bulk tissue screen. This approach represents a significant advance on earlier work in its ability to accurately assign an EST to a gene, statistically measure differential expression between two pools of cDNA libraries and predict putative tumour endothelial markers before entering the laboratory. These methods are of value and available http://www.compbio.ox.ac.uk/data/diffex.html to researchers that are interested in the analysis of transcriptomic data.

  7. cDNA cloning and transcriptional controlling of a novel low dose radiation-induced gene and its function analysis

    International Nuclear Information System (INIS)

    Zhou Pingkun; Sui Jianli

    2002-01-01

    Objective: To clone a novel low dose radiation-induced gene (LRIGx) and study its function as well as its transcriptional changes after irradiation. Methods: Its cDNA was obtained by DDRT-PCR and RACE techniques. Northern blot hybridization was used to investigate the gene transcription. Bioinformatics was employed to analysis structure and function of this gene. Results: LRIGx cDNA was cloned. The sequence of LRIGx was identical to a DNA clone located in human chromosome 20 q 11.2-12 Bioinformatics analysis predicted an encoded protein with a conserved helicase domain. Northern analysis revealed a ∼8.5 kb transcript which was induced after 0.2 Gy as well as 0.02 Gy irradiation, and the transcript level was increased 5 times at 4 h after 0.2 Gy irradiation. The induced level of LRIGx transcript by 2.0 Gy high dose was lower than by 0.2 Gy. Conclusion: A novel low dose radiation-induced gene has been cloned. It encodes a protein with a conserved helicase domain that could involve in DNA metabolism in the cellular process of radiation response

  8. Screening for candidate genes related to breast cancer with cDNA microarray analysis

    Directory of Open Access Journals (Sweden)

    Yu-Juan Xiang

    2015-06-01

    Full Text Available Objective: The aim of this study was to reveal the exact changes during the occurrence of breast cancer to explore significant new and promising genes or factors related to this disease. Methods: We compared the gene expression profiles of breast cancer tissues with its uninvolved normal breast tissues as controls using the cDNA microarray analysis in seven breast cancer patients. Further, one representative gene, named IFI30, was quantitatively analyzed by real-time PCR to confirm the result of the cDNA microarray analysis. Results: A total of 427 genes were identified with significantly differential expression, 221 genes were up-regulated and 206 genes were down-regulated. And the result of cDNA microarray analysis was validated by detection of IFI30 mRNA level changes by real-time PCR. Genes for cell proliferation, cell cycle, cell division, mitosis, apoptosis, and immune response were enriched in the up-regulated genes, while genes for cell adhesion, proteolysis, and transport were significantly enriched in the down-regulated genes in breast cancer tissues compared with normal breast tissues by a gene ontology analysis. Conclusion: Our present study revealed a range of differentially expressed genes between breast cancer tissues and normal breast tissues, and provide candidate genes for further study focusing on the pathogenesis and new biomarkers for breast cancer. Keywords: Breast neoplasms, Candidate genes, Microarray

  9. Human nuclear respiratory factor 2 alpha subunit cDNA: isolation, subcloning, sequencing, and in situ hybridization of transcripts in normal and monocularly deprived macaque visual system.

    Science.gov (United States)

    Guo, A; Nie, F; Wong-Riley, M

    2000-02-07

    Nuclear respiratory factor 2 (NRF-2) has been shown to contribute to the transcriptional regulation of a number of subunits of respiratory chain enzymes, including cytochrome c oxidase (CO). Our recent study demonstrated a parallel distribution of the alpha subunit proteins of NRF-2 (NRF-2 alpha) with CO in the monkey striate cortex, and that it can be regulated by neuronal activity. To determine whether this regulation is at the transcriptional level, the present study examined the expression of NRF-2 alpha mRNA in normal and monocularly deprived adult monkeys. A partial NRF-2 alpha cDNA was isolated from a human brain cDNA library. Sequence analysis revealed that it shared 99% identity with the published sequence from human HeLa cells. Riboprobes of NRF-2 alpha was generated and labeled with digoxigenin-11-UTP for in situ hybridization. The expression pattern of NRF-2 alpha mRNA in the normal striate cortex paralleled that of CO activity. It was highly expressed in layers IVC and VI, which contained high levels of CO, and more densely expressed in puffs of layers II and III than in interpuffs. In monkeys monocularly treated with tetrodotoxin for 1 day to 2 weeks, both NRF-2 alpha expression and CO activity were reduced in deprived ocular dominance columns of the visual cortex and in deprived layers of the lateral geniculate nucleus. These data indicate that, in the normal and visually deprived adult monkeys, NRF-2 alpha is regulated by neuronal activity at the transcriptional level.

  10. Study on construction of cDNA library of the treated changliver cell and quality analysis

    OpenAIRE

    Juntang, Lin; Pramanik, Jogenananda; Congrui, Wang; Huiyong, Zhang; Huigen, Feng; Baosheng, Yang; Yuchang, Li; Cunshuan, Xu

    2004-01-01

    The study aims to construct cDNA library of Changliver cell by SMART (switching mechanism at 5′ end of RNA transcript) technique and analyze its quality. cDNA of Changliver cell was made with RT-PCR and LD-PCR (long-distance PCR), the cDNA library was constructed with SMART cDNA library construction kit. Through testing, the high quality cDNA library containing whole long cDNA of Changliver cell had been constructed. The titer of the amplified cDNA library was 4.5 × 1010 pfu/ml and the averag...

  11. Construction and analysis of SSH cDNA library of human vascular endothelial cells related to gastrocarcinoma.

    Science.gov (United States)

    Liu, Yong-Bo; Wei, Zhao-Xia; Li, Li; Li, Hang-Sheng; Chen, Hui; Li, Xiao-Wen

    2003-11-01

    To construct subtracted cDNA libraries of human vascular endothelial cells (VECs) related to gastrocarcinoma using suppression substractive hybridization (SSH) and to analyze cDNA libraries of gastrocarcinoma and VECs in Cancer Gene Anatomy Project (CGAP) database. Human VECs related to gastric adenocarcinoma and corresponding normal tissue were separated by magnetic beads coupled with antibody CD31 (Dynabeads CD31). A few amount of total RNA were synthesized and amplified by SMART PCR cDNA Synthesis Kit. Then, using SSH and T/A cloning techniques, cDNA fragments of differentially expressed genes in human VECs of gastric adenocarcinoma were inserted into JM109 bacteria. One hundred positive bacteria clones were randomly picked and identified by colony PCR method. To analyze cDNA libraries of gastrocarcinoma and VECs in CGAP database, the tools of Library Finder, cDNA xProfiler, Digital GENE Expression Displayer (DGED), and Digital Differential Display (DDD) were used. Forward and reverse subtraction cDNA libraries of human VECs related to gastrocarcinoma were constructed successfully with SSH and T/A cloning techniques. Analysis of CGAP database indicated that no appropriate library of VECs related to carcinoma was constructed. Construction of subtraction cDNA libraries of human VECs related to gastrocarcinoma was successful and necessary, which laid a foundation for screening and cloning new and specific genes of VECs related to gastrocarcinoma.

  12. Increased mRNA expression of a laminin-binding protein in human colon carcinoma: Complete sequence of a full-length cDNA encoding the protein

    International Nuclear Information System (INIS)

    Yow, Hsiukang; Wong, Jau Min; Chen, Hai Shiene; Lee, C.; Steele, G.D. Jr.; Chen, Lanbo

    1988-01-01

    Reliable markers to distinguish human colon carcinoma from normal colonic epithelium are needed particularly for poorly differentiated tumors where no useful marker is currently available. To search for markers the authors constructed cDNA libraries from human colon carcinoma cell lines and screened for clones that hybridize to a greater degree with mRNAs of colon carcinomas than with their normal counterparts. Here they report one such cDNA clone that hybridizes with a 1.2-kilobase (kb) mRNA, the level of which is ∼9-fold greater in colon carcinoma than in adjacent normal colonic epithelium. Blot hybridization of total RNA from a variety of human colon carcinoma cell lines shows that the level of this 1.2-kb mRNA in poorly differentiated colon carcinomas is as high as or higher than that in well-differentiated carcinomas. Molecular cloning and complete sequencing of cDNA corresponding to the full-length open reading frame of this 1.2-kb mRNA unexpectedly show it to contain all the partial cDNA sequence encoding 135 amino acid residues previously reported for a human laminin receptor. The deduced amino acid sequence suggests that this putative laminin-binding protein from human colon carcinomas consists of 295 amino acid residues with interesting features. There is an unusual C-terminal 70-amino acid segment, which is trypsin-resistant and highly negatively charged

  13. Evaluation and Adaptation of a Laboratory-Based cDNA Library Preparation Protocol for Retrospective Sequencing of Archived MicroRNAs from up to 35-Year-Old Clinical FFPE Specimens

    Directory of Open Access Journals (Sweden)

    Olivier Loudig

    2017-03-01

    Full Text Available Formalin-fixed paraffin-embedded (FFPE specimens, when used in conjunction with patient clinical data history, represent an invaluable resource for molecular studies of cancer. Even though nucleic acids extracted from archived FFPE tissues are degraded, their molecular analysis has become possible. In this study, we optimized a laboratory-based next-generation sequencing barcoded cDNA library preparation protocol for analysis of small RNAs recovered from archived FFPE tissues. Using matched fresh and FFPE specimens, we evaluated the robustness and reproducibility of our optimized approach, as well as its applicability to archived clinical specimens stored for up to 35 years. We then evaluated this cDNA library preparation protocol by performing a miRNA expression analysis of archived breast ductal carcinoma in situ (DCIS specimens, selected for their relation to the risk of subsequent breast cancer development and obtained from six different institutions. Our analyses identified six miRNAs (miR-29a, miR-221, miR-375, miR-184, miR-363, miR-455-5p differentially expressed between DCIS lesions from women who subsequently developed an invasive breast cancer (cases and women who did not develop invasive breast cancer within the same time interval (control. Our thorough evaluation and application of this laboratory-based miRNA sequencing analysis indicates that the preparation of small RNA cDNA libraries can reliably be performed on older, archived, clinically-classified specimens.

  14. Evaluation and Adaptation of a Laboratory-Based cDNA Library Preparation Protocol for Retrospective Sequencing of Archived MicroRNAs from up to 35-Year-Old Clinical FFPE Specimens.

    Science.gov (United States)

    Loudig, Olivier; Wang, Tao; Ye, Kenny; Lin, Juan; Wang, Yihong; Ramnauth, Andrew; Liu, Christina; Stark, Azadeh; Chitale, Dhananjay; Greenlee, Robert; Multerer, Deborah; Honda, Stacey; Daida, Yihe; Spencer Feigelson, Heather; Glass, Andrew; Couch, Fergus J; Rohan, Thomas; Ben-Dov, Iddo Z

    2017-03-14

    Formalin-fixed paraffin-embedded (FFPE) specimens, when used in conjunction with patient clinical data history, represent an invaluable resource for molecular studies of cancer. Even though nucleic acids extracted from archived FFPE tissues are degraded, their molecular analysis has become possible. In this study, we optimized a laboratory-based next-generation sequencing barcoded cDNA library preparation protocol for analysis of small RNAs recovered from archived FFPE tissues. Using matched fresh and FFPE specimens, we evaluated the robustness and reproducibility of our optimized approach, as well as its applicability to archived clinical specimens stored for up to 35 years. We then evaluated this cDNA library preparation protocol by performing a miRNA expression analysis of archived breast ductal carcinoma in situ (DCIS) specimens, selected for their relation to the risk of subsequent breast cancer development and obtained from six different institutions. Our analyses identified six miRNAs (miR-29a, miR-221, miR-375, miR-184, miR-363, miR-455-5p) differentially expressed between DCIS lesions from women who subsequently developed an invasive breast cancer (cases) and women who did not develop invasive breast cancer within the same time interval (control). Our thorough evaluation and application of this laboratory-based miRNA sequencing analysis indicates that the preparation of small RNA cDNA libraries can reliably be performed on older, archived, clinically-classified specimens.

  15. Evaluation and Adaptation of a Laboratory-Based cDNA Library Preparation Protocol for Retrospective Sequencing of Archived MicroRNAs from up to 35-Year-Old Clinical FFPE Specimens

    Science.gov (United States)

    Loudig, Olivier; Wang, Tao; Ye, Kenny; Lin, Juan; Wang, Yihong; Ramnauth, Andrew; Liu, Christina; Stark, Azadeh; Chitale, Dhananjay; Greenlee, Robert; Multerer, Deborah; Honda, Stacey; Daida, Yihe; Spencer Feigelson, Heather; Glass, Andrew; Couch, Fergus J.; Rohan, Thomas; Ben-Dov, Iddo Z.

    2017-01-01

    Formalin-fixed paraffin-embedded (FFPE) specimens, when used in conjunction with patient clinical data history, represent an invaluable resource for molecular studies of cancer. Even though nucleic acids extracted from archived FFPE tissues are degraded, their molecular analysis has become possible. In this study, we optimized a laboratory-based next-generation sequencing barcoded cDNA library preparation protocol for analysis of small RNAs recovered from archived FFPE tissues. Using matched fresh and FFPE specimens, we evaluated the robustness and reproducibility of our optimized approach, as well as its applicability to archived clinical specimens stored for up to 35 years. We then evaluated this cDNA library preparation protocol by performing a miRNA expression analysis of archived breast ductal carcinoma in situ (DCIS) specimens, selected for their relation to the risk of subsequent breast cancer development and obtained from six different institutions. Our analyses identified six miRNAs (miR-29a, miR-221, miR-375, miR-184, miR-363, miR-455-5p) differentially expressed between DCIS lesions from women who subsequently developed an invasive breast cancer (cases) and women who did not develop invasive breast cancer within the same time interval (control). Our thorough evaluation and application of this laboratory-based miRNA sequencing analysis indicates that the preparation of small RNA cDNA libraries can reliably be performed on older, archived, clinically-classified specimens. PMID:28335433

  16. PCR amplification and sequences of cDNA clones for the small and large subunits of ADP-glucose pyrophosphorylase from barley tissues.

    Science.gov (United States)

    Villand, P; Aalen, R; Olsen, O A; Lüthi, E; Lönneborg, A; Kleczkowski, L A

    1992-06-01

    Several cDNAs encoding the small and large subunit of ADP-glucose pyrophosphorylase (AGP) were isolated from total RNA of the starchy endosperm, roots and leaves of barley by polymerase chain reaction (PCR). Sets of degenerate oligonucleotide primers, based on previously published conserved amino acid sequences of plant AGP, were used for synthesis and amplification of the cDNAs. For either the endosperm, roots and leaves, the restriction analysis of PCR products (ca. 550 nucleotides each) has revealed heterogeneity, suggesting presence of three transcripts for AGP in the endosperm and roots, and up to two AGP transcripts in the leaf tissue. Based on the derived amino acid sequences, two clones from the endosperm, beps and bepl, were identified as coding for the small and large subunit of AGP, respectively, while a leaf transcript (blpl) encoded the putative large subunit of AGP. There was about 50% identity between the endosperm clones, and both of them were about 60% identical to the leaf cDNA. Northern blot analysis has indicated that beps and bepl are expressed in both the endosperm and roots, while blpl is detectable only in leaves. Application of the PCR technique in studies on gene structure and gene expression of plant AGP is discussed.

  17. Model for a transcript map of human chromosome 21: isolation of new coding sequences from exon and enriched cDNA libraries.

    Science.gov (United States)

    Yaspo, M L; Gellen, L; Mott, R; Korn, B; Nizetic, D; Poustka, A M; Lehrach, H

    1995-08-01

    The construction of a transcriptional map for human chromosome 21 requires the generation of a specific catalogue of genes, together with corresponding mapping information. Towards this goal, we conducted a pilot study on a pool of random chromosome 21 cosmids representing 2 Mb of non-contiguous DNA. Exon-amplification and cDNA selection methods were used in combination to extract the coding content from these cosmids, and to derive expressed sequences libraries. These libraries and the source cosmid library were arrayed at high density for hybridisation screening. A strategy was used which related data obtained by multiple hybridisations of clones originating from one library, screened against the other libraries. In this way, it was possible to integrate the information with the physical map and to compare the gene recovery rate of each technique. cDNAs and exons were grouped into bins delineated by EcoRI cosmid fragments, and a subset of 91 cDNAs and 29 exons have been sequenced. These sequences defined 79 non-overlapping potential coding segments distributed in 24 transcriptional units, which were mapped along 21q. Northern blot analysis performed for a subset of cDNAs indicated the existence of a cognate transcript. Comparison to databases indicated three segments matching to known chromosome 21 genes: PFKL, COL6A1 and S100B and six segments matching to unmapped anonymous expressed sequence tags (ESTs). At the translated nucleotide level, strong homologies to known proteins were found with ATP-binding transporters of the ABC family and the dihydroorotase domain of pyrimidine synthetases. These data strongly suggest that bona fide partial genes have been isolated. Several of the newly isolated transcriptional units map to clinically important regions, in particular those involved in Down's syndrome, progressive myoclonus epilepsia and auto-immune polyglandular disease. The study presented here illustrates the complementarity of exon-amplification and cDNA

  18. Sequencing over 13 000 expressed sequence tags from six subtractive cDNA libraries of wild and modern wheats following slow drought stress.

    Science.gov (United States)

    Ergen, Neslihan Z; Budak, Hikmet

    2009-03-01

    A deeper understanding of the drought response and genetic improvement of the cultivated crops for better tolerance requires attention because of the complexity of the drought response syndrome and the loss of genetic diversity during domestication. We initially screened about 200 wild emmer wheat genotypes and then focused on 26 of these lines, which led to the selection of two genotypes with contrasting responses to water deficiency. Six subtractive cDNA libraries were constructed, and over 13 000 expressed sequence tags (ESTs) were sequenced using leaf and root tissues of wild emmer wheat genotypes TR39477 (tolerant) and TTD-22 (sensitive), and modern wheat variety Kiziltan drought stressed for 7 d. Clustering and assembly of ESTs resulted in 2376 unique sequences (1159 without hypothetical proteins and no hits), 75% of which were represented only once. At this level of EST sampling, each tissue shared a very low percentage of transcripts (13-26%). The data obtained indicated that the genotypes shared common elements of drought stress as well as distinctly differential expression patterns that might be illustrative of their contrasting ability to tolerate water deficiencies. The new EST data generated here provide a highly diverse and rich source for gene discovery in wheat and other grasses.

  19. Full-Length Venom Protein cDNA Sequences from Venom-Derived mRNA: Exploring Compositional Variation and Adaptive Multigene Evolution.

    Science.gov (United States)

    Modahl, Cassandra M; Mackessy, Stephen P

    2016-06-01

    Envenomation of humans by snakes is a complex and continuously evolving medical emergency, and treatment is made that much more difficult by the diverse biochemical composition of many venoms. Venomous snakes and their venoms also provide models for the study of molecular evolutionary processes leading to adaptation and genotype-phenotype relationships. To compare venom complexity and protein sequences, venom gland transcriptomes are assembled, which usually requires the sacrifice of snakes for tissue. However, toxin transcripts are also present in venoms, offering the possibility of obtaining cDNA sequences directly from venom. This study provides evidence that unknown full-length venom protein transcripts can be obtained from the venoms of multiple species from all major venomous snake families. These unknown venom protein cDNAs are obtained by the use of primers designed from conserved signal peptide sequences within each venom protein superfamily. This technique was used to assemble a partial venom gland transcriptome for the Middle American Rattlesnake (Crotalus simus tzabcan) by amplifying sequences for phospholipases A2, serine proteases, C-lectins, and metalloproteinases from within venom. Phospholipase A2 sequences were also recovered from the venoms of several rattlesnakes and an elapid snake (Pseudechis porphyriacus), and three-finger toxin sequences were recovered from multiple rear-fanged snake species, demonstrating that the three major clades of advanced snakes (Elapidae, Viperidae, Colubridae) have stable mRNA present in their venoms. These cDNA sequences from venom were then used to explore potential activities derived from protein sequence similarities and evolutionary histories within these large multigene superfamilies. Venom-derived sequences can also be used to aid in characterizing venoms that lack proteomic profiles and identify sequence characteristics indicating specific envenomation profiles. This approach, requiring only venom, provides

  20. Construction and evaluation of normalized cDNA libraries enriched with full-length sequences for rapid discovery of new genes from Sisal (Agave sisalana Perr.) different developmental stages.

    Science.gov (United States)

    Zhou, Wen-Zhao; Zhang, Yan-Mei; Lu, Jun-Ying; Li, Jun-Feng

    2012-10-12

    To provide a resource of sisal-specific expressed sequence data and facilitate this powerful approach in new gene research, the preparation of normalized cDNA libraries enriched with full-length sequences is necessary. Four libraries were produced with RNA pooled from Agave sisalana multiple tissues to increase efficiency of normalization and maximize the number of independent genes by SMART™ method and the duplex-specific nuclease (DSN). This procedure kept the proportion of full-length cDNAs in the subtracted/normalized libraries and dramatically enhanced the discovery of new genes. Sequencing of 3875 cDNA clones of libraries revealed 3320 unigenes with an average insert length about 1.2 kb, indicating that the non-redundancy of libraries was about 85.7%. These unigene functions were predicted by comparing their sequences to functional domain databases and extensively annotated with Gene Ontology (GO) terms. Comparative analysis of sisal unigenes and other plant genomes revealed that four putative MADS-box genes and knotted-like homeobox (knox) gene were obtained from a total of 1162 full-length transcripts. Furthermore, real-time PCR showed that the characteristics of their transcripts mainly depended on the tight expression regulation of a number of genes during the leaf and flower development. Analysis of individual library sequence data indicated that the pooled-tissue approach was highly effective in discovering new genes and preparing libraries for efficient deep sequencing.

  1. Construction and Evaluation of Normalized cDNA Libraries Enriched with Full-Length Sequences for Rapid Discovery of New Genes from Sisal (Agave sisalana Perr. Different Developmental Stages

    Directory of Open Access Journals (Sweden)

    Jun-Feng Li

    2012-10-01

    Full Text Available To provide a resource of sisal-specific expressed sequence data and facilitate this powerful approach in new gene research, the preparation of normalized cDNA libraries enriched with full-length sequences is necessary. Four libraries were produced with RNA pooled from Agave sisalana multiple tissues to increase efficiency of normalization and maximize the number of independent genes by SMART™ method and the duplex-specific nuclease (DSN. This procedure kept the proportion of full-length cDNAs in the subtracted/normalized libraries and dramatically enhanced the discovery of new genes. Sequencing of 3875 cDNA clones of libraries revealed 3320 unigenes with an average insert length about 1.2 kb, indicating that the non-redundancy of libraries was about 85.7%. These unigene functions were predicted by comparing their sequences to functional domain databases and extensively annotated with Gene Ontology (GO terms. Comparative analysis of sisal unigenes and other plant genomes revealed that four putative MADS-box genes and knotted-like homeobox (knox gene were obtained from a total of 1162 full-length transcripts. Furthermore, real-time PCR showed that the characteristics of their transcripts mainly depended on the tight expression regulation of a number of genes during the leaf and flower development. Analysis of individual library sequence data indicated that the pooled-tissue approach was highly effective in discovering new genes and preparing libraries for efficient deep sequencing.

  2. Cloning and sequencing of a cDNA for the delta-subunit of photosynthetic ATP-synthase (EC 3.6.1.34) from pea (Pisum sativum).

    Science.gov (United States)

    Hoesche, J A; Berzborn, R J

    1992-12-29

    lambda gt10 cDNA clones for the nuclear encoded subunit delta of chloroplast ATP-synthase from Pisum sativum have been isolated. The 5' end was completed by PCR. The sequenced cDNA codes for the import precursor. N-Terminal sequencing of the mature protein isolated from chloroplasts revealed that the processing sites of the transit peptide from Pisum sativum and Spinacea oleracea are similar. The overall homology of the deduced amino acid sequences of the mature delta proteins from higher plants is about 40%. The conservation among hydrophilic residues is higher than for hydrophobic ones, indicating that the surface of delta is important for its function within the ATP-synthase.

  3. Characterization and phylogenetic analysis of lectin gene cDNA isolated from sea cucumber ( Apostichopus japonicus) body wall

    Science.gov (United States)

    Xue, Zhuang; Li, Hui; Liu, Yang; Zhou, Wei; Sun, Jing; Wang, Xiuli

    2017-12-01

    As a `living fossil' of species origin and `rich treasure' of food and nutrition development, sea cucumber has received a lot of attentions from researchers. The cDNA library construction and EST sequencing of blood had been conducted previously in our lab. The bioinformatic analysis provided a gene fragment which is highly homologous with the genes of lectin family, named AjL ( Apostichopus japonicus lectin). To characterize and determine the phylogeny of AjL genes in early evolution, we isolated a full-length cDNA of lectin gene from the body wall of A. japonicus. The open reading frame of this gene contained 489 bp and encoded a 163 amino acids secretory protein being homologous to lectins of mammals and aquatic organisms. The deduced protein included a lectin-like domain. SDS-PAGE analysis showed that AjL migrated as a specific band (about 36.09 kDa under reducing), and agglutinated against rabbit red blood cells. AjL was similar to chain A of CEL-IV in space structure. We predicted that AjL may play the same role of CEL-IV. Our results suggested that more than one lectin gene functioned in sea cucumber and most of other species, which was fused by uncertain sequences during the evolution and encoded different proteins with diverse functions. Our findings provided the insights into the function and characteristics of lectin genes invertebrates. The results will also be helpful for the identification and structural, functional, and evolutionary analyses of lectin genes.

  4. Identification of testis-relevant genes using in silico analysis from testis ESTs and cDNA microarray in the black tiger shrimp (Penaeus monodon

    Directory of Open Access Journals (Sweden)

    Wongsurawat Thidathip

    2010-08-01

    Full Text Available Abstract Background Poor reproductive maturation of the black tiger shrimp (Penaeus monodon in captivity is one of the serious threats to sustainability of the shrimp farming industry. Understanding molecular mechanisms governing reproductive maturation processes requires the fundamental knowledge of integrated expression profiles in gonads of this economically important species. In P. monodon, a non-model species for which the genome sequence is not available, expressed sequence tag (EST and cDNA microarray analyses can help reveal important transcripts relevant to reproduction and facilitate functional characterization of transcripts with important roles in male reproductive development and maturation. Results In this study, a conventional testis EST library was exploited to reveal novel transcripts. A total of 4,803 ESTs were unidirectionally sequenced and analyzed in silico using a customizable data analysis package, ESTplus. After sequence assembly, 2,702 unique sequences comprised of 424 contigs and 2,278 singletons were identified; of these, 1,133 sequences are homologous to genes with known functions. The sequences were further characterized according to gene ontology categories (41% biological process, 24% molecular function, 35% cellular component. Through comparison with EST libraries of other tissues of P. monodon, 1,579 transcripts found only in the testis cDNA library were identified. A total of 621 ESTs have not been identified in penaeid shrimp. Furthermore, cDNA microarray analysis revealed several ESTs homologous to testis-relevant genes were more preferentially expressed in testis than in ovary. Representatives of these transcripts, homologs of saposin (PmSap and Dmc1 (PmDmc1, were further characterized by RACE-PCR. The more abundant expression levels in testis than ovary of PmSap and PmDmc1 were verified by quantitative real-time PCR in juveniles and wild broodstock of P. monodon. Conclusions Without a genome sequence, a

  5. Monoterpene biosynthesis in lemon (Citrus limon) cDNA isolation and functional analysis of four monoterpene synthases

    NARCIS (Netherlands)

    Lücker, J.; Tamer, El M.K.; Schwab, W.; Verstappen, F.W.A.; Plas, van der L.H.W.; Bouwmeester, H.J.; Verhoeven, H.A.

    2002-01-01

    Citrus limon possesses a high content and large variety of monoterpenoids, especially in the glands of the fruit flavedo. The genes responsible for the production of these monoterpenes have never been isolated. By applying a random sequencing approach to a cDNA library from mRNA isolated from the

  6. Identification and complete sequencing of novel human transcripts through the use of mouse orthologs and testis cDNA sequences

    DEFF Research Database (Denmark)

    Ferreira, Elisa N; Pires, Lilian C; Parmigiani, Raphael B

    2004-01-01

    The correct identification of all human genes, and their derived transcripts, has not yet been achieved, and it remains one of the major aims of the worldwide genomics community. Computational programs suggest the existence of 30,000 to 40,000 human genes. However, definitive gene identification ......, and a search for protein domains was undertaken based on their putative open reading frames. In silico expression analysis suggests that some of these transcripts are expressed at low levels and in a restricted set of tissues....

  7. EST analysis and annotation of transcripts derived from a trichome-specific cDNA library from Salvia fruticosa.

    Science.gov (United States)

    Chatzopoulou, Fani M; Makris, Antonios M; Argiriou, Anagnostis; Degenhardt, Jörg; Kanellis, Angelos K

    2010-05-01

    Greek sage (Salvia fruticosa Mill., Syn. Salvia triloba L.) is appreciated for its essential oil which is used as an aromatic spice and active against a wide range of microorganisms and viruses. The essential oil is dominated by terpenoids and flavonoids which are produced and stored in glandular trichomes on the plant surface. The present study aims to give insights into the metabolic activities of S. fruticosa trichomes on a transcriptome level. A total of 2,304 clones were sequenced from a cDNA library from leaves' trichomes of S. fruticosa. Exclusion of sequences shorter than 100 bp resulted in 1,615 high-quality ESTs with a mean length of 592 bp. Cluster analysis indicated the presence of 197 contigs (908 clones) and 707 singletons, generating a total of 904 unique sequences. Of the 904 unique ESTs, 628 (69.5%) had significant hits in the non-redundant protein database and were annotated. A total of 517 (82.3%) sequences were functionally classified using the gene ontologies (GO) and established pathway associations to 220 (24.3%) sequences in Kyoto encyclopedia of genes and genomes (KEGG). In addition, 52 (5.8%) of the unique ESTs revealed a GO biological term with relation to terpenoid (78 ESTs), phenylpropanoid (43 ESTs), flavonoid (18 ESTs) or alkaloid (10 ESTs) biosynthesis or to P450s (26 ESTs). Expression analysis of a selected set of genes known to be involved in the pathways of secondary metabolite synthesis showed higher expression levels in trichomes, validating the tissue specificity of the analyzed glandular trichome library.

  8. Localization of the human fibromodulin gene (FMOD) to chromosome 1q32 and completion of the cDNA sequence

    Energy Technology Data Exchange (ETDEWEB)

    Sztrolovics, R.; Grover, J.; Roughley, P.J. [McGill Univ., Montreal (Canada)] [and others

    1994-10-01

    This report describes the cloning of the 3{prime}-untranslated region of the human fibromodulin cDNA and its use to map the gene. For somatic cell hybrids, the generation of the PCR product was concordant with the presence of chromosome 1 and discordant with the presence of all other chromosomes, confirming that the fibromodulin gene is located within region q32 of chromosome 1. The physical mapping of genes is a critical step in the process of identifying which genes may be responsible for various inherited disorders. Specifically, the mapping of the fibromodulin gene now provides the information necessary to evaluate its potential role in genetic disorders of connective tissues. The analysis of previously reported diseases mapped to chromosome 1 reveals two genes located in the proximity of the fibromodulin locus. These are Usher syndrome type II, a recessive disorder characterized by hearing loss and retinitis pigmentosa, and Van der Woude syndrome, a dominant condition associated with abnormalities such as cleft lip and palate and hyperdontia. The genes for both of these disorders have been projected to be localized to 1q32 of a physical map that integrates available genetic linkage and physical data. However, it seems improbable that either of these disorders, exhibiting restricted tissue involvement, could be linked to the fibromodulin gene, given the wide tissue distribution of the encoded proteoglycan, although it remains possible that the relative importance of the quantity and function of the proteoglycan may avry between tissues. 11 refs., 1 fig.

  9. Phenoloxidase from the sea cucumber Apostichopus japonicus: cDNA cloning, expression and substrate specificity analysis.

    Science.gov (United States)

    Jiang, Jingwei; Zhou, Zunchun; Dong, Ying; Sun, Hongjuan; Chen, Zhong; Yang, Aifu; Gao, Shan; Wang, Bai; Jiang, Bei; Guan, Xiaoyan

    2014-02-01

    Phenoloxidase (PO) is a crucial component of the immune system of echinoderms. In the present study, the full-length cDNA of PO (AjPO) was cloned from coelomocytes of the sea cucumber Apostichopus japonicus using 3'- and 5'-rapid amplification of cDNA ends (RACE) PCR method, which is 2508 bp, with an open reading frame (ORF) of 2040 bp encoding 679 amino acids. AjPO contains a transmembrane domain, and three Cu-oxidase domains with copper binding centers formed by 10 histidines, one cysteine and one methionine respectively. Phylogenetic analysis revealed that AjPO was clustered with laccase-type POs of invertebrates. Using the isolated membrane proteins as crude AjPO, the enzyme could catalyze the substrates catechol, L-3,4-dihydroxyphenylalanine (L-DOPA), dopamine and hydroquinone, but failed to oxidize tyrosine. The results described above collectively proved that AjPO was a membrane-binding laccase-type PO. The quantitative real-time PCR (qRT-PCR) analysis revealed that AjPO mRNA was expressed in muscle, body wall, coelomocytes, tube feet, respiratory tree and intestine with the highest expression level in coelomocytes. AjPO could be significantly induced by lipopolysaccharide (LPS), peptidoglycan (PGN), Zymosan A and polyinosinic-polycytidylic acid (PolyI:C), suggesting AjPO is closely involved in the defense against the infection of bacteria, fungi and double-stranded RNA viruses. Copyright © 2013 Elsevier Ltd. All rights reserved.

  10. Crustacean hyperglycemic hormones of two cold water crab species, Chionoecetes opilio and C. japonicus: isolation of cDNA sequences and localization of CHH neuropeptide in eyestalk ganglia.

    Science.gov (United States)

    Chung, J Sook; Ahn, I S; Yu, O H; Kim, D S

    2015-04-01

    Crustacean hyperglycemic hormone (CHH) is primarily known for its prototypical function in hyperglycemia which is induced by the release of CHH. The CHH release takes place as an adaptive response to the energy demands of the animals experiencing stressful environmental, physiological or behavioral conditions. Although >63 decapod CHH nucleotide sequences are known (GenBank), the majority of them is garnered from the species inhabiting shallow and warm water. In order to understand the adaptive role of CHH in Chionoecetes opilio and Chionoecetes japonicus inhabiting deep water environments, we first aimed for the isolation of the full-length cDNA sequence of CHH from the eyestalk ganglia of C. opilio (ChoCHH) and C. japonicus (ChjCHH) using degenerate PCR and 5' and 3' RACE. Cho- and ChjCHH cDNA sequences are identical in 5' UTR and ORF with 100% sequence identity of the putative 138aa of preproCHHs. The length of 3' UTR ChjCHH cDNA sequence is 39 nucleotides shorter than that of ChoCHH. This is the first report in decapod crustaceans that two different species have the identical sequence of CHH. ChoCHH expression increases during embryogenesis of C. opilio and is significantly higher in adult males and females. C. japonicus males have slightly higher ChjCHH expression than C. opilio males, but no statistical difference. In both species, the immunostaining intensity of CHH is stronger in the sinus gland than that of X-organ cells. Future studies will enable us to gain better understanding of the comparative metabolic physiology and endocrinology of cold, deep water species of Chionoecetes spp. Copyright © 2014 Elsevier Inc. All rights reserved.

  11. High-throughput screening of suppression subtractive hybridization cDNA libraries using DNA microarray analysis.

    Science.gov (United States)

    van den Berg, Noëlani; Crampton, Bridget G; Hein, Ingo; Birch, Paul R J; Berger, Dave K

    2004-11-01

    Efficient construction of cDNA libraries enriched for differentially expressed transcripts is an important first step in many biological investigations. We present a quantitative procedure for screening cDNA libraries constructed by suppression subtractive hybridization (SSH). The methodology was applied to two independent SSHs from pearl millet and banana. Following two-color cyanin dye labeling and hybridization of subtracted tester with either unsubtracted driver or unsubtracted tester cDNAs to the SSH libraries arrayed on glass slides, two values were calculated for each clone, an enrichment ratio 1 (ER1) and an enrichment ratio 2 (ER2). Graphical representation of ER1 and ER2 enabled the identification of clones that were likely to represent up-regulated transcripts. Normalization of each clone by the SSH process was determined from the ER2 values, thereby indicating whether clones represented rare or abundant transcripts. Differential expression of pearl millet and banana clones identified from both libraries by this quantitative approach was verified by inverse Northern blot analysis.

  12. Analysis of cDNA libraries from developing seeds of guar (Cyamopsis tetragonoloba (L. Taub

    Directory of Open Access Journals (Sweden)

    Dixon Richard A

    2007-11-01

    Full Text Available Abstract Background Guar, Cyamopsis tetragonoloba (L. Taub, is a member of the Leguminosae (Fabaceae family and is economically the most important of the four species in the genus. The endosperm of guar seed is a rich source of mucilage or gum, which forms a viscous gel in cold water, and is used as an emulsifier, thickener and stabilizer in a wide range of foods and industrial applications. Guar gum is a galactomannan, consisting of a linear (1→4-β-linked D-mannan backbone with single-unit, (1→6-linked, α-D-galactopyranosyl side chains. To better understand regulation of guar seed development and galactomannan metabolism we created cDNA libraries and a resulting EST dataset from different developmental stages of guar seeds. Results A database of 16,476 guar seed ESTs was constructed, with 8,163 and 8,313 ESTs derived from cDNA libraries I and II, respectively. Library I was constructed from seeds at an early developmental stage (15–25 days after flowering, DAF, and library II from seeds at 30–40 DAF. Quite different sets of genes were represented in these two libraries. Approximately 27% of the clones were not similar to known sequences, suggesting that these ESTs represent novel genes or may represent non-coding RNA. The high flux of energy into carbohydrate and storage protein synthesis in guar seeds was reflected by a high representation of genes annotated as involved in signal transduction, carbohydrate metabolism, chaperone and proteolytic processes, and translation and ribosome structure. Guar unigenes involved in galactomannan metabolism were identified. Among the seed storage proteins, the most abundant contig represented a conglutin accounting for 3.7% of the total ESTs from both libraries. Conclusion The present EST collection and its annotation provide a resource for understanding guar seed biology and galactomannan metabolism.

  13. Frameshift mutations in infectious cDNA clones of Citrus tristeza virus: a strategy to minimize the toxicity of viral sequences to Escherichia coli

    International Nuclear Information System (INIS)

    Satyanarayana, Tatineni; Gowda, Siddarame; Ayllon, Maria A.; Dawson, William O.

    2003-01-01

    The advent of reverse genetics revolutionized the study of positive-stranded RNA viruses that were amenable for cloning as cDNAs into high-copy-number plasmids of Escherichia coli. However, some viruses are inherently refractory to cloning in high-copy-number plasmids due to toxicity of viral sequences to E. coli. We report a strategy that is a compromise between infectivity of the RNA transcripts and toxicity to E. coli effected by introducing frameshift mutations into 'slippery sequences' near the viral 'toxicity sequences' in the viral cDNA. Citrus tristeza virus (CTV) has cDNA sequences that are toxic to E. coli. The original full-length infectious cDNA of CTV and a derivative replicon, CTV-ΔCla, cloned into pUC119, resulted in unusually limited E. coli growth. However, upon sequencing of these cDNAs, an additional uridinylate (U) was found in a stretch of U's between nts 3726 and 3731 that resulted in a change to a reading frame with a stop codon at nt 3734. Yet, in vitro produced RNA transcripts from these clones infected protoplasts, and the resulting progeny virus was repaired. Correction of the frameshift mutation in the CTV cDNA constructs resulted in increased infectivity of in vitro produced RNA transcripts, but also caused a substantial increase of toxicity to E. coli, now requiring 3 days to develop visible colonies. Frameshift mutations created in sequences not suspected to facilitate reading frame shifting and silent mutations introduced into oligo(U) regions resulted in complete loss of infectivity, suggesting that the oligo(U) region facilitated the repair of the frameshift mutation. Additional frameshift mutations introduced into other oligo(U) regions also resulted in transcripts with reduced infectivity similarly to the original clones with the +1 insertion. However, only the frameshift mutations introduced into oligo(U) regions that were near and before the toxicity region improved growth and stability in E. coli. These data demonstrate that

  14. Image sequence analysis

    CERN Document Server

    1981-01-01

    The processing of image sequences has a broad spectrum of important applica­ tions including target tracking, robot navigation, bandwidth compression of TV conferencing video signals, studying the motion of biological cells using microcinematography, cloud tracking, and highway traffic monitoring. Image sequence processing involves a large amount of data. However, because of the progress in computer, LSI, and VLSI technologies, we have now reached a stage when many useful processing tasks can be done in a reasonable amount of time. As a result, research and development activities in image sequence analysis have recently been growing at a rapid pace. An IEEE Computer Society Workshop on Computer Analysis of Time-Varying Imagery was held in Philadelphia, April 5-6, 1979. A related special issue of the IEEE Transactions on Pattern Anal­ ysis and Machine Intelligence was published in November 1980. The IEEE Com­ puter magazine has also published a special issue on the subject in 1981. The purpose of this book ...

  15. The construction of a cDNA library enriched for immune genes and the analysis of 7535 ESTs from Chinese mitten crab Eriocheir sinensis.

    Science.gov (United States)

    Gai, Yunchao; Wang, Lingling; Zhao, Jianmin; Qiu, Limei; Song, Linsheng; Li, Ling; Mu, Changkao; Wang, Wan; Wang, Mengqiang; Zhang, Ying; Yao, Xuemei; Yang, Jialong

    2009-12-01

    Chinese mitten crab Eriocheir sinensis is one of the most important aquaculture crustacean species in China. A cDNA library was constructed from hemocytes of E. sinensis challenged with the mixture of Listonella anguillarum and Staphylococcus aureus, and randomly sequenced to collect genomic information and identify genes involved in immune defense response. Single-pass 5' sequencing of 10368 clones yielded 7535 high quality ESTs (Expressed Sequence Tags) and these ESTs were assembled into 2943 unigenes. BLAST analysis revealed that 1706 unigenes (58.0% of the total) or 4593 ESTs (61.0% of the total) were novel genes that had no significant matches to any protein sequences in the public databases. The rest 1237 unigenes (42.0% of the total) were closely matched to the known genes or sequences deposited in public databases, which could be classed into 20 or 23 classifications according to "molecular function" or "biological process" respectively based on the Gene Ontology (GO). And 221 unigenes (7.5% of all 2943 unigenes, 17.9% of matched unigenes) or 969 ESTs (12.9% of all 7535 ESTs, 32.9% of matched ESTs) were identified to be immune genes. The relative higher proportion of immune-related genes in the present cDNA library than that in the normal library of E. sinensis and other crustaceans libraries, and the differences and changes in percentage and quantity of some key immune-related genes especially the immune inducible genes between two E. sinensis cDNA libraries may derive from the bacteria challenge to the Chinese mitten crab. The results provided a well-characterized EST resource for the genomics community, gene discovery especially for the identification of host-defense genes and pathways in crabs as well as other crustaceans.

  16. Gene expression profile analysis in human hepatocellular carcinoma by cDNA microarray.

    Science.gov (United States)

    Chung, Eun Jung; Sung, Young Kwan; Farooq, Mohammad; Kim, Younghee; Im, Sanguk; Tak, Won Young; Hwang, Yoon Jin; Kim, Yang Il; Han, Hyung Soo; Kim, Jung-Chul; Kim, Moon Kyu

    2002-12-31

    We performed gene expression profiling of normal and hepatocellular carcinoma (HCC) liver tissues using a high-density microarray that contained 3,063 human cDNA. The results of a microarray hybridization experiment from eight different HCC tissues were analyzed and classified by the Cluster program. Among these differentially-expressed genes, the galectin-3, serine/threonine kinase SGK, translation factor eIF-4A, -4B, -3, fibroblast growth factor receptor, and ribosomal protein L35A were up-regulated; the mRNAs of Nip3, decorin, and the insulin-like growth factor binding protein-3 were down-regulated in HCC. The differential expression of these genes was further confirmed by an RT-PCR analysis. In addition, our data suggest that the gene expression profile of HCC varies according to the histological types.

  17. [cDNA library construction from panicle meristem of finger millet].

    Science.gov (United States)

    Radchuk, V; Pirko, Ia V; Isaenkov, S V; Emets, A I; Blium, Ia B

    2014-01-01

    The protocol for production of full-size cDNA using SuperScript Full-Length cDNA Library Construction Kit II (Invitrogen) was tested and high quality cDNA library from meristematic tissue of finger millet panicle (Eleusine coracana (L.) Gaertn) was created. The titer of obtained cDNA library comprised 3.01 x 10(5) CFU/ml in avarage. In average the length of cDNA insertion consisted about 1070 base pairs, the effectivity of cDNA fragment insertions--99.5%. The selective sequencing of cDNA clones from created library was performed. The sequences of cDNA clones were identified with usage of BLAST-search. The results of cDNA library analysis and selective sequencing represents prove good functionality and full length character of inserted cDNA clones. Obtained cDNA library from meristematic tissue of finger millet panicle represents good and valuable source for isolation and identification of key genes regulating metabolism and meristematic development and for mining of new molecular markers to conduct out high quality genetic investigations and molecular breeding as well.

  18. Sequence analysis on microcomputers.

    Science.gov (United States)

    Cannon, G C

    1987-10-02

    Overall, each of the program packages performed their tasks satisfactorily. For analyses where there was a well-defined answer, such as a search for a restriction site, there were few significant differences between the program sets. However, for tasks in which a degree of flexibility is desirable, such as homology or similarity determinations and database searches, DNASTAR consistently afforded the user more options in conducting the required analysis than did the other two packages. However, for laboratories where sequence analysis is not a major effort and the expense of a full sequence analysis workstation cannot be justified, MicroGenie and IBI-Pustell offer a satisfactory alternative. MicroGenie is a polished program system. Many may find that its user interface is more "user friendly" than the standard menu-driven interfaces. Its system of filing sequences under individual passwords facilitates use by more than one person. MicroGenie uses a hardware device for software protection that occupies a card slot in the computer on which it is used. Although I am sympathetic to the problem of software piracy, I feel that a less drastic solution is in order for a program likely to be sharing limited computer space with other software packages. The IBI-Pustell package performs the required analysis functions as accurately and quickly as MicroGenie but it lacks the clearness and ease of use. The menu system seems disjointed, and new or infrequent users often find themselves at apparent "dead-end menus" where the only clear alternative is to restart the entire program package. It is suggested from published accounts that the user interface is going to be upgraded and perhaps when that version is available, use of the system will be improved. The documentation accompanying each package was relatively clear as to how to run the programs, but all three packages assumed that the user was familiar with the computational techniques employed. MicroGenie and IBI-Pustell further

  19. Acetylcholinesterase of the Sand Fly, Phlebotomus papatasi (Scopoli): cDNA Sequence, Baculovirus Expression, and Biochemical Properties

    Science.gov (United States)

    2013-01-01

    and domestic animals around the world are affected by leishmaniasis, a disease caused by various species of flagellated protozoans in the genus...distribution unlimited 13. SUPPLEMENTARY NOTES 14. ABSTRACT Background: Millions of people and domestic animals around the world are affected by...nucleotides, [GenBank: DQ898276]) respectively, as well as to other arthropod AChEs. The P. papatasi AChE cDNA ORF encodes a 710-amino acid protein

  20. [Construction and preliminary analysis of a full-length cDNA library for Paris polyphylla var. yunnanensis].

    Science.gov (United States)

    Zhao, Shuang; Dong, Xu; Ma, Teng

    2014-01-01

    A full-length cDNA library of Paris polyphylla var. yunnanensis was constructed in order to research the genes relating to growing development and the genes regulation of its secondary metabolite biosynthesis. The total RNA was extracted from Paris polyphylla var. yunnanensis using modified Trizol method. The SMART (switching mechanism at 5' end of RNA transcript )technology was appliedl to construct the full-length cDNA library. The library titer,recombinant rate and length of insert fragments were determined,the sequences of the library were analyzed by Blastx and were compared to GenBank database. The capacity of the library was 2. 5 x 107 cfu/mL, the recombinant rate was 98.5% and the average size of the inserted fragment was 1.5 kb. 9 ESTs (Expressed Sequence Tags) were relating to growing development and 5 ESTs were relating to regulation of secondary metabolite biosynthesis among 149 ESTs obtained from 192 clones sequenced. A full-length cDNA library of Paris polyphylla var. yunnanensis is constructed by SMART technology successfully, and the library has enough capacity, high recombinant rate and long insert fragment for the further research to screen and identify the functional genes of Paris polyphylla var. yunnanensis.

  1. Transcriptomic identification of candidate genes involved in sunflower responses to chilling and salt stresses based on cDNA microarray analysis.

    Science.gov (United States)

    Fernandez, Paula; Di Rienzo, Julio; Fernandez, Luis; Hopp, H Esteban; Paniego, Norma; Heinz, Ruth A

    2008-01-26

    Considering that sunflower production is expanding to arid regions, tolerance to abiotic stresses as drought, low temperatures and salinity arises as one of the main constrains nowadays. Differential organ-specific sunflower ESTs (expressed sequence tags) were previously generated by a subtractive hybridization method that included a considerable number of putative abiotic stress associated sequences. The objective of this work is to analyze concerted gene expression profiles of organ-specific ESTs by fluorescence microarray assay, in response to high sodium chloride concentration and chilling treatments with the aim to identify and follow up candidate genes for early responses to abiotic stress in sunflower. Abiotic-related expressed genes were the target of this characterization through a gene expression analysis using an organ-specific cDNA fluorescence microarray approach in response to high salinity and low temperatures. The experiment included three independent replicates from leaf samples. We analyzed 317 unigenes previously isolated from differential organ-specific cDNA libraries from leaf, stem and flower at R1 and R4 developmental stage. A statistical analysis based on mean comparison by ANOVA and ordination by Principal Component Analysis allowed the detection of 80 candidate genes for either salinity and/or chilling stresses. Out of them, 50 genes were up or down regulated under both stresses, supporting common regulatory mechanisms and general responses to chilling and salinity. Interestingly 15 and 12 sequences were up regulated or down regulated specifically in one stress but not in the other, respectively. These genes are potentially involved in different regulatory mechanisms including transcription/translation/protein degradation/protein folding/ROS production or ROS-scavenging. Differential gene expression patterns were confirmed by qRT-PCR for 12.5% of the microarray candidate sequences. Eighty genes isolated from organ-specific cDNA libraries

  2. Transcriptomic identification of candidate genes involved in sunflower responses to chilling and salt stresses based on cDNA microarray analysis

    Directory of Open Access Journals (Sweden)

    Paniego Norma

    2008-01-01

    Full Text Available Abstract Background Considering that sunflower production is expanding to arid regions, tolerance to abiotic stresses as drought, low temperatures and salinity arises as one of the main constrains nowadays. Differential organ-specific sunflower ESTs (expressed sequence tags were previously generated by a subtractive hybridization method that included a considerable number of putative abiotic stress associated sequences. The objective of this work is to analyze concerted gene expression profiles of organ-specific ESTs by fluorescence microarray assay, in response to high sodium chloride concentration and chilling treatments with the aim to identify and follow up candidate genes for early responses to abiotic stress in sunflower. Results Abiotic-related expressed genes were the target of this characterization through a gene expression analysis using an organ-specific cDNA fluorescence microarray approach in response to high salinity and low temperatures. The experiment included three independent replicates from leaf samples. We analyzed 317 unigenes previously isolated from differential organ-specific cDNA libraries from leaf, stem and flower at R1 and R4 developmental stage. A statistical analysis based on mean comparison by ANOVA and ordination by Principal Component Analysis allowed the detection of 80 candidate genes for either salinity and/or chilling stresses. Out of them, 50 genes were up or down regulated under both stresses, supporting common regulatory mechanisms and general responses to chilling and salinity. Interestingly 15 and 12 sequences were up regulated or down regulated specifically in one stress but not in the other, respectively. These genes are potentially involved in different regulatory mechanisms including transcription/translation/protein degradation/protein folding/ROS production or ROS-scavenging. Differential gene expression patterns were confirmed by qRT-PCR for 12.5% of the microarray candidate sequences. Conclusion

  3. cDNA cloning, mRNA distribution and heterogeneity, chromosomal location, and RFLP analysis of human osteopontin (OPN)

    DEFF Research Database (Denmark)

    Young, M F; Kerr, J M; Termine, J D

    1990-01-01

    A human osteopontin (OP) cDNA was isolated from a library made from primary cultures of human bone cells. The distribution of osteopontin mRNA in human tissues was investigated by Northern analysis and showed that the human message was predominant in cultures of bone cells and in decidua cells is...

  4. A simple and novel method for RNA-seq library preparation of single cell cDNA analysis by hyperactive Tn5 transposase.

    Science.gov (United States)

    Brouilette, Scott; Kuersten, Scott; Mein, Charles; Bozek, Monika; Terry, Anna; Dias, Kerith-Rae; Bhaw-Rosun, Leena; Shintani, Yasunori; Coppen, Steven; Ikebe, Chiho; Sawhney, Vinit; Campbell, Niall; Kaneko, Masahiro; Tano, Nobuko; Ishida, Hidekazu; Suzuki, Ken; Yashiro, Kenta

    2012-10-01

    Deep sequencing of single cell-derived cDNAs offers novel insights into oncogenesis and embryogenesis. However, traditional library preparation for RNA-seq analysis requires multiple steps with consequent sample loss and stochastic variation at each step significantly affecting output. Thus, a simpler and better protocol is desirable. The recently developed hyperactive Tn5-mediated library preparation, which brings high quality libraries, is likely one of the solutions. Here, we tested the applicability of hyperactive Tn5-mediated library preparation to deep sequencing of single cell cDNA, optimized the protocol, and compared it with the conventional method based on sonication. This new technique does not require any expensive or special equipment, which secures wider availability. A library was constructed from only 100 ng of cDNA, which enables the saving of precious specimens. Only a few steps of robust enzymatic reaction resulted in saved time, enabling more specimens to be prepared at once, and with a more reproducible size distribution among the different specimens. The obtained RNA-seq results were comparable to the conventional method. Thus, this Tn5-mediated preparation is applicable for anyone who aims to carry out deep sequencing for single cell cDNAs. Copyright © 2012 Wiley Periodicals, Inc.

  5. Identification and functional analysis of a new glyphosate resistance gene from a fungus cDNA library.

    Science.gov (United States)

    Tao, Bo; Shao, Bai-Hui; Qiao, Yu-Xin; Wang, Xiao-Qin; Chang, Shu-Jun; Qiu, Li-Juan

    2017-08-01

    Glyphosate is a widely used broad spectrum herbicide; however, this limits its use once crops are planted. If glyphosate-resistant crops are grown, glyphosate can be used for weed control in crops. While several glyphosate resistance genes are used in commercial glyphosate tolerant crops, there is interest in identifying additional genes for glyphosate tolerance. This research constructed a high-quality cDNA library form the glyphosate-resistant fungus Aspergillus oryzae RIB40 to identify genes that may confer resistance to glyphosate. Using a medium containing glyphosate (120mM), we screened several clones from the library. Based on a nucleotide sequence analysis, we identified a gene of unknown function (GenBank accession number: XM_001826835.2) that encoded a hypothetical 344-amino acid protein. The gene was named MFS40. Its ORF was amplified to construct an expression vector, pGEX-4T-1-MFS40, to express the protein in Escherichia coli BL21. The gene conferred glyphosate tolerance to E. coli ER2799 cells. Copyright © 2017 Elsevier B.V. All rights reserved.

  6. Screening of cDNA libraries on glass slide microarrays.

    Science.gov (United States)

    Berger, Dave K; Crampton, Bridget G; Hein, Ingo; Vos, Wiesner

    2007-01-01

    A quantitative screening method was developed to evaluate the quality of cDNA libraries constructed by suppression subtraction hybridization (SSH) or other enrichment techniques. The SSH technique was adapted to facilitate screening of the resultant library on a small number of glass slide microarrays. A simple data analysis pipeline named SSHscreen using "linear models for microarray data" (limma) functions in the R computing environment was developed to identify clones in the cDNA libraries that are significantly differentially expressed, and to determine if they were rare or abundant in the original treated sample. This approach facilitates the choice of clones from the cDNA library for further analysis, such as DNA sequencing, Northern blotting, RT-PCR, or detailed expression profiling using a custom cDNA microarray. Furthermore, this strategy is particularly useful for studies of nonmodel organisms for which there is little genome sequence information.

  7. Cloning and structural analysis of alpha-latroinsectotoxin cDNA. Abundance of ankyrin-like repeats.

    Science.gov (United States)

    Kiyatkin, N; Dulubova, I; Grishin, E

    1993-04-01

    alpha-Latroinsectotoxin (alpha-LIT), purified from venom glands of the black widow spider Latrodectus mactans tredecimguttatus, is a presynaptic neurotoxin selective only for insects. A cDNA encoding the putative alpha-LIT precursor was isolated from a spider venom gland cDNA library. The cDNA contains a 4236-base-pair open reading frame corresponding to a 157826-Da protein composed of 1411 amino acids. The mature alpha-LIT, with molecular mass approximately 130 kDa, is probably derived from double processing in the N-terminal and C-terminal regions of the primary translation product. The structure region, extending over residues 464-1176, is composed almost entirely of ankyrin-like repeats which represent a motif also found in the alpha-latrotoxin (alpha-LTX), which has selective action on vertebrates. Total alignment of the alpha-LIT and alpha-LTX amino acid sequences reveals an overall similarity of 34.1%. Strong sequence divergence is observed in analogous cysteine-rich regions situated within the ankyrin-repeat domains of both alpha-LIT and alpha-LTX.

  8. Full-length transcriptome analysis using a bias-free cDNA library prepared with the vector-capping method.

    Science.gov (United States)

    Kato, Seishi; Oshikawa, Mio; Ohtoko, Kuniyo

    2011-01-01

    Full-length complementary DNAs (cDNAs) are an essential resource for functional genomics. Recently, we have developed a simple and efficient method for preparing a full-length cDNA library from a small amount of total RNA, named the "vector-capping" method. The biggest advantage of this method is that the intactness of the cDNA can be assured by the presence of dG at the 5' end of the full-length cDNA. Furthermore, the cDNA library represents the mRNA population in the cell owing to a bias-free procedure. In this chapter, we describe not only the protocol for preparing the library but also the points for analyzing the 5'-end sequence of the obtained cDNA.

  9. Cloning of canine myocilin cDNA and molecular analysis of the myocilin gene in Shiba Inu dogs.

    Science.gov (United States)

    Kato, Kumiko; Sasaki, Nobuo; Matsunaga, Satoru; Nishimura, Ryohei; Ogawa, Hiroyuki

    2007-01-01

    To identify canine myocilin cDNA and compare its sequence in glaucomatous and nonglaucomatous Shiba Inu dogs with closed and open iridocorneal angles (ICAs). Total RNA was extracted from the ciliary body of the eyes of a healthy Beagle, and the canine myocilin gene was cloned and sequenced. Of the Shiba Inu dogs tested, five were glaucomatous with closed ICA, three were nonglaucomatous with open ICA, and two were nonglaucomatous with closed ICA. The genomic DNA of these dogs was extracted from peripheral blood leukocytes. The exons of the canine myocilin gene were amplified using the polymerase chain reaction (PCR), and sequenced. The frequency of mutation in canine myocilin DNA was verified in these dogs by using the myocilin cDNA of a Beagle. The canine myocilin cDNA was 1452 bp long and contained the entire open reading frame encoding 483 amino acids. A leucine zipper-like motif and olfactomedin-like domain were conserved in the amino acid residues. The presence of sequence variants in the genomic DNA of Shiba Inu dogs was independent of the occurrence of glaucoma and ICA grading. Myocilin RNA was detected in the ciliary body and trabecular meshwork (TM) of a Beagle. The myocilin sequence of Shiba Inu dogs suggests that myocilin mutations are unlikely to play a significant role in the pathogenesis of primary closed-angle glaucoma in this breed. However, several mutations in the myocilin gene in exon 1 of Shiba Inu dogs may predispose them to an obstruction in the anterior aqueous outflow.

  10. Hybrid Sequencing of Full-Length cDNA Transcripts of Stems and Leaves in Dendrobium officinale

    Directory of Open Access Journals (Sweden)

    Liu He

    2017-10-01

    Full Text Available Dendrobium officinale is an extremely valuable orchid used in traditional Chinese medicine, so sought after that it has a higher market value than gold. Although the expression profiles of some genes involved in the polysaccharide synthesis have previously been investigated, little research has been carried out on their alternatively spliced isoforms in D. officinale. In addition, information regarding the translocation of sugars from leaves to stems in D. officinale also remains limited. We analyzed the polysaccharide content of D. officinale leaves and stems, and completed in-depth transcriptome sequencing of these two diverse tissue types using second-generation sequencing (SGS and single-molecule real-time (SMRT sequencing technology. The results of this study yielded a digital inventory of gene and mRNA isoform expressions. A comparative analysis of both transcriptomes uncovered a total of 1414 differentially expressed genes, including 844 that were up-regulated and 570 that were down-regulated in stems. Of these genes, one sugars will eventually be exported transporter (SWEET and one sucrose transporter (SUT are expressed to a greater extent in D. officinale stems than in leaves. Two glycosyltransferase (GT and four cellulose synthase (Ces genes undergo a distinct degree of alternative splicing. In the stems, the content of polysaccharides is twice as much as that in the leaves. The differentially expressed GT and transcription factor (TF genes will be the focus of further study. The genes DoSWEET4 and DoSUT1 are significantly expressed in the stem, and are likely to be involved in sugar loading in the phloem.

  11. Assessment of adaptive evolution between wheat and rice as deduced from full-length common wheat cDNA sequence data and expression patterns

    Directory of Open Access Journals (Sweden)

    Hayashizaki Yoshihide

    2009-06-01

    Full Text Available Abstract Background Wheat is an allopolyploid plant that harbors a huge, complex genome. Therefore, accumulation of expressed sequence tags (ESTs for wheat is becoming particularly important for functional genomics and molecular breeding. We prepared a comprehensive collection of ESTs from the various tissues that develop during the wheat life cycle and from tissues subjected to stress. We also examined their expression profiles in silico. As full-length cDNAs are indispensable to certify the collected ESTs and annotate the genes in the wheat genome, we performed a systematic survey and sequencing of the full-length cDNA clones. This sequence information is a valuable genetic resource for functional genomics and will enable carrying out comparative genomics in cereals. Results As part of the functional genomics and development of genomic wheat resources, we have generated a collection of full-length cDNAs from common wheat. By grouping the ESTs of recombinant clones randomly selected from the full-length cDNA library, we were able to sequence 6,162 independent clones with high accuracy. About 10% of the clones were wheat-unique genes, without any counterparts within the DNA database. Wheat clones that showed high homology to those of rice were selected in order to investigate their expression patterns in various tissues throughout the wheat life cycle and in response to abiotic-stress treatments. To assess the variability of genes that have evolved differently in wheat and rice, we calculated the substitution rate (Ka/Ks of the counterparts in wheat and rice. Genes that were preferentially expressed in certain tissues or treatments had higher Ka/Ks values than those in other tissues and treatments, which suggests that the genes with the higher variability expressed in these tissues is under adaptive selection. Conclusion We have generated a high-quality full-length cDNA resource for common wheat, which is essential for continuation of the

  12. Human glutamate pyruvate transaminase (GPT): Localization to 8q24.3, cDNA and genomic sequences, and polymorphic sites

    Energy Technology Data Exchange (ETDEWEB)

    Sohocki, M.M.; Sullivan, L.S.; Daiger, S.P. [Univ. of Texas Health Science Center, Houston, TX (United States)] [and others

    1997-03-01

    Two frequent protein variants of glutamate pyruvate transaminase (GPT) (E.C.2.6.1.2) have been used as genetic markers in humans for more than two decades, although chromosomal mapping of the GPT locus in the 1980s produced conflicting results. To resolve this conflict and develop useful DNA markers for this gene, we isolated and characterized cDNA and genomic clones of GPT. We have definitively mapped human GPT to the terminus of 8q using several methods. First, two cosmids shown to contain the GPT sequence were derived from a chromosome 8-specific library. Second, by fluorescence in situ hybridization, we mapped the cosmid containing the human GPT gene to chromosome band 8q24.3. Third, we mapped the rat gpt cDNA to the syntenic region of rat chromosome 7. Finally, PCR primers specific to human GPT amplify sequences contained within a {open_quotes}half-YAC{close_quotes} from the long arm of chromosome 8, that is, a YAC containing the 8q telomere. The human GPT genomic sequence spans 2.7 kb and consists of 11 exons, ranging in size from 79 to 243 bp. The exonic sequence encodes a protein of 495 amino acids that is nearly identical to the previously reported protein sequence of human GPT-1. The two polymorphic GPT isozymes are the result of a nucleotide substitution in codon 14. In addition, a cosmid containing the GPT sequence also contains a previously unmapped, polymorphic microsatellite sequence, D8S421. The cloned GPT gene and associated polymorphisms will be useful for linkage and physical mapping of disease loci that map to the terminus of 8q, including atypical vitelliform macular dystrophy (VMD1) and epidermolysis bullosa simplex, type Ogna (EBS1). In addition, this will be a useful system for characterizing the telomeric region of 8q. Finally, determination of the molecular basis of the GPT isozyme variants will permit PCR-based detection of this world-wide polymorphism. 22 refs., 3 figs.

  13. Construction of a full-length cDNA library and analysis of expressed ...

    African Journals Online (AJOL)

    sunny t

    2015-06-10

    Jun 10, 2015 ... projects provide a very useful and quick means of accessing gene sequence and expression information. (Manickavelu et al., 2012). Some reports have proven that projects based on ESTs are powerful tools for both the analysis of gene ..... genes controlling many important traits of agronomic importance ...

  14. Construction of a full-length cDNA library and analysis of expressed ...

    African Journals Online (AJOL)

    ... in the GenBank databases. Cluster analysis allowed the identification of 61 unique sequences. These genes were classified into six types by Gene Ontology (GO) annotation. The results also indicated that unigenes of C. capsularis have higher homology to Populus trichocarpa, Ricinus communis and Corchorus olitorius.

  15. Probe-Directed Degradation (PDD) for Flexible Removal of Unwanted cDNA Sequences from RNA-Seq Libraries.

    Science.gov (United States)

    Archer, Stuart K; Shirokikh, Nikolay E; Preiss, Thomas

    2015-04-01

    Most applications for RNA-seq require the depletion of abundant transcripts to gain greater coverage of the underlying transcriptome. The sequences to be targeted for depletion depend on application and species and in many cases may not be supported by commercial depletion kits. This unit describes a method for generating RNA-seq libraries that incorporates probe-directed degradation (PDD), which can deplete any unwanted sequence set, with the low-bias split-adapter method of library generation (although many other library generation methods are in principle compatible). The overall strategy is suitable for applications requiring customized sequence depletion or where faithful representation of fragment ends and lack of sequence bias is paramount. We provide guidelines to rapidly design specific probes against the target sequence, and a detailed protocol for library generation using the split-adapter method including several strategies for streamlining the technique and reducing adapter dimer content. Copyright © 2015 John Wiley & Sons, Inc.

  16. Monitoring expression profiles of rice (Oryza sativa L.) genes under abiotic stresses using cDNA Microarray Analysis (abstract)

    International Nuclear Information System (INIS)

    Rabbani, M.A.

    2005-01-01

    Transcript regulation in response to cold, drought, high salinity and ABA application was investigated in rice (Oryza sativa L., Nipponbare) with microarray analysis including approx. 1700 independent DNA elements derived from three cDNA libraries constructed from 15-day old rice seedlings stressed with drought, cold and high salinity. A total of 141 non-redundant genes were identified, whose expression ratios were more than three-fold compared with the control genes for at least one of stress treatments in microarray analysis. However, after RNA gel blot analysis, a total of 73 genes were identified, among them the transcripts of 36, 62, 57 and 43 genes were found increased after cold, drought, high salinity and ABA application, respectively. Sixteen of these identified genes have been reported previously to be stress inducible in rice, while 57 of which are novel that have not been reported earlier as stress responsive in rice. We observed a strong association in the expression patterns of stress responsive genes and found 15 stress inducible genes that responded to all four treatments. Based on Venn diagram analysis, 56 genes were induced by both drought and high salinity, whereas 22 genes were upregulated by both cold and high salinity stress. Similarly 43 genes were induced by both drought stress and ABA application, while only 17 genes were identified as cold and ABA inducible genes. These results indicated the existence of greater cross talk between drought, ABA and high salinity stress signaling processes than those between cold and ABA, and cold and high salinity stress signaling pathways. The cold, drought, high salinity and ABA inducible genes were classified into four gene groups from their expression profiles. Analysis of data enabled us to identify a number of promoters and possible cis-acting DNA elements of several genes induced by a variety of abiotic stresses by combining expression data with genomic sequence data of rice. Comparative analysis of

  17. cDNA cloning and expression analysis of two distinct Sox8 genes in Paramisgurnus dabryanus (Cypriniformes).

    Science.gov (United States)

    Xia, Xiaohua; Zhao, Jie; Du, Qiyan; Chang, Zhongjie

    2010-08-01

    The Sox9 gene attracts a lot of attention because of its connection with gonadal development and differentiation. However, Sox8, belonging to the same subgroup SoxE, has rarely been studied. To investigate the function as well as the evolutionary origin of SOXE subgroup, we amplified the genomic DNA of Paramisgurnus dabryanu using a pair of degenerate primers. Using rapid amplification of the cDNA ends (RACE), it was discovered that P. dabryanu has two duplicates: Sox8a and Sox8b. Each has an intron of different length in the conserved HMG-box region. The overall sequence similarity of the deduced amino acid of PdSox8a and PdSox8b was 46.26%, and only two amino acids changed in the HMG-box. This is the first evidence showing that there are two distinct duplications of Sox8 genes in Cypriniformes. Southern blot analysis showed only one hybrid band, with lengths 7.4 or 9.2 kb. Both semi-quantitative RT-PCR and real-time quantitative PCR assay displayed that both PdSox8a and PdSox8b are downregulated during early embryonic development. In adult tissues, the two Sox8 genes expressed ubiquitously, and expression levels are particularly high in the gonads and brain. In gonads, both PdSox8a and PdSox8b are expressed at a higher level in the tesis than in the ovary. PdSox8a and PdSox8b may have functional overlaps and are essential for the neuronal development and differentiation of gonads.

  18. Normalized cDNA libraries

    Science.gov (United States)

    Soares, Marcelo B.; Efstratiadis, Argiris

    1997-01-01

    This invention provides a method to normalize a directional cDNA library constructed in a vector that allows propagation in single-stranded circle form comprising: (a) propagating the directional cDNA library in single-stranded circles; (b) generating fragments complementary to the 3' noncoding sequence of the single-stranded circles in the library to produce partial duplexes; (c) purifying the partial duplexes; (d) melting and reassociating the purified partial duplexes to moderate Cot; and (e) purifying the unassociated single-stranded circles, thereby generating a normalized cDNA library.

  19. Thermostable group II intron reverse transcriptase fusion proteins and their use in cDNA synthesis and next-generation RNA sequencing

    Science.gov (United States)

    Mohr, Sabine; Ghanem, Eman; Smith, Whitney; Sheeter, Dennis; Qin, Yidan; King, Olga; Polioudakis, Damon; Iyer, Vishwanath R.; Hunicke-Smith, Scott; Swamy, Sajani; Kuersten, Scott; Lambowitz, Alan M.

    2013-01-01

    Mobile group II introns encode reverse transcriptases (RTs) that function in intron mobility (“retrohoming”) by a process that requires reverse transcription of a highly structured, 2–2.5-kb intron RNA with high processivity and fidelity. Although the latter properties are potentially useful for applications in cDNA synthesis and next-generation RNA sequencing (RNA-seq), group II intron RTs have been difficult to purify free of the intron RNA, and their utility as research tools has not been investigated systematically. Here, we developed general methods for the high-level expression and purification of group II intron-encoded RTs as fusion proteins with a rigidly linked, noncleavable solubility tag, and we applied them to group II intron RTs from bacterial thermophiles. We thus obtained thermostable group II intron RT fusion proteins that have higher processivity, fidelity, and thermostability than retroviral RTs, synthesize cDNAs at temperatures up to 81°C, and have significant advantages for qRT-PCR, capillary electrophoresis for RNA-structure mapping, and next-generation RNA sequencing. Further, we find that group II intron RTs differ from the retroviral enzymes in template switching with minimal base-pairing to the 3′ ends of new RNA templates, making it possible to efficiently and seamlessly link adaptors containing PCR-primer binding sites to cDNA ends without an RNA ligase step. This novel template-switching activity enables facile and less biased cloning of nonpolyadenylated RNAs, such as miRNAs or protein-bound RNA fragments. Our findings demonstrate novel biochemical activities and inherent advantages of group II intron RTs for research, biotechnological, and diagnostic methods, with potentially wide applications. PMID:23697550

  20. cDNA sequences and organization of IgM heavy chain genes in two holostean fish.

    Science.gov (United States)

    Wilson, M R; van Ravenstein, E; Miller, N W; Clem, L W; Middleton, D L; Warr, G W

    1995-01-01

    Immunoglobulin M heavy chain (mu) sequences of two holostean fish, the bowfin, Amia calva, and the longnose gar, Lepisosteus osseus, were amplified from spleen mRNA by RACE-PCR, cloned, and sequenced. Each mu chain showed the conserved four constant domain structure typical of a secreted mu chain. Southern blot analyses with specific heavy chain variable (VH) and constant (CH) region probes suggest that both fish possess an IgH locus that resembles that of the teleosts, amphibians, and mammals in its organization. The overall sequence similarity of gar and bowfin mu chains was 60% and 48% at the nucleotide and amino acid levels, respectively, while similarity to the mu chains of teleosts and elasmobranchs was lower. The bowfin mu chain possesses a distinctive proline-rich sequence at the C mu 1/C mu 2 boundary; a shorter proline-rich sequence is present at this position in the gar mu chain. Both gar and bowfin show, in their C mu 4 sequences, motifs that could serve as cryptic splice donor sites for the production of mRNA encoding the membrane-bound form of the mu chains, and the bowfin also shows a potential cryptic splice donor site in the C mu 3 exon.

  1. [Construction of suppression subtractive hybridization cDNA library of half-blood males of Dermacentor silvarum and analysis of differentially expressed genes].

    Science.gov (United States)

    Liu, Qi; Wang, Wei-lin; Meng, Qing-feng; Xu, Zhan; Cui, Jie; Liu, Xin-xin; Wang, Wei-li

    2014-08-01

    To construct a suppression subtractive hybridization (SSH) cDNA library of half-blood males of Dermacentor silvarum, and analyze the differentially expressed genes. Total RNA was extracted from the half-blood males and unfed males of D. silvarum. cDNA was synthesized following the protocol of SMARTER cDNA synthesis kit. After Rsa I digestion, cDNA was ligated to adaptors. The cDNA from the half-blood males was used as the tester, and unfed males as the driver. The SSH library was constructed using TaKaRa PCR-select cDNA subtraction kit. Differentially expressed cDNAs were amplified by nested PCR, cloned into PMD-18T vector, transformed into E. coli DH5alpha, and the white-blue plaque selection was used to get the positive clones. The titer of SSH library and the recombination efficiency were calculated. Individual colonies were randomly selected from library. Subtractive efficiency of the subtracted cDNA library was examined by reverse Northern blotting and RT-PCR. Positive clones with differentially expressed genes were sequenced. Homology comparison and function prediction were performed by Blastn and Blastx. The bands of double-stranded cDNAs from half-blood males and unfed males of D. silvarum were dispersed and longer than 500 bp. After Rsa I digestion, the ds cDNA-fragments were 100-1000 bp. The ligation reaction efficiency of adaptor was more than 25%. Nested PCR showed that the bands of subtracted ds cDNA were gathered, ranging from 250 to 500 bp. The titer of SSH library was 700,000 pfu/ml, and the recombination efficiency was 88.5% (239/270). Reverse Northern hybridization revealed that the clones showed stronger signals in half-blood males cDNA probes than in unfed males cDNA probes. RT-PCR showed that among the eight random selected positive clones, 5 clones were up-expressed under half-blood condition. A total of 87 differentially expressed sequence tags (ESTs, 200-800 bp) were obtained from 115 positive clones. Among the 87 ESTs, 53 ESTs showed

  2. cDNA cloning and expression analysis of a mannose-binding lectin ...

    Indian Academy of Sciences (India)

    Pinellia pedatisecta agglutinin (PPA) is a very basic protein that accumulates in the tuber of P. pedatisecta. PPA is a hetero-tetramer protein of 40 kDa, composed of two polypeptide chains A (about 12 kDa) and two polypeptides chains B (about 12 kDa). The full-length cDNA of PPA was cloned from P. pedatisecta using ...

  3. A novel method using edge detection for signal extraction from cDNA microarray image analysis.

    Science.gov (United States)

    Kim, J H; Kim, H Y; Lee, Y S

    2001-06-30

    Gene expression analyses by probes of hybridization from mRNA to cDNA targets arrayed on membranes or activated glass surfaces have revolutionized the way of profiling mega level gene expression. The main remaining problems however are sensitivity of detection, reproducibility and data processing. During processing of microarray images, especially irregularities of spot position and shape could generate significant errors: small regions of signal spots can be mis-included into background area and vice versa. Here we report a novel method to eliminate such obstacles by sensing their edges. Application of edge detection technology on separating spots from the background decreases the probability of the errors and gives more accurate information about the states of spots such as the pixel number, degree of fragmentation, width and height of spot, and circumference of spot. Such information can be used for the quality control of cDNA microarray experiments and filtering of low quality spots. We analyzed the cDNA microarray image that contains 10,368 genes using edge detection and compared the result with that of conventional method which draws circle around the spot.

  4. Analysis and Functional Annotation of an Expressed Sequence Tag Collection for Tropical Crop Sugarcane

    Science.gov (United States)

    Vettore, André L.; da Silva, Felipe R.; Kemper, Edson L.; Souza, Glaucia M.; da Silva, Aline M.; Ferro, Maria Inês T.; Henrique-Silva, Flavio; Giglioti, Éder A.; Lemos, Manoel V.F.; Coutinho, Luiz L.; Nobrega, Marina P.; Carrer, Helaine; França, Suzelei C.; Bacci, Maurício; Goldman, Maria Helena S.; Gomes, Suely L.; Nunes, Luiz R.; Camargo, Luis E.A.; Siqueira, Walter J.; Van Sluys, Marie-Anne; Thiemann, Otavio H.; Kuramae, Eiko E.; Santelli, Roberto V.; Marino, Celso L.; Targon, Maria L.P.N.; Ferro, Jesus A.; Silveira, Henrique C.S.; Marini, Danyelle C.; Lemos, Eliana G.M.; Monteiro-Vitorello, Claudia B.; Tambor, José H.M.; Carraro, Dirce M.; Roberto, Patrícia G.; Martins, Vanderlei G.; Goldman, Gustavo H.; de Oliveira, Regina C.; Truffi, Daniela; Colombo, Carlos A.; Rossi, Magdalena; de Araujo, Paula G.; Sculaccio, Susana A.; Angella, Aline; Lima, Marleide M.A.; de Rosa, Vicente E.; Siviero, Fábio; Coscrato, Virginia E.; Machado, Marcos A.; Grivet, Laurent; Di Mauro, Sonia M.Z.; Nobrega, Francisco G.; Menck, Carlos F.M.; Braga, Marilia D.V.; Telles, Guilherme P.; Cara, Frank A.A.; Pedrosa, Guilherme; Meidanis, João; Arruda, Paulo

    2003-01-01

    To contribute to our understanding of the genome complexity of sugarcane, we undertook a large-scale expressed sequence tag (EST) program. More than 260,000 cDNA clones were partially sequenced from 26 standard cDNA libraries generated from different sugarcane tissues. After the processing of the sequences, 237,954 high-quality ESTs were identified. These ESTs were assembled into 43,141 putative transcripts. Of the assembled sequences, 35.6% presented no matches with existing sequences in public databases. A global analysis of the whole SUCEST data set indicated that 14,409 assembled sequences (33% of the total) contained at least one cDNA clone with a full-length insert. Annotation of the 43,141 assembled sequences associated almost 50% of the putative identified sugarcane genes with protein metabolism, cellular communication/signal transduction, bioenergetics, and stress responses. Inspection of the translated assembled sequences for conserved protein domains revealed 40,821 amino acid sequences with 1415 Pfam domains. Reassembling the consensus sequences of the 43,141 transcripts revealed a 22% redundancy in the first assembling. This indicated that possibly 33,620 unique genes had been identified and indicated that >90% of the sugarcane expressed genes were tagged. PMID:14613979

  5. Analysis and functional annotation of an expressed sequence tag collection for tropical crop sugarcane.

    Science.gov (United States)

    Vettore, André L; da Silva, Felipe R; Kemper, Edson L; Souza, Glaucia M; da Silva, Aline M; Ferro, Maria Inês T; Henrique-Silva, Flavio; Giglioti, Eder A; Lemos, Manoel V F; Coutinho, Luiz L; Nobrega, Marina P; Carrer, Helaine; França, Suzelei C; Bacci Júnior, Mauricio; Goldman, Maria Helena S; Gomes, Suely L; Nunes, Luiz R; Camargo, Luis E A; Siqueira, Walter J; Van Sluys, Marie-Anne; Thiemann, Otavio H; Kuramae, Eiko E; Santelli, Roberto V; Marino, Celso L; Targon, Maria L P N; Ferro, Jesus A; Silveira, Henrique C S; Marini, Danyelle C; Lemos, Eliana G M; Monteiro-Vitorello, Claudia B; Tambor, José H M; Carraro, Dirce M; Roberto, Patrícia G; Martins, Vanderlei G; Goldman, Gustavo H; de Oliveira, Regina C; Truffi, Daniela; Colombo, Carlos A; Rossi, Magdalena; de Araujo, Paula G; Sculaccio, Susana A; Angella, Aline; Lima, Marleide M A; de Rosa Júnior, Vicente E; Siviero, Fábio; Coscrato, Virginia E; Machado, Marcos A; Grivet, Laurent; Di Mauro, Sonia M Z; Nobrega, Francisco G; Menck, Carlos F M; Braga, Marilia D V; Telles, Guilherme P; Cara, Frank A A; Pedrosa, Guilherme; Meidanis, João; Arruda, Paulo

    2003-12-01

    To contribute to our understanding of the genome complexity of sugarcane, we undertook a large-scale expressed sequence tag (EST) program. More than 260,000 cDNA clones were partially sequenced from 26 standard cDNA libraries generated from different sugarcane tissues. After the processing of the sequences, 237,954 high-quality ESTs were identified. These ESTs were assembled into 43,141 putative transcripts. Of the assembled sequences, 35.6% presented no matches with existing sequences in public databases. A global analysis of the whole SUCEST data set indicated that 14,409 assembled sequences (33% of the total) contained at least one cDNA clone with a full-length insert. Annotation of the 43,141 assembled sequences associated almost 50% of the putative identified sugarcane genes with protein metabolism, cellular communication/signal transduction, bioenergetics, and stress responses. Inspection of the translated assembled sequences for conserved protein domains revealed 40,821 amino acid sequences with 1415 Pfam domains. Reassembling the consensus sequences of the 43,141 transcripts revealed a 22% redundancy in the first assembling. This indicated that possibly 33,620 unique genes had been identified and indicated that >90% of the sugarcane expressed genes were tagged.

  6. Image analysis for DNA sequencing

    International Nuclear Information System (INIS)

    Palaniappan, K.; Huang, T.S.

    1991-01-01

    This paper reports that there is a great deal of interest in automating the process of DNA (deoxyribonucleic acid) sequencing to support the analysis of genomic DNA such as the Human and Mouse Genome projects. In one class of gel-based sequencing protocols autoradiograph images are generated in the final step and usually require manual interpretation to reconstruct the DNA sequence represented by the image. The need to handle a large volume of sequence information necessitates automation of the manual autoradiograph reading step through image analysis in order to reduce the length of time required to obtain sequence data and reduce transcription errors. Various adaptive image enhancement, segmentation and alignment methods were applied to autoradiograph images. The methods are adaptive to the local characteristics of the image such as noise, background signal, or presence of edges. Once the two-dimensional data is converted to a set of aligned one-dimensional profiles waveform analysis is used to determine the location of each band which represents one nucleotide in the sequence. Different classification strategies including a rule-based approach are investigated to map the profile signals, augmented with the original two-dimensional image data as necessary, to textual DNA sequence information

  7. Nucleotide sequence of the cDNA encoding the precursor of the beta subunit of rat lutropin.

    OpenAIRE

    Chin, W W; Godine, J E; Klein, D R; Chang, A S; Tan, L K; Habener, J F

    1983-01-01

    We have determined the nucleotide sequences of cDNAs encoding the precursor of the beta subunit of rat lutropin, a polypeptide hormone that regulates gonadal function, including the development of gametes and the production of steroid sex hormones. The cDNAs were prepared from poly(A)+ RNA derived from the pituitary glands of rats 4 weeks after ovariectomy and were cloned in bacterial plasmids. Bacterial colonies containing transfected plasmids were screened by hybridization with a 32P-labele...

  8. Construction of Geobacillus thermoglucosidasius cDNA library and analysis of genes expressed in response to heat stress.

    Science.gov (United States)

    Tripathy, S; Maiti, N K

    2014-03-01

    Thermophiles exhibit various kinds of molecular mechanisms to survive in extreme environment, but their behavioral responses to long duration stress is poorly understood until date. In the present study, we have prospected for the genes differentially expressed in response to long duration heat stress in thermophilic bacteria. A cDNA library was constructed from Geobacillus thermoglucosidasius grown with a temperature upshift of 10 °C from optimum growth temperature of 45 °C for 16 h. A total of 451 clones from the library were sequenced with accurate base calling that generated 257 high quality sequences with an average read length of 350 bp. We queried our collection of single pass sequences against the NCBI non-redundant database using the BLASTX algorithm and obtained sequences that showed significant similarity (>60%) with heat shock proteins, metabolic proteins and hypothetical proteins. The expressed sequence tags (ESTs) expressed in response to heat stress were annotated that further commuted a strong interaction network among one another. The ESTs based on the best hits were validated by RT-PCR. Di- and tri-nucleotide repeat motifs were also found to be associated with 17 genes involved in heat shock response, metabolism, transport and transcriptional regulation. The present results provide the novel identification of the putative genes responsible for imparting tolerance to bacteria under heat stress and unveil their role for survival of life in environmental extremes.

  9. Trehalose as a good candidate for enriching full-length cDNAs in cDNA library construction.

    Science.gov (United States)

    Chen, Lei; Cao, Lixue; Zhou, Longhai; Jing, Yudong; Chen, Zuozhou; Deng, Cheng; Shen, Yu; Chen, Liangbiao

    2007-01-10

    It has been reported that the disaccharide trehalose is capable of increasing the thermostability and thermoactivity of reverse transcriptase, and therefore improving the length of cDNA synthesis. However, no test has been done on how the disaccharide trehalose performs in the context of the entire cDNA synthesis processes, or whether it can seamlessly integrate into the commercially available cDNA synthesis kit. In this report, we optimized a protocol to incorporate trehalose in the Stratagene's cDNA library construction kit in order to demonstrate great improvement in cDNA's length (average length of 1.8 kb in the trehalose group versus 1.0 kb in the control). Sequence analysis of the cDNA clones showed that the addition of trehalose did not increase the error rate of the RT products but greatly increase the quantity of full-length in cDNA library.

  10. Complete cDNA sequence of the preproform of human pregnancy-associated plasma protein-A. Evidence for expression in the brain and induction by cAMP

    DEFF Research Database (Denmark)

    Haaning, Jesper; Oxvig, Claus; Overgaard, Michael Toft

    1996-01-01

    A cDNA that encodes the prepropeptide of pregnancy-associated plasma protein-A (preproPAPP-A), a putative metalloproteinase, has been cloned and sequenced. PAPP-A is synthesized in the placenta as a 1627-residue precursor preproprotein with a putative 22-residue signal peptide and a highly basic...

  11. cDNA sequence and tissue distribution of the mRNA for bovine and murine p11, the S100-related light chain of the protein-tyrosine kinase substrate p36 (calpactin I)

    DEFF Research Database (Denmark)

    Saris, Chris J M; Kristensen, Torsten; D’Eustachio, Peter

    1987-01-01

    We have isolated and sequenced cDNA clones of bovine nd murine pl 1 mRNAs. The nonpolyadenylated mRNAs are predicted to be 614 and 600 nucleotides, respectively. The p l l mRNAs both contain a 291 nucleotide open reading frame, preceded by a 5”untranslated region of 73 nucleotides in bovine p l l m...

  12. Cloning and sequence of cDNA encoding 1-aminocyclo- propane-1-carboxylate oxidase in Vanda flowers

    Directory of Open Access Journals (Sweden)

    Pattana Srifah Huehne

    2013-08-01

    Full Text Available The 1-aminocyclopropane-1-carboxylate oxidase (ACO gene in the final step of ethylene biosynthesis was isolated from ethylene-sensitive Vanda Miss Joaquim flowers. This consists of 1,242 base pairs (bp encoding for 326 amino acid residues. To investigate the specific divergence in orchid ACO sequences, the deduced Vanda ACO was aligned with five other orchid ACOs. The results reveal that the ACO sequences within Doritaenopsis, Phalaenopsis and Vanda show highly conserved and almost 95% identical homology, while the ACOs isolated from Cymbidium, Dendrobium and Cattleya are 8788% identical to Vanda ACO. In addition, the 2-oxoglutarate- Fe(II_oxygenase (Oxy domain of orchid ACOs consists of a higher degree of amino acid conservation than that of the non-haem dioxygenase (DIOX_N domain. The overall homology regions of Vanda ACO are commonly folded into 12 α-helices and 12 β-sheets similar to the three dimensional template-structure of Petunia ACO. This Vanda ACO cloned gene is highly expressed in flower tissue compared with root and leaf tissues. In particular, there is an abundance of ACO transcript accumulation in the column followed by the lip and the perianth of Vanda Miss Joaquim flowers at the fully-open stage.

  13. Construction and analysis of SSH cDNA library of human vascular endothelial cells related to gastrocarcinoma

    OpenAIRE

    Liu, Yong-Bo; Wei, Zhao-Xia; Li, Li; Li, Hang-Sheng; Chen, Hui; Li, Xiao-Wen

    2003-01-01

    AIM: To construct subtracted cDNA libraries of human vascular endothelial cells (VECs) related to gastrocarcinoma using suppression substractive hybridization (SSH) and to analyze cDNA libraries of gastrocarcinoma and VECs in Cancer Gene Anatomy Project (CGAP) database.

  14. Identification of differentially-expressed genes potentially implicated in drought response in pitaya (Hylocereus undatus) by suppression subtractive hybridization and cDNA microarray analysis.

    Science.gov (United States)

    Fan, Qing-Jie; Yan, Feng-Xia; Qiao, Guang; Zhang, Bing-Xue; Wen, Xiao-Peng

    2014-01-01

    Drought is one of the most severe threats to the growth, development and yield of plant. In order to unravel the molecular basis underlying the high tolerance of pitaya (Hylocereus undatus) to drought stress, suppression subtractive hybridization (SSH) and cDNA microarray approaches were firstly combined to identify the potential important or novel genes involved in the plant responses to drought stress. The forward (drought over drought-free) and reverse (drought-free over drought) suppression subtractive cDNA libraries were constructed using in vitro shoots of cultivar 'Zihonglong' exposed to drought stress and drought-free (control). A total of 2112 clones, among which half were from either forward or reverse SSH library, were randomly picked up to construct a pitaya cDNA microarray. Microarray analysis was carried out to verify the expression fluctuations of this set of clones upon drought treatment compared with the controls. A total of 309 expressed sequence tags (ESTs), 153 from forward library and 156 from reverse library, were obtained, and 138 unique ESTs were identified after sequencing by clustering and blast analyses, which included genes that had been previously reported as responsive to water stress as well as some functionally unknown genes. Thirty six genes were mapped to 47 KEGG pathways, including carbohydrate metabolism, lipid metabolism, energy metabolism, nucleotide metabolism, and amino acid metabolism of pitaya. Expression analysis of the selected ESTs by reverse transcriptase polymerase chain reaction (RT-PCR) corroborated the results of differential screening. Moreover, time-course expression patterns of these selected ESTs further confirmed that they were closely responsive to drought treatment. Among the differentially expressed genes (DEGs), many are related to stress tolerances including drought tolerance. Thereby, the mechanism of drought tolerance of this pitaya genotype is a very complex physiological and biochemical process, in

  15. A Universal Next-Generation Sequencing Protocol To Generate Noninfectious Barcoded cDNA Libraries from High-Containment RNA Viruses

    Science.gov (United States)

    Moser, Lindsey A.; Ramirez-Carvajal, Lisbeth; Puri, Vinita; Pauszek, Steven J.; Matthews, Krystal; Dilley, Kari A.; Mullan, Clancy; McGraw, Jennifer; Khayat, Michael; Beeri, Karen; Yee, Anthony; Dugan, Vivien; Heise, Mark T.; Frieman, Matthew B.; Rodriguez, Luis L.; Bernard, Kristen A.; Wentworth, David E.

    2016-01-01

    ABSTRACT Several biosafety level 3 and/or 4 (BSL-3/4) pathogens are high-consequence, single-stranded RNA viruses, and their genomes, when introduced into permissive cells, are infectious. Moreover, many of these viruses are select agents (SAs), and their genomes are also considered SAs. For this reason, cDNAs and/or their derivatives must be tested to ensure the absence of infectious virus and/or viral RNA before transfer out of the BSL-3/4 and/or SA laboratory. This tremendously limits the capacity to conduct viral genomic research, particularly the application of next-generation sequencing (NGS). Here, we present a sequence-independent method to rapidly amplify viral genomic RNA while simultaneously abolishing both viral and genomic RNA infectivity across multiple single-stranded positive-sense RNA (ssRNA+) virus families. The process generates barcoded DNA amplicons that range in length from 300 to 1,000 bp, which cannot be used to rescue a virus and are stable to transport at room temperature. Our barcoding approach allows for up to 288 barcoded samples to be pooled into a single library and run across various NGS platforms without potential reconstitution of the viral genome. Our data demonstrate that this approach provides full-length genomic sequence information not only from high-titer virion preparations but it can also recover specific viral sequence from samples with limited starting material in the background of cellular RNA, and it can be used to identify pathogens from unknown samples. In summary, we describe a rapid, universal standard operating procedure that generates high-quality NGS libraries free of infectious virus and infectious viral RNA. IMPORTANCE This report establishes and validates a standard operating procedure (SOP) for select agents (SAs) and other biosafety level 3 and/or 4 (BSL-3/4) RNA viruses to rapidly generate noninfectious, barcoded cDNA amenable for next-generation sequencing (NGS). This eliminates the burden of testing all

  16. Toward a better knowledge of the molecular evolution of phosphoenolpyruvate carboxylase by comparison of partial cDNA sequences.

    Science.gov (United States)

    Gehrig, H H; Heute, V; Kluge, M

    1998-01-01

    To get deeper insight into the evolution of phosphoenolpyruvate carboxylase we have identified PEPC fragments (about 1,100 bp) of another 12 plants species not yet investigated in this context. The selected plants include one Chlorophyta, two Bryophyta, four Pteridophyta, and five Spermatophyta species. The obtained phylogenetic trees on PEPC isoforms are the most complete ones up to now available. Independent of their manner of construction, the resulting dendrograms are very similar and fully consistent with the main topology as it is postulated for the evolution of the higher terrestrial plants. We found a distinct clustering of the PEPC sequences of the prokaryotes, the algae, and the spermatophytes. PEPC isoforms of the archegoniates are located in the phylogenetic trees between the algae and spermatophytes. Our results strengthen the view that the PEPC is a very useful molecular marker with which to visualize phylogenetic trends both on the metabolic and organismic levels.

  17. Cloning and sequence analysis of lily and tobacco guanylate kinases.

    Science.gov (United States)

    Kumar, V

    2000-03-01

    Guanylate kinase is an essential enzyme in the nucleotide biosynthetic pathway, catalyzing the reversible transfer of the terminal phosphoryl group of ATP to GMP or dGMP. This enzyme has been well studied from several organisms and many structural and functional details have been characterized. Animal GMP kinases have also been implicated in signal transduction pathways. However, the corresponding role by plant derived GMP kinases remains to be elucidated. Full-length cDNA clones encoding enzymatically active guanylate kinases were isolated from cDNA libraries of lily and tobacco. Lily cDNA is predicted to encode a 392-amino acid protein with a molecular mass of 43.1 kDa and carries amino- and carboxy- terminal extensions of the guanylate kinase (GK)-like domain. But tobacco cDNA is predicted to encode a smaller protein of 297-amino acids with a molecular mass of 32.7 kDa. The amino acid residues known to participate in the catalytic activity of functionally characterized GMP kinases, are also conserved in GK domains of LGK-1 and NGK-1. The GK domains of NGK-1, LGK-1 and previously characterized AGK-1 from Arabidopsis exhibit 74-84% identity, whereas their N- and C-terminal domains are more divergent with amino acid conservation in the order of 48-55%. Phylogenetic analysis on the deduced amino acid sequences reveals that NGK-1 and LGK-1 form one distinct subgroup along with AGK-1 and AGK-2 homologues from Arabidopsis. Isolation of GMP kinases from diverse plant species like lily and tobacco adds a new dimension in understanding their role in cell signaling pathways that are associated with plant growth and development.

  18. Integrated sequence analysis. Final report

    International Nuclear Information System (INIS)

    Andersson, K.; Pyy, P.

    1998-02-01

    The NKS/RAK subprojet 3 'integrated sequence analysis' (ISA) was formulated with the overall objective to develop and to test integrated methodologies in order to evaluate event sequences with significant human action contribution. The term 'methodology' denotes not only technical tools but also methods for integration of different scientific disciplines. In this report, we first discuss the background of ISA and the surveys made to map methods in different application fields, such as man machine system simulation software, human reliability analysis (HRA) and expert judgement. Specific event sequences were, after the surveys, selected for application and testing of a number of ISA methods. The event sequences discussed in the report were cold overpressure of BWR, shutdown LOCA of BWR, steam generator tube rupture of a PWR and BWR disturbed signal view in the control room after an external event. Different teams analysed these sequences by using different ISA and HRA methods. Two kinds of results were obtained from the ISA project: sequence specific and more general findings. The sequence specific results are discussed together with each sequence description. The general lessons are discussed under a separate chapter by using comparisons of different case studies. These lessons include areas ranging from plant safety management (design, procedures, instrumentation, operations, maintenance and safety practices) to methodological findings (ISA methodology, PSA,HRA, physical analyses, behavioural analyses and uncertainty assessment). Finally follows a discussion about the project and conclusions are presented. An interdisciplinary study of complex phenomena is a natural way to produce valuable and innovative results. This project came up with structured ways to perform ISA and managed to apply the in practice. The project also highlighted some areas where more work is needed. In the HRA work, development is required for the use of simulators and expert judgement as

  19. Salmo salar and Esox lucius full-length cDNA sequences reveal changes in evolutionary pressures on a post-tetraploidization genome

    Directory of Open Access Journals (Sweden)

    Holt Robert A

    2010-04-01

    Full Text Available Abstract Background Salmonids are one of the most intensely studied fish, in part due to their economic and environmental importance, and in part due to a recent whole genome duplication in the common ancestor of salmonids. This duplication greatly impacts species diversification, functional specialization, and adaptation. Extensive new genomic resources have recently become available for Atlantic salmon (Salmo salar, but documentation of allelic versus duplicate reference genes remains a major uncertainty in the complete characterization of its genome and its evolution. Results From existing expressed sequence tag (EST resources and three new full-length cDNA libraries, 9,057 reference quality full-length gene insert clones were identified for Atlantic salmon. A further 1,365 reference full-length clones were annotated from 29,221 northern pike (Esox lucius ESTs. Pairwise dN/dS comparisons within each of 408 sets of duplicated salmon genes using northern pike as a diploid out-group show asymmetric relaxation of selection on salmon duplicates. Conclusions 9,057 full-length reference genes were characterized in S. salar and can be used to identify alleles and gene family members. Comparisons of duplicated genes show that while purifying selection is the predominant force acting on both duplicates, consistent with retention of functionality in both copies, some relaxation of pressure on gene duplicates can be identified. In addition, there is evidence that evolution has acted asymmetrically on paralogs, allowing one of the pair to diverge at a faster rate.

  20. Integrated sequence analysis. Final report

    Energy Technology Data Exchange (ETDEWEB)

    Andersson, K.; Pyy, P

    1998-02-01

    The NKS/RAK subprojet 3 `integrated sequence analysis` (ISA) was formulated with the overall objective to develop and to test integrated methodologies in order to evaluate event sequences with significant human action contribution. The term `methodology` denotes not only technical tools but also methods for integration of different scientific disciplines. In this report, we first discuss the background of ISA and the surveys made to map methods in different application fields, such as man machine system simulation software, human reliability analysis (HRA) and expert judgement. Specific event sequences were, after the surveys, selected for application and testing of a number of ISA methods. The event sequences discussed in the report were cold overpressure of BWR, shutdown LOCA of BWR, steam generator tube rupture of a PWR and BWR disturbed signal view in the control room after an external event. Different teams analysed these sequences by using different ISA and HRA methods. Two kinds of results were obtained from the ISA project: sequence specific and more general findings. The sequence specific results are discussed together with each sequence description. The general lessons are discussed under a separate chapter by using comparisons of different case studies. These lessons include areas ranging from plant safety management (design, procedures, instrumentation, operations, maintenance and safety practices) to methodological findings (ISA methodology, PSA,HRA, physical analyses, behavioural analyses and uncertainty assessment). Finally follows a discussion about the project and conclusions are presented. An interdisciplinary study of complex phenomena is a natural way to produce valuable and innovative results. This project came up with structured ways to perform ISA and managed to apply the in practice. The project also highlighted some areas where more work is needed. In the HRA work, development is required for the use of simulators and expert judgement as

  1. Construction and analysis of cDNA libraries from the antennae of Batocera horsfieldi and expression pattern of putative odorant binding proteins.

    Science.gov (United States)

    Li, Hui; Zhang, Aijun; Chen, Li-Zhen; Zhang, Guoan; Wang, Man-Qun

    2014-04-19

    A high-quality cDNA library was constructed from female and male antenna of the longhorned beetle, Batocera horsfieldi (Hope) (Coleoptera: Cerambycidae), a serious pest of Populus (Salicales: Salicaceae). The titer was approximately 2.37 × 106 pfu/mL, and this complies with the test requirement. From the libraries, 692 clones were selected randomly, sequenced, and further analyzed, and the recombinational efficiency reached 93.85%. By alignment and cluster analysis, we identified four odorant binding proteins, two pheromone-binding proteins (have the characteristic six conserved cysteine residues), four Minus-C odorant binding proteins (lost two conserved cysteines), and three chemosensory proteins. In this study, we describe the identification and characterization of four new cDNAs that encode Minus-C odorant binding proteins (Minus-C OBPs) from B. horsfieldi antennal cDNA libraries. Our investigation focused on the expression pattern of the Minus-C OBP genes in various tissues in both sexes at different developmental stages, using reverse transcription PCR (RT-PCR) and realtime PCR (qPCR) strategies. Minus-C OBP1, 2, and 3 were expressed in all tested tissues, with the exception of the head (without antenna, labial palps, and maxillary palps). Minus-C OBP4 was expressed in the antenna, legs, and abdomen, but not in the labial palps, maxillary palps, or head. The qPCR results revealed MinusC OBPs were expressed in the antenna throughout the adult life, and that the transcript levels of these genes depended on the sex, age, and mating status of adults. This is an open access paper. We use the Creative Commons Attribution 3.0 license that permits unrestricted use, provided that the paper is properly attributed.

  2. Sequence Handling by Sequence Analysis Toolbox v1.0

    DEFF Research Database (Denmark)

    Ingrell, Christian Ravnsborg; Matthiesen, Rune; Jensen, Ole Nørregaard

    2006-01-01

    analysis toolbox v1.0 was to have a general purpose sequence analyzing tool that can import sequences obtained by high-throughput sequencing methods. The program includes algorithms for calculation or prediction of isoelectric point, hydropathicity index, transmembrane segments, and glycosylphosphatidyl......The fact that mass spectrometry have become a high-throughput method calls for bioinformatic tools for automated sequence handling and prediction. For efficient use of bioinformatic tools, it is important that these tools are integrated or interfaced with each other. The purpose of sequence...... inositol-anchored proteins....

  3. Identification of Multiple Stress Responsive Genes by Sequencing a Normalized cDNA Library from Sea-Land Cotton (Gossypium barbadense L..

    Directory of Open Access Journals (Sweden)

    Bin Zhou

    Full Text Available Plants often face multiple stresses including drought, extreme temperature, salinity, nutrition deficiency and biotic stresses during growth and development. All the stresses result in a series of physiological and metabolic reactions and then generate reversible inhibition of metabolism and growth and can cause seriously irreversible damage, even death. At each stage of cotton growth, environmental stress conditions pose devastating threats to plant growth and development, especially yield and quality. Due to the complex stress conditions and unclear molecular mechanisms of stress response, there is an urgent need to explore the mechanisms of cotton response against abiotic stresses.A normalized cDNA library was constructed using Gossypium barbadense Hai-7124 treated with different stress conditions (heat, cold, salt, drought, potassium and phosphorus deficit and Verticillium dahliae infection. Random sequencing of this library generated 6,047 high-quality expressed sequence tags (ESTs. The ESTs were clustered and assembled into 3,135 uniESTs, composed of 2,497 contigs and 638 singletons. The blastx results demonstrated 2,746 unigenes showing significant similarity to known genes, 74 uniESTs displaying significant similarity to genes of predicted proteins, and 315 uniESTs remain uncharacterized. Functional classification unveiled the abundance of uniESTs in binding, catalytic activity, and structural molecule activity. Annotations of the uniESTs by the plant transcription factor database (PlantTFDB and Plant Stress Protein Database (PSPDB disclosed that transcription factors and stress-related genes were enriched in the current library. The expression of some transcription factors and specific stress-related genes were verified by RT-PCR under various stress conditions.Annotation results showed that a huge number of genes respond to stress in our study, such as MYB-related, C2H2, FAR1, bHLH, bZIP, MADS, and mTERF. These results will improve our

  4. Generation of full-length cDNA libraries: focus on plants.

    Science.gov (United States)

    Seki, Motoaki; Kamiya, Asako; Carninci, Piero; Hayashizaki, Yoshihide; Shinozaki, Kazuo

    2009-01-01

    Full-length cDNAs are essential for the correct annotation of transcriptional units and gene products from genomic sequence data and for functional analysis of the genes. Full-length cDNA libraries are very important resources for isolation of the full-length cDNAs. The biotinylated cap trapper method using the trehalose-thermostabilized reverse transcriptase has been developed and has become an efficient method for construction of high-content full-length cDNA libraries. We have constructed full-length cDNA libraries from various plants and animals using this method. The protocol of the method is described in this chapter.

  5. Molecular cloning, sequence characteristics, and tissue expression analysis of ECE1 gene in Tibetan pig.

    Science.gov (United States)

    Wang, Yan-Dong; Zhang, Jian; Li, Chuan-Hao; Xu, Hai-Peng; Chen, Wei; Zeng, Yong-Qing; Wang, Hui

    2015-10-25

    Low air pressure and low oxygen partial pressure at high altitude seriously affect the survival and development of human beings and animals. ECE1 is a recently discovered gene that is involved in anti-hypoxia, but the full-length cDNA sequence has not been obtained. For a better understanding of the structure and function of the ECE1 gene and to study its effect in Tibetan pig, the cDNA of the ECE1 gene from the muscle of Tibetan pig was cloned, sequenced and characterized. The ECE1 full-length cDNA sequence consists of 2262 bp coding sequence (CDS) that encodes 753 amino acids with a molecular mass of 85,449 kD, 2 bp 5'UTR and 1507 bp 3'UTR. In addition, the phylogenetic tree analysis revealed that the Tibetan pig ECE1 has a closer genetic relationship and evolution distance with the land mammals ECE1. Furthermore, analysis by qPCR showed that the ECE1 transcript is constitutively expressed in the 10 tissues tested: the liver, subcutaneous fat, kidney, muscle, stomach, heart, brain, spleen, pancreas, and lung. These results serve as a foundation for further insight into the Tibetan pig ECE1 gene. Copyright © 2015 Elsevier B.V. All rights reserved.

  6. Complete nucleotide sequences and construction of full-length infectious cDNA clones of cucumber green mottle mosaic virus (CGMMV) in a versatile newly developed binary vector including both 35S and T7 promoters.

    Science.gov (United States)

    Park, Chan-Hwan; Ju, Hye-Kyoung; Han, Jae-Yeong; Park, Jong-Seo; Kim, Ik-Hyun; Seo, Eun-Young; Kim, Jung-Kyu; Hammond, John; Lim, Hyoun-Sub

    2017-04-01

    Seed-transmitted viruses have caused significant damage to watermelon crops in Korea in recent years, with cucumber green mottle mosaic virus (CGMMV) infection widespread as a result of infected seed lots. To determine the likely origin of CGMMV infection, we collected CGMMV isolates from watermelon and melon fields and generated full-length infectious cDNA clones. The full-length cDNAs were cloned into newly constructed binary vector pJY, which includes both the 35S and T7 promoters for versatile usage (agroinfiltration and in vitro RNA transcription) and a modified hepatitis delta virus ribozyme sequence to precisely cleave RNA transcripts at the 3' end of the tobamovirus genome. Three CGMMV isolates (OMpj, Wpj, and Mpj) were separately evaluated for infectivity in Nicotiana benthamiana, demonstrated by either Agroinfiltration or inoculation with in vitro RNA transcripts. CGMMV nucleotide identities to other tobamoviruses were calculated from pairwise alignments using DNAMAN. CGMMV identities were 49.89% to tobacco mosaic virus; 49.85% to pepper mild mottle virus; 50.47% to tomato mosaic virus; 60.9% to zucchini green mottle mosaic virus; and 60.96% to kyuri green mottle mosaic virus, confirming that CGMMV is a distinct species most similar to other cucurbit-infecting tobamoviruses. We further performed phylogenetic analysis to determine relationships of our new Korean CGMMV isolates to previously characterized isolates from Canada, China, India, Israel, Japan, Korea, Russia, Spain, and Taiwan available from NCBI. Analysis of CGMMV amino acid sequences showed three major clades, broadly typified as 'Russian,' 'Israeli,' and 'Asian' groups. All of our new Korean isolates fell within the 'Asian' clade. Neither the 128 nor 186 kDa RdRps of the three new isolates showed any detectable gene silencing suppressor function.

  7. Bioinformatics and expressional analysis of cDNA clones from floral buds

    Science.gov (United States)

    Pawełkowicz, Magdalena Ewa; Skarzyńska, Agnieszka; Cebula, Justyna; Hincha, Dirck; ZiÄ bska, Karolina; PlÄ der, Wojciech; Przybecki, Zbigniew

    2017-08-01

    The application of genomic approaches may serve as an initial step in understanding the complexity of biochemical network and cellular processes responsible for regulation and execution of many developmental tasks. The molecular mechanism of sex expression in cucumber is still not elucidated. A study of differential expression was conducted to identify genes involved in sex determination and floral organ morphogenesis. Herein, we present generation of expression sequence tags (EST) obtained by differential hybridization (DH) and subtraction technique (cDNA-DSC) and their characteristic features such as molecular function, involvement in biology processes, expression and mapping position on the genome.

  8. The tetramethylammonium chloride method for screening of cDNA libraries using highly degenerate oligonucleotides obtained by backtranslation of amino-acid sequences

    DEFF Research Database (Denmark)

    Honoré, B; Madsen, Peder; Leffers, H

    1993-01-01

    We describe a method for screening of cDNA libraries with highly degenerate oligonucleotides using tetramethylammonium chloride (TMAC). This method is a convenient alternative to using probes generated by the polymerase chain reaction (PCR), especially when these cannot easily be made. Nylon...

  9. [Construction and analysis of a forward and reverse subtractive cDNA library from leaves and stem of Polygonum sibiricum Laxm. under salt stress].

    Science.gov (United States)

    Liu, Guan-Jun; Liu, Ming-Kun; Xu, Zhi-Ru; Yan, Xiu-Feng; Wei, Zhi-Gang; Yang, Chuan-Ping

    2009-04-01

    Using cDNAs prepared from the leaves and stems of Polygonum sibiricum Laxm. treated with NaHCO3 stress for 48 h as testers and cDNAs from unstressed P. sibiricum leaves and stems as drivers library, suppression subtractive hybridization (SSH) was employed to construct a cDNA subtracted library, which contained 2 282 valid sequences including 598 ESTs in the stems forward SSH library and 490 ESTs in the stem reverse SSH library, 627 ESTs in the leaf forward SSH library and 567 in the leaf reverse SSH library. According to the functional catalogue of MIPs and the comparison of the reverse and forward SSH libraries of the stem and leaf, the responses to NaHCO3 stress were different between leaf and stem, except for the same trend in cell rescue defense and transport facilitation. The trend in the metabolism, energy, photosynthesis, protein synthesis, transcription, and signal transduction was opposite. RT-PCR analysis demonstrated that the expression of 12 putative stress related genes in the NaHCO3-treated leaves and stems was different from that in the untreated leaves and stems. This indicated that different mechanisms might be responsible for reactions of leaf and stem in P. sibiricum. The results from this study are useful in understanding the molecular mechanism of saline-alkali tolerance in P. sibiricum.

  10. Molecular characterization of a Leishmania donovani cDNA clone with similarity to human 20S proteasome a-type subunit

    DEFF Research Database (Denmark)

    Christensen, C B; Jørgensen, L; Jensen, A T

    2000-01-01

    Using plasma from patients infected or previously infected with Leishmania donovanii, we isolated a L. donovanii cDNA clone with similarity to the proteasome a-type subunit from humans and other eukaryotes. The cDNA clone, designated LePa, was DNA sequenced and Northern blot analysis of L...

  11. RNA Sequencing Analysis of Salivary Extracellular RNA.

    Science.gov (United States)

    Majem, Blanca; Li, Feng; Sun, Jie; Wong, David T W

    2017-01-01

    Salivary biomarkers for disease detection, diagnostic and prognostic assessments have become increasingly well established in recent years. In this chapter we explain the current leading technology that has been used to characterize salivary non-coding RNAs (ncRNAs) from the extracellular RNA (exRNA) fraction: HiSeq from Illumina® platform for RNA sequencing. Therefore, the chapter is divided into two main sections regarding the type of the library constructed (small and long ncRNA libraries), from saliva collection, RNA extraction and quantification to cDNA library generation and corresponding QCs. Using these invaluable technical tools, one can identify thousands of ncRNA species in saliva. These methods indicate that salivary exRNA provides an efficient medium for biomarker discovery of oral and systemic diseases.

  12. De novo transcriptome sequencing and sequence analysis of the malaria vector Anopheles sinensis (Diptera: Culicidae)

    Science.gov (United States)

    2014-01-01

    Background Anopheles sinensis is the major malaria vector in China and Southeast Asia. Vector control is one of the most effective measures to prevent malaria transmission. However, there is little transcriptome information available for the malaria vector. To better understand the biological basis of malaria transmission and to develop novel and effective means of vector control, there is a need to build a transcriptome dataset for functional genomics analysis by large-scale RNA sequencing (RNA-seq). Methods To provide a more comprehensive and complete transcriptome of An. sinensis, eggs, larvae, pupae, male adults and female adults RNA were pooled together for cDNA preparation, sequenced using the Illumina paired-end sequencing technology and assembled into unigenes. These unigenes were then analyzed in their genome mapping, functional annotation, homology, codon usage bias and simple sequence repeats (SSRs). Results Approximately 51.6 million clean reads were obtained, trimmed, and assembled into 38,504 unigenes with an average length of 571 bp, an N50 of 711 bp, and an average GC content 51.26%. Among them, 98.4% of unigenes could be mapped onto the reference genome, and 69% of unigenes could be annotated with known biological functions. Homology analysis identified certain numbers of An. sinensis unigenes that showed homology or being putative 1:1 orthologues with genomes of other Dipteran species. Codon usage bias was analyzed and 1,904 SSRs were detected, which will provide effective molecular markers for the population genetics of this species. Conclusions Our data and analysis provide the most comprehensive transcriptomic resource and characteristics currently available for An. sinensis, and will facilitate genetic, genomic studies, and further vector control of An. sinensis. PMID:25000941

  13. Molecular cloning, sequence analysis and phylogeny of first caudata g-type lysozyme in axolotl (Ambystoma mexicanum).

    Science.gov (United States)

    Yu, Haining; Gao, Jiuxiang; Lu, Yiling; Guang, Huijuan; Cai, Shasha; Zhang, Songyan; Wang, Yipeng

    2013-11-01

    Lysozymes are key proteins that play important roles in innate immune defense in many animal phyla by breaking down the bacterial cell-walls. In this study, we report the molecular cloning, sequence analysis and phylogeny of the first caudate amphibian g-lysozyme: a full-length spleen cDNA library from axolotl (Ambystoma mexicanum). A goose-type (g-lysozyme) EST was identified and the full-length cDNA was obtained using RACE-PCR. The axolotl g-lysozyme sequence represents an open reading frame for a putative signal peptide and the mature protein composed of 184 amino acids. The calculated molecular mass and the theoretical isoelectric point (pl) of this mature protein are 21523.0 Da and 4.37, respectively. Expression of g-lysozyme mRNA is predominantly found in skin, with lower levels in spleen, liver, muscle, and lung. Phylogenetic analysis revealed that caudate amphibian g-lysozyme had distinct evolution pattern for being juxtaposed with not only anura amphibian, but also with the fish, bird and mammal. Although the first complete cDNA sequence for caudate amphibian g-lysozyme is reported in the present study, clones encoding axolotl's other functional immune molecules in the full-length cDNA library will have to be further sequenced to gain insight into the fundamental aspects of antibacterial mechanisms in caudate.

  14. Cloning of cDNA encoding steroid 11β-hydroxylase (P450c11)

    International Nuclear Information System (INIS)

    Chua, S.C.; Szabo, P.; Vitek, A.; Grzeschik, K.H.; John, M.; White, P.C.

    1987-01-01

    The authors have isolated bovine and human adrenal cDNA clones encoding the adrenal cytochrome P-450 specific for 11β-hydroxylation (P450c11). A bovine adrenal cDNA library constructed in the bacteriophage λ vector gt10 was probed with a previously isolated cDNA clone corresponding to part of the 3' untranslated region of the 4.2-kilobase (kb) mRNA encoding P450c11. Several clones with 3.2-kb cDNA inserts were isolated. Sequence analysis showed that they overlapped the original probe by 300 base pairs (bp). Combined cDNA and RNA sequence data demonstrated a continuous open reading frame of 1509 bases. P450c11 is predicted to contain 479 amino acid residues in the mature protein in addition to a 24-residue amino-terminal mitochondrial signal sequence. A bovine clone was used to isolate a homologous clone with a 3.5-kb insert from a human adrenal cDNA library. A region of 1100 bp was 81% homologous to 769 bp of the coding sequence of the bovine cDNA except for a 400-bp segment presumed to be an unprocessed intron. Hybridization of the human cDNA to DNA from a panel of human-rodent somatic cell hybrid lines and in situ hybridization to metaphase spreads of human chromosomes localized the gene to the middle of the long arm of chromosome 8. These data should be useful in developing reagents for heterozygote detection and prenatal diagnosis of 11β-hydroxylase deficiency, the second most frequent cause of congenital adrenal hyperplasia

  15. cDNA cloning and expression analysis of a mannose-binding lectin ...

    Indian Academy of Sciences (India)

    PRAKASH

    Phaseolus vulgaris erythroagglutinin (PHA-E) (Sun et al. 1995). The nature of the lectin correlates with its mannose- binding active site. 3.3 Southern blot analysis. The presence of multiple copies of the mannose-binding lectin gene in the genome has been reported in many plant species, particularly those belonging to the ...

  16. Nonlinear analysis of biological sequences

    Energy Technology Data Exchange (ETDEWEB)

    Torney, D.C.; Bruno, W.; Detours, V. [and others

    1998-11-01

    This is the final report of a three-year, Laboratory Directed Research and Development (LDRD) project at the Los Alamos National Laboratory (LANL). The main objectives of this project involved deriving new capabilities for analyzing biological sequences. The authors focused on tabulating the statistical properties exhibited by Human coding DNA sequences and on techniques of inferring the phylogenetic relationships among protein sequences related by descent.

  17. Molecular characterization, sequence analysis and tissue expression of a porcine gene – MOSPD2

    Directory of Open Access Journals (Sweden)

    Yang Jie

    2017-01-01

    Full Text Available The full-length cDNA sequence of a porcine gene, MOSPD2, was amplified using the rapid amplification of cDNA ends method based on a pig expressed sequence tag sequence which was highly homologous to the coding sequence of the human MOSPD2 gene. Sequence prediction analysis revealed that the open reading frame of this gene encodes a protein of 491 amino acids that has high homology with the motile sperm domain-containing protein 2 (MOSPD2 of five species: horse (89%, human (90%, chimpanzee (89%, rhesus monkey (89% and mouse (85%; thus, it could be defined as a porcine MOSPD2 gene. This novel porcine gene was assigned GeneID: 100153601. This gene is structured in 15 exons and 14 introns as revealed by computer-assisted analysis. The phylogenetic analysis revealed that the porcine MOSPD2 gene has a closer genetic relationship with the MOSPD2 gene of horse. Tissue expression analysis indicated that the porcine MOSPD2 gene is generally and differentially expressed in the spleen, muscle, skin, kidney, lung, liver, fat and heart. Our experiment is the first to establish the primary foundation for further research on the porcine MOSPD2 gene.

  18. [cDNA libraries construction and screening in gene expression profiling of disease resistance in wheat].

    Science.gov (United States)

    Luo, Meng; Kong, Xiu-Ying; Liu, Yue; Zhou, Rong-Hua; Jia, Ji-Zeng

    2002-09-01

    A wheat line, Bai Nong 3217/Mardler BC5F4 with resistance to powdery mildew, was used to construct a conventional cDNA library and a suppression subtractive hybridization (SSH) cDNA library from wheat leaves inoculated by Erysiphe graminis DC. Three hundred and eighty-seven non-redundant ESTs from the conventional cDNA library and 760 ESTs from the SSH cDNA library were obtained, and the ESTs similarity analysis using BLASTn and BLASTx were conducted by comparing these ESTs with sequences in GenBank. The results showed that the redundancy of some kinds of genes such as photosynthesis related genes and ribosome related genes was higher in the conventional cDNA library but the varieties and quantities of disease resistance genes were less than those in the SSH cDNA library. The SSH cDNA library was found to have obvious advantages in gene expression profiling of disease resistance such as simple library construction procedure, rich specific DRR (disease-resistance-related) genes and decreased sequencing amount. To acquire genes that were involved in the powdery mildew resistance of wheat, hybridization with high-density dots membranes was used to screen the two libraries. The result showed that the method was relatively simple in operation, and the membranes could be used for many times. But some problems also existed with this screening method. For instance, a large amount of mRNA and radioactive isotope were needed and the hybridization procedure must be repeated several times to obtain stable hybridization results. About 54.1% function-known ESTs in the SSH cDNA library were identified to be DRR genes by screening. There were 247 clones of the SSH cDNA library that had positive signal in the repeated hybridizations with the pathogen uninfected probe. The identified DRR genes distributed in the whole procedure of powdery mildew resistance, but mainly focused on the SAR (systemic of acquired resistance).

  19. Generation and analysis of expressed sequence tags (ESTs) for marker development in yam (Dioscorea alata L.)

    Science.gov (United States)

    2011-01-01

    Background Anthracnose (Colletotrichum gloeosporioides) is a major limiting factor in the production of yam (Dioscorea spp.) worldwide. Availability of high quality sequence information is necessary for designing molecular markers associated with resistance. However, very limited sequence information pertaining to yam is available at public genome databases. Therefore, this collaborative project was developed for genetic improvement and germplasm characterization of yams using molecular markers. The current investigation is focused on studying gene expression, by large scale generation of ESTs, from one susceptible (TDa 95-0310) and two resistant yam genotypes (TDa 87-01091, TDa 95-0328) challenged with the fungus. Total RNA was isolated from young leaves of resistant and susceptible genotypes and cDNA libraries were sequenced using Roche 454 technology. Results A total of 44,757 EST sequences were generated from the cDNA libraries of the resistant and susceptible genotypes. Greater than 56% of ESTs were annotated using MapMan Mercator tool and Blast2GO search tools. Gene annotations were used to characterize the transcriptome in yam and also perform a differential gene expression analysis between the resistant and susceptible EST datasets. Mining for SSRs in the ESTs revealed 1702 unique sequences containing SSRs and 1705 SSR markers were designed using those sequences. Conclusion We have developed a comprehensive annotated transcriptome data set in yam to enrich the EST information in public databases. cDNA libraries were constructed from anthracnose fungus challenged leaf tissues for transcriptome characterization, and differential gene expression analysis. Thus, it helped in identifying unique transcripts in each library for disease resistance. These EST resources provide the basis for future microarray development, marker validation, genetic linkage mapping and QTL analysis in Dioscorea species. PMID:21303556

  20. Generation and analysis of expressed sequence tags (ESTs for marker development in yam (Dioscorea alata L.

    Directory of Open Access Journals (Sweden)

    Robert Asiedu

    2011-02-01

    Full Text Available Abstract Background Anthracnose (Colletotrichum gloeosporioides is a major limiting factor in the production of yam (Dioscorea spp. worldwide. Availability of high quality sequence information is necessary for designing molecular markers associated with resistance. However, very limited sequence information pertaining to yam is available at public genome databases. Therefore, this collaborative project was developed for genetic improvement and germplasm characterization of yams using molecular markers. The current investigation is focused on studying gene expression, by large scale generation of ESTs, from one susceptible (TDa 95-0310 and two resistant yam genotypes (TDa 87-01091, TDa 95-0328 challenged with the fungus. Total RNA was isolated from young leaves of resistant and susceptible genotypes and cDNA libraries were sequenced using Roche 454 technology. Results A total of 44,757 EST sequences were generated from the cDNA libraries of the resistant and susceptible genotypes. Greater than 56% of ESTs were annotated using MapMan Mercator tool and Blast2GO search tools. Gene annotations were used to characterize the transcriptome in yam and also perform a differential gene expression analysis between the resistant and susceptible EST datasets. Mining for SSRs in the ESTs revealed 1702 unique sequences containing SSRs and 1705 SSR markers were designed using those sequences. Conclusion We have developed a comprehensive annotated transcriptome data set in yam to enrich the EST information in public databases. cDNA libraries were constructed from anthracnose fungus challenged leaf tissues for transcriptome characterization, and differential gene expression analysis. Thus, it helped in identifying unique transcripts in each library for disease resistance. These EST resources provide the basis for future microarray development, marker validation, genetic linkage mapping and QTL analysis in Dioscorea species.

  1. Construction and analysis of antennal cDNA library from rice striped stem borer, Chilo suppressalis (Walker) (Lepidoptera: Pyralidae), and expression profiles of putative odorant-binding protein and chemosensory protein genes.

    Science.gov (United States)

    Gong, Zhong-Jun; Liu, Su; Jiang, Yan-Dong; Zhou, Wen-Wu; Liang, Qing-Mei; Cheng, Jiaan; Zhang, Chuan-Xi; Zhu, Zeng-Rong; Gurr, Geoff M

    2015-05-01

    In this study, we constructed a high-quality cDNA library from the antennae of the Chilo suppressalis (Walker) (Lepidoptera: Pyralidae). A total of 1,235 colonies with inserts greater than 0.7 kb were sequenced and analyzed. Homology searching coupled with bioinformatics analysis identified 15 and 7 cDNA sequences, respectively, encoding putative odorant-binding proteins (OBPs) and chemosensory proteins (CSPs). A phylogenetic tree of CsupCSPs showed that each CsupCSP has orthologs in Manduca sexta and Bombyx mori with strong bootstrapping support. One CSP was either very specific or more related to the CSPs of another species than to conspecific CSP. The expression profiles of the OBPs and CSPs in different tissues were measured by real-time quantitative PCR. The results revealed that of the 11 OBP genes, the transcript levels of CsupOBP1, CsupOBP5, and CsupOBP7 were higher in both male and female antennae than those in other tissues. And CsupCSP7 was highly expressed in both male and female antennae. Based on these results, the possible physiological functions of CsupOBPs and CsupCSPs were discussed. © 2015 Wiley Periodicals, Inc.

  2. cDNA Microarray Analysis of Serially Sampled Cervical Cancer Specimens From Patients Treated With Thermochemoradiotherapy

    International Nuclear Information System (INIS)

    Borkamo, Erling Dahl; Schem, Baard-Christian; Fluge, Oystein; Bruland, Ove; Dahl, Olav; Mella, Olav

    2009-01-01

    Purpose: To elucidate changes in gene expression after treatment with regional thermochemoradiotherapy in locally advanced squamous cell cervical cancer. Methods and Materials: Tru-Cut biopsy specimens were serially collected from 16 patients. Microarray gene expression levels before and 24 h after the first and second trimodality treatment sessions were compared. Pathway and network analyses were conducted by use of Ingenuity Pathways Analysis (IPA; Ingenuity Systems, Redwood City, CA). Single gene expressions were analyzed by quantitative real-time reverse transcription-polymerase chain reaction. Results: We detected 53 annotated genes that were differentially expressed after trimodality treatment. Central in the three top networks detected by IPA were interferon alfa, interferon beta, and interferon gamma receptor; nuclear factor κB; and tumor necrosis factor, respectively. These genes encode proteins that are important in regulation cell signaling, proliferation, gene expression, and immune stimulation. Biological processes over-represented among the 53 genes were fibrosis, tumorigenesis, and immune response. Conclusions: Microarrays showed minor changes in gene expression after thermochemoradiotherapy in locally advanced cervical cancer. We detected 53 differentially expressed genes, mainly involved in fibrosis, tumorigenesis, and immune response. A limitation with the use of serial biopsy specimens was low quality of ribonucleic acid from tumors that respond to highly effective therapy. Another 'key limitation' is timing of the post-treatment biopsy, because 24 h may be too late to adequately assess the impact of hyperthermia on gene expression.

  3. Analysis of expressed sequence tags from Prunus mume flower and fruit and development of simple sequence repeat markers

    Directory of Open Access Journals (Sweden)

    Gao Zhihong

    2010-07-01

    Full Text Available Abstract Background Expressed Sequence Tag (EST has been a cost-effective tool in molecular biology and represents an abundant valuable resource for genome annotation, gene expression, and comparative genomics in plants. Results In this study, we constructed a cDNA library of Prunus mume flower and fruit, sequenced 10,123 clones of the library, and obtained 8,656 expressed sequence tag (EST sequences with high quality. The ESTs were assembled into 4,473 unigenes composed of 1,492 contigs and 2,981 singletons and that have been deposited in NCBI (accession IDs: GW868575 - GW873047, among which 1,294 unique ESTs were with known or putative functions. Furthermore, we found 1,233 putative simple sequence repeats (SSRs in the P. mume unigene dataset. We randomly tested 42 pairs of PCR primers flanking potential SSRs, and 14 pairs were identified as true-to-type SSR loci and could amplify polymorphic bands from 20 individual plants of P. mume. We further used the 14 EST-SSR primer pairs to test the transferability on peach and plum. The result showed that nearly 89% of the primer pairs produced target PCR bands in the two species. A high level of marker polymorphism was observed in the plum species (65% and low in the peach (46%, and the clustering analysis of the three species indicated that these SSR markers were useful in the evaluation of genetic relationships and diversity between and within the Prunus species. Conclusions We have constructed the first cDNA library of P. mume flower and fruit, and our data provide sets of molecular biology resources for P. mume and other Prunus species. These resources will be useful for further study such as genome annotation, new gene discovery, gene functional analysis, molecular breeding, evolution and comparative genomics between Prunus species.

  4. Sequence analysis of Leukemia DNA

    Science.gov (United States)

    Nacong, Nasria; Lusiyanti, Desy; Irawan, Muhammad. Isa

    2018-03-01

    Cancer is a very deadly disease, one of which is leukemia disease or better known as blood cancer. The cancer cell can be detected by taking DNA in laboratory test. This study focused on local alignment of leukemia and non leukemia data resulting from NCBI in the form of DNA sequences by using Smith-Waterman algorithm. SmithWaterman algorithm was invented by TF Smith and MS Waterman in 1981. These algorithms try to find as much as possible similarity of a pair of sequences, by giving a negative value to the unequal base pair (mismatch), and positive values on the same base pair (match). So that will obtain the maximum positive value as the end of the alignment, and the minimum value as the initial alignment. This study will use sequences of leukemia and 3 sequences of non leukemia.

  5. Extending Immunological Profiling in the Gilthead Sea Bream, Sparus aurata, by Enriched cDNA Library Analysis, Microarray Design and Initial Studies upon the Inflammatory Response to PAMPs

    Directory of Open Access Journals (Sweden)

    Sebastian Boltaña

    2017-02-01

    Full Text Available This study describes the development and validation of an enriched oligonucleotide-microarray platform for Sparus aurata (SAQ to provide a platform for transcriptomic studies in this species. A transcriptome database was constructed by assembly of gilthead sea bream sequences derived from public repositories of mRNA together with reads from a large collection of expressed sequence tags (EST from two extensive targeted cDNA libraries characterizing mRNA transcripts regulated by both bacterial and viral challenge. The developed microarray was further validated by analysing monocyte/macrophage activation profiles after challenge with two Gram-negative bacterial pathogen-associated molecular patterns (PAMPs; lipopolysaccharide (LPS and peptidoglycan (PGN. Of the approximately 10,000 EST sequenced, we obtained a total of 6837 EST longer than 100 nt, with 3778 and 3059 EST obtained from the bacterial-primed and from the viral-primed cDNA libraries, respectively. Functional classification of contigs from the bacterial- and viral-primed cDNA libraries by Gene Ontology (GO showed that the top five represented categories were equally represented in the two libraries: metabolism (approximately 24% of the total number of contigs, carrier proteins/membrane transport (approximately 15%, effectors/modulators and cell communication (approximately 11%, nucleoside, nucleotide and nucleic acid metabolism (approximately 7.5% and intracellular transducers/signal transduction (approximately 5%. Transcriptome analyses using this enriched oligonucleotide platform identified differential shifts in the response to PGN and LPS in macrophage-like cells, highlighting responsive gene-cassettes tightly related to PAMP host recognition. As observed in other fish species, PGN is a powerful activator of the inflammatory response in S. aurata macrophage-like cells. We have developed and validated an oligonucleotide microarray (SAQ that provides a platform enriched for the study

  6. Characterization of squid enolase mRNA: sequence analysis, tissue distribution, and axonal localization.

    Science.gov (United States)

    Chun, J T; Gioio, A E; Crispino, M; Giuditta, A; Kaplan, B B

    1995-08-01

    Enolase is a glycolytic enzyme whose amino acid sequence is highly conserved across a wide range of animal species. In mammals, enolase is known to be a dimeric protein composed of distinct but closely related subunits: alpha (non-neuronal), beta (muscle-specific), and gamma (neuron-specific). However, little information is available on the primary sequence of enolase in invertebrates. Here we report the isolation of two overlapping cDNA clones and the putative primary structure of the enzyme from the squid (Loligo pealii) nervous system. The composite sequence of those cDNA clones is 1575 bp and contains the entire coding region (1302 bp), as well as 66 and 207 bp of 5' and 3' untranslated sequence, respectively. Cross-species comparison of enolase primary structure reveals that squid enolase shares over 70% sequence identity to vertebrate forms of the enzyme. The greatest degree of sequence similarity was manifest to the alpha isoform of the human homologue. Results of Northern analysis revealed a single 1.6 kb mRNA species, the relative abundance of which differs approximately 10-fold between various tissues. Interestingly, evidence derived from in situ hybridization and polymerase chain reaction experiments indicate that the mRNA encoding enolase is present in the squid giant axon.

  7. Development of a porcine skeletal muscle cDNA microarray: analysis of differential transcript expression in phenotypically distinct muscles

    Directory of Open Access Journals (Sweden)

    Stear Michael

    2003-03-01

    Full Text Available Abstract Background Microarray profiling has the potential to illuminate the molecular processes that govern the phenotypic characteristics of porcine skeletal muscles, such as hypertrophy or atrophy, and the expression of specific fibre types. This information is not only important for understanding basic muscle biology but also provides underpinning knowledge for enhancing the efficiency of livestock production. Results We report on the de novo development of a composite skeletal muscle cDNA microarray, comprising 5500 clones from two developmentally distinct cDNA libraries (longissimus dorsi of a 50-day porcine foetus and the gastrocnemius of a 3-day-old pig. Clones selected for the microarray assembly were of low to moderate abundance, as indicated by colony hybridisation. We profiled the differential expression of genes between the psoas (red muscle and the longissimus dorsi (white muscle, by co-hybridisation of Cy3 and Cy5 labelled cDNA derived from these two muscles. Results from seven microarray slides (replicates correctly identified genes that were expected to be differentially expressed, as well as a number of novel candidate regulatory genes. Quantitative real-time RT-PCR on selected genes was used to confirm the results from the microarray. Conclusion We have developed a porcine skeletal muscle cDNA microarray and have identified a number of candidate genes that could be involved in muscle phenotype determination, including several members of the casein kinase 2 signalling pathway.

  8. Genomic Profiles in Stage I Primary Non Small Cell Lung Cancer Using Comparative Genomic Hybridization Analysis of cDNA Microarrays

    Directory of Open Access Journals (Sweden)

    Feng Jiang

    2004-09-01

    Full Text Available To investigate the genomic aberrations that are involved in lung tumorigenesis and therefore may be developed as biomarkers for lung cancer diagnosis, we characterized the genomic copy number changes associated with individual genes in 14 tumors from patients with primary non small cell lung cancer (NSCLC. Six squamous cell carcinomas (SQCAs and eight adenocarcinomas (ADCAs were examined by high-resolution comparative genomic hybridization (CGH analysis of cDNA microarray. The SQCAs and ADCAs shared common frequency distributions of recurrent genomic gains of 63 genes and losses of 72 genes. Cluster analysis using 57 genes defined the genomic differences between these two major histologic types of NSCLC. Genomic aberrations from a set of 18 genes showed distinct difference of primary ADCAs from their paired normal lung tissues. The genomic copy number of four genes was validated by fluorescence in situ hybridization of 32 primary NSCLC tumors, including those used for cDNA microarray CGH analysis; a strong correlation with cDNA microarray CGH data emerged. The identified genomic aberrations may be involved in the initiation and progression of lung tumorigenesis and, most importantly, may be developed as new biomarkers for the early detection and classification of lung cancer.

  9. The midgut transcriptome of Lutzomyia longipalpis: comparative analysis of cDNA libraries from sugar-fed, blood-fed, post-digested and Leishmania infantum chagasi-infected sand flies

    Directory of Open Access Journals (Sweden)

    Elnaiem Dia-Eldin

    2008-01-01

    Full Text Available Abstract Background In the life cycle of Leishmania within the alimentary canal of sand flies the parasites have to survive the hostile environment of blood meal digestion, escape the blood bolus and attach to the midgut epithelium before differentiating into the infective metacyclic stages. The molecular interactions between the Leishmania parasites and the gut of the sand fly are poorly understood. In the present work we sequenced five cDNA libraries constructed from midgut tissue from the sand fly Lutzomyia longipalpis and analyzed the transcripts present following sugar feeding, blood feeding and after the blood meal has been processed and excreted, both in the presence and absence of Leishmania infantum chagasi. Results Comparative analysis of the transcripts from sugar-fed and blood-fed cDNA libraries resulted in the identification of transcripts differentially expressed during blood feeding. This included upregulated transcripts such as four distinct microvillar-like proteins (LuloMVP1, 2, 4 and 5, two peritrophin like proteins, a trypsin like protein (Lltryp1, two chymotrypsin like proteins (LuloChym1A and 2 and an unknown protein. Downregulated transcripts by blood feeding were a microvillar-like protein (LuloMVP3, a trypsin like protein (Lltryp2 and an astacin-like metalloprotease (LuloAstacin. Furthermore, a comparative analysis between blood-fed and Leishmania infected midgut cDNA libraries resulted in the identification of the transcripts that were differentially expressed due to the presence of Leishmania in the gut of the sand fly. This included down regulated transcripts such as four microvillar-like proteins (LuloMVP1,2, 4 and 5, a Chymotrypsin (LuloChym1A and a carboxypeptidase (LuloCpepA1, among others. Upregulated midgut transcripts in the presence of Leishmania were a peritrophin like protein (LuloPer1, a trypsin-like protein (Lltryp2 and an unknown protein. Conclusion This transcriptome analysis represents the largest set

  10. Robustness analysis of chiller sequencing control

    International Nuclear Information System (INIS)

    Liao, Yundan; Sun, Yongjun; Huang, Gongsheng

    2015-01-01

    Highlights: • Uncertainties with chiller sequencing control were systematically quantified. • Robustness of chiller sequencing control was systematically analyzed. • Different sequencing control strategies were sensitive to different uncertainties. • A numerical method was developed for easy selection of chiller sequencing control. - Abstract: Multiple-chiller plant is commonly employed in the heating, ventilating and air-conditioning system to increase operational feasibility and energy-efficiency under part load condition. In a multiple-chiller plant, chiller sequencing control plays a key role in achieving overall energy efficiency while not sacrifices the cooling sufficiency for indoor thermal comfort. Various sequencing control strategies have been developed and implemented in practice. Based on the observation that (i) uncertainty, which cannot be avoided in chiller sequencing control, has a significant impact on the control performance and may cause the control fail to achieve the expected control and/or energy performance; and (ii) in current literature few studies have systematically addressed this issue, this paper therefore presents a study on robustness analysis of chiller sequencing control in order to understand the robustness of various chiller sequencing control strategies under different types of uncertainty. Based on the robustness analysis, a simple and applicable method is developed to select the most robust control strategy for a given chiller plant in the presence of uncertainties, which will be verified using case studies

  11. [Construction of cDNA expression library of unfed female Haemaphysalis longicornis and immuno-screening].

    Science.gov (United States)

    Chai, Hui-ping; Liu, Guang-yuan; Zhang, Lin; Gong, Zhen-li; Xie, Jun-ren; Tian, Zhan-cheng; Wang, Lu; Jia, Ning

    2009-02-28

    To construct a cDNA expression library from unfed female tick Haemaphysalis longicornis for screening and cloning potential antigenic genes. Total RNA was isolated from unfed female ticks, mRNA was purified and a library of oligo (dT) -primed cDNA with added directional EcoR I /Hind III linkers was constructed from the purified mRNA. The constructed cDNA was ligated to the EcoR I /Hind III arms of the lambda SCREEN vector. Pure phage stocks were harvested by plaque purification and converted to plasmid subclones by plating phage on host strain BM25.8. Recombinant plasmids that were subcloned to E. coli BM25.8 were isolated and transformed into E. coli JM109. Recombinant plasmids abstracted from JM109 were identified by PCR and sequencing. The recombinant phage DNA was packaged by using phage-marker packaging extracts, resulting in a primary cDNA library with a size of 1.8 x 10(6) pfu. Data showed 100% of the library were recombinant and the titer of the amplified library was 2.4 x 10(9) pfu/ml. Forty-two clones of encoding immunodominant antigens were obtained from the cDNA library. Sequence analysis revealed 12 unique cDNA sequences and the encoded putative proteins showed similarities to H. longicornis tropomyosin mRNA, Rhipicephalus annulatus unknown larval protein mRNA, chromosome 2R of Drosophila melanogaster, mitochondrial DNA of H. flava, clones HqL09 unkown mRNA and Hq05 mRNA of H. qinghaiensis, and myosin alkali light chain protein mRNA. The cDNA expression library from unfed female H. longicornis was successfully constructed and screening of protective genes may provide candidate antigens of the tick.

  12. [Whole cDNA sequence cloning and expression of chicken L-FABP gene and its relationship with lipid deposition of hybrid chickens].

    Science.gov (United States)

    Yu, Ying; Wang, Dong; Sun, Dong-Xiao; Xu, Gui-Yun; Li, Jun-Ying; Zhang, Yuan

    2011-07-01

    Liver fatty acid-binding protein (L-FABP) is closely related to intracellular transportation and deposition of lipids. A positive differential displayed fragment was found in the liver tissue among Silkie (CC), CAU-brown chicken (CD), and their reciprocal hybrids (CD and DC) at 8 weeks-old using differential display RT-PCR techniques (DDRT-PCR). Through recycling, sequencing, and alignment analysis, the fragment was identified as chicken liver fatty acid-binding protein gene (L-FABP, GenBank accession number AY321365). Reverse Northern dot blot and semi-quantitative RT-PCR revealed that the avian L-FABP gene was over-expressed in the liver tissue of the reciprocal hybrids (CD and DC) compared to their parental lines (CC and DD), which was consistent with the fact that higher abdomen fat weight and wider inter-muscular fat width observed in the reciprocal hybrids. Considering the higher expression of L-FABP may contribute to the increased lipid deposition in the hybrid chickens, the functional study of avian L-FABP is warranted in future.

  13. Role of normalization in the elimination of abundant myelin sequences in spinal cord cDNA libraries produced by suppression subtractive hybridization.

    Science.gov (United States)

    Lathia, K B; Yan, Z; Clapshaw, P A

    2009-12-01

    Spinal cord libraries subtracted against visual cortex using suppression subtractive hybridization SSH are dominated by abundant gene sequences derived from myelin elements. We compared our subtracted library results of three of these abundant sequences to published expressed sequence tag libraries that are not normalized and not subtracted and presumed representatives of murine spinal cord mRNA abundance. We show that: all three abundant sequences, myelin basic protein (Mbp), proteolipid protein (Plp1) and Ferretin heavy chain (Fth1) are highly expressed in spinal cord when this structure is compared to visual cortex; myelin basic protein is represented in our subtracted libraries but at a low frequency, whereas Plp1 and Fth1 represent nearly one-third of all sequences in these libraries; mirror orientation selection, a procedure designed to reduce background sequences, generates libraries very similar in abundance to SSH; proteolipid protein can be reduced in these libraries by adding Plp1 sequences to the driver in the SSH procedure and also by subtracting Plp1 directly from tester and driver. We conclude that adequate normalization is essential to reduce the presence of abundant sequences in SSH libraries.

  14. Molecular cloning of growth hormone encoding cDNA of Indian ...

    Indian Academy of Sciences (India)

    A modified rapid amplification of cDNA ends (RACE) strategy has been developed for cloning highly conserved cDNA sequences. Using this modified method, the growth hormone (GH) encoding cDNA sequences of Labeo rohita, Cirrhina mrigala and Catla catla have been cloned, characterized and overexpressed in ...

  15. Molecular cloning of growth hormone encoding cDNA of Indian

    Indian Academy of Sciences (India)

    A modified rapid amplification of cDNA ends (RACE) strategy has been developed for cloning highly conserved cDNA sequences. Using this modified method, the growth hormone (GH) encoding cDNA sequences of Labeo rohita, Cirrhina mrigala and Catla catla have been cloned, characterized and overexpressed in ...

  16. Alternative splicing of human elastin mRNA indicated by sequence analysis of cloned genomic and complementary DNA

    International Nuclear Information System (INIS)

    Indik, Z.; Yeh, H.; Ornstein-goldstein, N.; Sheppard, P.; Anderson, N.; Rosenbloom, J.C.; Peltonen, L.; Rosenbloom, J.

    1987-01-01

    Poly(A) + RNA, isolated from a single 7-mo fetal human aorta, was used to synthesize cDNA by the RNase H method, and the cDNA was inserted into λgt10. Recombinant phage containing elastin sequences were identified by hybridization with cloned, exon-containing fragments of the human elastin gene. Three clones containing inserts of 3.3, 2.7, and 2.3 kilobases were selected for further analysis. Three overlapping clones containing 17.8 kilobases of the human elastin gene were also isolated from genomic libraries. Complete sequence analysis of the six clones demonstrated that: (i) the cDNA encompassed the entire translated portion of the mRNA encoding 786 amino acids, including several unusual hydrophilic amino acid sequences not previously identified in porcine tropoelastin, (ii) exons encoding either hydrophobic or crosslinking domains in the protein alternated in the gene, and (iii) a great abundance of Alu repetitive sequences occurred throughout the introns. The data also indicated substantial alternative splicing of the mRNA. These results suggest the potential for significant variation in the precise molecular structure of the elastic fiber in the human population

  17. Automated genome sequence analysis and annotation.

    Science.gov (United States)

    Andrade, M A; Brown, N P; Leroy, C; Hoersch, S; de Daruvar, A; Reich, C; Franchini, A; Tamames, J; Valencia, A; Ouzounis, C; Sander, C

    1999-05-01

    Large-scale genome projects generate a rapidly increasing number of sequences, most of them biochemically uncharacterized. Research in bioinformatics contributes to the development of methods for the computational characterization of these sequences. However, the installation and application of these methods require experience and are time consuming. We present here an automatic system for preliminary functional annotation of protein sequences that has been applied to the analysis of sets of sequences from complete genomes, both to refine overall performance and to make new discoveries comparable to those made by human experts. The GeneQuiz system includes a Web-based browser that allows examination of the evidence leading to an automatic annotation and offers additional information, views of the results, and links to biological databases that complement the automatic analysis. System structure and operating principles concerning the use of multiple sequence databases, underlying sequence analysis tools, lexical analyses of database annotations and decision criteria for functional assignments are detailed. The system makes automatic quality assessments of results based on prior experience with the underlying sequence analysis tools; overall error rates in functional assignment are estimated at 2.5-5% for cases annotated with highest reliability ('clear' cases). Sources of over-interpretation of results are discussed with proposals for improvement. A conservative definition for reporting 'new findings' that takes account of database maturity is presented along with examples of possible kinds of discoveries (new function, family and superfamily) made by the system. System performance in relation to sequence database coverage, database dynamics and database search methods is analysed, demonstrating the inherent advantages of an integrated automatic approach using multiple databases and search methods applied in an objective and repeatable manner. The GeneQuiz system

  18. Construction of cDNA library of Pyrocystis lunula (Pyrophyta)

    Science.gov (United States)

    Sui, Zhenghong; Kowallik, Klaus V.

    2004-10-01

    Complementary DNA library of a dinoflagellate Pyrocystis lunula was constructed for the purpose of expression sequence tags analysis. The RNA isolated from this alga was about 20µgg-1 net cells, and the band intensity ratio of 28S/18 S in electrophoresis pattern was nearly 1 to 1. Different cDNA/vector molar ratios were exploited in the ligating reaction to be optimized. The clones produced by cDNA/vector molar ratio of 3.75 to 1 were desirable, most of whose inserts were longer than 300 bp. The recombinants insert length of the unfractionation cDNA library was largely shorter than 500 bp. However, in the fractionation library made from high molecule weight cDNA parts, over seventy percent of the recombinants contained inserts longer than 1 kb, some of which were even longer than 3 kb. Operating concerns were discussed at the end.

  19. KLONING cDNA HORMON PERTUMBUHAN DARI IKAN GURAME (Osphronemus gouramy

    Directory of Open Access Journals (Sweden)

    Estu Nugroho

    2016-11-01

    Full Text Available Penelitian mengenai kloning cDNA pengkode hormon pertumbuhan ikan gurame telah dilakukan. Tujuan dari penelitian ini adalah untuk memperoleh sekuens DNA komplemen hormon pertumbuhan sebagai langkah awal dalam rangka pengembangan teknologi rekayasa genetik ikan gurame. Empat buah kelenjar hifopisa ikan gurame digunakan sebagai bahan bakunya dan dilakukan proses ekstraksi RNA total dari kelenjar hipofisa, dilanjutkan dengan sintesis cDNA, amplifikasi PCR, purifikasi fragmen DNA dari gel, ligasi produk PCR dengan vektor kloning, transformasi dan inkubasi bakteri, seleksi koloni bakteri putih, isolasi plasmid, dan sekuensing. Hasil sekuensing menunjukkan bahwa panjang produk amplifikasi PCR adalah 843 bp yang menyandikan 204 asam amino residu dan mengandung sekuens-sekuens yang konserf untuk gen hormon pertumbuhan (GH. Analisis homologi menunjukkan kesamaan sekuens hasil isolasi antara 52,4%--97,6% dengan gen GH ikan lainnya, dengan persentase homologi tertinggi adalah dengan ikan sepat. Dengan demikian dapat disimpulkan bahwa sekuens hasil isolasi merupakan sekuens gen GH. Dari hasil analisis sekuens terlihat bahwa gen GH ikan gurame secara evolusi adalah konserf. Research on cDNA cloning encoded the gouramy growth hormone was conducted. The aim of the research was to get complementary DNA, cDNA, sequences of growth hormone as an initial step to develop genetic engineering of gouramy fish. Four pituitary glands of the gouramy were taken and then processed with total RNA extraction, and continued with cDNA synthesis, PCR amplification, DNA fragment purification from the gel, PCR product legation with cloning vector, transformation and incubation of bacteria, white colony bacteria selection, plasmid isolation and sequencing analysis. Sequencing result showed that the amplified PCR product length had 834 bp, encoding 204 amino acid residue and contained conserve sequence for GH (growth hormone gen. Homolog analysis showed sequence similarity of

  20. cDNA encoding the chicken ortholog of the mouse dilute gene product. Sequence comparison reveals a myosin I subfamily with conserved C-terminal domains.

    Science.gov (United States)

    Sanders, G; Lichte, B; Meyer, H E; Kilimann, M W

    1992-10-26

    We report the cDNA-deduced primary structure of the chicken counterpart of the murine dilute gene product, a member of the myosin I family. Comparison of the chicken and mouse sequences reveals a distinct pattern of domains of high and low sequence conservation. An internal deletion of 25 amino acids probably reflects differential mRNA processing. Compared with other myosin heavy chain molecules, sequence similarity is highest with the MYO2 gene product of Saccharomyces cerevisiae. The MYO2 protein, implicated in vectorial vesicle transport, is homologous to the dilute protein over practically its entire length. In addition, the C-terminal domain of the dilute protein is highly similar to a putative glutamic acid decarboxylase sequence cloned from mouse brain. Alternatively, this closely related clone might represent an isoform of the dilute protein derived from a second gene, potentially involved in genetic conditions related to dilute.

  1. cDNA cloning and immunological characterization of the rye grass allergen Lol p I.

    Science.gov (United States)

    Perez, M; Ishioka, G Y; Walker, L E; Chesnut, R W

    1990-09-25

    The complete amino acid sequence of two "isoallergenic" forms of Lol p I, the major rye grass (Lolium perenne) pollen allergen, was deduced from cDNA sequence analysis. cDNA clones isolated from a Lolium perenne pollen library contained an open reading frame coding for a 240-amino acid protein. Comparison of the nucleotide and deduced amino acid sequence of two of these clones revealed four changes at the amino acid level and numerous nucleotide differences. Both clones contained one possible asparagine-linked glycosylation site. Northern blot analysis shows one RNA species of 1.2 kilobases. Based on the complete amino acid sequence of Lol p I, overlapping peptides covering the entire molecule were synthesized. Utilizing these peptides we have identified a determinant within the Lol p I molecule that is recognized by human leukocyte antigen class II-restricted T cells obtained from persons allergic to rye grass pollen.

  2. Molecular cloning of lupin leghemoglobin cDNA

    DEFF Research Database (Denmark)

    Konieczny, A; Jensen, E O; Marcker, K A

    1987-01-01

    Poly(A)+ RNA isolated from root nodules of yellow lupin (Lupinus luteus, var. Ventus) has been used as a template for the construction of a cDNA library. The ds cDNA was synthesized and inserted into the Hind III site of plasmid pBR 322 using synthetic Hind III linkers. Clones containing sequences...... specific for nodules were selected by differential colony hybridization using 32P-labeled cDNA synthesized either from nodule poly(A)+ RNA or from poly(A)+ RNA of uninfected root as probes. Among the recombinant plasmids, the cDNA gene for leghemoglobin was identified. The protein structure derived from...... its nucleotide sequence was consistent with known amino acid sequence of lupin Lb II. The cloned lupin Lb cDNA hybridized to poly(A)+ RNA from nodules only, which is in accordance with the general concept, that leghemoglobin is expressed exclusively in nodules. Udgivelsesdato: 1987-null...

  3. Whole genome sequence phylogenetic analysis of four Mexican rabies viruses isolated from cattle.

    Science.gov (United States)

    Bárcenas-Reyes, I; Loza-Rubio, E; Cantó-Alarcón, G J; Luna-Cozar, J; Enríquez-Vázquez, A; Barrón-Rodríguez, R J; Milián-Suazo, F

    2017-08-01

    Phylogenetic analysis of the rabies virus in molecular epidemiology has been traditionally performed on partial sequences of the genome, such as the N, G, and P genes; however, that approach raises concerns about the discriminatory power compared to whole genome sequencing. In this study we characterized four strains of the rabies virus isolated from cattle in Querétaro, Mexico by comparing the whole genome sequence to that of strains from the American, European and Asian continents. Four cattle brain samples positive to rabies and characterized as AgV11, genotype 1, were used in the study. A cDNA sequence was generated by reverse transcription PCR (RT-PCR) using oligo dT. cDNA samples were sequenced in an Illumina NextSeq 500 platform. The phylogenetic analysis was performed with MEGA 6.0. Minimum evolution phylogenetic trees were constructed with the Neighbor-Joining method and bootstrapped with 1000 replicates. Three large and seven small clusters were formed with the 26 sequences used. The largest cluster grouped strains from different species in South America: Brazil, and the French Guyana. The second cluster grouped five strains from Mexico. A Mexican strain reported in a different study was highly related to our four strains, suggesting common source of infection. The phylogenetic analysis shows that the type of host is different for the different regions in the American Continent; rabies is more related to bats. It was concluded that the rabies virus in central Mexico is genetically stable and that it is transmitted by the vampire bat Desmodus rotundus. Copyright © 2017 Elsevier Ltd. All rights reserved.

  4. Molecular cloning of lupin leghemoglobin cDNA

    DEFF Research Database (Denmark)

    Konieczny, A; Jensen, E O; Marcker, K A

    1987-01-01

    Poly(A)+ RNA isolated from root nodules of yellow lupin (Lupinus luteus, var. Ventus) has been used as a template for the construction of a cDNA library. The ds cDNA was synthesized and inserted into the Hind III site of plasmid pBR 322 using synthetic Hind III linkers. Clones containing sequences...... its nucleotide sequence was consistent with known amino acid sequence of lupin Lb II. The cloned lupin Lb cDNA hybridized to poly(A)+ RNA from nodules only, which is in accordance with the general concept, that leghemoglobin is expressed exclusively in nodules. Udgivelsesdato: 1987-null...

  5. Analysis of expressed sequence tags from the Ulva prolifera (Chlorophyta)

    Science.gov (United States)

    Niu, Jianfeng; Hu, Haiyan; Hu, Songnian; Wang, Guangce; Peng, Guang; Sun, Song

    2010-01-01

    In 2008, a green tide broke out before the sailing competition of the 29th Olympic Games in Qingdao. The causative species was determined to be Enteromorpha prolifera ( Ulva prolifera O. F. Müller), a familiar green macroalga along the coastline of China. Rapid accumulation of a large biomass of floating U. prolifera prompted research on different aspects of this species. In this study, we constructed a nonnormalized cDNA library from the thalli of U. prolifera and acquired 10 072 high-quality expressed sequence tags (ESTs). These ESTs were assembled into 3 519 nonredundant gene groups, including 1 446 clusters and 2 073 singletons. After annotation with the nr database, a large number of genes were found to be related with chloroplast and ribosomal protein, GO functional classification showed 1 418 ESTs participated in photosynthesis and 1 359 ESTs were responsible for the generation of precursor metabolites and energy. In addition, rather comprehensive carbon fixation pathways were found in U. prolifera using KEGG. Some stress-related and signal transduction-related genes were also found in this study. All the evidences displayed that U. prolifera had substance and energy foundation for the intense photosynthesis and the rapid proliferation. Phylogenetic analysis of cytochrome c oxidase subunit I revealed that this green-tide causative species is most closely affiliated to Pseudendoclonium akinetum (Ulvophyceae).

  6. Construction of cDNA library from intestine, mesentery and coelomocyte of Apostichopus japonicus Selenka infected with Vibrio sp. and a preliminary analysis of immunity-related genes

    Science.gov (United States)

    Liu, Hongzhan; Zheng, Fengrong; Sun, Xiuqin; Cai, Yimei

    2012-06-01

    The aquaculture of sea cucumber Apostichopus japonicus (Echinodermata, Holothuroidea) has grown rapidly during recent years and has become an important sector of the marine industry in Northern China. However, with the rapid growth of the industry and the use of non-standard culture techniques, epidemic diseases of A. japonicus now pose increasing problems to the industry. To screen the genes with stress response to bacterial infection in sea cucumber at a genome wide level, we constructed a cDNA library from A. japonicus Selenka (Aspidochirotida: Stichopodidae) after infecting them with Vibrio sp. for 48 h. Total RNA was extracted from the intestine, mesentery and coelomocyte of infected sea cucumber using Trizol and mRNA was isolated by Oligotex mRNA Kits. The ligated cDNAs were transformed into DH5α, and a library of 3.24×105 clones (3.24×105 cfu mL-1) was obtained with the sizes of inserted fragments ranging from 0.8 to 2.5 kb. Sequencing the cDNA clones resulted in a total of 1106 ESTs that passed the quality control. BlastX and BlastN searches have identified 168 (31.5%) ESTs sharing significant homology with known sequences in NCBI protein or nucleotide databases. Among a panel of 25 putative immunity-related genes, serum lectin isoform, complement component 3, complement component 3-like genes were further studied by real-time PCR and they all increased more than 5 fold in response to Vibrio sp. challenge. Our library provides a valuable molecular tool for future study of invertebrate immunity against bacterial infection and our gene expression data indicates the importance of the immune system in the evolution and development of sea cucumber.

  7. Identification and expression analysis of cDNA encoding chitinase-like protein (CLP) gene in Japanese scallop Mizuhopecten yessoensis.

    Science.gov (United States)

    Gao, L; Xu, G J; Su, H; Gao, X G; Li, Y F; Bao, X B; Liu, W D; He, C B

    2014-12-18

    Chitinase-like proteins (CLP) are important members of the glycoside hydrolase family 18 (GH18) and are involved in growth control and remodeling processes. In this study, a CLP transcript was isolated and sequenced from the Japanese scallop (Mizuhopecten yessoensis) after screening expressed sequence tags. The full-length complementary DNA of M. yessoensis CLP (My-Clp1) was 1555 bp in length, consisting of a 75-bp 5'-untranslated region (UTR), a 160-bp 3'-UTR, and a 1320-bp open reading frame bearing characteristics of the GH18 family. The My-Clp1 protein was well conserved, with similar domain structures and architecture across species (e.g., from mollusks to mammals). Expression analysis in healthy tissues and across developmental stages revealed a strong preference for expression; My-Clp1 was abundantly expressed in the mantle and throughout metamorphosis, which suggests the involvement of My-Clp1 in the synthesis of extracellular components, and tissue degeneration and remodeling. My-Clp1 expression was induced after infection with a bacterial pathogen, Vibrio anguillarum, suggesting its involvement in immunity against this intracellular pathogen.

  8. Construction of equalized short hairpin RNA library from human brain cDNA.

    Science.gov (United States)

    Xu, Lei; Li, Jingqi; Liu, Li; Lu, Lixia; Gao, Jingxia; Li, Xueli

    2007-02-20

    Short hairpin RNA (shRNA) library is a powerful new tool for high-throughput loss-of-function genetic screens in mammalian cells. An shRNA library can be constructed from synthetic oligonucleotides or enzymatically cleaved natural cDNA. Here, we describe a new method for constructing equalized shRNA libraries from cDNA. First, enzymatically digested cDNA fragments are equalized by a suppression PCR-based method modified from suppression subtractive hybridization. The efficiency of equalization was confirmed by quantitative real-time PCR. The fragments are then converted into an shRNA library by a series of enzymatic treatments. With this new technology, we constructed a library from human brain cDNA. Sequence analysis showed that most of the randomly selected clones had inverted repeat sequences converted from different cDNA. After transfecting HEK 293T cells and detecting gene expression, three out of eight clones were demonstrated to significantly inhibit their target genes.

  9. Cloning, expression, and mapping of GDP-D-mannose pyrophosphorylase cDNA from tomato (Lycopersicon esculentum).

    Science.gov (United States)

    Zou, Li-Ping; Li, Han-Xia; Ouyang, Bo; Zhang, Jun-Hong; Ye, Zhi-Biao

    2006-08-01

    GDP-D-mannose pyrophosphorylase (GMP, EC 2.7.7.22) catalyzes the synthesis of GDP-D-mannose and represents the first committed step in plant ascorbic acid biosynthesis. Using potato GMP cDNA sequence as a querying probe, 65 highly homologous tomato ESTs were obtained from dbEST of GenBank and the putative cDNA sequence of tomato GMP was assembled. The full-length GMP cDNA of tomato was cloned by RACE-PCR with primers designed according to the assembled cDNA sequence. The full-length cDNA sequence contained a complete open reading frame (ORF) of 1,086 bp, which encoded 361 amino acid residues. This gene was designated as LeGMP (GenBank accession No. AY605668). Homology analysis of LeGMP showed a 96% identity with potato GMP and the deduced amino acid showed 99%, 97%, 91% and 89% homology with GMP from potato, tobacco, alfalfa and Arabidopsis thaliana, respectively. Northern blot analysis showed that LeGMP was constitutively expressed in roots, stems, leaves, flowers and fruits of tomato; but the expression levels varied. LeGMP was mapped to 3-D using 75 tomato introgression lines (ILs), each containing a single homozygous RFLP-defined chromosome segment from the green-fruited species Lycopersicon pennellii.

  10. Single pass cDNA sequencing - a powerful tool to analyse gene expression in preparasytic juveniles of the southern root-knot nematode Meliodogyne incognita

    NARCIS (Netherlands)

    Dautova, M.; Rosso, M.N.; Abad, P.; Gommers, F.L.; Bakker, J.; Smant, G.

    2001-01-01

    Expressed sequence tags (EST) have been widely used to assist in gene discovery in various organisms (e.g., Arabidopsis thaliana, Caenorhabditis elegans, Mus musculus, and Homo sapiens). In this paper we describe an EST project, which aims to investigate gene expression in Meloidogyne incognita at

  11. Yeast two-hybrid analysis of a human trabecular meshwork cDNA library identified EFEMP2 as a novel PITX2 interacting protein.

    Science.gov (United States)

    Acharya, Moulinath; Sharp, Michael W; Mirzayans, Farideh; Footz, Tim; Huang, Lijia; Birdi, Chanchal; Walter, Michael A

    2012-01-01

    Mutations in the homeobox transcription factor paired-like homeodomain transcription factor 2 (PITX2) cause Axenfeld-Reiger syndrome (ARS), which is associated with anterior segment dysgenesis (ASD) and glaucoma. To understand ARS pathogenesis, it is essential to know the normal functions of PITX2 and the proteins with which PITX2 interacts in the eye. Therefore, we used a unique cDNA library that we created from human trabecular meshwork (TM) primary cells to discover PITX2-interacting proteins (PIPs). A human TM cDNA library was created from primary cells in the ProQuest Two-Hybrid prey vector: pEXP-AD502. Human PITX2A and PITX2C isoforms were used independently as "bait" to identify novel PIPs. A total of 1.25×10⁶ clones were screened by yeast two-hybrid (Y2H) analyses. PIPs obtained from each Y2H experiment were confirmed by yeast retransformation and mammalian co-immunoprecipitation assays. EGF-containing fibulin-like extracellular matrix protein 2 (EFEMP2) was identified by both PITX2A and PITX2C isoforms as a novel PIP from Y2H analyses. EFEMP2 is 443 amino acids long with six epidermal growth factor (EGF)-like modules and one fibulin-like module. The PITX2-interaction domain in EFEMP2 lies between the second EGF-like module and the COOH-terminal fibulin-like module. Co-immunoprecipitation assays in COS-7 cells confirmed the interaction between PITX2 and EFEMP2. We discovered EFEMP2 as a novel PITX2-interacting protein. Further, our cDNA library made from human TM primary cells is a unique and effective resource to identify novel interacting proteins for glaucoma and ASD candidates. This resource could be used both for discovery and validation of interactomes identified from in silico analysis.

  12. Sequence analysis by iterated maps, a review.

    Science.gov (United States)

    Almeida, Jonas S

    2014-05-01

    Among alignment-free methods, Iterated Maps (IMs) are on a particular extreme: they are also scale free (order free). The use of IMs for sequence analysis is also distinct from other alignment-free methodologies in being rooted in statistical mechanics instead of computational linguistics. Both of these roots go back over two decades to the use of fractal geometry in the characterization of phase-space representations. The time series analysis origin of the field is betrayed by the title of the manuscript that started this alignment-free subdomain in 1990, 'Chaos Game Representation'. The clash between the analysis of sequences as continuous series and the better established use of Markovian approaches to discrete series was almost immediate, with a defining critique published in same journal 2 years later. The rest of that decade would go by before the scale-free nature of the IM space was uncovered. The ensuing decade saw this scalability generalized for non-genomic alphabets as well as an interest in its use for graphic representation of biological sequences. Finally, in the past couple of years, in step with the emergence of BigData and MapReduce as a new computational paradigm, there is a surprising third act in the IM story. Multiple reports have described gains in computational efficiency of multiple orders of magnitude over more conventional sequence analysis methodologies. The stage appears to be now set for a recasting of IMs with a central role in processing nextgen sequencing results.

  13. Construction and analysis of an SSH cDNA library of early heat-induced genes of Vigna aconitifolia variety RMO-40.

    Science.gov (United States)

    Rampuria, Sakshi; Joshi, Uma; Palit, Paramita; Deokar, Amit A; Meghwal, Raju R; Mohapatra, T; Srinivasan, R; Bhatt, K V; Sharma, Ramavtar

    2012-11-01

    Moth bean ( Vigna aconitifolia (Jacq.) Marechal) is an important grain legume crop grown in rain fed areas of hot desert regions of Thar, India, under scorching sun rays with very little supplementation of water. An SSH cDNA library was generated from leaf tissues of V. aconitifolia var. RMO-40 exposed to an elevated temperature of 42 °C for 5 min to identify early-induced genes. A total of 488 unigenes (114 contigs and 374 singletons) were derived by cluster assembly and sequence alignment of 738 ESTs; out of 206 ESTs (28%) of unknown proteins, 160 ESTs (14%) were found to be novel to moth bean. Only 578 ESTs (78%) showed significant BLASTX similarity (<1 × 10(-6)) in the NCBI non-redundant database. Gene ontology functional classification terms were retrieved for 479 (65%) sequences, and 339 sequences were annotated with 165 EC codes and mapped to 68 different KEGG pathways. Four hundred and fifty-two ESTs were further annotated with InterProScan (IPS), and no IPS was assigned to 153 ESTs. In addition, the expression level of 27 ESTs in response to heat stress was evaluated through semiquantitative RT-PCR assay. Approximately 20 different signaling genes and 16 different transcription factors have been shown to be associated with heat stress in moth bean for the first time.

  14. Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arrays.

    Science.gov (United States)

    Brenner, S; Johnson, M; Bridgham, J; Golda, G; Lloyd, D H; Johnson, D; Luo, S; McCurdy, S; Foy, M; Ewan, M; Roth, R; George, D; Eletr, S; Albrecht, G; Vermaas, E; Williams, S R; Moon, K; Burcham, T; Pallas, M; DuBridge, R B; Kirchner, J; Fearon, K; Mao, J; Corcoran, K

    2000-06-01

    We describe a novel sequencing approach that combines non-gel-based signature sequencing with in vitro cloning of millions of templates on separate 5 microm diameter microbeads. After constructing a microbead library of DNA templates by in vitro cloning, we assembled a planar array of a million template-containing microbeads in a flow cell at a density greater than 3x10(6) microbeads/cm2. Sequences of the free ends of the cloned templates on each microbead were then simultaneously analyzed using a fluorescence-based signature sequencing method that does not require DNA fragment separation. Signature sequences of 16-20 bases were obtained by repeated cycles of enzymatic cleavage with a type IIs restriction endonuclease, adaptor ligation, and sequence interrogation by encoded hybridization probes. The approach was validated by sequencing over 269,000 signatures from two cDNA libraries constructed from a fully sequenced strain of Saccharomyces cerevisiae, and by measuring gene expression levels in the human cell line THP-1. The approach provides an unprecedented depth of analysis permitting application of powerful statistical techniques for discovery of functional relationships among genes, whether known or unknown beforehand, or whether expressed at high or very low levels.

  15. Anchoring a Defined Sequence to the 55' Ends of mRNAs : The Bolt to Clone Rare Full Length mRNAs and Generate cDNA Libraries porn a Few Cells.

    Science.gov (United States)

    Baptiste, J; Milne Edwards, D; Delort, J; Mallet, J

    1993-01-01

    Among numerous applications, the polymerase chain reaction (PCR) (1,2) provides a convenient means to clone 5' ends of rare mRNAs and to generate cDNA libraries from tissue available in amounts too low to be processed by conventional methods. Basically, the amplification of cDNAs by the PCR requires the availability of the sequences of two stretches of the molecule to be amplified. A sequence can easily be imposed at the 5' end of the first-strand cDNAs (corresponding to the 3' end of the mRNAs) by priming the reverse transcription with a specific primer (for cloning the 5' end of rare messenger) or with an oligonucleotide tailored with a poly (dT) stretch (for cDNA library construction), taking advantage of the poly (A) sequence that is located at the 3' end of mRNAs. Several strategies have been devised to tag the 3' end of the ss-cDNAs (corresponding to the 55' end of the mRNAs). We (3) and others have described strategies based on the addition of a homopolymeric dG (4,5) or dA (6,7) tail using terminal deoxyribonucleotide transferase (TdT) ("anchor-PCR" [4]). However, this strategy has important limitations. The TdT reaction is difficult to control and has a low efficiency (unpublished observations). But most importantly, the return primers containing a homopolymeric (dC or dT) tail generate nonspecific amplifications, a phenomenon that prevents the isolation of low abundance mRNA species and/or interferes with the relative abundance of primary clones in the library. To circumvent these drawbacks, we have used two approaches. First, we devised a strategy based on a cRNA enrichment procedure, which has been useful to eliminate nonspecific-PCR products and to allow detection and cloning of cDNAs of low abundance (3). More recently, to avoid the nonspecific amplification resulting from the annealing of the homopolymeric tail oligonucleotide, we have developed a novel anchoring strategy that is based on the ligation of an oligonucleotide to the 35' end of ss

  16. Preliminary hazard analysis using sequence tree method

    International Nuclear Information System (INIS)

    Huang Huiwen; Shih Chunkuan; Hung Hungchih; Chen Minghuei; Yih Swu; Lin Jiinming

    2007-01-01

    A system level PHA using sequence tree method was developed to perform Safety Related digital I and C system SSA. The conventional PHA is a brainstorming session among experts on various portions of the system to identify hazards through discussions. However, this conventional PHA is not a systematic technique, the analysis results strongly depend on the experts' subjective opinions. The analysis quality cannot be appropriately controlled. Thereby, this research developed a system level sequence tree based PHA, which can clarify the relationship among the major digital I and C systems. Two major phases are included in this sequence tree based technique. The first phase uses a table to analyze each event in SAR Chapter 15 for a specific safety related I and C system, such as RPS. The second phase uses sequence tree to recognize what I and C systems are involved in the event, how the safety related systems work, and how the backup systems can be activated to mitigate the consequence if the primary safety systems fail. In the sequence tree, the defense-in-depth echelons, including Control echelon, Reactor trip echelon, ESFAS echelon, and Indication and display echelon, are arranged to construct the sequence tree structure. All the related I and C systems, include digital system and the analog back-up systems are allocated in their specific echelon. By this system centric sequence tree based analysis, not only preliminary hazard can be identified systematically, the vulnerability of the nuclear power plant can also be recognized. Therefore, an effective simplified D3 evaluation can be performed as well. (author)

  17. Molecular cloning and characterization of a cDNA encoding ...

    African Journals Online (AJOL)

    enoh

    2012-03-29

    Nanjing) co., Ltd. The nucleotide sequences of these primers are as follows: ..... Ebizuka Y (2000). Molecular cloning and characterization of a cDNA for Glycyrrhiza glabra cycloartenol synthase. Biol. Pharm. Bull. 23(2):231-234.

  18. Information theory applications for biological sequence analysis.

    Science.gov (United States)

    Vinga, Susana

    2014-05-01

    Information theory (IT) addresses the analysis of communication systems and has been widely applied in molecular biology. In particular, alignment-free sequence analysis and comparison greatly benefited from concepts derived from IT, such as entropy and mutual information. This review covers several aspects of IT applications, ranging from genome global analysis and comparison, including block-entropy estimation and resolution-free metrics based on iterative maps, to local analysis, comprising the classification of motifs, prediction of transcription factor binding sites and sequence characterization based on linguistic complexity and entropic profiles. IT has also been applied to high-level correlations that combine DNA, RNA or protein features with sequence-independent properties, such as gene mapping and phenotype analysis, and has also provided models based on communication systems theory to describe information transmission channels at the cell level and also during evolutionary processes. While not exhaustive, this review attempts to categorize existing methods and to indicate their relation with broader transversal topics such as genomic signatures, data compression and complexity, time series analysis and phylogenetic classification, providing a resource for future developments in this promising area.

  19. Digital image sequence processing, compression, and analysis

    CERN Document Server

    Reed, Todd R

    2004-01-01

    IntroductionTodd R. ReedCONTENT-BASED IMAGE SEQUENCE REPRESENTATIONPedro M. Q. Aguiar, Radu S. Jasinschi, José M. F. Moura, andCharnchai PluempitiwiriyawejTHE COMPUTATION OF MOTIONChristoph Stiller, Sören Kammel, Jan Horn, and Thao DangMOTION ANALYSIS AND DISPLACEMENT ESTIMATION IN THE FREQUENCY DOMAINLuca Lucchese and Guido Maria CortelazzoQUALITY OF SERVICE ASSESSMENT IN NEW GENERATION WIRELESS VIDEO COMMUNICATIONSGaetano GiuntaERROR CONCEALMENT IN DIGITAL VIDEOFrancesco G.B. De NataleIMAGE SEQUENCE RESTORATION: A WIDER PERSPECTIVEAnil KokaramVIDEO SUMMARIZATIONCuneyt M. Taskiran and Edward

  20. Identification and expression analysis of peroxisome proliferator-activated receptors cDNA in a reptile, the leopard gecko (Eublepharis macularius).

    Science.gov (United States)

    Kato, Keisuke; Oka, Yoshitaka; Park, Min Kyun

    2008-05-01

    Despite the physiological and evolutionary significance of lipid metabolism in amniotes, the molecular mechanisms involved have been unclear in reptiles. To elucidate this, we investigated peroxisome proliferators-activated receptors (PPARs) in the leopard gecko (Eublepharis macularius). PPARs belong to a nuclear hormone-receptor family mainly involved in lipid metabolism. Although PPARs have been widely studied in mammals, little information about them is yet available from reptiles. We identified in the leopard gecko partial cDNA sequences of PPARalpha and beta, and full sequences of two isoforms of PPARgamma. This is the first report of reptilian PPARgamma mRNA isoforms. We also evaluated the organ distribution of expression of these genes by using RT-PCR and competitive PCR. The expression level of PPARalpha mRNA was highest in the large intestine, and moderate in the liver and kidney. The expression level of PPARbeta mRNA was highest in the kidney and large intestine, and moderate in the liver. Similarly to the expression of human PPARgamma isoforms, PPARgammaa was expressed ubiquitously, whereas the expression of PPARgammab was restricted. The highest levels of their expression, however, were observed in the large intestine, rather than in the adipose tissue as in mammals. Taken together, these results showed that the profile of PPARbeta mRNA expression in the leopard gecko is similar to that in mammals, and that those of PPAR alpha and gamma are species specific. This may reflect adaptation to annual changes in lipid storage due to seasonal food availability.

  1. OTU analysis using metagenomic shotgun sequencing data.

    Directory of Open Access Journals (Sweden)

    Xiaolin Hao

    Full Text Available Because of technological limitations, the primer and amplification biases in targeted sequencing of 16S rRNA genes have veiled the true microbial diversity underlying environmental samples. However, the protocol of metagenomic shotgun sequencing provides 16S rRNA gene fragment data with natural immunity against the biases raised during priming and thus the potential of uncovering the true structure of microbial community by giving more accurate predictions of operational taxonomic units (OTUs. Nonetheless, the lack of statistically rigorous comparison between 16S rRNA gene fragments and other data types makes it difficult to interpret previously reported results using 16S rRNA gene fragments. Therefore, in the present work, we established a standard analysis pipeline that would help confirm if the differences in the data are true or are just due to potential technical bias. This pipeline is built by using simulated data to find optimal mapping and OTU prediction methods. The comparison between simulated datasets revealed a relationship between 16S rRNA gene fragments and full-length 16S rRNA sequences that a 16S rRNA gene fragment having a length >150 bp provides the same accuracy as a full-length 16S rRNA sequence using our proposed pipeline, which could serve as a good starting point for experimental design and making the comparison between 16S rRNA gene fragment-based and targeted 16S rRNA sequencing-based surveys possible.

  2. Display of a maize cDNA library on baculovirus infected insect cells.

    Science.gov (United States)

    Meller Harel, Helene Y; Fontaine, Veronique; Chen, Hongying; Jones, Ian M; Millner, Paul A

    2008-08-12

    Maize is a good model system for cereal crop genetics and development because of its rich genetic heritage and well-characterized morphology. The sequencing of its genome is well advanced, and new technologies for efficient proteomic analysis are needed. Baculovirus expression systems have been used for the last twenty years to express in insect cells a wide variety of eukaryotic proteins that require complex folding or extensive posttranslational modification. More recently, baculovirus display technologies based on the expression of foreign sequences on the surface of Autographa californica (AcMNPV) have been developed. We investigated the potential of a display methodology for a cDNA library of maize young seedlings. We constructed a full-length cDNA library of young maize etiolated seedlings in the transfer vector pAcTMVSVG. The library contained a total of 2.5 x 10(5) independent clones. Expression of two known maize proteins, calreticulin and auxin binding protein (ABP1), was shown by western blot analysis of protein extracts from insect cells infected with the cDNA library. Display of the two proteins in infected insect cells was shown by selective biopanning using magnetic cell sorting and demonstrated proof of concept that the baculovirus maize cDNA display library could be used to identify and isolate proteins. The maize cDNA library constructed in this study relies on the novel technology of baculovirus display and is unique in currently published cDNA libraries. Produced to demonstrate proof of principle, it opens the way for the development of a eukaryotic in vivo display tool which would be ideally suited for rapid screening of the maize proteome for binding partners, such as proteins involved in hormone regulation or defence.

  3. Toward a cDNA map of the human genome

    Energy Technology Data Exchange (ETDEWEB)

    Korenberg, J.R.; Chen, X.N. [Cedars-Sinai Research Institute, Los Angeles, CA (United States); Adams, M.D.; Venter, J.C. [Institute for Genomic Research, Gaithersburg, MD (United States)

    1995-09-20

    Advances in the Human Genome Project are shaping the strategies for identifying the 50,000-100,000 human genes. High-resolution genetic maps of the human genome combined with sequencing herald an era of rapid regional definition of disease genes. However, only once their chromosomes band location is known will the systematic partial sequencing of thousands of random cDNA clones provide the reagents for the rapid assessment of the genes responsible for the inherited disorders. We now present an approach to the rapid determination of map position and therefore to the creation of a transcribed map of the human genome. Sensitive fluorescence in situ hybridization has been combined with high-resolution chromosome banding and random cDNA sequencing to 41 cDNAs with an average insert size of < 2 kb to single human chromosome bands. The results provide 15 new genes, with database and functional information, as candidates for human disease. These include the large extracellular single-related kinase (HUMERK), the ERK activator kinase (PRKMK1), a new member of the RAS oncogene family, protein phosphotase 2 regulatory subunit B alpha isoform (PPP2R2A), and a novel human gene with very high homology to a plant membrane transport family. Further, an analysis of expressed genes associated with pseudogenes showed that by using these techniques, it is possible to detect accurately the transcribed locus within a multigene or processed pseudogene family in most cases. These findings suggest that direct cDNA mapping using fluorescence in situ hybridization provides an accurate and rapid approach to the definition of a transcribed map of the human genome. This low-cost, high-resolution (205 Mb) mapping greatly enhances the speed with which these genes can be subsequently assigned to contigs. This assignment provides a necessary first step in understanding the relationship of the genes to both acquired and inherited human diseases. 16 refs., 1 fig., 3 tabs.

  4. Comparative analysis of protein coding sequences from human, mouse, and the domesticated pig  

    DEFF Research Database (Denmark)

    Jørgensen, Frank Grønlund; Hobolth, Asger; Hornshøj, H.

    2005-01-01

    Background The availability of abundant sequence data from key model organisms has made large scale studies of molecular evolution an exciting possibility. Here we use full length cDNA alignments comprising more than 700,000 nucleotides from human, mouse, pig and the Japanese pufferfish Fugu...... indicate that a large fraction of these genes may have lost their function quite recently or may still be functional genes in some or all of the three mammalian species. Conclusions We present a comparative analysis of protein coding genes from three major mammalian lineages. Our study demonstrates...... the usefulness of codon-based likelihood models in detecting selection and it illustrates the value of sequencing organisms at different phylogenetic distances for comparative studies....

  5. [Cloning and sequence analysis of thioredoxin peroxidase gene from Taenia multiceps].

    Science.gov (United States)

    Li, Yong-guang; Li, Wen-hui; Gai, Wen-yan; Yao, Ju-xia; Qu, Zi-gang; Jia, Wan-zhong; Radu, Blaga; Fu, Bao-quan

    2011-02-28

    Protoscoleces of Taenia multiceps were collected from the naturally infected sheep and total RNA was extracted. Specific primers were designed according to TaHe2-D11 mRNA sequence and T. multiceps thioredoxin peroxidase gene (TmTPx) was amplified by RT-PCR. PCR products were ligated into pMD18-T vector and transformed to E. coli DH5alpha. The recombinant plasmids were identified by restriction digestion and sequencing. A 614 bp cDNA was amplified. The TmTPx open reading frame (591 bp) encoded a 196-amino acid protein with Mr 21,690, pI 7.61. Bioinformatics analysis indicated that TmTPx had a typical 2-Cys Prx conserved domain. Phylogenetic tree revealed that T. multiceps had the closest relationship to T. asiatica, followed by T. solium and T. crassiceps, E. granulosus and E. multilocularis.

  6. Isolation and analysis of genes mainly expressed in adult mouse heart using subtractive hybridization cDNA library.

    Science.gov (United States)

    Komurcu-Bayrak, Evrim; Ozsait, Bilge; Erginel-Unaltuna, Nihan

    2012-08-01

    Subtractive hybridization cDNA library (SHL) is one of the powerful approaches for isolating differentially expressed genes. Using this technique between mouse heart and skeletal muscle (skm) tissues, we aimed to construct a cDNA-library that was specific to heart tissue and to identify the potential candidate genes that might be responsible for the development of cardiac diseases or related pathophysiological conditions. In the first step of the study, we created a cDNA-library between mouse heart and skm tissues. The homologies of the randomly selected 215 clones were analyzed and then classified by function. A total of 146 genes were analyzed for their expression profiles in the heart and skm tissues in published mouse microarray dataset. In the second step, we analyzed the expression patterns of the selected genes by Northern blot and RNA in situ hybridization (RISH). In Northern blot analyses, the expression levels of Myl3, Myl2, Mfn2, Dcn, Pdlim4, mt-Co3, mt-Co1, Atpase6 and Tsc22d1 genes were higher in heart than skm. For first time with this study, expression patterns of Pdlim4 and Tsc22d1 genes in mouse heart and skm were shown by RISH. In the last step, 43 genes in this library were identified to have relationships mostly with cardiac diseases and/or related phenotypes. This is the first study reporting differentially expressed genes in healthy mouse heart using SHL technique. This study confirms our hypothesis that tissue-specific genes are most likely to have a disease association, if they possess mutations.

  7. Studies on the isolation, structural analysis and tissue localization of fetal antigen 1 and its relation to a human adrenal-specific cDNA, pG2

    DEFF Research Database (Denmark)

    Jensen, Charlotte Harken; Teisner, Børge; Højrup, Peter

    1993-01-01

    sequence was back-translated into the appropriate degenerate sequence of nucleic acids, fetal antigen 1 could be partially aligned to a 'human adrenal-specific mRNA, pG2'. The indirect immunoperoxidase technique demonstrated fetal antigen 1 in fetal hepatocytes, glandular cells of fetal pancreas...... is encoded by the mRNA defined by the cDNA clone pG2, but definitive sequencing and expression studies of this mRNA have not been achieved. Udgivelsesdato: 1993-Apr...

  8. Sequence Matching Analysis for Curriculum Development

    Directory of Open Access Journals (Sweden)

    Liem Yenny Bendatu

    2015-06-01

    Full Text Available Many organizations apply information technologies to support their business processes. Using the information technologies, the actual events are recorded and utilized to conform with predefined model. Conformance checking is an approach to measure the fitness and appropriateness between process model and actual events. However, when there are multiple events with the same timestamp, the traditional approach unfit to result such measures. This study attempts to develop a sequence matching analysis. Considering conformance checking as the basis of this approach, this proposed approach utilizes the current control flow technique in process mining domain. A case study in the field of educational process has been conducted. This study also proposes a curriculum analysis framework to test the proposed approach. By considering the learning sequence of students, it results some measurements for curriculum development. Finally, the result of the proposed approach has been verified by relevant instructors for further development.

  9. Generation and analysis of expressed sequence tags from the ciliate protozoan parasite Ichthyophthirius multifiliis

    Directory of Open Access Journals (Sweden)

    Arias Covadonga

    2007-06-01

    Full Text Available Abstract Background The ciliate protozoan Ichthyophthirius multifiliis (Ich is an important parasite of freshwater fish that causes 'white spot disease' leading to significant losses. A genomic resource for large-scale studies of this parasite has been lacking. To study gene expression involved in Ich pathogenesis and virulence, our goal was to generate expressed sequence tags (ESTs for the development of a powerful microarray platform for the analysis of global gene expression in this species. Here, we initiated a project to sequence and analyze over 10,000 ESTs. Results We sequenced 10,368 EST clones using a normalized cDNA library made from pooled samples of the trophont, tomont, and theront life-cycle stages, and generated 9,769 sequences (94.2% success rate. Post-sequencing processing led to 8,432 high quality sequences. Clustering analysis of these ESTs allowed identification of 4,706 unique sequences containing 976 contigs and 3,730 singletons. These unique sequences represent over two million base pairs (~10% of Plasmodium falciparum genome, a phylogenetically related protozoan. BLASTX searches produced 2,518 significant (E-value -5 hits and further Gene Ontology (GO analysis annotated 1,008 of these genes. The ESTs were analyzed comparatively against the genomes of the related protozoa Tetrahymena thermophila and P. falciparum, allowing putative identification of additional genes. All the EST sequences were deposited by dbEST in GenBank (GenBank: EG957858–EG966289. Gene discovery and annotations are presented and discussed. Conclusion This set of ESTs represents a significant proportion of the Ich transcriptome, and provides a material basis for the development of microarrays useful for gene expression studies concerning Ich development, pathogenesis, and virulence.

  10. Cloning of gibberellin 3 beta-hydroxylase cDNA and analysis of endogenous gibberellins in the developing seeds in watermelon.

    Science.gov (United States)

    Kang, Hong-Gyu; Jun, Sung-Hoon; Kim, Joonyul; Kawaide, Hiroshi; Kamiya, Yuji; An, Gynheung

    2002-02-01

    We have isolated Cv3h, a cDNA clone from the developing seeds of watermelon, and have demonstrated significant amino acid homology with gibberellin (GA) 3 beta-hydroxylases. This cDNA clone was expressed in Escherichia coli as a fusion protein that oxidized GA(9) and GA(12) to GA(4) and GA(14), respectively. The Cv3h protein had the highest similarity with pumpkin GA 2 beta,3 beta-hydroxylase, but did not possess 2 beta-hydroxylation function. RNA blot analysis showed that the gene was expressed primarily in the inner parts of developing seeds, up to 10 d after pollination (DAP). In the parthenocarpic fruits induced by treatment with 1-(2-chloro-4-pyridyl)-3-phenylurea (CPPU), the embryo and endosperm of the seeds were undeveloped, whereas the integumental tissues, of maternal origin, showed nearly normal development. Cv3h mRNA was undetectable in the seeds of CPPU-treated fruits, indicating that the GA 3 beta-hydroxylase gene was expressed in zygotic cells. In our analysis of endogenous GAs from developing seeds, GA(9) and GA(4) were detected at high levels but those of GA(20) and GA(1) were very low. This demonstrates that GA biosynthesis in seeds prefers a non-13-hydroxylation pathway over an early 13-hydroxylation pathway. We also analyzed endogenous GAs from seeds of the parthenocarpic fruits. The level of bioactive GA(4 )was much lower there than in normal seeds, indicating that bioactive GAs, unconnected with Cv3h, exist in integumental tissues during early seed development.

  11. Improved coverage of cDNA-AFLP by sequential digestion of immobilized cDNA

    Directory of Open Access Journals (Sweden)

    Karlovsky Petr

    2008-10-01

    Full Text Available Abstract Background cDNA-AFLP is a transcriptomics technique which does not require prior sequence information and can therefore be used as a gene discovery tool. The method is based on selective amplification of cDNA fragments generated by restriction endonucleases, electrophoretic separation of the products and comparison of the band patterns between treated samples and controls. Unequal distribution of restriction sites used to generate cDNA fragments negatively affects the performance of cDNA-AFLP. Some transcripts are represented by more than one fragment while other escape detection, causing redundancy and reducing the coverage of the analysis, respectively. Results With the goal of improving the coverage of cDNA-AFLP without increasing its redundancy, we designed a modified cDNA-AFLP protocol. Immobilized cDNA is sequentially digested with several restriction endonucleases and the released DNA fragments are collected in mutually exclusive pools. To investigate the performance of the protocol, software tool MECS (Multiple Enzyme cDNA-AFLP Simulation was written in Perl. cDNA-AFLP protocols described in the literatur and the new sequential digestion protocol were simulated on sets of cDNA sequences from mouse, human and Arabidopsis thaliana. The redundancy and coverage, the total number of PCR reactions, and the average fragment length were calculated for each protocol and cDNA set. Conclusion Simulation revealed that sequential digestion of immobilized cDNA followed by the partitioning of released fragments into mutually exclusive pools outperformed other cDNA-AFLP protocols in terms of coverage, redundancy, fragment length, and the total number of PCRs. Primers generating 30 to 70 amplicons per PCR provided the highest fraction of electrophoretically distinguishable fragments suitable for normalization. For A. thaliana, human and mice transcriptome, the use of two marking enzymes and three sequentially applied releasing enzymes for each

  12. Protein sequence analysis using Hewlett-Packard biphasic sequencing cartridges in an applied biosystems 473A protein sequencer.

    Science.gov (United States)

    Tang, S; Mozdzanowski, J; Anumula, K R

    1999-01-01

    Protein sequence analysis using an adsorptive biphasic sequencing cartridge, a set of two coupled columns introduced by Hewlett-Packard for protein sequencing by Edman degradation, in an Applied Biosystems 473A protein sequencer has been demonstrated. Samples containing salts, detergents, excipients, etc. (e.g., formulated protein drugs) can be easily analyzed using the ABI sequencer. Simple modifications to the ABI sequencer to accommodate the cartridge extend its utility in the analysis of difficult samples. The ABI sequencer solvents and reagents were compatible with the HP cartridge for sequencing. Sequence information up to ten residues can be easily generated by this nonoptimized procedure, and it is sufficient for identifying proteins by database search and for preparing a DNA probe for cloning novel proteins.

  13. Genome sequence and analysis of Lactobacillus helveticus

    Directory of Open Access Journals (Sweden)

    Paola eCremonesi

    2013-01-01

    Full Text Available The microbiological characterization of lactobacilli is historically well developed, but the genomic analysis is recent. Because of the widespread use of L. helveticus in cheese technology, information concerning the heterogeneity in this species is accumulating rapidly. Recently, the genome of five L. helveticus strains was sequenced to completion and compared with other genomically characterized lactobacilli. The genomic analysis of the first sequenced strain, L. helveticus DPC 4571, isolated from cheese and selected for its characteristics of rapid lysis and high proteolytic activity, has revealed a plethora of genes with industrial potential including those responsible for key metabolic functions such as proteolysis, lipolysis, and cell lysis. These genes and their derived enzymes can facilitate the production of cheese and cheese derivatives with potential for use as ingredients in consumer foods. In addition, L. helveticus has the potential to produce peptides with a biological function, such as angiotensin converting enzyme (ACE inhibitory activity, in fermented dairy products, demonstrating the therapeutic value of this species. A most intriguing feature of the genome of L. helveticus is the remarkable similarity in gene content with many intestinal lactobacilli. Comparative genomics has allowed the identification of key gene sets that facilitate a variety of lifestyles including adaptation to food matrices or the gastrointestinal tract.As genome sequence and functional genomic information continues to explode, key features of the genomes of L. helveticus strains continue to be discovered, answering many questions but also raising many new ones.

  14. Sequence analysis and over-expression of ribosomal protein S28 ...

    African Journals Online (AJOL)

    RPS28 is a component of the 40S small ribosomal subunit encoded by RPS28 gene, which is specific to eukaryotes. The cDNA and the genomic sequence of RPS28 were cloned successfully from the Giant Panda using RT-PCR technology and Touchdown-PCR, respectively. Both sequences were analyzed preliminarily ...

  15. Normalizing cDNA libraries.

    Science.gov (United States)

    Bogdanov, Ekaterina A; Shagina, Irina; Barsova, Ekaterina V; Kelmanson, Ilya; Shagin, Dmitry A; Lukyanov, Sergey A

    2010-04-01

    The characterization of rare messages in cDNA libraries is complicated by the substantial variations that exist in the abundance levels of different transcripts in cells and tissues. The equalization (normalization) of cDNA is a helpful approach for decreasing the prevalence of abundant transcripts, thereby facilitating the assessment of rare transcripts. This unit provides a method for duplex-specific nuclease (DSN)-based normalization, which allows for the fast and reliable equalization of cDNA, thereby facilitating the generation of normalized, full-length-enriched cDNA libraries, and enabling efficient RNA analyses. (c) 2010 by John Wiley & Sons, Inc.

  16. Generation and analysis of expressed sequence tags from NaCl-treated Glycine soja.

    Science.gov (United States)

    Ji, Wei; Li, Yong; Li, Jie; Dai, Cui-hong; Wang, Xi; Bai, Xi; Cai, Hua; Yang, Liang; Zhu, Yan-ming

    2006-02-22

    Salinization causes negative effects on plant productivity and poses an increasingly serious threat to the sustainability of agriculture. Wild soybean (Glycine soja) can survive in highly saline conditions, therefore provides an ideal candidate plant system for salt tolerance gene mining. As a first step towards the characterization of genes that contribute to combating salinity stress, we constructed a full-length cDNA library of Glycine soja (50109) leaf treated with 150 mM NaCl, using the SMART technology. Random expressed sequence tag (EST) sequencing of 2,219 clones produced 2,003 cleaned ESTs for gene expression analysis. The average read length of cleaned ESTs was 454 bp, with an average GC content of 40%. These ESTs were assembled using the PHRAP program to generate 375 contigs and 696 singlets. The resulting unigenes were categorized according to the Gene Ontology (GO) hierarchy. The potential roles of gene products associated with stress related ESTs were discussed. We compared the EST sequences of Glycine soja to that of Glycine max by using the blastn algorithm. Most expressed sequences from wild soybean exhibited similarity with soybean. All our EST data are available on the Internet (GenBank_Accn: DT082443-DT084445). The Glycine soja ESTs will be used to mine salt tolerance gene, whose full-length cDNAs will be obtained easily from the full-length cDNA library. Comparison of Glycine soja ESTs with those of Glycine max revealed the potential to investigate the wild soybean's expression profile using the soybean's gene chip. This will provide opportunities to understand the genetic mechanisms underlying stress response of plants.

  17. Identification of a Cryptic Bacterial Promoter in Mouse (mdr1a P-Glycoprotein cDNA.

    Directory of Open Access Journals (Sweden)

    Kristen M Pluchino

    Full Text Available The efflux transporter P-glycoprotein (P-gp is an important mediator of various pharmacokinetic parameters, being expressed at numerous physiological barriers and also in multidrug-resistant cancer cells. Molecular cloning of homologous cDNAs is an important tool for the characterization of functional differences in P-gp between species. However, plasmids containing mouse mdr1a cDNA display significant genetic instability during cloning in bacteria, indicating that mdr1a cDNA may be somehow toxic to bacteria, allowing only clones containing mutations that abrogate this toxicity to survive transformation. We demonstrate here the presence of a cryptic promoter in mouse mdr1a cDNA that causes mouse P-gp expression in bacteria. This expression may account for the observed toxicity of mdr1a DNA to bacteria. Sigma 70 binding site analysis and GFP reporter plasmids were used to identify sequences in the first 321 bps of mdr1a cDNA capable of initiating bacterial protein expression. An mdr1a M107L cDNA containing a single residue mutation at the proposed translational start site was shown to allow sub-cloning of mdr1a in E. coli while retaining transport properties similar to wild-type P-gp. This mutant mdr1a cDNA may prove useful for efficient cloning of mdr1a in E. coli.

  18. High-Throughput Plasmid cDNA Library Screening

    Energy Technology Data Exchange (ETDEWEB)

    Wan, Kenneth H.; Yu, Charles; George, Reed A.; Carlson, JosephW.; Hoskins, Roger A.; Svirskas, Robert; Stapleton, Mark; Celniker, SusanE.

    2006-05-24

    Libraries of cDNA clones are valuable resources foranalysing the expression, structure, and regulation of genes, as well asfor studying protein functions and interactions. Full-length cDNA clonesprovide information about intron and exon structures, splice junctionsand 5'- and 3'-untranslated regions (UTRs). Open reading frames (ORFs)derived from cDNA clones can be used to generate constructs allowingexpression of native proteins and N- or C-terminally tagged proteins.Thus, obtaining full-length cDNA clones and sequences for most or allgenes in an organism is critical for understanding genome functions.Expressed sequence tag (EST) sequencing samples cDNA libraries at random,which is most useful at the beginning of large-scale screening projects.However, as projects progress towards completion, the probability ofidentifying unique cDNAs via EST sequencing diminishes, resulting in poorrecovery of rare transcripts. We describe an adapted, high-throughputprotocol intended for recovery of specific, full-length clones fromplasmid cDNA libraries in five days.

  19. Gene structure, cDNA characterization and RNAi-based functional analysis of a myeloid differentiation factor 88 homolog in Tenebrio molitor larvae exposed to Staphylococcus aureus infection.

    Science.gov (United States)

    Patnaik, Bharat Bhusan; Patnaik, Hongray Howrelia; Seo, Gi Won; Jo, Yong Hun; Lee, Yong Seok; Lee, Bok Luel; Han, Yeon Soo

    2014-10-01

    Myeloid differentiation factor 88 (MyD88), an intracellular adaptor protein involved in Toll/Toll-like receptor (TLR) signal processing, triggers activation of nuclear factor-kappaB (NF-κB) transcription factors. In the present study, we analyzed the gene structure and biological function of MyD88 in a coleopteran insect, Tenebrio molitor (TmMyD88). The TmMyD88 gene was 1380 bp in length and consisted of five exons and four introns. The 5'-flanking sequence revealed several putative transcription factor binding sites, such as STAT-4, AP-1, cJun, cfos, NF-1 and many heat shock factor binding elements. The cDNA contained a typical death domain, a conservative Toll-like interleukin-1 receptor (TIR) domain, and a C-terminal extension (CTE). The TmMyD88 TIR domain showed three significantly conserved motifs for interacting with the TIR domain of TLRs. TmMyD88 was grouped within the invertebrate cluster of the phylogenetic tree and shared 75% sequence identity with the TIR domain of Tribolium castaneum MyD88. Homology modeling of the TmMyD88 TIR domain revealed five parallel β-strands surrounded by five α-helices that adopted loop conformations to function as an adaptor. TmMyD88 expression was upregulated 7.3- and 4.79-fold after 12 and 6h, respectively, of challenge with Staphylococcus aureus and fungal β-1,3 glucan. Silencing of the TmMyD88 transcript by RNA interference led to reduced resistance of the host to infection by S. aureus. These results indicate that TmMyD88 is required for survival against Staphylococcus infection. Copyright © 2014 Elsevier Ltd. All rights reserved.

  20. Sequencing and analysis of full-length cDNAs, 5'-ESTs and 3'-ESTs from a cartilaginous fish, the elephant shark (Callorhinchus milii).

    KAUST Repository

    Brenner, Sydney

    2012-10-08

    Cartilaginous fishes are the most ancient group of living jawed vertebrates (gnathostomes) and are, therefore, an important reference group for understanding the evolution of vertebrates. The elephant shark (Callorhinchus milii), a holocephalan cartilaginous fish, has been identified as a model cartilaginous fish genome because of its compact genome (∼910 Mb) and a genome project has been initiated to obtain its whole genome sequence. In this study, we have generated and sequenced full-length enriched cDNA libraries of the elephant shark using the \\'oligo-capping\\' method and Sanger sequencing. A total of 6,778 full-length protein-coding cDNA and 10,701 full-length noncoding cDNA were sequenced from six tissues (gills, intestine, kidney, liver, spleen, and testis) of the elephant shark. Analysis of their polyadenylation signals showed that polyadenylation usage in elephant shark is similar to that in mammals. Furthermore, both coding and noncoding transcripts of the elephant shark use the same proportion of canonical polyadenylation sites. Besides BLASTX searches, protein-coding transcripts were annotated by Gene Ontology, InterPro domain, and KEGG pathway analyses. By comparing elephant shark genes to bony vertebrate genes, we identified several ancient genes present in elephant shark but differentially lost in tetrapods or teleosts. Only ∼6% of elephant shark noncoding cDNA showed similarity to known noncoding RNAs (ncRNAs). The rest are either highly divergent ncRNAs or novel ncRNAs. In addition to full-length transcripts, 30,375 5\\'-ESTs and 41,317 3\\'-ESTs were sequenced and annotated. The clones and transcripts generated in this study are valuable resources for annotating transcription start sites, exon-intron boundaries, and UTRs of genes in the elephant shark genome, and for the functional characterization of protein sequences. These resources will also be useful for annotating genes in other cartilaginous fishes whose genomes have been targeted for

  1. Comparative analysis of sequences from PT 2013

    DEFF Research Database (Denmark)

    Mikkelsen, Susie Sommer

    . All but one sequence mapped to the MCP gene while the last sequence mapped to the Neurofilament gene. Approx. half of the sequences contained no errors while the rest differed with 88-99 percent similarity with most having 99% similarity. One sequence, when BLASTed, showed most similarity to European...... Sheatfish and not EHNV. Generally, mistakes occurred at the ends of the sequences. This can be due to several factors. One is that the sequence has not been trimmed of the sequence primer sites. Another is the lack of quality control of the chromatogram. Finally, sequencing in just one direction can result...

  2. UNIVERSAL PRIMERS FOR THE AMPLIFICATION AND SEQUENCE ANALYSIS OF ACTIN-1 FROM DIVERSE MOSQUITO SPECIES

    Science.gov (United States)

    STALEY, MOLLY; DORMAN, KARIN S.; BARTHOLOMAY, LYRIC C.; FERNÁNDEZ-SALAS, ILDEFONSO; FARFAN-ALE, JOSE A.; LOROÑO-PINO, MARIA A.; GARCIA-REJON, JULIAN E.; IBARRA-JUAREZ, LUIS

    2010-01-01

    We report the development of universal primers for the reverse-transcription polymerase chain reaction (RT-PCR) amplification and nucleotide sequence analysis of actin cDNAs from taxonomically diverse mosquito species. Primers specific to conserved regions of the invertebrate actin-1 gene were designed after actin cDNA sequences of Anopheles gambiae, Bombyx mori, Drosophila melanogaster, and Caenorhabditis elegans. The efficacy of these primers was determined by RT-PCR with the use of total RNA from mosquitoes belonging to 30 species and 8 genera (Aedes, Anopheles, Culex, Deinocerites, Mansonia, Psorophora, Toxorhynchites, and Wyeomyia). The RT-PCR products were sequenced, and sequence data were used to design additional primers. One primer pair, denoted as Act-2F (5′-ATGGTCGGYATGGGNCAGAAGGACTC-3′) and Act-8R (5′-GATTCCATACCCAGGAAG-GADGG-3′), successfully amplified an RT-PCR product of the expected size (683-nt) in all mosquito spp. tested. We propose that this primer pair can be used as an internal control to test the quality of RNA from mosquitoes collected in vector surveillance studies. These primers can also be used in molecular experiments in which the detection, amplification or silencing of a ubiquitously expressed mosquito housekeeping gene is necessary. Sequence and phylogenetic data are also presented in this report. PMID:20649132

  3. Genomic organization, sequence characterization and expression analysis of Tenebrio molitor apolipophorin-III in response to an intracellular pathogen, Listeria monocytogenes.

    Science.gov (United States)

    Noh, Ju Young; Patnaik, Bharat Bhusan; Tindwa, Hamisi; Seo, Gi Won; Kim, Dong Hyun; Patnaik, Hongray Howrelia; Jo, Yong Hun; Lee, Yong Seok; Lee, Bok Luel; Kim, Nam Jung; Han, Yeon Soo

    2014-01-25

    Apolipophorin III (apoLp-III) is a well-known hemolymph protein having a functional role in lipid transport and immune response of insects. We cloned full-length cDNA encoding putative apoLp-III from larvae of the coleopteran beetle, Tenebrio molitor (TmapoLp-III), by identification of clones corresponding to the partial sequence of TmapoLp-III, subsequently followed with full length sequencing by a clone-by-clone primer walking method. The complete cDNA consists of 890 nucleotides, including an ORF encoding 196 amino acid residues. Excluding a putative signal peptide of the first 20 amino acid residues, the 176-residue mature apoLp-III has a calculated molecular mass of 19,146Da. Genomic sequence analysis with respect to its cDNA showed that TmapoLp-III was organized into four exons interrupted by three introns. Several immune-related transcription factor binding sites were discovered in the putative 5'-flanking region. BLAST and phylogenetic analyses reveal that TmapoLp-III has high sequence identity (88%) with Tribolium castaneum apoLp-III but shares little sequence homologies (molitor. Copyright © 2013 Elsevier B.V. All rights reserved.

  4. [Construction and immunoscreening of cDNA library of Babesia orientalis].

    Science.gov (United States)

    Liu, Qin; Zhou, Dan-Na; Zhou, Yan-Qin; Zhang, Ying; He, Lan; Yao, Bao-An; Zhao, Jun-Long

    2009-06-01

    To construct a cDNA library for Babesia orientalis and screen immunologically positive clones. Total RNA of B. orientalis in red blood cells from an infected calf was isolated. cDNA was synthesized by reverse transcriptase, amplified by PCR and ligated into lambdaTriplEx2 vector. The recombined vectors were packaged and the unamplified cDNA library was constructed. The cDNA library was then amplified and immunologically screened with rabbit anti-B. orientalis serum. The recombinant lambdaTriplEx2 of positive clones were converted to the corresponding recombinant pTriplEx2. The inserted fragments were identified by PCR amplification. The plasmids were sequenced and compared against GenBank database by Blast. The titer of the unamplified library was 2.0 x 10(6) pfu/ml. The inserted fragment length of the library ranged from 500 to 3,000 bp, and the recombination efficiency accounted for 98.8%. The titer of the amplified library was 5.8 x 10(8) pfu/ml. Three positive clones were selected by serum immunological screening and named B04, B05, and B41, respectively. The inserted fragments of the B04, B05 and B41 were about 1,300 bp, 1,000 bp, and 2,400 bp, respectively. Sequence analysis revealed that the 3 clones contained open reading frames. Blast results showed that they were highly homologous to the nuclear movement protein gene, the hypothetical protein gene and the heat shock protein 70 (HSP70) gene, respectively. The deduced amino acid sequences of B04, B05 and B41 contained 310, 192 and 647 amino acid residues, with Mr of 34,000, 21,000, and 70,700, respectively. A qualified cDNA library of B. orientalis has been constructed and three positive clones of B. orientalis discovered.

  5. Bayesian Correlation Analysis for Sequence Count Data.

    Directory of Open Access Journals (Sweden)

    Daniel Sánchez-Taltavull

    Full Text Available Evaluating the similarity of different measured variables is a fundamental task of statistics, and a key part of many bioinformatics algorithms. Here we propose a Bayesian scheme for estimating the correlation between different entities' measurements based on high-throughput sequencing data. These entities could be different genes or miRNAs whose expression is measured by RNA-seq, different transcription factors or histone marks whose expression is measured by ChIP-seq, or even combinations of different types of entities. Our Bayesian formulation accounts for both measured signal levels and uncertainty in those levels, due to varying sequencing depth in different experiments and to varying absolute levels of individual entities, both of which affect the precision of the measurements. In comparison with a traditional Pearson correlation analysis, we show that our Bayesian correlation analysis retains high correlations when measurement confidence is high, but suppresses correlations when measurement confidence is low-especially for entities with low signal levels. In addition, we consider the influence of priors on the Bayesian correlation estimate. Perhaps surprisingly, we show that naive, uniform priors on entities' signal levels can lead to highly biased correlation estimates, particularly when different experiments have widely varying sequencing depths. However, we propose two alternative priors that provably mitigate this problem. We also prove that, like traditional Pearson correlation, our Bayesian correlation calculation constitutes a kernel in the machine learning sense, and thus can be used as a similarity measure in any kernel-based machine learning algorithm. We demonstrate our approach on two RNA-seq datasets and one miRNA-seq dataset.

  6. A basic analysis toolkit for biological sequences

    Directory of Open Access Journals (Sweden)

    Siragusa Enrico

    2007-09-01

    Full Text Available Abstract This paper presents a software library, nicknamed BATS, for some basic sequence analysis tasks. Namely, local alignments, via approximate string matching, and global alignments, via longest common subsequence and alignments with affine and concave gap cost functions. Moreover, it also supports filtering operations to select strings from a set and establish their statistical significance, via z-score computation. None of the algorithms is new, but although they are generally regarded as fundamental for sequence analysis, they have not been implemented in a single and consistent software package, as we do here. Therefore, our main contribution is to fill this gap between algorithmic theory and practice by providing an extensible and easy to use software library that includes algorithms for the mentioned string matching and alignment problems. The library consists of C/C++ library functions as well as Perl library functions. It can be interfaced with Bioperl and can also be used as a stand-alone system with a GUI. The software is available at http://www.math.unipa.it/~raffaele/BATS/ under the GNU GPL.

  7. An analysis of sequence alignment: heuristic algorithms.

    Science.gov (United States)

    Bucak, I Ö; Uslan, V

    2010-01-01

    Sequence alignment becomes challenging with an increase in size and number of sequences. Finding optimal or near optimal solutions for sequence alignment is one of the most important operations in bioinformatics. This study aims to survey heuristics applied for the sequence alignment problem summarized in a time line.

  8. Whole genome sequence analysis of Mycobacterium suricattae

    KAUST Repository

    Dippenaar, Anzaan

    2015-10-21

    Tuberculosis occurs in various mammalian hosts and is caused by a range of different lineages of the Mycobacterium tuberculosis complex (MTBC). A recently described member, Mycobacterium suricattae, causes tuberculosis in meerkats (Suricata suricatta) in Southern Africa and preliminary genetic analysis showed this organism to be closely related to an MTBC pathogen of rock hyraxes (Procavia capensis), the dassie bacillus. Here we make use of whole genome sequencing to describe the evolution of the genome of M. suricattae, including known and novel regions of difference, SNPs and IS6110 insertion sites. We used genome-wide phylogenetic analysis to show that M. suricattae clusters with the chimpanzee bacillus, previously isolated from a chimpanzee (Pan troglodytes) in West Africa. We propose an evolutionary scenario for the Mycobacterium africanum lineage 6 complex, showing the evolutionary relationship of M. africanum and chimpanzee bacillus, and the closely related members M. suricattae, dassie bacillus and Mycobacterium mungi.

  9. Expressed sequence tag analysis of the soybean rust pathogen Phakopsora pachyrhizi.

    Science.gov (United States)

    Posada-Buitrago, Martha Lucia; Frederick, Reid D

    2005-12-01

    Soybean rust is caused by the obligate fungal pathogen Phakopsora pachyrhizi Sydow. A unidirectional cDNA library was constructed using mRNA isolated from germinating P. pachyrhizi urediniospores to identify genes expressed at this physiological stage. Single pass sequence analysis of 908 clones revealed 488 unique expressed sequence tags (ESTs, unigenes) of which 107 appeared as multiple copies. BLASTX analysis identified 189 unigenes with significant similarities (Evalue<10(-5)) to sequences deposited in the NCBI non-redundant protein database. A search against the NCBI dbEST using the BLASTN algorithm revealed 32 ESTs with high or moderate similarities to plant and fungal sequences. Using the Expressed Gene Anatomy Classification, 31.7% of these ESTs were involved in primary metabolism, 14.3% in gene/protein expression, 7.4% in cell structure and growth, 6.9% in cell division, 4.8% in cell signaling/cell communication, and 4.8% in cell/organism defense. Approximately 29.6% of the identities were to hypothetical proteins and proteins with unknown function.

  10. Cloning and sequence analysis of cDNA encoding a putative juvenile hormone esterase from the Colorado potato beetle.

    NARCIS (Netherlands)

    Vermunt, A.M.W.; Koopmanschap, A.B.; Vlak, J.M.; Kort, de C.A.D.

    1998-01-01

    In the Colorado potato beetle, Leptinotarsa decemlineata, reproduction and diapause are mediated by the juvenile hormone (JH) titer in the hemolymph. This titer is controlled by JH synthesis in the corpora allata and by JH degradation. The main pathway of JH degradation is by JH esterase in the

  11. Cloning of the cDNA for human 12-lipoxygenase

    International Nuclear Information System (INIS)

    Izumi, T.; Hoshiko, S.; Radmark, O.; Samuelsson, B.

    1990-01-01

    A full-length cDNA clone encoding 12-lipoxygenase was isolated from a human platelet cDNA library by using a cDNA for human reticulocyte 15-lipoxygenase as probe for the initial screening. The cDNA had an open reading frame encoding 662 amino acid residues with a calculated molecular weight of 75,590. Three independent clones revealed minor heterogeneities in their DNA sequences. Thus, in three positions of the deduced amino acid sequence, there is a choice between two different amino acids. The deduced sequence from the clone plT3 showed 65% identity with human reticulocyte 15-lipoxygenase and 42% identity with human leukocyte 5-lipoxygenase. The 12-lipoxygenase cDNA recognized a 3.0-kilobase mRNA species in platelets and human erythroleukemia cells (HEL cells). Phorbol 12-tetradecanoyl 13-acetate induced megakaryocytic differentiation of HEL cells and 12-lipoxygenase activity and increased mRNA for 12-lipoxygenase. The identity of the cloned 12-lipoxygenase was assured by expression in a mammalian cell line (COS cells). Human platelet 12-lipoxygenase has been difficult to purify to homogeneity. The cloning of this cDNA will increase the possibilities to elucidate the structure and function of this enzyme

  12. MLPA and cDNA analysis improves COL4A5 mutation detection in X-linked Alport syndrome

    DEFF Research Database (Denmark)

    Hertz, JM; Juncker, I; Marcussen, N

    2008-01-01

    for 10-15% of mutations. We have established a method for mutation analysis of COL4A5 based on reverse transcriptase-polymerase chain reaction analysis of mRNA from cultured skin fibroblasts and multiplex ligation-dependent probe amplification (MLPA) on genomic DNA. One advantage of using skin biopsies...

  13. Construction and selection of subtracted cDNA library of mouse hepatocarcinoma cell lines with different lymphatic metastasis potential

    Science.gov (United States)

    Hou, Li; Tang, Jan-Wu; Cui, Xiao-Nan; Wang, Bo; Song, Bo; Sun, Lei

    2004-01-01

    AIM: In order to elucidate the molecular mechanism of lymphatic metastasis of hepatocarcinoma, we detected the difference of gene expression between mouse hepatocarcinoma cell lines Hca-F and Hca-P with different lymphatic metastasis potential. METHODS: cDNA of Hca-F cells was used as a tester and cDNA of Hca-P cells was used as a driver. cDNAs highly expressed in Hca-F cells were isolated by the suppression subtractive hybridization (SSH) method. The isolated cDNA was cloned into T/A cloning vector. The ligation products were transformed into DH5 α competent cells. Individual clones were randomly selected and used for PCR amplification. Vector DNA from positive clones was isolated for sequencing. RESULTS: There were 800 positive clones in amplified subtracted cDNA library. Random analysis of 160 clones with PCR showed that 95% of the clones contained 100-700 bp inserts. Analysis of 20 sequenced cDNA clones randomly picked from the SSH library revealed 4 known genes (mouse heat shock protein 84 ku, DNA helicase, ribosomal protein S13 ,ethanol induced 6 gene) and 3 expressed sequence tags (ESTs). Four cDNAs showed no homology and presumably represent novel genes. CONCLUSION: A subtracted cDNA library of differentially expressed genes in mouse heptocarcinoma cell lines with different lymphatic metastasis potential was successfully constructed with SSH and T/A cloning techniques. The library is efficient and lays a solid foundation for searching new lymphatic metastasis related genes. The expression of mouse heat shock protein gene, DNA helicase and other 4 novel gene may be different between mouse heptocarcinoma cell lines with different lymphatic metastasis potential. PMID:15285011

  14. Soluble forms of tumor necrosis factor receptors (TNF-Rs). The cDNA for the type I TNF-R, cloned using amino acid sequence data of its soluble form, encodes both the cell surface and a soluble form of the receptor

    DEFF Research Database (Denmark)

    Nophar, Y; Kemper, O; Brakebusch, C

    1990-01-01

    the extracellular domain of the type I TNF-R matches the COOH-terminal sequence of TBPI. Amino acid sequences in the extracellular domain also fully match other sequences found in TBPI. On the other hand, amino acid sequences in the soluble form of the type II TNF-R (TBPII), while indicating a marked homology...... found to have effects characteristic of TNF, including stimulating phosphorylation of specific cellular proteins. Oligonucleotide probes designed on the basis of the NH2-terminal amino acid sequence of TBPI were used to clone the cDNA for the structurally related cell surface type 1 TNF-R. It is notable...... that although this receptor can signal the phosphorylation of cellular proteins, it appears from its amino acid sequence to be devoid of intrinsic protein kinase activity. The extracellular domain of the receptor is composed of four internal cysteine-rich repeats, homologous to structures repeated four times...

  15. RIKEN Integrated Sequence Analysis (RISA) System—384-Format Sequencing Pipeline with 384 Multicapillary Sequencer

    Science.gov (United States)

    Shibata, Kazuhiro; Itoh, Masayoshi; Aizawa, Katsunori; Nagaoka, Sumiharu; Sasaki, Nobuya; Carninci, Piero; Konno, Hideaki; Akiyama, Junichi; Nishi, Katsuo; Kitsunai, Tokuji; Tashiro, Hideo; Itoh, Mari; Sumi, Noriko; Ishii, Yoshiyuki; Nakamura, Shin; Hazama, Makoto; Nishine, Tsutomu; Harada, Akira; Yamamoto, Rintaro; Matsumoto, Hiroyuki; Sakaguchi, Sumito; Ikegami, Takashi; Kashiwagi, Katsuya; Fujiwake, Syuji; Inoue, Kouji; Togawa, Yoshiyuki; Izawa, Masaki; Ohara, Eiji; Watahiki, Masanori; Yoneda, Yuko; Ishikawa, Tomokazu; Ozawa, Kaori; Tanaka, Takumi; Matsuura, Shuji; Kawai, Jun; Okazaki, Yasushi; Muramatsu, Masami; Inoue, Yorinao; Kira, Akira; Hayashizaki, Yoshihide

    2000-01-01

    The RIKEN high-throughput 384-format sequencing pipeline (RISA system) including a 384-multicapillary sequencer (the so-called RISA sequencer) was developed for the RIKEN mouse encyclopedia project. The RISA system consists of colony picking, template preparation, sequencing reaction, and the sequencing process. A novel high-throughput 384-format capillary sequencer system (RISA sequencer system) was developed for the sequencing process. This system consists of a 384-multicapillary auto sequencer (RISA sequencer), a 384-multicapillary array assembler (CAS), and a 384-multicapillary casting device. The RISA sequencer can simultaneously analyze 384 independent sequencing products. The optical system is a scanning system chosen after careful comparison with an image detection system for the simultaneous detection of the 384-capillary array. This scanning system can be used with any fluorescent-labeled sequencing reaction (chain termination reaction), including transcriptional sequencing based on RNA polymerase, which was originally developed by us, and cycle sequencing based on thermostable DNA polymerase. For long-read sequencing, 380 out of 384 sequences (99.2%) were successfully analyzed and the average read length, with more than 99% accuracy, was 654.4 bp. A single RISA sequencer can analyze 216 kb with >99% accuracy in 2.7 h (90 kb/h). For short-read sequencing to cluster the 3′ end and 5′ end sequencing by reading 350 bp, 384 samples can be analyzed in 1.5 h. We have also developed a RISA inoculator, RISA filtrator and densitometer, RISA plasmid preparator which can handle throughput of 40,000 samples in 17.5 h, and a high-throughput RISA thermal cycler which has four 384-well sites. The combination of these technologies allowed us to construct the RISA system consisting of 16 RISA sequencers, which can process 50,000 DNA samples per day. One haploid genome shotgun sequence of a higher organism, such as human, mouse, rat, domestic animals, and plants, can

  16. RIKEN integrated sequence analysis (RISA) system--384-format sequencing pipeline with 384 multicapillary sequencer.

    Science.gov (United States)

    Shibata, K; Itoh, M; Aizawa, K; Nagaoka, S; Sasaki, N; Carninci, P; Konno, H; Akiyama, J; Nishi, K; Kitsunai, T; Tashiro, H; Itoh, M; Sumi, N; Ishii, Y; Nakamura, S; Hazama, M; Nishine, T; Harada, A; Yamamoto, R; Matsumoto, H; Sakaguchi, S; Ikegami, T; Kashiwagi, K; Fujiwake, S; Inoue, K; Togawa, Y

    2000-11-01

    The RIKEN high-throughput 384-format sequencing pipeline (RISA system) including a 384-multicapillary sequencer (the so-called RISA sequencer) was developed for the RIKEN mouse encyclopedia project. The RISA system consists of colony picking, template preparation, sequencing reaction, and the sequencing process. A novel high-throughput 384-format capillary sequencer system (RISA sequencer system) was developed for the sequencing process. This system consists of a 384-multicapillary auto sequencer (RISA sequencer), a 384-multicapillary array assembler (CAS), and a 384-multicapillary casting device. The RISA sequencer can simultaneously analyze 384 independent sequencing products. The optical system is a scanning system chosen after careful comparison with an image detection system for the simultaneous detection of the 384-capillary array. This scanning system can be used with any fluorescent-labeled sequencing reaction (chain termination reaction), including transcriptional sequencing based on RNA polymerase, which was originally developed by us, and cycle sequencing based on thermostable DNA polymerase. For long-read sequencing, 380 out of 384 sequences (99.2%) were successfully analyzed and the average read length, with more than 99% accuracy, was 654.4 bp. A single RISA sequencer can analyze 216 kb with >99% accuracy in 2.7 h (90 kb/h). For short-read sequencing to cluster the 3' end and 5' end sequencing by reading 350 bp, 384 samples can be analyzed in 1.5 h. We have also developed a RISA inoculator, RISA filtrator and densitometer, RISA plasmid preparator which can handle throughput of 40,000 samples in 17.5 h, and a high-throughput RISA thermal cycler which has four 384-well sites. The combination of these technologies allowed us to construct the RISA system consisting of 16 RISA sequencers, which can process 50,000 DNA samples per day. One haploid genome shotgun sequence of a higher organism, such as human, mouse, rat, domestic animals, and plants, can be

  17. Multilocus sequence analysis of the family Halomonadaceae.

    Science.gov (United States)

    de la Haba, Rafael R; Márquez, M Carmen; Papke, R Thane; Ventosa, Antonio

    2012-03-01

    Multilocus sequence analysis (MLSA) protocols have been developed for species circumscription for many taxa. However, at present, no studies based on MLSA have been performed within any moderately halophilic bacterial group. To test the usefulness of MLSA with these kinds of micro-organisms, the family Halomonadaceae, which includes mainly halophilic bacteria, was chosen as a model. This family comprises ten genera with validly published names and 85 species of environmental, biotechnological and clinical interest. In some cases, the phylogenetic relationships between members of this family, based on 16S rRNA gene sequence comparisons, are not clear and a deep phylogenetic analysis using several housekeeping genes seemed appropriate. Here, MLSA was applied using the 16S rRNA, 23S rRNA, atpA, gyrB, rpoD and secA genes for species of the family Halomonadaceae. Phylogenetic trees based on the individual and concatenated gene sequences revealed that the family Halomonadaceae formed a monophyletic group of micro-organisms within the order Oceanospirillales. With the exception of the genera Halomonas and Modicisalibacter, all other genera within this family were phylogenetically coherent. Five of the six studied genes (16S rRNA, 23S rRNA, gyrB, rpoD and secA) showed a consistent evolutionary history. However, the results obtained with the atpA gene were different; thus, this gene may not be considered useful as an individual gene phylogenetic marker within this family. The phylogenetic methods produced variable results, with those generated from the maximum-likelihood and neighbour-joining algorithms being more similar than those obtained by maximum-parsimony methods. Horizontal gene transfer (HGT) plays an important evolutionary role in the family Halomonadaceae; however, the impact of recombination events in the phylogenetic analysis was minimized by concatenating the six loci, which agreed with the current taxonomic scheme for this family. Finally, the findings of

  18. Constructing and detecting a cDNA library for mites.

    Science.gov (United States)

    Hu, Li; Zhao, YaE; Cheng, Juan; Yang, YuanJun; Li, Chen; Lu, ZhaoHui

    2015-10-01

    RNA extraction and construction of complementary DNA (cDNA) library for mites have been quite challenging due to difficulties in acquiring tiny living mites and breaking their hard chitin. The present study is to explore a better method to construct cDNA library for mites that will lay the foundation on transcriptome and molecular pathogenesis research. We selected Psoroptes cuniculi as an experimental subject and took the following steps to construct and verify cDNA library. First, we combined liquid nitrogen grinding with TRIzol for total RNA extraction. Then, switching mechanism at 5' end of the RNA transcript (SMART) technique was used to construct full-length cDNA library. To evaluate the quality of cDNA library, the library titer and recombination rate were calculated. The reliability of cDNA library was detected by sequencing and analyzing positive clones and genes amplified by specific primers. The results showed that the RNA concentration was 836 ng/μl and the absorbance ratio at 260/280 nm was 1.82. The library titer was 5.31 × 10(5) plaque-forming unit (PFU)/ml and the recombination rate was 98.21%, indicating that the library was of good quality. In the 33 expressed sequence tags (ESTs) of P. cuniculi, two clones of 1656 and 1658 bp were almost identical with only three variable sites detected, which had an identity of 99.63% with that of Psoroptes ovis, indicating that the cDNA library was reliable. Further detection by specific primers demonstrated that the 553-bp Pso c II gene sequences of P. cuniculi had an identity of 98.56% with those of P. ovis, confirming that the cDNA library was not only reliable but also feasible.

  19. Development of a simple and powerful method, cDNA AFLP-SSPAG ...

    African Journals Online (AJOL)

    Differential cDNAs were easily obtained from silver stained cDNA-AFLP separated on polyacylamide gels. The cDNA was then reamplified, cloned and fragments were sequenced. Sequenced clones were used as probes in northern dot blot analyses and library screening. Full-length cDNA was cloned from a library ...

  20. Time fluctuation analysis of forest fire sequences

    Science.gov (United States)

    Vega Orozco, Carmen D.; Kanevski, Mikhaïl; Tonini, Marj; Golay, Jean; Pereira, Mário J. G.

    2013-04-01

    Forest fires are complex events involving both space and time fluctuations. Understanding of their dynamics and pattern distribution is of great importance in order to improve the resource allocation and support fire management actions at local and global levels. This study aims at characterizing the temporal fluctuations of forest fire sequences observed in Portugal, which is the country that holds the largest wildfire land dataset in Europe. This research applies several exploratory data analysis measures to 302,000 forest fires occurred from 1980 to 2007. The applied clustering measures are: Morisita clustering index, fractal and multifractal dimensions (box-counting), Ripley's K-function, Allan Factor, and variography. These algorithms enable a global time structural analysis describing the degree of clustering of a point pattern and defining whether the observed events occur randomly, in clusters or in a regular pattern. The considered methods are of general importance and can be used for other spatio-temporal events (i.e. crime, epidemiology, biodiversity, geomarketing, etc.). An important contribution of this research deals with the analysis and estimation of local measures of clustering that helps understanding their temporal structure. Each measure is described and executed for the raw data (forest fires geo-database) and results are compared to reference patterns generated under the null hypothesis of randomness (Poisson processes) embedded in the same time period of the raw data. This comparison enables estimating the degree of the deviation of the real data from a Poisson process. Generalizations to functional measures of these clustering methods, taking into account the phenomena, were also applied and adapted to detect time dependences in a measured variable (i.e. burned area). The time clustering of the raw data is compared several times with the Poisson processes at different thresholds of the measured function. Then, the clustering measure value

  1. Statistical analysis of next generation sequencing data

    CERN Document Server

    Nettleton, Dan

    2014-01-01

    Next Generation Sequencing (NGS) is the latest high throughput technology to revolutionize genomic research. NGS generates massive genomic datasets that play a key role in the big data phenomenon that surrounds us today. To extract signals from high-dimensional NGS data and make valid statistical inferences and predictions, novel data analytic and statistical techniques are needed. This book contains 20 chapters written by prominent statisticians working with NGS data. The topics range from basic preprocessing and analysis with NGS data to more complex genomic applications such as copy number variation and isoform expression detection. Research statisticians who want to learn about this growing and exciting area will find this book useful. In addition, many chapters from this book could be included in graduate-level classes in statistical bioinformatics for training future biostatisticians who will be expected to deal with genomic data in basic biomedical research, genomic clinical trials and personalized med...

  2. SVAMP: Sequence variation analysis, maps and phylogeny

    KAUST Repository

    Naeem, Raeece

    2014-04-03

    Summary: SVAMP is a stand-alone desktop application to visualize genomic variants (in variant call format) in the context of geographical metadata. Users of SVAMP are able to generate phylogenetic trees and perform principal coordinate analysis in real time from variant call format (VCF) and associated metadata files. Allele frequency map, geographical map of isolates, Tajima\\'s D metric, single nucleotide polymorphism density, GC and variation density are also available for visualization in real time. We demonstrate the utility of SVAMP in tracking a methicillin-resistant Staphylococcus aureus outbreak from published next-generation sequencing data across 15 countries. We also demonstrate the scalability and accuracy of our software on 245 Plasmodium falciparum malaria isolates from three continents. Availability and implementation: The Qt/C++ software code, binaries, user manual and example datasets are available at http://cbrc.kaust.edu.sa/svamp. © The Author 2014.

  3. cDNA cloning, mRNA distribution and heterogeneity, chromosomal location, and RFLP analysis of human osteopontin (OPN)

    DEFF Research Database (Denmark)

    Young, M F; Kerr, J M; Termine, J D

    1990-01-01

    on a region of 4q that is near the centromere. A high-frequency restriction fragment length polymorphism was evident in the DNA from 29 unrelated individuals using the enzyme BglII. Analysis of total genomic DNA by digestion with several restriction enzymes, Southern blotting, and hybridization with the human...

  4. Pig genome sequence - analysis and publication strategy

    DEFF Research Database (Denmark)

    Archibald, Alan L.; Bolund, Lars; Churcher, Carol

    2010-01-01

    BACKGROUND: The pig genome is being sequenced and characterised under the auspices of the Swine Genome Sequencing Consortium. The sequencing strategy followed a hybrid approach combining hierarchical shotgun sequencing of BAC clones and whole genome shotgun sequencing. RESULTS: Assemblies...... of the BAC clone derived genome sequence have been annotated using the Pre-Ensembl and Ensembl automated pipelines and made accessible through the Pre-Ensembl/Ensembl browsers. The current annotated genome assembly (Sscrofa9) was released with Ensembl 56 in September 2009. A revised assembly (Sscrofa10......) is under construction and will incorporate whole genome shotgun sequence (WGS) data providing > 30x genome coverage. The WGS sequence, most of which comprise short Illumina/Solexa reads, were generated from DNA from the same single Duroc sow as the source of the BAC library from which clones were...

  5. cDNA library preparation.

    Science.gov (United States)

    Kooiker, Maarten; Xue, Gang-Ping

    2014-01-01

    The construction of full-length cDNA libraries allows researchers to study gene expression and protein interactions and undertake gene discovery. Recent improvements allow the construction of high-quality cDNA libraries, with small amounts of mRNA. In parallel, these improvements allow for the incorporation of adapters into the cDNA, both at the 5' and 3' end of the cDNA. The 3' adapter is attached to the oligo-dT primer that is used by the reverse transcriptase, whereas the 5' adapter is incorporated by the template switching properties of the MMLV reverse transcriptase. This allows directional cloning and eliminates inefficient steps like adapter ligation, phosphorylation, and methylation. Another important step in the construction of high-quality cDNA libraries is the normalization. The difference in the levels of expression between genes might be several orders of magnitude. Therefore, it is essential that the cDNA library is normalized. With a recently discovered enzyme, duplex-specific nuclease, it is possible to normalize the cDNA library, based on the fact that more abundant molecules are more likely to reanneal after denaturation compared to rare molecules.

  6. Cloning the human lysozyme cDNA: inverted Alu repeat in the mRNA and in situ hybridization for macrophages and Paneth cells.

    OpenAIRE

    Chung, L P; Keshav, S; Gordon, S

    1988-01-01

    Lysozyme is a major secretory product of human and rodent macrophages and a useful marker for myelomonocytic cells. Based on the known human lysozyme amino acid sequence, oligonucleotides were synthesized and used as probes to screen a phorbol 12-myristate 13-acetate-treated U937 cDNA library. A full-length human lysozyme cDNA clone, pHL-2, was obtained and characterized. Sequence analysis shows that human lysozyme, like chicken lysozyme, has an 18-amino-acid-long signal peptide, but unlike t...

  7. Molecular characterization of a Leishmania donovani cDNA clone with similarity to human 20S proteasome a-type subunit

    DEFF Research Database (Denmark)

    Christensen, C B; Jørgensen, L; Jensen, A T

    2000-01-01

    Using plasma from patients infected or previously infected with Leishmania donovanii, we isolated a L. donovanii cDNA clone with similarity to the proteasome a-type subunit from humans and other eukaryotes. The cDNA clone, designated LePa, was DNA sequenced and Northern blot analysis of L...... out of 25 patients with visceral leishmaniasis and four out of 18 patients with cutaneous leishmaniasis contained IgG antibodies which reacted with the purified LePa fusion protein as evaluated in an ELISA. The LePa DNA sequence was inserted into an eukaryotic expression vector and Balb/c mice were...

  8. Construction and analysis of the cDNA subtraction library of yeast and mycelial phases of Sporothrix globosa isolated in China: identification of differentially expressed genes*

    Science.gov (United States)

    Hu, Qing-bi; He, Yu; Zhou, Xun

    2015-01-01

    Species included in the Sporothrix schenckii complex are temperature-dependent with dimorphic growth and cause sporotrichosis that is characterized by chronic and fatal lymphocutaneous lesions. The putative species included in the Sporothrix complex are S. brasiliensis, S. globosa, S. mexicana, S. pallida, S. schenckii, and S. lurei. S. globosa is the causal agent of sporotrichosis in China, and its pathogenicity appears to be closely related to the dimorphic transition, i.e. from the mycelial to the yeast phase, it adapts to changing environmental conditions. To determine the molecular mechanisms of the switching process that mediates the dimorphic transition of S. globosa, suppression subtractive hybridization (SSH) was used to prepare a complementary DNA (cDNA) subtraction library from the yeast and mycelial phases. Bioinformatics analysis was performed to profile the relationship between differently expressed genes and the dimorphic transition. Two genes that were expressed at higher levels by the yeast form were selected, and their differential expression levels were verified using a quantitative real-time reverse transcriptase polymerase chain reaction (qRT-PCR). It is believed that these differently expressed genes are involved in the pathogenesis of S. globosa infection in China. PMID:26642182

  9. Construction and analysis of the cDNA subtraction library of yeast and mycelial phases of Sporothrix globosa isolated in China: identification of differentially expressed genes.

    Science.gov (United States)

    Hu, Qing-bi; He, Yu; Zhou, Xun

    2015-12-01

    Species included in the Sporothrix schenckii complex are temperature-dependent with dimorphic growth and cause sporotrichosis that is characterized by chronic and fatal lymphocutaneous lesions. The putative species included in the Sporothrix complex are S. brasiliensis, S. globosa, S. mexicana, S. pallida, S. schenckii, and S. lurei. S. globosa is the causal agent of sporotrichosis in China, and its pathogenicity appears to be closely related to the dimorphic transition, i.e. from the mycelial to the yeast phase, it adapts to changing environmental conditions. To determine the molecular mechanisms of the switching process that mediates the dimorphic transition of S. globosa, suppression subtractive hybridization (SSH) was used to prepare a complementary DNA (cDNA) subtraction library from the yeast and mycelial phases. Bioinformatics analysis was performed to profile the relationship between differently expressed genes and the dimorphic transition. Two genes that were expressed at higher levels by the yeast form were selected, and their differential expression levels were verified using a quantitative real-time reverse transcriptase polymerase chain reaction (qRT-PCR). It is believed that these differently expressed genes are involved in the pathogenesis of S. globosa infection in China.

  10. Development of Transcriptomic Markers for Population Analysis Using Restriction Site Associated RNA Sequencing (RARseq.

    Directory of Open Access Journals (Sweden)

    Magdy S Alabady

    Full Text Available We describe restriction site associated RNA sequencing (RARseq, an RNAseq-based genotype by sequencing (GBS method. It includes the construction of RNAseq libraries from double stranded cDNA digested with selected restriction enzymes. To test this, we constructed six single- and six-dual-digested RARseq libraries from six F2 pitcher plant individuals and sequenced them on a half of a Miseq run. On average, the de novo approach of population genome analysis detected 544 and 570 RNA SNPs, whereas the reference transcriptome-based approach revealed an average of 1907 and 1876 RNA SNPs per individual, from single- and dual-digested RARseq data, respectively. The average numbers of RNA SNPs and alleles per loci are 1.89 and 2.17, respectively. Our results suggest that the RARseq protocol allows good depth of coverage per loci for detecting RNA SNPs and polymorphic loci for population genomics and mapping analyses. In non-model systems where complete genomes sequences are not always available, RARseq data can be analyzed in reference to the transcriptome. In addition to enriching for functional markers, this method may prove particularly useful in organisms where the genomes are not favorable for DNA GBS.

  11. Movement Pattern Analysis Based on Sequence Signatures

    Directory of Open Access Journals (Sweden)

    Seyed Hossein Chavoshi

    2015-09-01

    Full Text Available Increased affordability and deployment of advanced tracking technologies have led researchers from various domains to analyze the resulting spatio-temporal movement data sets for the purpose of knowledge discovery. Two different approaches can be considered in the analysis of moving objects: quantitative analysis and qualitative analysis. This research focuses on the latter and uses the qualitative trajectory calculus (QTC, a type of calculus that represents qualitative data on moving point objects (MPOs, and establishes a framework to analyze the relative movement of multiple MPOs. A visualization technique called sequence signature (SESI is used, which enables to map QTC patterns in a 2D indexed rasterized space in order to evaluate the similarity of relative movement patterns of multiple MPOs. The applicability of the proposed methodology is illustrated by means of two practical examples of interacting MPOs: cars on a highway and body parts of a samba dancer. The results show that the proposed method can be effectively used to analyze interactions of multiple MPOs in different domains.

  12. Cloning the human lysozyme cDNA: Inverted Alu repeat in the mRNA and in situ hybridization for macrophages and Paneth cells

    International Nuclear Information System (INIS)

    Chung, L.P.; Keshav, S.; Gordon, S.

    1988-01-01

    Lysozyme is a major secretory product of human and rodent macrophages and a useful marker for myelomonocytic cells. Based on the known human lysozyme amino acid sequence, oligonucleotides were synthesized and used as probes to screen a phorbol 12-myristate 13-acetate-treated U937 cDNA library. A full-length human lysozyme cDNA clone, pHL-2, was obtained and characterized. Sequence analysis shows that human lysozyme, like chicken lysozyme, has in 18-amino-acid-long signal peptide, but unlike the chicken lysozyme cDNA, the human lysozyme cDNA has a >1-kilobase-long 3' nontranslated sequence. Interestingly, within this 3' region, an inverted repeat of the Alu family of repetitive sequences was discovered. In RNA blot analyses, DNA probes prepared from pHL-2 can be used to detect lysozyme mRNA not only from human but also from mouse and rat. Moreover, by in situ hybridization, complementary RNA transcripts have been used as probes to detect lysozyme mRNA in mouse macrophages and Paneth cells. This human lysozyme cDNA clone is therefore likely to be a useful molecular probe for studying macrophage distribution and gene expression

  13. Cloning the human lysozyme cDNA: Inverted Alu repeat in the mRNA and in situ hybridization for macrophages and Paneth cells

    Energy Technology Data Exchange (ETDEWEB)

    Chung, L.P.; Keshav, S.; Gordon, S.

    1988-09-01

    Lysozyme is a major secretory product of human and rodent macrophages and a useful marker for myelomonocytic cells. Based on the known human lysozyme amino acid sequence, oligonucleotides were synthesized and used as probes to screen a phorbol 12-myristate 13-acetate-treated U937 cDNA library. A full-length human lysozyme cDNA clone, pHL-2, was obtained and characterized. Sequence analysis shows that human lysozyme, like chicken lysozyme, has in 18-amino-acid-long signal peptide, but unlike the chicken lysozyme cDNA, the human lysozyme cDNA has a >1-kilobase-long 3' nontranslated sequence. Interestingly, within this 3' region, an inverted repeat of the Alu family of repetitive sequences was discovered. In RNA blot analyses, DNA probes prepared from pHL-2 can be used to detect lysozyme mRNA not only from human but also from mouse and rat. Moreover, by in situ hybridization, complementary RNA transcripts have been used as probes to detect lysozyme mRNA in mouse macrophages and Paneth cells. This human lysozyme cDNA clone is therefore likely to be a useful molecular probe for studying macrophage distribution and gene expression.

  14. Sequencing and analysis of the Mediterranean amphioxus (Branchiostoma lanceolatum transcriptome.

    Directory of Open Access Journals (Sweden)

    Silvan Oulion

    Full Text Available BACKGROUND: The basally divergent phylogenetic position of amphioxus (Cephalochordata, as well as its conserved morphology, development and genetics, make it the best proxy for the chordate ancestor. Particularly, studies using the amphioxus model help our understanding of vertebrate evolution and development. Thus, interest for the amphioxus model led to the characterization of both the transcriptome and complete genome sequence of the American species, Branchiostoma floridae. However, recent technical improvements allowing induction of spawning in the laboratory during the breeding season on a daily basis with the Mediterranean species Branchiostoma lanceolatum have encouraged European Evo-Devo researchers to adopt this species as a model even though no genomic or transcriptomic data have been available. To fill this need we used the pyrosequencing method to characterize the B. lanceolatum transcriptome and then compared our results with the published transcriptome of B. floridae. RESULTS: Starting with total RNA from nine different developmental stages of B. lanceolatum, a normalized cDNA library was constructed and sequenced on Roche GS FLX (Titanium mode. Around 1.4 million of reads were produced and assembled into 70,530 contigs (average length of 490 bp. Overall 37% of the assembled sequences were annotated by BlastX and their Gene Ontology terms were determined. These results were then compared to genomic and transcriptomic data of B. floridae to assess similarities and specificities of each species. CONCLUSION: We obtained a high-quality amphioxus (B. lanceolatum reference transcriptome using a high throughput sequencing approach. We found that 83% of the predicted genes in the B. floridae complete genome sequence are also found in the B. lanceolatum transcriptome, while only 41% were found in the B. floridae transcriptome obtained with traditional Sanger based sequencing. Therefore, given the high degree of sequence conservation

  15. CpGAVAS, an integrated web server for the annotation, visualization, analysis, and GenBank submission of completely sequenced chloroplast genome sequences

    Directory of Open Access Journals (Sweden)

    Liu Chang

    2012-12-01

    Full Text Available Abstract Background The complete sequences of chloroplast genomes provide wealthy information regarding the evolutionary history of species. With the advance of next-generation sequencing technology, the number of completely sequenced chloroplast genomes is expected to increase exponentially, powerful computational tools annotating the genome sequences are in urgent need. Results We have developed a web server CPGAVAS. The server accepts a complete chloroplast genome sequence as input. First, it predicts protein-coding and rRNA genes based on the identification and mapping of the most similar, full-length protein, cDNA and rRNA sequences by integrating results from Blastx, Blastn, protein2genome and est2genome programs. Second, tRNA genes and inverted repeats (IR are identified using tRNAscan, ARAGORN and vmatch respectively. Third, it calculates the summary statistics for the annotated genome. Fourth, it generates a circular map ready for publication. Fifth, it can create a Sequin file for GenBank submission. Last, it allows the extractions of protein and mRNA sequences for given list of genes and species. The annotation results in GFF3 format can be edited using any compatible annotation editing tools. The edited annotations can then be uploaded to CPGAVAS for update and re-analyses repeatedly. Using known chloroplast genome sequences as test set, we show that CPGAVAS performs comparably to another application DOGMA, while having several superior functionalities. Conclusions CPGAVAS allows the semi-automatic and complete annotation of a chloroplast genome sequence, and the visualization, editing and analysis of the annotation results. It will become an indispensible tool for researchers studying chloroplast genomes. The software is freely accessible from http://www.herbalgenomics.org/cpgavas.

  16. Efficient computational methods for sequence analysis of small RNAs

    OpenAIRE

    Cozen, Gozde

    2007-01-01

    With the discovery of small regulatory RNAs, there has been a tremendous increase in the number of RNA sequencing projects. Meanwhile, novel high-throughput sequencing technologies, which can sequence as much as 500000 small RNA sequences in one run, have emerged. The challenge of processing this rapidly growing data can be addressed by optimizing current analysis approaches for small RNA sequences. We present fast register-level methods for small RNA pairwise alignment and small RNA to genom...

  17. Method for construction of normalized cDNA libraries

    Science.gov (United States)

    Soares, Marcelo B.; Efstratiadis, Argiris

    1998-01-01

    This invention provides a method to normalize a directional cDNA library constructed in a vector that allows propagation in single-stranded circle form comprising: (a) propagating the directional cDNA library in single-stranded circles; (b) generating fragments complementary to the 3' noncoding sequence of the single-stranded circles in the library to produce partial duplexes; (c) purifying the partial duplexes; (d) melting and reassociating the purified partial duplexes to appropriate Cot; and (e) purifying the unassociated single-stranded circles, thereby generating a normalized cDNA library. This invention also provides normalized cDNA libraries generated by the above-described method and uses of the generated libraries.

  18. Noncoding sequence classification based on wavelet transform analysis: part I

    Science.gov (United States)

    Paredes, O.; Strojnik, M.; Romo-Vázquez, R.; Vélez Pérez, H.; Ranta, R.; Garcia-Torales, G.; Scholl, M. K.; Morales, J. A.

    2017-09-01

    DNA sequences in human genome can be divided into the coding and noncoding ones. Coding sequences are those that are read during the transcription. The identification of coding sequences has been widely reported in literature due to its much-studied periodicity. Noncoding sequences represent the majority of the human genome. They play an important role in gene regulation and differentiation among the cells. However, noncoding sequences do not exhibit periodicities that correlate to their functions. The ENCODE (Encyclopedia of DNA elements) and Epigenomic Roadmap Project projects have cataloged the human noncoding sequences into specific functions. We study characteristics of noncoding sequences with wavelet analysis of genomic signals.

  19. Identification of late O{sub 3}-responsive genes in Arabidopsis thaliana by cDNA microarray analysis

    Energy Technology Data Exchange (ETDEWEB)

    D' Haese, D. [Univ. of Antwerp, Dept. of Biology, Antwerp (BE) and Univ. of Newcastle, School of Biology and Psychology, Div. of Biology, Newcastle-Upon-Tyne (United Kingdom); Horemans, N.; Coen, W. De; Guisez, Y. [Univ. of Antwerp, Dept. of Biology, Antwerp (Belgium)

    2006-09-15

    To better understand the response of a plant to 0{sub 3} stress, an integrated microarray analysis was performed on Arabidopsis plants exposed during 2 days to purified air or 150 nl l{sup -1} O{sub 3}, 8 h day-l. Agilent Arabidopsis 2 Oligo Microarrays were used of which the reliability was confirmed by quantitative real-time PCR of nine randomly selected genes. We confirmed the O{sub 3} responsiveness of heat shock proteins (HSPs), glutathione-S-tranferases and genes involved in cell wall stiffening and microbial defence. Whereas, a previous study revealed that during an early stage of the O{sub 3} stress response, gene expression was strongly dependent on jasmonic acid and ethylene, we report that at a later stage (48 h) synthesis of jasrnonic acid and ethylene was downregulated. In addition, we observed the simultaneous induction of salicylic acid synthesis and genes involved in programmed cell death and senescence. Also typically, the later stage of the response to O{sub 3} appeared to be the induction of the complete pathway leading to the biosynthesis of anthocyanin diglucosides and the induction of thioredoxin-based redox control. Surprisingly absent in the list of induced genes were genes involved in ASC-dependent antioxidation, few of which were found to be induced after 12 h of 0{sub 3} exposure in another study. We discuss these and other particular results of the microarray analysis and provide a map depicting significantly affected genes and their pathways highlighting their interrelationships and subcellular localization. (au)

  20. Cloning and sequencing of the Atlantic salmon (Salmo salar) cytochrome c oxidase subunit III gene (coxIII) and analysis of coxIII expression during parr-smolt transformation.

    Science.gov (United States)

    Hardiman, G; Byrnes, L; Peden, J; Wolff, J; Gannon, F

    1994-08-01

    Smoltification is the process whereby salmon alter their metabolism in preparation for movement from freshwater to seawater. Differential screening of a cDNA library prepared from post-smolt salmon liver mRNA led to the selection of a smoltification-induced sequence. Analysis of this cDNA revealed that it partially encoded subunit III of the enzyme cytochrome c oxidase. The complete coxIII sequence was amplified from salmon genomic DNA using consensus oligonucleotides based on ATPase 6 and tRNA(GLY) sequences from Pacific salmonid species. Cytochrome c oxidase subunit III liver mRNA levels were found to be significantly increased in salmon smolts. Northern blot analysis revealed a coxIII transcript of approximately 750 bp in all salmon tissues tested except blood. The DNA sequence of coxIII employs the mammalian mitochondrial genetic code and is strongly conserved when compared with that of other species.

  1. Molecular cloning and sequencing analysis of the interferon receptor (IFNAR-1) from Columba livia.

    Science.gov (United States)

    Li, Chao; Chang, Wei Shan

    2014-01-01

    Partial sequence cloning of interferon receptor (IFNAR-1) of Columba livia. In order to obtain a certain length (630 bp) of gene, a pair of primers was designed according to the conserved nucleotide sequence of Gallus (EU477527.1) and Taeniopygia guttata (XM_002189232.1) IFNAR-1 gene fragment that was published by GenBank. Special primers were designed by the Race method to amplify the 3'terminal cDNA. The Columba livia IFNAR-1 displayed 88.5%, 80.5% and 73.8% nucleotide identity to Falco peregrinus, Gallus and Taeniopygia guttata, respectively. Phylogenetic analysis of the IFNAR1 gene showed that the relationship of Columba livia, Falco peregrinus and chicken had high homology. We successfully obtained a Columba livia IFNAR-1 gene partial sequence. Analysis of the genetic tree showed that the relationship of Columba livia and Falco peregrinus IFNAR-1 had high homology. This result can be used as reference for further research and practical application.

  2. Zooplankton diversity analysis through single-gene sequencing of a community sample

    Directory of Open Access Journals (Sweden)

    Nishida Mutsumi

    2009-09-01

    Full Text Available Abstract Background Oceans cover more than 70% of the earth's surface and are critical for the homeostasis of the environment. Among the components of the ocean ecosystem, zooplankton play vital roles in energy and matter transfer through the system. Despite their importance, understanding of zooplankton biodiversity is limited because of their fragile nature, small body size, and the large number of species from various taxonomic phyla. Here we present the results of single-gene zooplankton community analysis using a method that determines a large number of mitochondrial COI gene sequences from a bulk zooplankton sample. This approach will enable us to estimate the species richness of almost the entire zooplankton community. Results A sample was collected from a depth of 721 m to the surface in the western equatorial Pacific off Pohnpei Island, Micronesia, with a plankton net equipped with a 2-m2 mouth opening. A total of 1,336 mitochondrial COI gene sequences were determined from the cDNA library made from the sample. From the determined sequences, the occurrence of 189 species of zooplankton was estimated. BLASTN search results showed high degrees of similarity (>98% between the query and database for 10 species, including holozooplankton and merozooplankton. Conclusion In conjunction with the Census of Marine Zooplankton and Barcode of Life projects, single-gene zooplankton community analysis will be a powerful tool for estimating the species richness of zooplankton communities.

  3. Analysis of the effects of sex hormone background on the rat choroid plexus transcriptome by cDNA microarrays.

    Directory of Open Access Journals (Sweden)

    Telma Quintela

    Full Text Available The choroid plexus (CP are highly vascularized branched structures that protrude into the ventricles of the brain, and form a unique interface between the blood and the cerebrospinal fluid (CSF, the blood-CSF barrier, that are the main site of production and secretion of CSF. Sex hormones are widely recognized as neuroprotective agents against several neurodegenerative diseases, and the presence of sex hormones cognate receptors suggest that it may be a target for these hormones. In an effort to provide further insight into the neuroprotective mechanisms triggered by sex hormones we analyzed gene expression differences in the CP of female and male rats subjected to gonadectomy, using microarray technology. In gonadectomized female and male animals, 3045 genes were differentially expressed by 1.5-fold change, compared to sham controls. Analysis of the CP transcriptome showed that the top-five pathways significantly regulated by the sex hormone background are olfactory transduction, taste transduction, metabolism, steroid hormone biosynthesis and circadian rhythm pathways. These results represent the first overview of global expression changes in CP of female and male rats induced by gonadectomy and suggest that sex hormones are implicated in pathways with central roles in CP functions and CSF homeostasis.

  4. Optimized cDNA libraries for virus-induced gene silencing (VIGS using tobacco rattle virus

    Directory of Open Access Journals (Sweden)

    Page Jonathan E

    2008-01-01

    Full Text Available Abstract Background Virus-induced gene silencing (VIGS has emerged as a method for performing rapid loss-of-function experiments in plants. Despite its expanding use, the effect of host gene insert length and other properties on silencing efficiency have not been systematically tested. In this study, we probed the optimal properties of cDNA fragments of the phytoene desaturase (PDS gene for efficient VIGS in Nicotiana benthamiana using tobacco rattle virus (TRV. Results NbPDS inserts of between 192 bp and 1304 bp led to efficient silencing as determined by analysis of leaf chlorophyll a levels. The region of the NbPDS cDNA used for silencing had a small effect on silencing efficiency with 5' and 3' located inserts performing more poorly than those from the middle. Silencing efficiency was reduced by the inclusion of a 24 bp poly(A or poly(G homopolymeric region. We developed a method for constructing cDNA libraries for use as a source of VIGS-ready constructs. Library construction involved the synthesis of cDNA on a solid phase support, digestion with RsaI to yield short cDNA fragments lacking poly(A tails and suppression subtractive hybridization to enrich for differentially expressed transcripts. We constructed two cDNA libraries from methyl-jasmonate treated N. benthamiana roots and obtained 2948 ESTs. Thirty percent of the cDNA inserts were 401–500 bp in length and 99.5% lacked poly(A tails. To test the efficiency of constructs derived from the VIGS-cDNA libraries, we silenced the nicotine biosynthetic enzyme, putrescine N-methyltransferase (PMT, with ten different VIGS-NbPMT constructs ranging from 122 bp to 517 bp. Leaf nicotine levels were reduced by more than 90% in all plants infected with the NbPMT constructs. Conclusion Based on the silencing of NbPDS and NbPMT, we suggest the following design guidelines for constructs in TRV vectors: (1 Insert lengths should be in the range of ~200 bp to ~1300 bp, (2 they should be positioned in

  5. Comparative analysis of protein coding sequences from human, mouse and the domesticated pig

    DEFF Research Database (Denmark)

    Jørgensen, Frank Grønlund; Hobolth, Asger; Hornshøj, Henrik

    2005-01-01

    Background: The availability of abundant sequence data from key model organisms has made large scale studies of mulecular evolution an exciting possibility. Here we use full length cDNA alignments comprising more than 700,000 nucleotides from human, mouse, pig and the Japanese pufferfish Fugu...

  6. Sequencing small RNA: introduction and data analysis fundamentals.

    Science.gov (United States)

    Mehta, Jai Prakash

    2014-01-01

    Small RNAs are important transcriptional regulators within cells. With the advent of powerful Next Generation Sequencing platforms, sequencing small RNAs seems to be an obvious choice to understand their expression and its downstream effect. Additionally, sequencing provides an opportunity to identify novel and polymorphic miRNA. However, the biggest challenge is the appropriate data analysis pipeline, which is still in phase of active development by various academic groups. This chapter describes basic and advanced steps for small RNA sequencing analysis including quality control, small RNA alignment and quantification, differential expression analysis, novel small RNA identification, target prediction, and downstream analysis. We also provide a list of various resources for small RNA analysis.

  7. Project Report: Automatic Sequence Processor Software Analysis

    Science.gov (United States)

    Benjamin, Brandon

    2011-01-01

    The Mission Planning and Sequencing (MPS) element of Multi-Mission Ground System and Services (MGSS) provides space missions with multi-purpose software to plan spacecraft activities, sequence spacecraft commands, and then integrate these products and execute them on spacecraft. Jet Propulsion Laboratory (JPL) is currently is flying many missions. The processes for building, integrating, and testing the multi-mission uplink software need to be improved to meet the needs of the missions and the operations teams that command the spacecraft. The Multi-Mission Sequencing Team is responsible for collecting and processing the observations, experiments and engineering activities that are to be performed on a selected spacecraft. The collection of these activities is called a sequence and ultimately a sequence becomes a sequence of spacecraft commands. The operations teams check the sequence to make sure that no constraints are violated. The workflow process involves sending a program start command, which activates the Automatic Sequence Processor (ASP). The ASP is currently a file-based system that is comprised of scripts written in perl, c-shell and awk. Once this start process is complete, the system checks for errors and aborts if there are any; otherwise the system converts the commands to binary, and then sends the resultant information to be radiated to the spacecraft.

  8. Novel algorithms for protein sequence analysis

    NARCIS (Netherlands)

    Ye, Kai

    2008-01-01

    Each protein is characterized by its unique sequential order of amino acids, the so-called protein sequence. Biology”s paradigm is that this order of amino acids determines the protein”s architecture and function. In this thesis, we introduce novel algorithms to analyze protein sequences. Chapter 1

  9. Differential gene expression in a DNA double-strand-break repair mutant XRS-5 defective in Ku80. Analysis by cDNA microarray

    Energy Technology Data Exchange (ETDEWEB)

    Chan, John Y.H.; Chen, Lung-Kun; Chang, Jui-Feng [National Yang Ming Univ., Taipei, Taiwan (China). Inst. of Radiological Sciences] (and others)

    2001-12-01

    The ability of cells to rejoin DNA double-strand breaks (DSBs) usually correlates with their radiosensitivity. This correlation has been demonstrated in radiosensitive cells, including the Chinese hamster ovary mutant XRS-5. XRS-5 is defective in a DNA end-binding protein, Ku80, which is a component of a DNA-dependent protein kinase complex used for joining strand breaks. However, Ku80-deficient cells are known to be retarded in cell proliferation and growth as well as other yet to be identified defects. Using custom-made 600-gene cDNA microarray filters, we found differential gene expressions between the wild-type and XRS-5 cells. Defective Ku80 apparently affects the expression of several repair genes, including topoisomerase-I and -IIA, ERCC5, MLH1, and ATM. In contrast, other DNA repair-associated genes, such as GADD45A, EGR1 MDM2 and p53, were not affected. In addition, for large numbers of growth-associated genes, such as cyclins and clks, the growth factors and cytokines were also affected. Down-regulated expression was also found in several categories of seemingly unrelated genes, including apoptosis, angiogenesis, kinase and signaling, phosphatase, stress protein, proto-oncogenes and tumor suppressors, transcription and translation factors. A RT-PCR analysis confirmed that the XRS-5 cells used were defective in Ku80 expression. The diversified groups of genes being affected could mean that Ku80, a multi-functional DNA-binding protein, not only affects DNA repair, but is also involved in transcription regulation. Our data, taken together, indicate that there are specific genes being modulated in Ku80- deficient cells, and that some of the DNA repair pathways and other biological functions are apparently linked, suggesting that a defect in one gene could have global effects on many other processes. (author)

  10. Gene expression analysis of the rat testis after treatment with di(2-ethylhexyl) phthalate using cDNA microarray and real-time RT-PCR

    International Nuclear Information System (INIS)

    Kijima, Kazuyasu; Toyosawa, Kaoru; Yasuba, Masashi; Matsuoka, Nobuo; Adachi, Tetsuya; Komiyama, Masatoshi; Mori, Chisato

    2004-01-01

    To investigate the effects of di(2-ethylhexyl) phthalate (DEHP) on gene expression in rat testis, 6-week-old male Sprague-Dawley rats were given a single oral dose of 20 or 2000 mg/kg and euthanized 3, 6, 24, or 72 h thereafter. Terminal deoxynucleotidyl transferase-mediated dUTP nick-end labeling (TUNEL)-positive cells were significantly increased in the testis at 24 and 72 h after the exposure to 2000 mg/kg of DEHP. On cDNA microarray analysis, in addition to apoptosis-related genes, genes associated with atrophy, APEX nuclease, MutS homologue (E. coli), testosterone-repressed-prostatic-message-2 (TRPM-2), connective tissue growth factor, collagen alpha 2 type V, and cell adhesion kinase were differentially expressed. To investigate the relationship between histopathological alteration and gene expression, we selected genes associated with apoptosis and analyzed their expression by real-time quantitative reverse transcription-polymerase chain reaction (RT-PCR). With 20 mg/kg of DEHP treatment, bcl-2, key gene related to apoptosis, was increased. Up-regulation of bcl-2, inhibitor of Apaf-1/caspase-9/caspase-2 cascade of apoptosis, may be related to the fact that no morphological apoptotic change was induced after dosing of 20 mg/kg DEHP. With 2000 mg/kg of DEHP treatment, the apoptotic activator cascade, Fas/FasL, FADD/caspase-8/caspase-3 cascade, and Apaf-1/caspase-9/caspase-2 cascade were increased and bcl-2 was decreased. Thus, these gene regulations might lead the cells into apoptosis in the case of high exposure to DEHP. In contrast, FADD/caspase-10/caspase-6 cascade and caspase-11/caspase-3 cascade were not increased. These results indicate that the cascades of FADD/caspase-10/caspase-6 and caspase-11/caspase-3 are not related to apoptosis with DEHP treatment

  11. Isolation and Cloning of cDNA Fragment of Gene Encoding for Multidrug Resistance Associated Protein from M. affine.

    Directory of Open Access Journals (Sweden)

    Utut Widyastuti Suharsono

    2008-11-01

    Full Text Available Isolation and Cloning of cDNA Fragment of Gene Encoding for Multidrug Resistance Associated Protein from M. affine. M. affine can grow well in acid soil with high level of soluble aluminum. One of the important proteins in the detoxifying xenobiotic stress including acid and Al stresses is a multidrug resistance associated protein (MRP encoded by mrp gene. The objective of this research is to isolate and clone the cDNA fragment of MaMrp encoding MRP from M. affine. By reverse transcription, total cDNA had been synthesized from the total RNA as template. The fragment of cDNA MaMrp had been successfully isolated by PCR by using total cDNA as template and mrp primer designed from A. thaliana, yeast, and human. This fragment was successfully inserted into pGEM-T Easy and the recombinant plasmid was successfully introduced into E. coli DH5α. Nucleotide sequence analysis showed that the lenght of MaMrp fragment is 633 bp encoding 208 amino acids. Local alignment analysis based on nucleotide of mRNA showed that MaMrp fragment is 69% identical to AtMrp1 and 63% to AtMrp from A. thaliana. Based on deduced amino acid sequence, MaMRP is 84% identical to part of AtMRP13, 77% to AtMRP12, and 73% to AtMRP1 from A. thaliana respectively. Alignment analysis with AtMRP1 showed that MaMRP fragment is located in TM1 and NBF1 domains and has a specific amino acid sequence QCKAQLQNMEEE.

  12. Analysis and prediction of baculovirus promoter sequences.

    Science.gov (United States)

    Xing, Ke; Deng, Riqiang; Wang, Jinwen; Feng, Jinghua; Huang, Mingsong; Wang, Xunzhang

    2005-10-01

    Consensus patterns of baculovirus sequences upstream from the translational initiation sites have been analyzed and a web tool, Local Alignment Promoter Predictor (LAPP), for the prediction of baculovirus promoter sequences has also been developed. Potential consensus sequences, i.e., TCATTGT, TCTTGTA, CTCGTAA, TCCATTT and TCATT plus TCGT in approximately 30 bp spacing context, have been found in baculovirus promoter regions, in addition to well-characterized late and early promoter elements G/T/ATAAG and TATAA, which is accompanied about 30-bp downstream by a transcriptional initiation sequence CAGT or CATT. Promoter prediction is performed by a dynamic programming algorithm based on maximal segment pair measure with scores above some cutoff against each sequence in a refined promoter database. The algorithm was able to discriminate between promoter and non-promoter sequences in a test set of baculovirus sequences with prediction specificity and sensitivity superior to that using five other eukaryotic promoter recognition programs available on the Internet. A web server that implements the LAPP with continually updated promoter database is freely available at http://life.zsu.edu.cn/LAPP/.

  13. High-resolution analysis of the 5'-end transcriptome using a next generation DNA sequencer.

    Directory of Open Access Journals (Sweden)

    Shin-ichi Hashimoto

    Full Text Available Massively parallel, tag-based sequencing systems, such as the SOLiD system, hold the promise of revolutionizing the study of whole genome gene expression due to the number of data points that can be generated in a simple and cost-effective manner. We describe the development of a 5'-end transcriptome workflow for the SOLiD system and demonstrate the advantages in sensitivity and dynamic range offered by this tag-based application over traditional approaches for the study of whole genome gene expression. 5'-end transcriptome analysis was used to study whole genome gene expression within a colon cancer cell line, HT-29, treated with the DNA methyltransferase inhibitor, 5-aza-2'-deoxycytidine (5Aza. More than 20 million 25-base 5'-end tags were obtained from untreated and 5Aza-treated cells and matched to sequences within the human genome. Seventy three percent of the mapped unique tags were associated with RefSeq cDNA sequences, corresponding to approximately 14,000 different protein-coding genes in this single cell type. The level of expression of these genes ranged from 0.02 to 4,704 transcripts per cell. The sensitivity of a single sequence run of the SOLiD platform was 100-1,000 fold greater than that observed from 5'end SAGE data generated from the analysis of 70,000 tags obtained by Sanger sequencing. The high-resolution 5'end gene expression profiling presented in this study will not only provide novel insight into the transcriptional machinery but should also serve as a basis for a better understanding of cell biology.

  14. Transcriptome sequencing and analysis of leaf tissue of Avicennia marina using the Illumina platform.

    Directory of Open Access Journals (Sweden)

    Jianzi Huang

    Full Text Available Avicennia marina is a widely distributed mangrove species that thrives in high-salinity habitats. It plays a significant role in supporting coastal ecosystem and holds unique potential for studying molecular mechanisms underlying ecological adaptation. Despite and sometimes because of its numerous merits, this species is facing increasing pressure of exploitation and deforestation. Both study on adaptation mechanisms and conservation efforts necessitate more genomic resources for A. marina. In this study, we used Illumina sequencing of an A. marina foliar cDNA library to generate a transcriptome dataset for gene and marker discovery. We obtained 40 million high-quality reads and assembled them into 91,125 unigenes with a mean length of 463 bp. These unigenes covered most of the publicly available A. marina Sanger ESTs and greatly extended the repertoire of transcripts for this species. A total of 54,497 and 32,637 unigenes were annotated based on homology to sequences in the NCBI non-redundant and the Swiss-prot protein databases, respectively. Both Gene Ontology (GO analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG pathway analysis revealed some transcriptomic signatures of stress adaptation for this halophytic species. We also detected an extraordinary amount of transcripts derived from fungal endophytes and demonstrated the utility of transcriptome sequencing in surveying endophyte diversity without isolating them out of plant tissues. Additionally, we identified 3,423 candidate simple sequence repeats (SSRs from 3,141 unigenes with a density of one SSR locus every 8.25 kb sequence. Our transcriptomic data will provide valuable resources for ecological, genetic and evolutionary studies in A. marina.

  15. Transcriptome sequencing and analysis of leaf tissue of Avicennia marina using the Illumina platform.

    Science.gov (United States)

    Huang, Jianzi; Lu, Xiang; Zhang, Wanke; Huang, Rongfeng; Chen, Shouyi; Zheng, Yizhi

    2014-01-01

    Avicennia marina is a widely distributed mangrove species that thrives in high-salinity habitats. It plays a significant role in supporting coastal ecosystem and holds unique potential for studying molecular mechanisms underlying ecological adaptation. Despite and sometimes because of its numerous merits, this species is facing increasing pressure of exploitation and deforestation. Both study on adaptation mechanisms and conservation efforts necessitate more genomic resources for A. marina. In this study, we used Illumina sequencing of an A. marina foliar cDNA library to generate a transcriptome dataset for gene and marker discovery. We obtained 40 million high-quality reads and assembled them into 91,125 unigenes with a mean length of 463 bp. These unigenes covered most of the publicly available A. marina Sanger ESTs and greatly extended the repertoire of transcripts for this species. A total of 54,497 and 32,637 unigenes were annotated based on homology to sequences in the NCBI non-redundant and the Swiss-prot protein databases, respectively. Both Gene Ontology (GO) analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis revealed some transcriptomic signatures of stress adaptation for this halophytic species. We also detected an extraordinary amount of transcripts derived from fungal endophytes and demonstrated the utility of transcriptome sequencing in surveying endophyte diversity without isolating them out of plant tissues. Additionally, we identified 3,423 candidate simple sequence repeats (SSRs) from 3,141 unigenes with a density of one SSR locus every 8.25 kb sequence. Our transcriptomic data will provide valuable resources for ecological, genetic and evolutionary studies in A. marina.

  16. Molecular cloning and mammalian expression of human beta 2-glycoprotein I cDNA

    DEFF Research Database (Denmark)

    Kristensen, Torsten; Schousboe, Inger; Boel, Espen

    1991-01-01

    Human β2-glycoprotein (β2gpI) cDNA was isolated from a liver cDNA library and sequenced. The cDNA encoded a 19-residue hydrophobic signal peptide followed by the mature β2gpI of 326 amino acid residues. In liver and in the hepatoma cell line HepG2 there are two mRNA species of about 1.4 and 4.3 kb...

  17. Incident sequence analysis; event trees, methods and graphical symbols

    International Nuclear Information System (INIS)

    1980-11-01

    When analyzing incident sequences, unwanted events resulting from a certain cause are looked for. Graphical symbols and explanations of graphical representations are presented. The method applies to the analysis of incident sequences in all types of facilities. By means of the incident sequence diagram, incident sequences, i.e. the logical and chronological course of repercussions initiated by the failure of a component or by an operating error, can be presented and analyzed simply and clearly

  18. Computer-aided visualization and analysis system for sequence evaluation

    Energy Technology Data Exchange (ETDEWEB)

    Chee, Mark S.; Wang, Chunwei; Jevons, Luis C.; Bernhart, Derek H.; Lipshutz, Robert J.

    2004-05-11

    A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments are improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device.

  19. Analysis of Neuronal Sequences Using Pairwise Biases

    Science.gov (United States)

    2015-08-27

    semantic memory (knowledge of facts) and implicit memory (e.g., how to ride a bike ). Evidence for the participation of the hippocampus in the formation of...very different from each other in many ways including duration and number of spikes. Still, these sequences share a similar trend in the general order...1 and 2 precede all other spikes in both s and s�). Many other sequences share this property with s and s�; in fact, we can completely characterize

  20. Molecular cloning of cDNA for lysenin, a novel protein in the earthworm Eisenia foetida that causes contraction of rat vascular smooth muscle.

    Science.gov (United States)

    Sekizawa, Y; Kubo, T; Kobayashi, H; Nakajima, T; Natori, S

    1997-05-20

    Lysenin, which causes contraction of rat vascular smooth muscle, is a protein that was isolated from the earthworm Eisenia foetida. A cDNA encoding lysenin was isolated by use of a partial cDNA probe that had been generated by the PCR with a primer designed by reference to an internal peptide sequence of lysenin. This clone had an ORF encoding 297 amino acid residues. The amino acid sequence deduced from the cDNA revealed the absence of any significant homology to those of previously characterized vasoactive substances. The recombinant lysenin was produced in Escherichia coli. This protein and native lysenin isolated from the earthworm had similar contractive activities when tested on rat aorta. Northern blot analysis of the RNA from various tissues of the earthworm indicated that lysenin is produced by the coelomocytes.

  1. Cloning of the human androgen receptor cDNA

    International Nuclear Information System (INIS)

    Govindan, M.V.; Burelle, M.; Cantin, C.; Kabrie, C.; Labrie, F.; Lachance, Y.; Leblanc, G.; Lefebvre, C.; Patel, P.; Simard, J.

    1988-01-01

    The authors discuss how in order to define the functional domains of the human androgen receptor, complementary DNA (cDNA) clones encoding the human androgen receptor (hAR) have been isolated from a human testis λgtll cDNA library using synthetic oligonnucleotide probes, homologous to segments of the human glucocorticoid, estradiol and progesterone receptors. The cDNA clones corresponding to the human glucocorticoid, estradiol and progesterone receptors were eliminated after cross-hybridization with their respective cDNA probes and/or after restriction mapping of the cDNA clones. The remaining cDNA clones were classified into different groups after analysis by restriction digestion and cross-hybridization. Two of the largest cDNA clones from each group were inserted into an expression vector in both orientations. The linearized plasmids were used as templates in in vitro transcription with T7 RNA polymerase. Subsequent in vitro translation of the purified transcripts in rabbit reticulocyte lysate followed by sodium dodecylsulfate polyacrylamide gel electrophoresis (SDS-PAGE) permitted the characterization of the encoded polyeptides. The expressed proteins larger than 30,000 Da were analyzed for their ability to bind tritium-labelled dihydrotestosterone ([ 3 H] DHT) with high affinity and specificity

  2. The natriuretic peptide/helokinestatin precursor from Mexican beaded lizard (Heloderma horridum) venom: Amino acid sequence deduced from cloned cDNA and identification of two novel encoded helokinestatins.

    Science.gov (United States)

    Ma, Chengbang; Yang, Mu; Zhou, Mei; Wu, Yuxin; Wang, Lei; Chen, Tianbao; Ding, Anwei; Shaw, Chris

    2011-06-01

    Natriuretic peptides are common components of reptile venoms and molecular cloning of their biosynthetic precursors has revealed that in snakes, they co-encode bradykinin-potentiating peptides and in venomous lizards, some co-encode bradykinin inhibitory peptides such as the helokinestatins. The common natriuretic peptide/helokinestatin precursor of the Gila Monster, Heloderma suspectum, encodes five helokinestatins of differing primary structures. Here we report the molecular cloning of a natriuretic peptide/helokinestatin precursor cDNA from a venom-derived cDNA library of the Mexican beaded lizard (Heloderma horridum). Deduction of the primary structure of the encoded precursor protein from this cloned cDNA template revealed that it consisted of 196 amino acid residues encoding a single natriuretic peptide and five helokinestatins. While the natriuretic peptide was of identical primary structure to its Gila Monster (H. suspectum) homolog, the encoded helokinestatins were not, with this region of the common precursor displaying some significant differences to its H. suspectum homolog. The helokinestatin-encoding region contained a single copy of helokinestatin-1, 2 copies of helokinestatin-3 and single copies of 2 novel peptides, (Phe)(5)-helokinestatin-2 (VPPAFVPLVPR) and helokinestatin-6 (GPPFNPPPFVDYEPR). All predicted peptides were found in reverse phase HPLC fractions of the same venom. Synthetic replicates of both novel helokinestatins were found to antagonize the relaxing effect of bradykinin on rat tail artery smooth muscle. Thus lizard venom continues to provide a source of novel biologically active peptides. Copyright © 2011. Published by Elsevier Inc.

  3. Germacrene C synthase from Lycopersicon esculentum cv. VFNT cherry tomato: cDNA isolation, characterization, and bacterial expression of the multiple product sesquiterpene cyclase.

    Science.gov (United States)

    Colby, S M; Crock, J; Dowdle-Rizzo, B; Lemaux, P G; Croteau, R

    1998-03-03

    Germacrene C was found by GC-MS and NMR analysis to be the most abundant sesquiterpene in the leaf oil of Lycopersicon esculentum cv. VFNT Cherry, with lesser amounts of germacrene A, guaia-6,9-diene, germacrene B, beta-caryophyllene, alpha-humulene, and germacrene D. Soluble enzyme preparations from leaves catalyzed the divalent metal ion-dependent cyclization of [1-3H]farnesyl diphosphate to these same sesquiterpene olefins, as determined by radio-GC. To obtain a germacrene synthase cDNA, a set of degenerate primers was constructed based on conserved amino acid sequences of related terpenoid cyclases. With cDNA prepared from leaf epidermis-enriched mRNA, these primers amplified a 767-bp fragment that was used as a hybridization probe to screen the cDNA library. Thirty-one clones were evaluated for functional expression of terpenoid cyclase activity in Escherichia coli by using labeled geranyl, farnesyl, and geranylgeranyl diphosphates as substrates. Nine cDNA isolates expressed sesquiterpene synthase activity, and GC-MS analysis of the products identified germacrene C with smaller amounts of germacrene A, B, and D. None of the expressed proteins was active with geranylgeranyl diphosphate; however, one truncated protein converted geranyl diphosphate to the monoterpene limonene. The cDNA inserts specify a deduced polypeptide of 548 amino acids (Mr = 64,114), and sequence comparison with other plant sesquiterpene cyclases indicates that germacrene C synthase most closely resembles cotton delta-cadinene synthase (50% identity).

  4. Establishing a framework for comparative analysis of genome sequences

    Energy Technology Data Exchange (ETDEWEB)

    Bansal, A.K.

    1995-06-01

    This paper describes a framework and a high-level language toolkit for comparative analysis of genome sequence alignment The framework integrates the information derived from multiple sequence alignment and phylogenetic tree (hypothetical tree of evolution) to derive new properties about sequences. Multiple sequence alignments are treated as an abstract data type. Abstract operations have been described to manipulate a multiple sequence alignment and to derive mutation related information from a phylogenetic tree by superimposing parsimonious analysis. The framework has been applied on protein alignments to derive constrained columns (in a multiple sequence alignment) that exhibit evolutionary pressure to preserve a common property in a column despite mutation. A Prolog toolkit based on the framework has been implemented and demonstrated on alignments containing 3000 sequences and 3904 columns.

  5. Bioinformatic analysis of whole genome sequencing data

    OpenAIRE

    Maqbool, Khurram

    2014-01-01

    Evolution has shaped the life forms for billion of years. Domestication is an accelerated process that can be used as a model for evolutionary changes. The aim of this thesis project has been to carry out extensive bioinformatic analyses of whole genome sequencing data to reveal SNPs, InDels and selective sweeps in the chicken, pig and dog genome. Pig genome sequencing revealed loci under selection for elongation of back and increased number of vertebrae, associated with the NR6A1, PLAG1,...

  6. Expressed sequence tag analysis of adult human optic nerve for NEIBank: Identification of cell type and tissue markers

    Directory of Open Access Journals (Sweden)

    Peterson Katherine

    2009-09-01

    Full Text Available Abstract Background The optic nerve is a pure white matter central nervous system (CNS tract with an isolated blood supply, and is widely used in physiological studies of white matter response to various insults. We examined the gene expression profile of human optic nerve (ON and, through the NEIBANK online resource, to provide a resource of sequenced verified cDNA clones. An un-normalized cDNA library was constructed from pooled human ON tissues and was used in expressed sequence tag (EST analysis. Location of an abundant oligodendrocyte marker was examined by immunofluorescence. Quantitative real time polymerase chain reaction (qRT-PCR and Western analysis were used to compare levels of expression for key calcium channel protein genes and protein product in primate and rodent ON. Results Our analyses revealed a profile similar in many respects to other white matter related tissues, but significantly different from previously available ON cDNA libraries. The previous libraries were found to include specific markers for other eye tissues, suggesting contamination. Immune/inflammatory markers were abundant in the new ON library. The oligodendrocyte marker QKI was abundant at the EST level. Immunofluorescence revealed that this protein is a useful oligodendrocyte cell-type marker in rodent and primate ONs. L-type calcium channel EST abundance was found to be particularly low. A qRT-PCR-based comparative mammalian species analysis reveals that L-type calcium channel expression levels are significantly lower in primate than in rodent ON, which may help account for the class-specific difference in responsiveness to calcium channel blocking agents. Several known eye disease genes are abundantly expressed in ON. Many genes associated with normal axonal function, mRNAs associated with axonal transport, inflammation and neuroprotection are observed. Conclusion We conclude that the new cDNA library is a faithful representation of human ON and EST data

  7. [Construction and identification of the expression library of album pollen allergens cDNA].

    Science.gov (United States)

    Zhang, Jie; Sun, Xiu-zhen; Yan, Hong; Zhang, Ni; Feng, Xiang-li

    2011-05-01

    To construct and identify the express library of album pollen allergens cDNA. Total RNA were extracted from the album pollen with TRIzol reagent and the mRNA was isolate for the amplify followed. A double stranded cDNA (ds cDNA) was synthesized using primers containing Xho I and Poly(dT) sequence by ZAP Express®cDNA synthesis kit. The ds cDNA was modified and purified by gel chromatography, and then the cDNA fragment with the length of more than 400 bp containing sticky ends was obtained. The cDNA fragment was ligated with Uni-ZAP XR vector and subsequently treated with in vitro packaging using phage by ZAP-cDNA express GigapackIII Gold cloning kit. The express library of album pollen cDNA was constructed by in vitro packaging. The recombination rate and the lengths of fragments inserted of the cDNA library were detected by polymerase chain reaction. The titer and the recombination rate of cDNA expression library constructed were 9.7×10(5) and 100%, respectively. The capacity of the library was 4.85 Pfu. The average length of cDNA fragments inserted was about 1.0 kb. Based on the capacity of cDNA expression library constructed and the length of cDNA insertion fragments, the cDNA expression library constructed is qualified to screening target cDNA clone, laying the foundation for preparation of gene recombinant allergen pollen vaccine.

  8. [Tabular excel editor for analysis of aligned nucleotide sequences].

    Science.gov (United States)

    Demkin, V V

    2010-01-01

    Excel platform was used for transition of results of multiple aligned nucleotide sequences obtained using the BLAST network service to the form appropriate for visual analysis and editing. Two macros operators for MS Excel 2007 were constructed. The array of aligned sequences transformed into Excel table and processed using macros operators is more appropriate for analysis than initial html data.

  9. SEQUENCE ANALYSIS OF MATURASE K (MATK): A ...

    African Journals Online (AJOL)

    Global Journal

    The application and utilization of sequence data has been found very informative in the characterization and phylogenetic relationship of different crops species. This study aimed to use bioinformatics tools to characterize the. matK gene in some selected legumes with special reference to pigeon pea [cajanus cajan ...

  10. Genomic and cDNA cloning of a novel mouse lipoxygenase gene

    NARCIS (Netherlands)

    Willems van Dijk, K.; Steketee, K.; Havekes, L.; Frants, R.; Hofker, M.

    1995-01-01

    A novel 12- and 15-lipoxygenase related gene was isolated from a mouse strain 129 genomic phage library in a screen with a human 15-lipoxygenase cDNA probe. The complete genomic sequence revealed 14 exons and 13 introns covering 7.3 kb of DNA. The splice junctions were verified from the cDNA

  11. Analysis of cellular responses to aflatoxin B{sub 1} in yeast expressing human cytochrome P450 1A2 using cDNA microarrays

    Energy Technology Data Exchange (ETDEWEB)

    Guo Yingying [Departmental of Environmental and Occupational Health Sciences, University of Washington, Seattle, WA (United States); Fred Hutchinson Cancer Research Center, Seattle, WA (United States); Breeden, Linda L. [Fred Hutchinson Cancer Research Center, Seattle, WA (United States); Fan, Wenhong [Fred Hutchinson Cancer Research Center, Seattle, WA (United States); Zhao Lueping [Fred Hutchinson Cancer Research Center, Seattle, WA (United States); Eaton, David L. [Departmental of Environmental and Occupational Health Sciences, University of Washington, Seattle, WA (United States); Fred Hutchinson Cancer Research Center, Seattle, WA (United States); Zarbl, Helmut [Departmental of Environmental and Occupational Health Sciences, University of Washington, Seattle, WA (United States) and Fred Hutchinson Cancer Research Center, Seattle, WA (United States)]. E-mail: hzarbl@fhcrc.org

    2006-01-29

    Aflatoxin B1 (AFB{sub 1}) is a potent human hepatotoxin and hepatocarcinogen produced by the mold Aspergillus flavus. In human, AFB{sub 1} is bioactivated by cytochrome P450 (CYP450) enzymes, primarily CYP1A2, to the genotoxic epoxide that forms N{sup 7}-guanine DNA adducts. To characterize the transcriptional responses to genotoxic insults from AFB{sub 1}, a strain of Saccharomyces cerevisiae engineered to express human CYP1A2 was exposed to doses of AFB{sub 1} that resulted in minimal lethality, but substantial genotoxicity. Flow cytometric analysis demonstrated a dose and time dependent S phase delay under the same treatment conditions, indicating a checkpoint response to DNA damage. Replicate cDNA microarray analyses of AFB{sub 1} treated cells showed that about 200 genes were significantly affected by the exposure. The genes activated by AFB{sub 1}-treatment included RAD51, DUN1 and other members of the DNA damage response signature reported in a previous study with methylmethane sulfonate and ionizing radiation [A.P. Gasch, M. Huang, S. Metzner, D. Botstein, S.J. Elledge, P.O. Brown, Genomic expression responses to DNA-damaging agents and the regulatory role of the yeast ATR homolog Mec1p, Mol. Biol. Cell 12 (2001) 2987-3003]. However, unlike previous studies using highly cytotoxic doses, environmental stress response genes [A.P. Gasch, P.T. Spellman, C.M. Kao, O. Carmel-Harel, M.B. Eisen, G. Storz, D. Botstein, P.O. Brown, Genomic expression programs in the response of yeast cells to environmental changes, Mol. Biol. Cell 11 (2000) 4241-4257] were largely unaffected by our dosing regimen. About half of the transcripts affected are also known to be cell cycle regulated. The most strongly repressed transcripts were those encoding the histone genes and a group of genes that are cell cycle regulated and peak in M phase and early G1. These include most of the known daughter-specific genes. The rapid and coordinated repression of histones and M/G1-specific

  12. Deep Sequencing Analysis of the Ixodes ricinus Haemocytome.

    Directory of Open Access Journals (Sweden)

    Michalis Kotsyfakis

    2015-05-01

    Full Text Available Ixodes ricinus is the main tick vector of the microbes that cause Lyme disease and tick-borne encephalitis in Europe. Pathogens transmitted by ticks have to overcome innate immunity barriers present in tick tissues, including midgut, salivary glands epithelia and the hemocoel. Molecularly, invertebrate immunity is initiated when pathogen recognition molecules trigger serum or cellular signalling cascades leading to the production of antimicrobials, pathogen opsonization and phagocytosis. We presently aimed at identifying hemocyte transcripts from semi-engorged female I. ricinus ticks by mass sequencing a hemocyte cDNA library and annotating immune-related transcripts based on their hemocyte abundance as well as their ubiquitous distribution.De novo assembly of 926,596 pyrosequence reads plus 49,328,982 Illumina reads (148 nt length from a hemocyte library, together with over 189 million Illumina reads from salivary gland and midgut libraries, generated 15,716 extracted coding sequences (CDS; these are displayed in an annotated hyperlinked spreadsheet format. Read mapping allowed the identification and annotation of tissue-enriched transcripts. A total of 327 transcripts were found significantly over expressed in the hemocyte libraries, including those coding for scavenger receptors, antimicrobial peptides, pathogen recognition proteins, proteases and protease inhibitors. Vitellogenin and lipid metabolism transcription enrichment suggests fat body components. We additionally annotated ubiquitously distributed transcripts associated with immune function, including immune-associated signal transduction proteins and transcription factors, including the STAT transcription factor.This is the first systems biology approach to describe the genes expressed in the haemocytes of this neglected disease vector. A total of 2,860 coding sequences were deposited to GenBank, increasing to 27,547 the number so far deposited by our previous transcriptome studies

  13. Google matrix analysis of DNA sequences.

    Science.gov (United States)

    Kandiah, Vivek; Shepelyansky, Dima L

    2013-01-01

    For DNA sequences of various species we construct the Google matrix [Formula: see text] of Markov transitions between nearby words composed of several letters. The statistical distribution of matrix elements of this matrix is shown to be described by a power law with the exponent being close to those of outgoing links in such scale-free networks as the World Wide Web (WWW). At the same time the sum of ingoing matrix elements is characterized by the exponent being significantly larger than those typical for WWW networks. This results in a slow algebraic decay of the PageRank probability determined by the distribution of ingoing elements. The spectrum of [Formula: see text] is characterized by a large gap leading to a rapid relaxation process on the DNA sequence networks. We introduce the PageRank proximity correlator between different species which determines their statistical similarity from the view point of Markov chains. The properties of other eigenstates of the Google matrix are also discussed. Our results establish scale-free features of DNA sequence networks showing their similarities and distinctions with the WWW and linguistic networks.

  14. Google matrix analysis of DNA sequences.

    Directory of Open Access Journals (Sweden)

    Vivek Kandiah

    Full Text Available For DNA sequences of various species we construct the Google matrix [Formula: see text] of Markov transitions between nearby words composed of several letters. The statistical distribution of matrix elements of this matrix is shown to be described by a power law with the exponent being close to those of outgoing links in such scale-free networks as the World Wide Web (WWW. At the same time the sum of ingoing matrix elements is characterized by the exponent being significantly larger than those typical for WWW networks. This results in a slow algebraic decay of the PageRank probability determined by the distribution of ingoing elements. The spectrum of [Formula: see text] is characterized by a large gap leading to a rapid relaxation process on the DNA sequence networks. We introduce the PageRank proximity correlator between different species which determines their statistical similarity from the view point of Markov chains. The properties of other eigenstates of the Google matrix are also discussed. Our results establish scale-free features of DNA sequence networks showing their similarities and distinctions with the WWW and linguistic networks.

  15. Sequencing cDNAs: An Introduction to DNA Sequence Analysis in the Undergraduate Molecular Genetics Course.

    Science.gov (United States)

    Galewsky, Samuel

    2000-01-01

    Introduces a series of molecular genetics laboratories where students pick a single colony from a Drosophila melanogester embryo cDNA library and purify the plasmid, then analyze the insert through restriction digests and gel electrophoresis. (Author/YDS)

  16. Cloning a Chymotrypsin-Like 1 (CTRL-1 Protease cDNA from the Jellyfish Nemopilema nomurai

    Directory of Open Access Journals (Sweden)

    Yunwi Heo

    2016-07-01

    Full Text Available An enzyme in a nematocyst extract of the Nemopilema nomurai jellyfish, caught off the coast of the Republic of Korea, catalyzed the cleavage of chymotrypsin substrate in an amidolytic kinetic assay, and this activity was inhibited by the serine protease inhibitor, phenylmethanesulfonyl fluoride. We isolated the full-length cDNA sequence of this enzyme, which contains 850 nucleotides, with an open reading frame of 801 encoding 266 amino acids. A blast analysis of the deduced amino acid sequence showed 41% identity with human chymotrypsin-like (CTRL and the CTRL-1 precursor. Therefore, we designated this enzyme N. nomurai CTRL-1. The primary structure of N. nomurai CTRL-1 includes a leader peptide and a highly conserved catalytic triad of His69, Asp117, and Ser216. The disulfide bonds of chymotrypsin and the substrate-binding sites are highly conserved compared with the CTRLs of other species, including mammalian species. Nemopilema nomurai CTRL-1 is evolutionarily more closely related to Actinopterygii than to Scyphozoan (Aurelia aurita or Hydrozoan (Hydra vulgaris. The N. nomurai CTRL1 was amplified from the genomic DNA with PCR using specific primers designed based on the full-length cDNA, and then sequenced. The N. nomurai CTRL1 gene contains 2434 nucleotides and four distinct exons. The 5′ donor splice (GT and 3′ acceptor splice sequences (AG are wholly conserved. This is the first report of the CTRL1 gene and cDNA structures in the jellyfish N. nomurai.

  17. Cloning a Chymotrypsin-Like 1 (CTRL-1) Protease cDNA from the Jellyfish Nemopilema nomurai

    Science.gov (United States)

    Heo, Yunwi; Kwon, Young Chul; Bae, Seong Kyeong; Hwang, Duhyeon; Yang, Hye Ryeon; Choudhary, Indu; Lee, Hyunkyoung; Yum, Seungshic; Shin, Kyoungsoon; Yoon, Won Duk; Kang, Changkeun; Kim, Euikyung

    2016-01-01

    An enzyme in a nematocyst extract of the Nemopilema nomurai jellyfish, caught off the coast of the Republic of Korea, catalyzed the cleavage of chymotrypsin substrate in an amidolytic kinetic assay, and this activity was inhibited by the serine protease inhibitor, phenylmethanesulfonyl fluoride. We isolated the full-length cDNA sequence of this enzyme, which contains 850 nucleotides, with an open reading frame of 801 encoding 266 amino acids. A blast analysis of the deduced amino acid sequence showed 41% identity with human chymotrypsin-like (CTRL) and the CTRL-1 precursor. Therefore, we designated this enzyme N. nomurai CTRL-1. The primary structure of N. nomurai CTRL-1 includes a leader peptide and a highly conserved catalytic triad of His69, Asp117, and Ser216. The disulfide bonds of chymotrypsin and the substrate-binding sites are highly conserved compared with the CTRLs of other species, including mammalian species. Nemopilema nomurai CTRL-1 is evolutionarily more closely related to Actinopterygii than to Scyphozoan (Aurelia aurita) or Hydrozoan (Hydra vulgaris). The N. nomurai CTRL1 was amplified from the genomic DNA with PCR using specific primers designed based on the full-length cDNA, and then sequenced. The N. nomurai CTRL1 gene contains 2434 nucleotides and four distinct exons. The 5′ donor splice (GT) and 3′ acceptor splice sequences (AG) are wholly conserved. This is the first report of the CTRL1 gene and cDNA structures in the jellyfish N. nomurai. PMID:27399771

  18. Purification, characterization, cDNA cloning, and expression of a xyloglucan endoglucanase from Geotrichum sp. M128.

    Science.gov (United States)

    Yaoi, Katsuro; Mitsuishi, Yasushi

    2004-02-27

    A novel xyloglucan-specific endo-beta-1,4-glucanase (XEG), xyloglucanase, with a molecular mass of 80 kDa and a pI of 4.8, was isolated from the fungus Geotrichum sp. M128. It was found to be an endoglucanase active toward xyloglucan and not active toward carboxymethylcellulose, Avicel, or barley 1,3-1,4-beta-glucan. Analysis of the precise substrate specificity using various xyloglucan oligosaccharide structures revealed that XEG has at least four subsites (-2 to +2) and specifically recognizes xylose branching at the +1 and +2 sites. The full-length cDNA encoding XEG was cloned and sequenced. It consists of a 2436-bp open reading frame encoding a 776-amino acid protein. From its deduced amino acid sequence, XEG can be classified as a family 74 glycosyl hydrolase. The cDNA encoding XEG was then expressed in Escherichia coli, and enzymatically active recombinant XEG was obtained.

  19. Error Analysis of Deep Sequencing of Phage Libraries: Peptides Censored in Sequencing

    Directory of Open Access Journals (Sweden)

    Wadim L. Matochko

    2013-01-01

    Full Text Available Next-generation sequencing techniques empower selection of ligands from phage-display libraries because they can detect low abundant clones and quantify changes in the copy numbers of clones without excessive selection rounds. Identification of errors in deep sequencing data is the most critical step in this process because these techniques have error rates >1%. Mechanisms that yield errors in Illumina and other techniques have been proposed, but no reports to date describe error analysis in phage libraries. Our paper focuses on error analysis of 7-mer peptide libraries sequenced by Illumina method. Low theoretical complexity of this phage library, as compared to complexity of long genetic reads and genomes, allowed us to describe this library using convenient linear vector and operator framework. We describe a phage library as N×1 frequency vector n=ni, where ni is the copy number of the ith sequence and N is the theoretical diversity, that is, the total number of all possible sequences. Any manipulation to the library is an operator acting on n. Selection, amplification, or sequencing could be described as a product of a N×N matrix and a stochastic sampling operator (Sa. The latter is a random diagonal matrix that describes sampling of a library. In this paper, we focus on the properties of Sa and use them to define the sequencing operator (Seq. Sequencing without any bias and errors is Seq=Sa IN, where IN is a N×N unity matrix. Any bias in sequencing changes IN to a nonunity matrix. We identified a diagonal censorship matrix (CEN, which describes elimination or statistically significant downsampling, of specific reads during the sequencing process.

  20. Characterization of cDNA for PMT: a Partial Nicotine Biosynthesis-Related Gene Isolated from Indonesian Local Tobacco (Nicotiana tabacum cv. Sindoro1

    Directory of Open Access Journals (Sweden)

    SESANTI BASUKI

    2013-12-01

    Full Text Available Nicotine is the major alkaloid compound in cultivated tobacco (Nicotiana tabacum that could potentially be converted into carcinogenic compound (nor-nicotine. The PMT gene encoding putrescine N-methyltransferase (PMT is one of the two key genes that play a prominent role in nicotine biosynthesis. The aimed of this study was to isolate and characterize the cDNA sequence originated from Indonesian local tobacco cv. Sindoro1 (Ntpmt_Sindoro1. The results showed that the Ntpmt_Sindoro1 was 1124 bp in length. This cDNA fragment encodes for 374 amino acid residues. The predicted polypeptide from the cDNA is a hidrophilic protein, and has a predicted molecular weight of 40.95 kDa. The predicted amino acids sequence also showed high similarity to the PMT gene product Nicotiana sp. available in the GenBank data base. The amino acid sequences also exert conserved residues specifically exhibited only by PMT gene originated from N. tabacum. Clustering analysis revealed that Ntpmt_Sindoro1 belongs to the same clade as the PMT3 gene, a member of the N. tabacum PMT gene family. The Ntpmt_Sindoro1 cDNA sequence covering exon1-exon8 of the PMT gene fragment has been registered in the GenBank data base, under the accession number JX978277.

  1. Chromosomal Localization of DNA Amplifications in Neuroblastoma Tumors Using cDNA Microarray Comparative Genomic Hybridization

    Directory of Open Access Journals (Sweden)

    Ben Beheshti

    2003-01-01

    Full Text Available Conventional comparative genomic hybridization (CGH profiling of neuroblastomas has identified many genomic aberrations, although the limited resolution has precluded a precise localization of sequences of interest within amplicons. To map high copy number genomic gains in clinically matched stage IV neuroblastomas, CGH analysis using a 19,200-feature cDNA microarray was used. A dedicated (freely available algorithm was developed for rapid in silico determination of chromosomal localizations of microarray cDNA targets, and for generation of an ideogram-type profile of copy number changes. Using these methodologies, novel gene amplifications undetectable by chromosome CGH were identified, and larger MYCN amplicon sizes (in one tumor up to 6 Mb than those previously reported in neuroblastoma were identified. The genes HPCAL1, LPIN1/KIAA0188, NAG, and NSE1/LOC151354 were found to be coamplified with MYCN. To determine whether stage IV primary tumors could be further subclassified based on their genomic copy number profiles, hierarchical clustering was performed. Cluster analysis of microarray CGH data identified three groups: 1 no amplifications evident, 2 a small MYCN amplicon as the only detectable imbalance, and 3 a large MYCN amplicon with additional gene amplifications. Application of CGH to cDNA microarray targets will help to determine both the variation of amplicon size and help better define amplification-dependent and independent pathways of progression in neuroblastoma.

  2. Phylogenetic analysis of the genus Hordeum using repetitive DNA sequences

    DEFF Research Database (Denmark)

    Svitashev, S.; Bryngelsson, T.; Vershinin, A.

    1994-01-01

    A set of six cloned barley (Hordeum vulgare) repetitive DNA sequences was used for the analysis of phylogenetic relationships among 31 species (46 taxa) of the genus Hordeum, using molecular hybridization techniques. In situ hybridization experiments showed dispersed organization of the sequences...

  3. Sequence comparison and phylogenetic analysis of core gene of ...

    African Journals Online (AJOL)

    In Pakistan, more than 10 million people are living with hepatitis C virus (HCV) with high morbidity and mortality. The aims of the present study are to report HCV core gene sequences from Pakistani population and perform their sequence comparison/phylogenetic analysis. The core gene of HCV has been cloned from six ...

  4. Cloning and sequence analysis of the Antheraea pernyi ...

    Indian Academy of Sciences (India)

    A genomic library was generated using HindIII and the positive clones were sequenced and analysed. The gp64 gene, encoding the baculovirus envelope protein GP64, was found in an insert. The nucleotide sequence analysis indicated that the AnpeNPV gp64 gene consists of a 1530 nucleotide open reading frame ...

  5. Sequence analysis corresponding to the PPE and PE proteins in ...

    Indian Academy of Sciences (India)

    Amino acid sequence analysis corresponding to the PPE proteins in H37Rv and CDC1551 strains of the Mycobacterium tuberculosis genomes resulted in the identification of a previously uncharacterized 225 amino acidresidue common region in 22 proteins. The pairwise sequence identities were as low as 18%.

  6. Biological sequence analysis: probabilistic models of proteins and nucleic acids

    National Research Council Canada - National Science Library

    Durbin, Richard

    1998-01-01

    ... analysis methods are now based on principles of probabilistic modelling. Examples of such methods include the use of probabilistically derived score matrices to determine the significance of sequence alignments, the use of hidden Markov models as the basis for profile searches to identify distant members of sequence families, and the inference...

  7. Mining the bitter melon (momordica charantia l. seed transcriptome by 454 analysis of non-normalized and normalized cDNA populations for conjugated fatty acid metabolism-related genes

    Directory of Open Access Journals (Sweden)

    Shipp Matthew J

    2010-11-01

    Full Text Available Abstract Background Seeds of Momordica charantia (bitter melon produce high levels of eleostearic acid, an unusual conjugated fatty acid with industrial value. Deep sequencing of non-normalized and normalized cDNAs from developing bitter melon seeds was conducted to uncover key genes required for biotechnological transfer of conjugated fatty acid production to existing oilseed crops. It is expected that these studies will also provide basic information regarding the metabolism of other high-value novel fatty acids. Results Deep sequencing using 454 technology with non-normalized and normalized cDNA libraries prepared from bitter melon seeds at 18 DAP resulted in the identification of transcripts for the vast majority of known genes involved in fatty acid and triacylglycerol biosynthesis. The non-normalized library provided a transcriptome profile of the early stage in seed development that highlighted the abundance of transcripts for genes encoding seed storage proteins as well as for a number of genes for lipid metabolism-associated polypeptides, including Δ12 oleic acid desaturases and fatty acid conjugases, class 3 lipases, acyl-carrier protein, and acyl-CoA binding protein. Normalization of cDNA by use of a duplex-specific nuclease method not only increased the overall discovery of genes from developing bitter melon seeds, but also resulted in the identification of 345 contigs with homology to 189 known lipid genes in Arabidopsis. These included candidate genes for eleostearic acid metabolism such as diacylglycerol acyltransferase 1 and 2, and a phospholipid:diacylglycerol acyltransferase 1-related enzyme. Transcripts were also identified for a novel FAD2 gene encoding a functional Δ12 oleic acid desaturase with potential implications for eleostearic acid biosynthesis. Conclusions 454 deep sequencing, particularly with normalized cDNA populations, was an effective method for mining of genes associated with eleostearic acid metabolism in

  8. cDNA library construction of two human Demodexspecies.

    Science.gov (United States)

    Niu, DongLing; Wang, RuiLing; Zhao, YaE; Yang, Rui; Hu, Li; Lei, YuYang; Dan, WeiChao

    2017-06-01

    The research of Demodex, a type of pathogen causing various dermatoses in animals and human beings, is lacking at RNA level. This study aims at extracting RNA and constructing cDNA library for Demodex. First, P. cuniculiand D. farinaewere mixed to establish homogenization method for RNA extraction. Second, D. folliculorumand D. breviswere collected and preserved in Trizol, which were mixed with D. farinaerespectively to extract RNA. Finally, cDNA library was constructed and its quality was assessed. The results indicated that for D. folliculorum& D. farinae, the recombination rate of cDNA library was 90.67% and the library titer was 7.50 × 104 pfu/ml. 17 of the 59 positive clones were predicted to be of D. folliculorum; For D. brevis& D. farinae, the recombination rate was 90.96% and the library titer was 7.85 x104 pfu/ml. 40 of the 59 positive clones were predicted to be of D. brevis. Further detection by specific primers demonstrated that mtDNA cox1, cox3and ATP6 detected from cDNA libraries had 96.52%-99.73% identities with the corresponding sequences in GenBank. In conclusion, the cDNA libraries constructed for Demodexmixed with D. farinaewere successful and could satisfy the requirements for functional genes detection.

  9. Strategy for the sequence analysis of heparin.

    Science.gov (United States)

    Liu, J; Desai, U R; Han, X J; Toida, T; Linhardt, R J

    1995-12-01

    The versatile biological activities of proteoglycans are mainly mediated by their glycosaminoglycan (GAG) components. Unlike proteins and nucleic acids, no satisfactory method for sequencing GAGs has been developed. This paper describes a strategy to sequence the GAG chains of heparin. Heparin, prepared from animal tissue, and processed by proteinases and endoglucuronidases, is 90% GAG heparin and 10% peptidoglycan heparin (containing small remnants of core protein). Raw porcine mucosal heparin was labelled on the amino termini of these core protein remnants with a hydrophobic, fluorescent tag [N-4-(6-dimethylamino-2-benzofuranyl) phenyl (NDBP)-isothiocyanate]. Enrichment of the NDBP-heparin using phenyl-Sepharose chromatography, followed by treatment with a mixture of heparin lyase I and III, resulted in a single NDBP-linkage region tetrasaccharide, which was characterized as deltaUAp(1-->3)-beta-D-Galp(1-->3)-beta-D-Galp(1-->4)-beta-Xylp -(1-->O-Ser-NDBP (deltaUAp is 4-deoxy-alpha-L-threo-hex-4-enopyranosyl uronic acid). Several NDBP-octasaccharides were isolated when NDBP-heparin was treated with only heparin lyase I. The structure of one of these NDBP-octasaccharides, deltaUAp2S(1-->4)-alpha-D-GlcNpAc(1-->4)-alpha-L-IdoAp (1-->4)-alpha-D-GlcNpAc6S(1-->4)-beta-D-GlcAp(1-->3)-beta-D- Galp(1-->3)-beta-D-Galp(1-->4)-beta-Xylp-(1-->O-Ser NDBP (S is sulphate, Ac is acetate), was determined by 1H-NMR and enzymatic methods. Enriched NDBP-heparin was treated with lithium hydroxide to release heparin, and the GAG chain was then labelled at xylose with 7-amino-1,3-naphthalene disulphonic acid (AGA). The resulting AGA-Xyl-heparin was sequenced on gradient PAGE using heparin lyase I and heparin lyase III. A predominant sequence in heparin at the protein core attachment site was deduced to be -D-GlcNp2S6S(or 6OH)(1-->4)-alpha-L-IdoAp2S-(1-->4)-alpha-D-GlcNp2S6S (or60H) (1-->4)-alpha-L-IdoAp2S(1-->4)-alpha-D-GlcNp2S6S( or 6OH)(1-->4)-alpha-L-IdoAp2S(1-->4)-alpha-D-GlcNpAc (1

  10. Editorial: Special Issue on Algorithms for Sequence Analysis and Storage

    Directory of Open Access Journals (Sweden)

    Veli Mäkinen

    2014-03-01

    Full Text Available This special issue of Algorithms is dedicated to approaches to biological sequence analysis that have algorithmic novelty and potential for fundamental impact in methods used for genome research.

  11. Initial sequencing and comparative analysis of the mouse genome.

    Science.gov (United States)

    Waterston, Robert H; Lindblad-Toh, Kerstin; Birney, Ewan; Rogers, Jane; Abril, Josep F; Agarwal, Pankaj; Agarwala, Richa; Ainscough, Rachel; Alexandersson, Marina; An, Peter; Antonarakis, Stylianos E; Attwood, John; Baertsch, Robert; Bailey, Jonathon; Barlow, Karen; Beck, Stephan; Berry, Eric; Birren, Bruce; Bloom, Toby; Bork, Peer; Botcherby, Marc; Bray, Nicolas; Brent, Michael R; Brown, Daniel G; Brown, Stephen D; Bult, Carol; Burton, John; Butler, Jonathan; Campbell, Robert D; Carninci, Piero; Cawley, Simon; Chiaromonte, Francesca; Chinwalla, Asif T; Church, Deanna M; Clamp, Michele; Clee, Christopher; Collins, Francis S; Cook, Lisa L; Copley, Richard R; Coulson, Alan; Couronne, Olivier; Cuff, James; Curwen, Val; Cutts, Tim; Daly, Mark; David, Robert; Davies, Joy; Delehaunty, Kimberly D; Deri, Justin; Dermitzakis, Emmanouil T; Dewey, Colin; Dickens, Nicholas J; Diekhans, Mark; Dodge, Sheila; Dubchak, Inna; Dunn, Diane M; Eddy, Sean R; Elnitski, Laura; Emes, Richard D; Eswara, Pallavi; Eyras, Eduardo; Felsenfeld, Adam; Fewell, Ginger A; Flicek, Paul; Foley, Karen; Frankel, Wayne N; Fulton, Lucinda A; Fulton, Robert S; Furey, Terrence S; Gage, Diane; Gibbs, Richard A; Glusman, Gustavo; Gnerre, Sante; Goldman, Nick; Goodstadt, Leo; Grafham, Darren; Graves, Tina A; Green, Eric D; Gregory, Simon; Guigó, Roderic; Guyer, Mark; Hardison, Ross C; Haussler, David; Hayashizaki, Yoshihide; Hillier, LaDeana W; Hinrichs, Angela; Hlavina, Wratko; Holzer, Timothy; Hsu, Fan; Hua, Axin; Hubbard, Tim; Hunt, Adrienne; Jackson, Ian; Jaffe, David B; Johnson, L Steven; Jones, Matthew; Jones, Thomas A; Joy, Ann; Kamal, Michael; Karlsson, Elinor K; Karolchik, Donna; Kasprzyk, Arkadiusz; Kawai, Jun; Keibler, Evan; Kells, Cristyn; Kent, W James; Kirby, Andrew; Kolbe, Diana L; Korf, Ian; Kucherlapati, Raju S; Kulbokas, Edward J; Kulp, David; Landers, Tom; Leger, J P; Leonard, Steven; Letunic, Ivica; Levine, Rosie; Li, Jia; Li, Ming; Lloyd, Christine; Lucas, Susan; Ma, Bin; Maglott, Donna R; Mardis, Elaine R; Matthews, Lucy; Mauceli, Evan; Mayer, John H; McCarthy, Megan; McCombie, W Richard; McLaren, Stuart; McLay, Kirsten; McPherson, John D; Meldrim, Jim; Meredith, Beverley; Mesirov, Jill P; Miller, Webb; Miner, Tracie L; Mongin, Emmanuel; Montgomery, Kate T; Morgan, Michael; Mott, Richard; Mullikin, James C; Muzny, Donna M; Nash, William E; Nelson, Joanne O; Nhan, Michael N; Nicol, Robert; Ning, Zemin; Nusbaum, Chad; O'Connor, Michael J; Okazaki, Yasushi; Oliver, Karen; Overton-Larty, Emma; Pachter, Lior; Parra, Genís; Pepin, Kymberlie H; Peterson, Jane; Pevzner, Pavel; Plumb, Robert; Pohl, Craig S; Poliakov, Alex; Ponce, Tracy C; Ponting, Chris P; Potter, Simon; Quail, Michael; Reymond, Alexandre; Roe, Bruce A; Roskin, Krishna M; Rubin, Edward M; Rust, Alistair G; Santos, Ralph; Sapojnikov, Victor; Schultz, Brian; Schultz, Jörg; Schwartz, Matthias S; Schwartz, Scott; Scott, Carol; Seaman, Steven; Searle, Steve; Sharpe, Ted; Sheridan, Andrew; Shownkeen, Ratna; Sims, Sarah; Singer, Jonathan B; Slater, Guy; Smit, Arian; Smith, Douglas R; Spencer, Brian; Stabenau, Arne; Stange-Thomann, Nicole; Sugnet, Charles; Suyama, Mikita; Tesler, Glenn; Thompson, Johanna; Torrents, David; Trevaskis, Evanne; Tromp, John; Ucla, Catherine; Ureta-Vidal, Abel; Vinson, Jade P; Von Niederhausern, Andrew C; Wade, Claire M; Wall, Melanie; Weber, Ryan J; Weiss, Robert B; Wendl, Michael C; West, Anthony P; Wetterstrand, Kris; Wheeler, Raymond; Whelan, Simon; Wierzbowski, Jamey; Willey, David; Williams, Sophie; Wilson, Richard K; Winter, Eitan; Worley, Kim C; Wyman, Dudley; Yang, Shan; Yang, Shiaw-Pyng; Zdobnov, Evgeny M; Zody, Michael C; Lander, Eric S

    2002-12-05

    The sequence of the mouse genome is a key informational tool for understanding the contents of the human genome and a key experimental tool for biomedical research. Here, we report the results of an international collaboration to produce a high-quality draft sequence of the mouse genome. We also present an initial comparative analysis of the mouse and human genomes, describing some of the insights that can be gleaned from the two sequences. We discuss topics including the analysis of the evolutionary forces shaping the size, structure and sequence of the genomes; the conservation of large-scale synteny across most of the genomes; the much lower extent of sequence orthology covering less than half of the genomes; the proportions of the genomes under selection; the number of protein-coding genes; the expansion of gene families related to reproduction and immunity; the evolution of proteins; and the identification of intraspecies polymorphism.

  12. Laser Desorption Mass Spectrometry for DNA Sequencing and Analysis

    Science.gov (United States)

    Chen, C. H. Winston; Taranenko, N. I.; Golovlev, V. V.; Isola, N. R.; Allman, S. L.

    1998-03-01

    Rapid DNA sequencing and/or analysis is critically important for biomedical research. In the past, gel electrophoresis has been the primary tool to achieve DNA analysis and sequencing. However, gel electrophoresis is a time-consuming and labor-extensive process. Recently, we have developed and used laser desorption mass spectrometry (LDMS) to achieve sequencing of ss-DNA longer than 100 nucleotides. With LDMS, we succeeded in sequencing DNA in seconds instead of hours or days required by gel electrophoresis. In addition to sequencing, we also applied LDMS for the detection of DNA probes for hybridization LDMS was also used to detect short tandem repeats for forensic applications. Clinical applications for disease diagnosis such as cystic fibrosis caused by base deletion and point mutation have also been demonstrated. Experimental details will be presented in the meeting. abstract.

  13. Tools for integrated sequence-structure analysis with UCSF Chimera

    Directory of Open Access Journals (Sweden)

    Huang Conrad C

    2006-07-01

    Full Text Available Abstract Background Comparing related structures and viewing the structures in the context of sequence alignments are important tasks in protein structure-function research. While many programs exist for individual aspects of such work, there is a need for interactive visualization tools that: (a provide a deep integration of sequence and structure, far beyond mapping where a sequence region falls in the structure and vice versa; (b facilitate changing data of one type based on the other (for example, using only sequence-conserved residues to match structures, or adjusting a sequence alignment based on spatial fit; (c can be used with a researcher's own data, including arbitrary sequence alignments and annotations, closely or distantly related sets of proteins, etc.; and (d interoperate with each other and with a full complement of molecular graphics features. We describe enhancements to UCSF Chimera to achieve these goals. Results The molecular graphics program UCSF Chimera includes a suite of tools for interactive analyses of sequences and structures. Structures automatically associate with sequences in imported alignments, allowing many kinds of crosstalk. A novel method is provided to superimpose structures in the absence of a pre-existing sequence alignment. The method uses both sequence and secondary structure, and can match even structures with very low sequence identity. Another tool constructs structure-based sequence alignments from superpositions of two or more proteins. Chimera is designed to be extensible, and mechanisms for incorporating user-specific data without Chimera code development are also provided. Conclusion The tools described here apply to many problems involving comparison and analysis of protein structures and their sequences. Chimera includes complete documentation and is intended for use by a wide range of scientists, not just those in the computational disciplines. UCSF Chimera is free for non-commercial use and is

  14. Cloning and chromosomal assignment of a human cDNA encoding a T cell- and natural killer cell-specific trypsin-like serine protease

    International Nuclear Information System (INIS)

    Gershenfeld, H.K.; Hershberger, R.J.; Shows, T.B.; Weissman, I.L.

    1988-01-01

    A cDNA clone encoding a human T cell- and natural killer cell-specific serine protease was obtained by screening a phage λgt10 cDNA library from phytohemagglutinin-stimulated human peripheral blood lymphocytes with the mouse Hanukah factor cDNA clone. In an RNA blot-hybridization analysis, this human Hanukah factor cDNA hybridized with a 1.3-kilobase band in allogeneic-stimulated cytotoxic T cells and the Jurkat cell line, but this transcript was not detectable in normal muscle, liver, tonsil, or thymus. By dot-blot hybridization, this cDNA hybridized with RNA from three cytolytic T-cell clones and three noncytolytic T-cell clones grown in vitro as well as with purified CD16 + natural killer cells and CD3 + , CD16 - T-cell large granular lymphocytes from peripheral blood lymphocytes (CD = cluster designation). The nucleotide sequence of this cDNA clone encodes a predicted serine protease of 262 amino acids. The active enzyme is 71% and 77% similar to the mouse sequence at the amino acid and DNA level, respectively. The human and mouse sequences conserve the active site residues of serine proteases--the trypsin-specific Asp-189 and all 10 cysteine residues. The gene for the human Hanukah factor serine protease is located on human chromosome 5. The authors propose that this trypsin-like serine protease may function as a common component necessary for lysis of target cells by cytotoxic T lymphocytes and natural killer cells

  15. Categorizing accident sequences in the external radiotherapy for risk analysis

    OpenAIRE

    Kim, Jonghyun

    2013-01-01

    Purpose This study identifies accident sequences from the past accidents in order to help the risk analysis application to the external radiotherapy. Materials and Methods This study reviews 59 accidental cases in two retrospective safety analyses that have collected the incidents in the external radiotherapy extensively. Two accident analysis reports that accumulated past incidents are investigated to identify accident sequences including initiating events, failure of safety measures, and co...

  16. Sequencing and Analysis of Neanderthal Genomic DNA

    Energy Technology Data Exchange (ETDEWEB)

    Noonan, James P.; Coop, Graham; Kudaravalli, Sridhar; Smith,Doug; Krause, Johannes; Alessi, Joe; Chen, Feng; Platt, Darren; Paabo,Svante; Pritchard, Jonathan K.; Rubin, Edward M.

    2006-06-13

    Recovery and analysis of multiple Neanderthal autosomalsequences using a metagenomic approach reveals that modern humans andNeanderthals split ~;400,000 years ago, without significant evidence ofsubsequent admixture.

  17. DNA Sequence Analysis in Clinical Medicine, Proceeding Cautiously

    Directory of Open Access Journals (Sweden)

    Moyra Smith

    2017-05-01

    Full Text Available Delineation of underlying genomic and genetic factors in a specific disease may be valuable in establishing a definitive diagnosis and may guide patient management and counseling. In addition, genetic information may be useful in identification of at risk family members. Gene mapping and initial genome sequencing data enabled the development of microarrays to analyze genomic variants. The goal of this review is to consider different generations of sequencing techniques and their application to exome sequencing and whole genome sequencing and their clinical applications. In recent decades, exome sequencing has primarily been used in patient studies. Discussed in some detail, are important measures that have been developed to standardize variant calling and to assess pathogenicity of variants. Examples of cases where exome sequencing has facilitated diagnosis and led to improved medical management are presented. Whole genome sequencing and its clinical relevance are presented particularly in the context of analysis of nucleotide and structural genomic variants in large population studies and in certain patient cohorts. Applications involving analysis of cell free DNA in maternal blood for prenatal diagnosis of specific autosomal trisomies are reviewed. Applications of DNA sequencing to diagnosis and therapeutics of cancer are presented. Also discussed are important recent diagnostic applications of DNA sequencing in cancer, including analysis of tumor derived cell free DNA and exosomes that are present in body fluids. Insights gained into underlying pathogenetic mechanisms of certain complex common diseases, including schizophrenia, macular degeneration, neurodegenerative disease are presented. The relevance of different types of variants, rare, uncommon, and common to disease pathogenesis, and the continuum of causality, are addressed. Pharmogenetic variants detected by DNA sequence analysis are gaining in importance and are particularly relevant

  18. cDNA fingerprinting of osteoprogenitor cells to isolate differentiation stage-specific genes.

    OpenAIRE

    Candeliere, G A; Rao, Y; Floh, A; Sandler, S D; Aubin, J E

    1999-01-01

    A cDNA fingerprinting strategy was developed to identify genes based on their differential expression pattern during osteoblast development. Preliminary biological and molecular staging of cDNA pools prepared by global amplification PCR allowed discrim-inating choices to be made in selection of expressed sequence tags (ESTs) to be isolated. Sequencing of selected ESTs confirmed that both known and novel genes can be isolated from any developmental stage of interest, e.g. from primitive progen...

  19. CDNA cloning, characterization and expression of an endosperm-specific barley peroxidase

    DEFF Research Database (Denmark)

    Rasmussen, Søren Kjærsgård; Welinder, K.G.; Hejgaard, J.

    1991-01-01

    A barley peroxidase (BP 1) of pI ca. 8.5 and M(r) 37000 has been purified from mature barley grains. Using antibodies towards peroxidase BP 1, a cDNA clone (pcR7) was isolated from cDNA expression library. The nucleotide sequence of pcR7 gave a derived amino acid sequence identical to the 158 C...

  20. Cloning and Sequence Analysis of N Gene of Transmissible Gastroenteritis Virus HYM-09 Isolated from Dog in China.

    Science.gov (United States)

    Man, Chaolai; Yu, Xiaolong

    2012-12-01

    Transmissible gastroenteritis virus (TGEV) is the etiological agent of TGE, and dogs are potential carriers of TGEV. In this study, genomic RNA were extracted from TGEV designated HYM-09 isolated from dog naturally infected with TGEV. The nucleocapsid (N) gene of HYM-09 was amplified by RT-PCR and cloned into pMD18-T vector. The N gene cDNA was sequenced and encompassed an open reading frame of 1,149 nucleotides, encoding a 382-amino acids protein. Sequence analyses of the N genes were performed, including homologous comparison, phylogenetic tree analysis and residue substitution analysis. The results showed that there existed some unique mutations in the HYM-09 isolate N gene, but HYM-09 N gene shared over 96 % homologous identities compared with 12 TGEV reference strains derived from other regions or countries respectively. The phylogenetic tree analysis revealed that the HYM-09 branched into the most strains group. This study shows that the nucleotide sequence analysis can form a base or further study on the mutation trend of non-porcine TGEV.

  1. Method for RNA extraction and cDNA library construction from microbes in crop rhizosphere soil.

    Science.gov (United States)

    Fang, Changxun; Xu, Tiecheng; Ye, Changliang; Huang, Likun; Wang, Qingshui; Lin, Wenxiong

    2014-02-01

    Techniques to analyze the transcriptome of the soil rhizosphere are essential to reveal the interactions and communications between plants and microorganisms in the soil ecosystem. In this study, different volumes of Al₂(SO₄)₃ were added to rhizosphere soil samples to precipitate humic substances, which interfere with most procedures of RNA and DNA analyses. After humic substances were precipitated, cells of soil microorganisms were broken by vortexing with glass beads, and then DNA and RNA were recovered using Tris-HCl buffer with LiCl, SDS, and EDTA. The crude extract was precipitated and dissolved in RNAse-free water, and then separated by agarose gel electrophoresis. We determined the optimum volume of Al₂(SO₄)₃ for treating rhizosphere soil of rice, tobacco, sugarcane, Rehmannia glutinosa, and Pseudostellaria heterophylla. The crude nucleic acids extract from rice soil was treated with DNase I and then RNA was purified using a gel filtration column. The purified RNA was reverse-transcribed into single-strand cDNA and then ligated with an adaptor at each end before amplifying ds cDNA. The ds cDNA was sub-cloned for subsequent gene sequence analysis. We conducted qPCR to amplify 16S ribosomal DNA and observed highly efficient amplification. These results show that the extraction method can be optimized to isolate and obtain high-quality nucleic acids from microbes in different rhizosphere soils, suitable for genomic and post-genomic analyses.

  2. cDNA analyses of CAPN3 enhance mutation detection and reveal a low prevalence of LGMD2A patients in Denmark

    DEFF Research Database (Denmark)

    Duno, M.; Sveen, M.L.; Schwartz, M.

    2008-01-01

    suspected to have LGMD2A, based on western blot results. Four of these patients were shown to have LGMD2I upon molecular analysis, whereas 16 of the remaining 42 patients harbored mutations in CAPN3 by both direct genomic sequencing and cDNA analyses. In 10 patients, we identified both mutant alleles....... In three other, only one heterozygous mutation could be identified on the genomic level; however, CAPN3 cDNA analyses demonstrated homozygosity for the mutant allele, indicating the presence of an unidentified allele that somehow compromise correct CAPN3 RNA processing. In the three remaining patients......, only a single heterozygous mutation could be identified both at the genomic level and on full-length CAPN3 cDNA. All three patients exhibited a highly abnormal western blot for calpain-3 and clinical characteristics of LGMD2A. Only three of the genetically confirmed LGMD2A patients were of Danish...

  3. Quantiprot - a Python package for quantitative analysis of protein sequences.

    Science.gov (United States)

    Konopka, Bogumił M; Marciniak, Marta; Dyrka, Witold

    2017-07-17

    The field of protein sequence analysis is dominated by tools rooted in substitution matrices and alignments. A complementary approach is provided by methods of quantitative characterization. A major advantage of the approach is that quantitative properties defines a multidimensional solution space, where sequences can be related to each other and differences can be meaningfully interpreted. Quantiprot is a software package in Python, which provides a simple and consistent interface to multiple methods for quantitative characterization of protein sequences. The package can be used to calculate dozens of characteristics directly from sequences or using physico-chemical properties of amino acids. Besides basic measures, Quantiprot performs quantitative analysis of recurrence and determinism in the sequence, calculates distribution of n-grams and computes the Zipf's law coefficient. We propose three main fields of application of the Quantiprot package. First, quantitative characteristics can be used in alignment-free similarity searches, and in clustering of large and/or divergent sequence sets. Second, a feature space defined by quantitative properties can be used in comparative studies of protein families and organisms. Third, the feature space can be used for evaluating generative models, where large number of sequences generated by the model can be compared to actually observed sequences.

  4. Genome sequencing and analysis conference grant

    Energy Technology Data Exchange (ETDEWEB)

    Venter, J.C. [ed.

    1995-10-01

    The 14 plenary session presentations focused on nematode; yeast; fruit fly; plants; mycobacteria; and man. In addition there were presentations on a variety of technical innovations including database developments and refinements, bioelectronic genesensors, computer-assisted multiplex techniques, and hybridization analysis with DNA chip technology. This document includes a list of exhibitors and abstracts of sessions.

  5. Desiccation survival in an Antarctic nematode: molecular analysis using expressed sequenced tags

    Directory of Open Access Journals (Sweden)

    Wall Diana H

    2009-02-01

    Full Text Available Abstract Background Nematodes are the dominant soil animals in Antarctic Dry Valleys and are capable of surviving desiccation and freezing in an anhydrobiotic state. Genes induced by desiccation stress have been successfully enumerated in nematodes; however we have little knowledge of gene regulation by Antarctic nematodes which can survive multiple environmental stresses. To address this problem we investigated the genetic responses of a nematode species, Plectus murrayi, that is capable of tolerating Antarctic environmental extremes, in particular desiccation and freezing. In this study, we provide the first insight into the desiccation induced transcriptome of an Antarctic nematode through cDNA library construction and suppressive subtractive hybridization. Results We obtained 2,486 expressed sequence tags (ESTs from 2,586 clones derived from the cDNA library of desiccated P. murrayi. The 2,486 ESTs formed 1,387 putative unique transcripts of which 523 (38% had matches in the model-nematode Caenorhabditis elegans, 107 (7% in nematodes other than C. elegans, 153 (11% in non-nematode organisms and 605 (44% had no significant match to any sequences in the current databases. The 1,387 unique transcripts were functionally classified by using Gene Ontology (GO hierarchy and the Kyoto Encyclopedia of Genes and Genomes (KEGG database. The results indicate that the transcriptome contains a group of transcripts from diverse functional areas. The subtractive library of desiccated nematodes showed 80 transcripts differentially expressed during desiccation stress, of which 28% were metabolism related, 19% were involved in environmental information processing, 28% involved in genetic information processing and 21% were novel transcripts. Expression profiling of 14 selected genes by quantitative Real-time PCR showed 9 genes significantly up-regulated, 3 down-regulated and 2 continuously expressed in response to desiccation. Conclusion The establishment of a

  6. Categorizing accident sequences in the external radiotherapy for risk analysis

    Energy Technology Data Exchange (ETDEWEB)

    Kim, Jong Hyun [KEPCO International Nuclear Graduate School (KINGS), Ulsan (Korea, Republic of)

    2013-06-15

    This study identifies accident sequences from the past accidents in order to help the risk analysis application to the external radiotherapy. This study reviews 59 accidental cases in two retrospective safety analyses that have collected the incidents in the external radiotherapy extensively. Two accident analysis reports that accumulated past incidents are investigated to identify accident sequences including initiating events, failure of safety measures, and consequences. This study classifies the accidents by the treatments stages and sources of errors for initiating events, types of failures in the safety measures, and types of undesirable consequences and the number of affected patients. Then, the accident sequences are grouped into several categories on the basis of similarity of progression. As a result, these cases can be categorized into 14 groups of accident sequence. The result indicates that risk analysis needs to pay attention to not only the planning stage, but also the calibration stage that is committed prior to the main treatment process. It also shows that human error is the largest contributor to initiating events as well as to the failure of safety measures. This study also illustrates an event tree analysis for an accident sequence initiated in the calibration. This study is expected to provide sights into the accident sequences for the prospective risk analysis through the review of experiences.

  7. Categorizing accident sequences in the external radiotherapy for risk analysis.

    Science.gov (United States)

    Kim, Jonghyun

    2013-06-01

    This study identifies accident sequences from the past accidents in order to help the risk analysis application to the external radiotherapy. This study reviews 59 accidental cases in two retrospective safety analyses that have collected the incidents in the external radiotherapy extensively. Two accident analysis reports that accumulated past incidents are investigated to identify accident sequences including initiating events, failure of safety measures, and consequences. This study classifies the accidents by the treatments stages and sources of errors for initiating events, types of failures in the safety measures, and types of undesirable consequences and the number of affected patients. Then, the accident sequences are grouped into several categories on the basis of similarity of progression. As a result, these cases can be categorized into 14 groups of accident sequence. The result indicates that risk analysis needs to pay attention to not only the planning stage, but also the calibration stage that is committed prior to the main treatment process. It also shows that human error is the largest contributor to initiating events as well as to the failure of safety measures. This study also illustrates an event tree analysis for an accident sequence initiated in the calibration. This study is expected to provide sights into the accident sequences for the prospective risk analysis through the review of experiences.

  8. Isolation of full-length putative rat lysophospholipase cDNA using improved methods for mRNA isolation and cDNA cloning

    International Nuclear Information System (INIS)

    Han, J.H.; Stratowa, C.; Rutter, W.J.

    1987-01-01

    The authors have cloned a full-length putative rat pancreatic lysophospholipase cDNA by an improved mRNA isolation method and cDNA cloning strategy using [ 32 P]-labelled nucleotides. These new methods allow the construction of a cDNA library from the adult rat pancreas in which the majority of recombinant clones contained complete sequences for the corresponding mRNAs. A previously recognized but unidentified long and relatively rare cDNA clone containing the entire sequence from the cap site at the 5' end to the poly(A) tail at the 3' end of the mRNA was isolated by single-step screening of the library. The size, amino acid composition, and the activity of the protein expressed in heterologous cells strongly suggest this mRNA codes for lysophospholipase

  9. cDNA cloning of a major allergen from timothy grass (Phleum pratense) pollen; characterization of the recombinant Phl pV allergen

    NARCIS (Netherlands)

    Vrtala, S.; Sperr, W. R.; Reimitzer, I.; van Ree, R.; Laffer, S.; Müller, W. D.; Valent, P.; Lechner, K.; Rumpold, H.; Kraft, D.

    1993-01-01

    We isolated a cDNA encoding a major grass pollen allergen from a timothy grass (Phleum pratense) pollen expression cDNA library using allergic patients' IgE. The complete cDNA encoded an allergen that binds IgE from about 80% of grass pollen-allergic patients. Significant sequence homology was found

  10. Construction of C35 gene bait recombinants and T47D cell cDNA library.

    Science.gov (United States)

    Yin, Kun; Xu, Chao; Zhao, Gui-Hua; Liu, Ye; Xiao, Ting; Zhu, Song; Yan, Ge

    2017-11-20

    C35 is a novel tumor biomarker associated with metastasis progression. To investigate the interaction factors of C35 in its high expressed breast cancer cell lines, we constructed bait recombinant plasmids of C35 gene and T47D cell cDNA library for yeast two-hybrid screening. Full length C35 sequences were subcloned using RT-PCR from cDNA template extracted from T47D cells. Based on functional domain analysis, the full-length C35 1-348bp was also truncated into two fragments C351-153bp and C35154-348bp to avoid auto-activation. The three kinds of C35 genes were successfully amplified and inserted into pGBKT7 to construct bait recombinant plasmids pGBKT7-C351-348bp, pGBKT7-C351-153bp and pGBKT7-C35154-348bp, then transformed into Y187 yeast cells by the lithium acetate method. Auto-activation and toxicity of C35 baits were detected using nutritional deficient medium and X-α-Gal assays. The T47D cell ds cDNA was generated by SMART TM technology and the library was constructed using in vivo recombination-mediated cloning in the AH109 yeast strain using a pGADT7-Rec plasmid. The transformed Y187/pGBKT7-C351-348bp line was intensively inhibited while the truncated Y187/pGBKT7-C35 lines had no auto-activation and toxicity in yeast cells. The titer of established cDNA library was 2 × 10 7 pfu/mL with high transformation efficiency of 1.4 × 10 6 , and the insert size of ds cDNA was distributed homogeneously between 0.5-2.0 kb. Our research generated a T47D cell cDNA library with high titer, and the constructed two C35 "baits" contained a respective functional immunoreceptor tyrosine based activation motif (ITAM) and the conserved last four amino acids Cys-Ile-Leu-Val (CILV) motif, and therefore laid a foundation for screening the C35 interaction factors in a BC cell line.

  11. cDNA and deduced primary structure of basic phospholipase A2 with neurotoxic activity from the venom secretion of the Crotalus durissus collilineatus rattlesnake

    Directory of Open Access Journals (Sweden)

    F.H.R. Fagundes

    2010-03-01

    Full Text Available To illustrate the construction of precursor complementary DNAs, we isolated mRNAs from whole venom samples. After reverse transcription polymerase chain reaction (RT-PCR, we amplified the cDNA coding for a neurotoxic protein, phospholipase A2 D49 (PLA2 D49, from the venom of Crotalus durissus collilineatus (Cdc PLA2. The cDNA encoding Cdc PLA2 from whole venom was sequenced. The deduced amino acid sequence of this cDNA has high overall sequence identity with the group II PLA2 protein family. Cdc PLA2 has 14 cysteine residues capable of forming seven disulfide bonds that characterize this group of PLA2 enzymes. Cdc PLA2 was isolated using conventional Sephadex G75 column chromatography and reverse-phase high performance liquid chromatography (RP-HPLC. The molecular mass was estimated using matrix-assisted laser desorption ionization-time-of-flight (MALDI-TOF mass spectrometry. We tested the neuromuscular blocking activities on chick biventer cervicis neuromuscular tissue. Phylogenetic analysis of Cdc PLA2 showed the existence of two lines of N6-PLA2, denominated F24 and S24. Apparently, the sequences of the New World’s N6-F24-PLA2 are similar to those of the agkistrodotoxin from the Asian genus Gloydius. The sequences of N6-S24-PLA2 are similar to the sequence of trimucrotoxin from the genus Protobothrops, found in the Old World.

  12. Searching for cellular partners of hantaviral nonstructural protein NSs: Y2H screening of mouse cDNA library and analysis of cellular interactome.

    Directory of Open Access Journals (Sweden)

    Tuomas Rönnberg

    Full Text Available Hantaviruses (Bunyaviridae are negative-strand RNA viruses with a tripartite genome. The small (S segment encodes the nucleocapsid protein and, in some hantaviruses, also the nonstructural protein (NSs. The aim of this study was to find potential cellular partners for the hantaviral NSs protein. Toward this aim, yeast two-hybrid (Y2H screening of mouse cDNA library was performed followed by a search for potential NSs protein counterparts via analyzing a cellular interactome. The resulting interaction network was shown to form logical, clustered structures. Furthermore, several potential binding partners for the NSs protein, for instance ACBD3, were identified and, to prove the principle, interaction between NSs and ACBD3 proteins was demonstrated biochemically.

  13. [Construction and Identification of the cDNA Expression Library for Human Esophageal Cancer Cells].

    Science.gov (United States)

    Zhang, Zhe; Wu, Xiang-Yu; Feng, Lu; Huang, Shang-Ke; Luo, Min-Na; Shao, Shan; Zhao, Xin-Han

    2016-09-01

    To construct a cDNA phage expression library for human esophageal cancer cells. After the total RNA were obtained from esophageal cancer cells, the mRNA were separated with magnetic beads adsorption method, and the single-strand and double-strand cDNA were synthesized through reverse transcription. With the undesirable cDNA fragments removed, the remaining cDNA (linked with Eco R1 aptamer and phosphorylated its 5'end) combined with the carrier of T7 Select10-3b. The recombinant phage were packaged in vitro for preliminary cDNA library. PCR was used to identify the size of inserted cDNA. The constructed original cDNA phage expression library for human esophageal cancer cells was consisted of 2.01×10⁶ pfu/mL bacteriophages with a recombination rate of 100%. The length of the inserted cDNA fragments were range from 300 bp to 1 500 bp. The cDNA phage expression library of human esophageal cell is successfully constructed to meet the currently recognized standards, and can be well used to screen cDNA-cloned genes of human esophageal cancer antigens by serological analysis of recombinantly expressed cDNA clone (SEREX).

  14. Nonlinear analysis of river flow time sequences

    Science.gov (United States)

    Porporato, Amilcare; Ridolfi, Luca

    1997-06-01

    Within the field of chaos theory several methods for the analysis of complex dynamical systems have recently been proposed. In light of these ideas we study the dynamics which control the behavior over time of river flow, investigating the existence of a low-dimension deterministic component. The present article follows the research undertaken in the work of Porporato and Ridolfi [1996a] in which some clues as to the existence of chaos were collected. Particular emphasis is given here to the problem of noise and to nonlinear prediction. With regard to the latter, the benefits obtainable by means of the interpolation of the available time series are reported and the remarkable predictive results attained with this nonlinear method are shown.

  15. Deep Sequencing Analysis of Nucleolar Small RNAs: Bioinformatics.

    Science.gov (United States)

    Bai, Baoyan; Laiho, Marikki

    2016-01-01

    Small RNAs (size 20-30 nt) of various types have been actively investigated in recent years, and their subcellular compartmentalization and relative concentrations are likely to be of importance to their cellular and physiological functions. Comprehensive data on this subset of the transcriptome can only be obtained by application of high-throughput sequencing, which yields data that are inherently complex and multidimensional, as sequence composition, length, and abundance will all inform to the small RNA function. Subsequent data analysis, hypothesis testing, and presentation/visualization of the results are correspondingly challenging. We have constructed small RNA libraries derived from different cellular compartments, including the nucleolus, and asked whether small RNAs exist in the nucleolus and whether they are distinct from cytoplasmic and nuclear small RNAs, the miRNAs. Here, we present a workflow for analysis of small RNA sequencing data generated by the Ion Torrent PGM sequencer from samples derived from different cellular compartments.

  16. Generation and analysis of expressed sequence tags from six developing xylem libraries in Pinus radiata D. Don

    Directory of Open Access Journals (Sweden)

    Dillon Shannon K

    2009-01-01

    Full Text Available Abstract Background Wood is a major renewable natural resource for the timber, fibre and bioenergy industry. Pinus radiata D. Don is the most important commercial plantation tree species in Australia and several other countries; however, genomic resources for this species are very limited in public databases. Our primary objective was to sequence a large number of expressed sequence tags (ESTs from genes involved in wood formation in radiata pine. Results Six developing xylem cDNA libraries were constructed from earlywood and latewood tissues sampled at juvenile (7 yrs, transition (11 yrs and mature (30 yrs ages, respectively. These xylem tissues represent six typical development stages in a rotation period of radiata pine. A total of 6,389 high quality ESTs were collected from 5,952 cDNA clones. Assembly of 5,952 ESTs from 5' end sequences generated 3,304 unigenes including 952 contigs and 2,352 singletons. About 97.0% of the 5,952 ESTs and 96.1% of the unigenes have matches in the UniProt and TIGR databases. Of the 3,174 unigenes with matches, 42.9% were not assigned GO (Gene Ontology terms and their functions are unknown or unclassified. More than half (52.1% of the 5,952 ESTs have matches in the Pfam database and represent 772 known protein families. About 18.0% of the 5,952 ESTs matched cell wall related genes in the MAIZEWALL database, representing all 18 categories, 91 of all 174 families and possibly 557 genes. Fifteen cell wall-related genes are ranked in the 30 most abundant genes, including CesA, tubulin, AGP, SAMS, actin, laccase, CCoAMT, MetE, phytocyanin, pectate lyase, cellulase, SuSy, expansin, chitinase and UDP-glucose dehydrogenase. Based on the PlantTFDB database 41 of the 64 transcription factor families in the poplar genome were identified as being involved in radiata pine wood formation. Comparative analysis of GO term abundance revealed a distinct transcriptome in juvenile earlywood formation compared to other stages of

  17. Identification of some unknown transcripts from SSH cDNA library of buffalo follicular oocytes.

    Science.gov (United States)

    Rajput, S K; Kumar, P; Roy, B; Verma, A; Pandey, H P; Singh, D; De, S; Datta, T K

    2013-03-01

    A buffalo oocyte-specific subtracted cDNA library was constructed to identify exclusively or preferentially oocyte-expressed genes. The library represented an enriched population of transcripts obtained from oocytes of diverse ovarian follicular origin and at different stages of in vitro maturation. A total of 1173 high-quality sequences of oocyte-specific genes were clustered into 645 unique sequences, out of which 65.76% were represented as singlets and 34.26% as contig expressed sequence tags (ESTs; clusters). Analysis of sequences revealed that 498 of these sequences were identified as a known sequence in mammalian species including buffalo, 103 as uncharacterized ESTs and 44 unknown sequences including 1 novel EST, so far not reported in any species. Gene ontology annotation classified these sequences into functional categories of cellular events and biological processes associated with oocyte competence. Expression status of the isolated unknown ESTs confirmed that many of these are expressed in oocytes exclusively and in others preferentially, some in excess of 80-fold greater in comparison with a variety of somatic tissues. The isolated novel EST was detected to be expressed exclusively in oocytes and testicular cells only. To our knowledge, this is the first report giving a detailed transcriptome account of oocyte-expressed genes in buffalo. This study will provide important information on the physiological control of oocyte development, as well as many questions yet to be addressed on the reproductive process of buffalo.

  18. Molecular cloning and sequence analysis of the cat myostatin gene ...

    African Journals Online (AJOL)

    ... MEF3, MTBF, PAX3, SMAD, HBOX, HOMF and TEAF motifs. Comparative analysis for some motifs showed both conservations and differences among cat, horse, porcine and human. Key words: Cat, myostatin 5'-regulatory region, molecular cloning, sequence analysis and comparison, transcription factor binding sites.

  19. Food Fish Identification from DNA Extraction through Sequence Analysis

    Science.gov (United States)

    Hallen-Adams, Heather E.

    2015-01-01

    This experiment exposed 3rd and 4th y undergraduates and graduate students taking a course in advanced food analysis to DNA extraction, polymerase chain reaction (PCR), and DNA sequence analysis. Students provided their own fish sample, purchased from local grocery stores, and the class as a whole extracted DNA, which was then subjected to PCR,…

  20. An optimum analysis sequence for environmental gamma-ray spectrometry

    International Nuclear Information System (INIS)

    De la Torre, F.; Rios M, C.; Ruvalcaba A, M. G.; Mireles G, F.; Saucedo A, S.; Davila R, I.; Pinedo, J. L.

    2010-10-01

    This work aims to obtain an optimum analysis sequence for environmental gamma-ray spectroscopy by means of Genie 2000 (Canberra). Twenty different analysis sequences were customized using different peak area percentages and different algorithms for: 1) peak finding, and 2) peak area determination, and with or without the use of a library -based on evaluated nuclear data- of common gamma-ray emitters in environmental samples. The use of an optimum analysis sequence with certified nuclear information avoids the problems originated by the significant variations in out-of-date nuclear parameters of commercial software libraries. Interference-free gamma ray energies with absolute emission probabilities greater than 3.75% were included in the customized library. The gamma-ray spectroscopy system (based on a Ge Re-3522 Canberra detector) was calibrated both in energy and shape by means of the IAEA-2002 reference spectra for software intercomparison. To test the performance of the analysis sequences, the IAEA-2002 reference spectrum was used. The z-score and the reduced χ 2 criteria were used to determine the optimum analysis sequence. The results show an appreciable variation in the peak area determinations and their corresponding uncertainties. Particularly, the combination of second derivative peak locate with simple peak area integration algorithms provides the greater accuracy. Lower accuracy comes from the combination of library directed peak locate algorithm and Genie's Gamma-M peak area determination. (Author)

  1. An optimum analysis sequence for environmental gamma-ray spectrometry

    Energy Technology Data Exchange (ETDEWEB)

    De la Torre, F.; Rios M, C.; Ruvalcaba A, M. G.; Mireles G, F.; Saucedo A, S.; Davila R, I.; Pinedo, J. L., E-mail: fta777@hotmail.co [Universidad Autonoma de Zacatecas, Centro Regional de Estudis Nucleares, Calle Cipres No. 10, Fracc. La Penuela, 98068 Zacatecas (Mexico)

    2010-10-15

    This work aims to obtain an optimum analysis sequence for environmental gamma-ray spectroscopy by means of Genie 2000 (Canberra). Twenty different analysis sequences were customized using different peak area percentages and different algorithms for: 1) peak finding, and 2) peak area determination, and with or without the use of a library -based on evaluated nuclear data- of common gamma-ray emitters in environmental samples. The use of an optimum analysis sequence with certified nuclear information avoids the problems originated by the significant variations in out-of-date nuclear parameters of commercial software libraries. Interference-free gamma ray energies with absolute emission probabilities greater than 3.75% were included in the customized library. The gamma-ray spectroscopy system (based on a Ge Re-3522 Canberra detector) was calibrated both in energy and shape by means of the IAEA-2002 reference spectra for software intercomparison. To test the performance of the analysis sequences, the IAEA-2002 reference spectrum was used. The z-score and the reduced {chi}{sup 2} criteria were used to determine the optimum analysis sequence. The results show an appreciable variation in the peak area determinations and their corresponding uncertainties. Particularly, the combination of second derivative peak locate with simple peak area integration algorithms provides the greater accuracy. Lower accuracy comes from the combination of library directed peak locate algorithm and Genie's Gamma-M peak area determination. (Author)

  2. Isolation of an ATP synthase cDNA from Sinonovacula constricta ...

    African Journals Online (AJOL)

    Yomi

    2012-01-24

    Jan 24, 2012 ... Complete cDNA sequence of ScATPase and its deduced amino acid sequence. Nucleotides were numbered from the first base at the 5'end. The canonical polyadenylation signal-sequence was italic and underlined. The asterisk indicated the stop codon. The domain for ATP synthase C was underlined.

  3. Analysis and Visualization Tool for Targeted Amplicon Bisulfite Sequencing on Ion Torrent Sequencers.

    Directory of Open Access Journals (Sweden)

    Stephan Pabinger

    Full Text Available Targeted sequencing of PCR amplicons generated from bisulfite deaminated DNA is a flexible, cost-effective way to study methylation of a sample at single CpG resolution and perform subsequent multi-target, multi-sample comparisons. Currently, no platform specific protocol, support, or analysis solution is provided to perform targeted bisulfite sequencing on a Personal Genome Machine (PGM. Here, we present a novel tool, called TABSAT, for analyzing targeted bisulfite sequencing data generated on Ion Torrent sequencers. The workflow starts with raw sequencing data, performs quality assessment, and uses a tailored version of Bismark to map the reads to a reference genome. The pipeline visualizes results as lollipop plots and is able to deduce specific methylation-patterns present in a sample. The obtained profiles are then summarized and compared between samples. In order to assess the performance of the targeted bisulfite sequencing workflow, 48 samples were used to generate 53 different Bisulfite-Sequencing PCR amplicons from each sample, resulting in 2,544 amplicon targets. We obtained a mean coverage of 282X using 1,196,822 aligned reads. Next, we compared the sequencing results of these targets to the methylation level of the corresponding sites on an Illumina 450k methylation chip. The calculated average Pearson correlation coefficient of 0.91 confirms the sequencing results with one of the industry-leading CpG methylation platforms and shows that targeted amplicon bisulfite sequencing provides an accurate and cost-efficient method for DNA methylation studies, e.g., to provide platform-independent confirmation of Illumina Infinium 450k methylation data. TABSAT offers a novel way to analyze data generated by Ion Torrent instruments and can also be used with data from the Illumina MiSeq platform. It can be easily accessed via the Platomics platform, which offers a web-based graphical user interface along with sample and parameter storage

  4. [Construction and characterization of normalized cDNA library of maize inbred Mo17 from multiple tissues and developmental stages].

    Science.gov (United States)

    Zhang, Z X; Zhang, F D; Tang, W H; Pi, Y J; Zheng, Y L

    2005-01-01

    Comprehensive complementary DNA (cDNA) library is a valuable resource for functional genomics. In this study, we set up a normalized cDNA library of Mo17 (MONL) by saturation hybridization with genomic DNA, which contained expressed genes of eight tissues and organs from inbred Mo17 of maize (Zea mays L.). In this library, the insert sizes range from 0.4 kb to 4 kb and the average size is 1.18 kb. 10.830 clones were spotted on nylon membrane to make a cDNA microarray. Randomly picked 300 clones from the cDNA library were sequenced. The cDNA microarry was hybridized with pooled tissue mRNA probes or housekeeping gene cDNA probes. The results showed the normalized cDNA library comprehensively includes tissue-specific genes in which 71% are unique ESTs (expressed sequence tags) based on the 300 sequences analyzed. Using BLAST program to compare the sequences against online nucleotide databases, 88% sequences were found in ZmDB or NCBI, and 12% sequences were not found in existing nucleotide databases. More than 73% sequences are of unknown function. The library could be extensively used in developing DNA markers, sequencing ESTs, mining new genes, identifying positional cloning and candidate gene, and developing microarrays in maize genomics research.

  5. High Throughput Plasmid Sequencing with Illumina and CLC Bio (Seventh Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting 2012)

    Energy Technology Data Exchange (ETDEWEB)

    Athavale, Ajay

    2012-06-01

    Ajay Athavale (Monsanto) presents "High Throughput Plasmid Sequencing with Illumina and CLC Bio" at the 7th Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting held in June, 2012 in Santa Fe, NM.

  6. Isolation and preliminary function analysis of a Na + /H + antiporter ...

    African Journals Online (AJOL)

    A full-length cDNA Na+/H+ antiporter gene (MzNHX1) was isolated from Malus zumi according to the homologous Na+/H+ antiporter gene region in plants. Sequence analysis indicated that the cDNA was 2062 bp in length, including an open reading frame (ORF) of 1629 bp, which encoded a predicted polypeptide of 542 ...

  7. Bioinformatics analysis of circulating cell-free DNA sequencing data.

    Science.gov (United States)

    Chan, Landon L; Jiang, Peiyong

    2015-10-01

    The discovery of cell-free DNA molecules in plasma has opened up numerous opportunities in noninvasive diagnosis. Cell-free DNA molecules have become increasingly recognized as promising biomarkers for detection and management of many diseases. The advent of next generation sequencing has provided unprecedented opportunities to scrutinize the characteristics of cell-free DNA molecules in plasma in a genome-wide fashion and at single-base resolution. Consequently, clinical applications of circulating cell-free DNA analysis have not only revolutionized noninvasive prenatal diagnosis but also facilitated cancer detection and monitoring toward an era of blood-based personalized medicine. With the remarkably increasing throughput and lowering cost of next generation sequencing, bioinformatics analysis becomes increasingly demanding to understand the large amount of data generated by these sequencing platforms. In this Review, we highlight the major bioinformatics algorithms involved in the analysis of cell-free DNA sequencing data. Firstly, we briefly describe the biological properties of these molecules and provide an overview of the general bioinformatics approach for the analysis of cell-free DNA. Then, we discuss the specific upstream bioinformatics considerations concerning the analysis of sequencing data of circulating cell-free DNA, followed by further detailed elaboration on each key clinical situation in noninvasive prenatal diagnosis and cancer management where downstream bioinformatics analysis is heavily involved. We also discuss bioinformatics analysis as well as clinical applications of the newly developed massively parallel bisulfite sequencing of cell-free DNA. Finally, we offer our perspectives on the future development of bioinformatics in noninvasive diagnosis. Copyright © 2015 The Canadian Society of Clinical Chemists. Published by Elsevier Inc. All rights reserved.

  8. Molecular cloning of the cDNA encoding aspartate aminotransferase from bean root nodules and determination of its role in nodule nitrogen metabolism.

    Science.gov (United States)

    Silvente, Sonia; Camas, Alberto; Lara, Miguel

    2003-06-01

    A cDNA clone encoding aspartate aminotransferase (PVAAT-2) (EC 2.6.1.1) was isolated from the common bean Phaseolus vulgaris nodule cDNA library. The nucleotide sequence analysis of the full-length cDNA allowed its identification by comparison with sequence databases. The amino acid sequence of the bean PvAAT-2 showed high similarity with the AAT-2 isoforms described in other leguminous plants. The amino-terminal region of the PvAAT-2 contains a sequence, which shares common features of plastid transit peptides. Southern blot analysis showed that the PvAAT-2 clone is encoded by a single gene in the P. vulgaris genome. Analysis of the PvAAT-2 mRNA levels suggests that the expression of this gene is nodule enhanced. The PvAAT-2 transcript is more abundant in nodules with increased synthesis of amides and is down-regulated in conditions where ureides accumulate. When plants were supplemented with ureides or with amides, PvAAT-2 expression was reduced, while it was not affected when plants were treated with allopurinol, an inhibitor of ureide synthesis. On the other hand, the expression of asparagine synthetase (another enzyme involved in the synthesis of amides) is not affected either by ureides or amides. These data suggest a role for AAT-2 in the mechanism involved in the synthesis of nitrogen compounds in bean nodules.

  9. [A novel vector for construction of a cDNA library].

    Science.gov (United States)

    Fedchenko, V I; Kaloshin, A A; Medvedev, A E

    2010-01-01

    A new original vector pEM-(dT)40(f+) has been prepared. It can be used for cDNA library construction from polyadenylated mRNA, isolated from various sources. The pGEM-(dT)40f(+) is initially transformed into single stranded and then into a linear form and its (dT)40 tail at 3'-end is used as the vector-primer for synthesis of the first strand cDNA. The use of a synthetic oligonucleotide complementary to the vector and recombinant DNA results in vector cyclization and synthesis of the second strand cDNA. This approach significantly simplifies cDNA library construction, it does not require PCR reaction (which can induce artifact mutations in cDNA sequences) and restrictase treatment.

  10. Molecular characterization of a cDNA encoding vitellogenin in the banana shrimp, Penaeus (Litopenaeus) merguiensis and sites of vitellogenin mRNA expression.

    Science.gov (United States)

    Phiriyangkul, Pharima; Utarabhand, Prapaporn

    2006-04-01

    In order to determine the primary structure of banana shrimp, Penaeus merguiensis, vitellogenin (Vg), we previously purified vitellin (Vt) from the ovaries of vitellogenic females, and chemically analyzed the N-terminal amino acid sequence of its 78 kDa subunit. In this study, a cDNA from this species encoding Vg was cloned based on the N-terminal amino acid sequence of the major 78 kDa subunit of Vt and conserved sequences of Vg/Vt from other crustacean species. The complete nucleotide sequence of Vg cDNA was achieved by RT-PCR and 5' and 3' rapid amplification of cDNA ends (RACE) approaches. The full-length Vg cDNA consisted of 7,961 nucleotides. The open reading frame of this cDNA encoding a precursor peptide was comprised of 2,586 amino acid residues, with a putative processing site, R-X-K/R-R, recognized by subtilisin-like endoproteases. The deduced amino acid sequence was obtained from the Vg cDNA and its amino acid composition showed a high similarity to that of purified Vt. The deduced primary structure, of P. merguiensis Vg was 91.4% identical to the Vg of Penaeus semisulcatus and was also related to the Vg sequences of six other crustacean species with identities that ranged from 86.9% to 36.6%. In addition, the amino acid sequences corresponding to the signal peptide, N-terminal region and C-terminal region of P. merguiensis Vg were almost identical to the same sequences of the seven other reported crustacean species. Results from RT-PCR analysis showed that Vg mRNA expression was present in both the ovary and hepatopancreas of vitellogenic females but was not detected in other tissues including muscle, heart, and intestine of females or in the hepatopancreas of mature males. These results indicate that the Vg gene may be expressed only by mature P. merguiensis females and that both the ovary and hepatopancreas are possible sites for Vg synthesis in this species of shrimp. Copyright 2006 Wiley-Liss, Inc.

  11. Construction and characterization of a goat mammary gland cDNA library.

    Science.gov (United States)

    Han, Xue Feng; Luo, Jun; Wu, Ning; Matand, Kanyand; Yang, Bao Jin; Wu, Hui Juan; Zhang, Li Juan; Wang, Hai Bin

    2008-03-01

    A lactating goat mammary gland cDNA library was constructed by using a modified commercially available cDNA library construction kit protocol. The resulting clones were sequenced and functionally analyzed through cross-species genomic comparison to assess (1) the capacity and functional quality of the constructed library for subsequent research and (2) the efficiency of the procedural modifications. The study resulted in the construction of a high-quality mammary gland cDNA library, which was characterized by (1) the total recombinants number of 1.4 x 10(7) colony-forming units (cfus) that was at least 10 times greater than the number expected from the application of the standard kit protocol, (2) the recombinants rate of 96%, and (3) the average insert size of 1,082 bp. BLAST analysis of sequenced clones against GenBank databases determined 55.7% of clone redundancy, 22 known function gene clusters, and 29 novel gene clusters. The analysis of the primary gene expression profile showed that 59% of the tested clones were genes that coded for milk proteins while 16% of the clones coded for ribosomal, metabolism, immune response, and translation proteins. The remaining 25% of the tested clones were described as novel genes. Cross-species comparison showed that 77% of characterized gene clusters were successfully identified by using resources from other ruminants and unrelated species. This outcome is in consonance with the common belief that the genomic resources that have been generated across species are potentially powerful tools that could be used for enhancing the molecular understanding of less genomically studied species, such as goat.

  12. Validation of Genotyping-By-Sequencing Analysis in Populations of Tetraploid Alfalfa by 454 Sequencing

    Science.gov (United States)

    Rocher, Solen; Jean, Martine; Castonguay, Yves; Belzile, François

    2015-01-01

    Genotyping-by-sequencing (GBS) is a relatively low-cost high throughput genotyping technology based on next generation sequencing and is applicable to orphan species with no reference genome. A combination of genome complexity reduction and multiplexing with DNA barcoding provides a simple and affordable way to resolve allelic variation between plant samples or populations. GBS was performed on ApeKI libraries using DNA from 48 genotypes each of two heterogeneous populations of tetraploid alfalfa (Medicago sativa spp. sativa): the synthetic cultivar Apica (ATF0) and a derived population (ATF5) obtained after five cycles of recurrent selection for superior tolerance to freezing (TF). Nearly 400 million reads were obtained from two lanes of an Illumina HiSeq 2000 sequencer and analyzed with the Universal Network-Enabled Analysis Kit (UNEAK) pipeline designed for species with no reference genome. Following the application of whole dataset-level filters, 11,694 single nucleotide polymorphism (SNP) loci were obtained. About 60% had a significant match on the Medicago truncatula syntenic genome. The accuracy of allelic ratios and genotype calls based on GBS data was directly assessed using 454 sequencing on a subset of SNP loci scored in eight plant samples. Sequencing depth in this study was not sufficient for accurate tetraploid allelic dosage, but reliable genotype calls based on diploid allelic dosage were obtained when using additional quality filtering. Principal Component Analysis of SNP loci in plant samples revealed that a small proportion (<5%) of the genetic variability assessed by GBS is able to differentiate ATF0 and ATF5. Our results confirm that analysis of GBS data using UNEAK is a reliable approach for genome-wide discovery of SNP loci in outcrossed polyploids. PMID:26115486

  13. Next generation sequencing (NGS)technologies and applications

    Energy Technology Data Exchange (ETDEWEB)

    Vuyisich, Momchilo [Los Alamos National Laboratory

    2012-09-11

    NGS technology overview: (1) NGS library preparation - Nucleic acids extraction, Sample quality control, RNA conversion to cDNA, Addition of sequencing adapters, Quality control of library; (2) Sequencing - Clonal amplification of library fragments, (except PacBio), Sequencing by synthesis, Data output (reads and quality); and (3) Data analysis - Read mapping, Genome assembly, Gene expression, Operon structure, sRNA discovery, and Epigenetic analyses.

  14. Molecular cloning of a cDNA encoding human calumenin, expression in Escherichia coli and analysis of its Ca2+-binding activity

    DEFF Research Database (Denmark)

    Vorum, H; Liu, X; Madsen, Peder

    1998-01-01

    -terminal signal sequence, 7 EF-hand domains and, at the C-terminus, a HDEF sequence which has been reported to function as retrieval signal to the ER. The calumenin transcript is ubiquitously expressed in human tissue, at high levels in heart, placenta and skeletal muscle, at lower levels in lung, kidney...

  15. [Screening of specifically expressed genes in amphioxus neurula by construction of a subtractive cDNA library].

    Science.gov (United States)

    Zhang, Lei; Yang, Yong-Jie; Zhang, Yan-Jun

    2010-12-01

    To screen specifically expressed genes in the development of nerve, muscle, and body axis of amphioxus, Branchiostoma belcheri tsingtauenese. A subtractive cDNA library was constructed from the 12-hour amphioxus neurula cDNA after subtractively hybridized with the 6-hour amphioxus gastrula cDNA. The total RNA was extracted from the 12-hour neurula and 6-hour gastrula, then reverse transcribed into cDNA. The 12-hour neurula cDNA was designated as the experimental group (the tester) and the 6-hour gastrula cDNA as the control group (the driver). The differentially expressed sequences were exponentially amplified using suppression PCR. Background was subtracted and differentially expressed sequences were further enriched. The PCR products were ligated to the T Vector. After transformation of the recombinant plasmid carrying inserted amphioxus cDNA into E.coli host cells, the cDNA library was constructed successfully. Two hundred randomly chosen positive clones were sequenced and some of neurula-specifically expressed genes were obtained. SSH is an effective method for searching differentially expressed genes. The subtractive cDNA library we generated provides a tool for further study of regulatory mechanisms of amphioxus early embryonic development.

  16. Succinyl-CoA:3-oxoacid transferase (SCOT): Human cDNA and genomic cloning and chromosomal mapping of the human gene

    Energy Technology Data Exchange (ETDEWEB)

    Kassovska-Bratinova, S.; Mitchell, G.A. [Hopital Ste-Justine, Montreal, Quebec (Canada); Duncan, A. [Kingston General Hospital, Kingston, Ontario (Canada)] [and others

    1994-09-01

    SCOT (EC 2.8.3.5) mediates the activation and utilization of ketone bodies in extrahepatic tissues, especially brain, heart and kidney. Children with hereditary SCOT deficiency have episodes of severe ketoacidosis. Using a partial human heart SCOT cDNA, hSCOT-G, we detected a single {approximately}3 Kb mRNA in human heart and leukocytes, but not in liver. The length of the mouse SCOT mRNA detected with hSCOT-G in muscle, heart, kidney and brain is {approximately}3 Kb. We mapped the human SCOT gene to chromosome 5p13 by in situ hybridization. To date we have isolated human heart cDNAs spanning 2.9 Kb and including a 1248 bp open reading frame. The 3{prime} nontranslated region of the human SCOT mRNA extends at least 1712 bp, in contrast to the 209 bp sequence reported for pig SCOT cDNA. In one heart cDNA clone we detected a 58 bp insertion 258 bp downstream from the stop codon. We performed RT-PCR using a 5{prime} degenerate-sequence primer designed from the pig SCOT leader peptide sequence and 3{prime} human-specific primers. We obtained a fragment of the expected 320 bp length which strongly hybridizes to an internal oligonucleotide and which we are now characterizing. Human genomic Southern blot analysis with a partial human cDNA as probe suggests that the length of the human SCOT gene is about 40 K. Using hSCOT-G as a probe, we screened a human leukocyte genomic library in EMBL-3 phage and isolated two genomic clones. One of them contains a processed pseudogene. The other contains at least two exons of the human SCOT gene spanning cDNA residues 431 to 734. These findings will be useful for mutation analysis in SCOT-deficient patients.

  17. Cloning, Expression, Sequence Analysis and Homology Modeling of the Prolyl Endoprotease from Eurygaster integriceps Puton

    Directory of Open Access Journals (Sweden)

    Ravi Chandra Yandamuri

    2014-10-01

    Full Text Available eurygaster integriceps Puton, commonly known as sunn pest, is a major pest of wheat in Northern Africa, the Middle East and Eastern Europe. This insect injects a prolyl endoprotease into the wheat, destroying the gluten. The purpose of this study was to clone the full length cDNA of the sunn pest prolyl endoprotease (spPEP for expression in E. coli and to compare the amino acid sequence of the enzyme to other known PEPs in both phylogeny and potential tertiary structure. Sequence analysis shows that the 5ꞌ UTR contains several putative transcription factor binding sites for transcription factors known to be expressed in Drosophila that might be useful targets for inhibition of the enzyme. The spPEP was first identified as a prolyl endoprotease by Darkoh et al., 2010. The enzyme is a unique serine protease of the S9A family by way of its substrate recognition of the gluten proteins, which are greater than 30 kD in size. At 51% maximum identity to known PEPs, homology modeling using SWISS-MODEL, the porcine brain PEP (PDB: 2XWD was selected in the database of known PEP structures, resulting in a predicted tertiary structure 99% identical to the porcine brain PEP structure. A Km for the recombinant spPEP was determined to be 210 ± 53 µM for the zGly-Pro-pNA substrate in 0.025 M ethanolamine, pH 8.5, containing 0.1 M NaCl at 37 °C with a turnover rate of 172 ± 47 µM Gly-Pro-pNA/s/µM of enzyme.

  18. Expressed Sequence Tag-Simple Sequence Repeat (EST-SSR Marker Resources for Diversity Analysis of Mango (Mangifera indica L.

    Directory of Open Access Journals (Sweden)

    Natalie L. Dillon

    2014-01-01

    Full Text Available In this study, a collection of 24,840 expressed sequence tags (ESTs generated from five mango (Mangifera indica L. cDNA libraries was mined for EST-based simple sequence repeat (SSR markers. Over 1,000 ESTs with SSR motifs were detected from more than 24,000 EST sequences with di- and tri-nucleotide repeat motifs the most abundant. Of these, 25 EST-SSRs in genes involved in plant development, stress response, and fruit color and flavor development pathways were selected, developed into PCR markers and characterized in a population of 32 mango selections including M. indica varieties, and related Mangifera species. Twenty-four of the 25 EST-SSR markers exhibited polymorphisms, identifying a total of 86 alleles with an average of 5.38 alleles per locus, and distinguished between all Mangifera selections. Private alleles were identified for Mangifera species. These newly developed EST-SSR markers enhance the current 11 SSR mango genetic identity panel utilized by the Australian Mango Breeding Program. The current panel has been used to identify progeny and parents for selection and the application of this extended panel will further improve and help to design mango hybridization strategies for increased breeding efficiency.

  19. Cloning and functional expression of a human pancreatic islet glucose-transporter cDNA

    International Nuclear Information System (INIS)

    Permutt, M.A.; Koranyi, L.; Keller, K.; Lacy, P.E.; Scharp, D.W.; Mueckler, M.

    1989-01-01

    Previous studies have suggested that pancreatic islet glucose transport is mediated by a high-K m , low-affinity facilitated transporter similar to that expressed in liver. To determine the relationship between islet and liver glucose transporters, liver-type glucose-transporter cDNA clones were isolated from a human liver cDNA library. The liver-type glucose-transporter cDNA clone hybridized to mRNA transcripts of the same size in human liver and pancreatic islet RNA. A cDNA library was prepared from purified human pancreatic islet tissue and screened with human liver-type glucose-transporter cDNA. The authors isolated two overlapping cDNA clones encompassing 2600 base pairs, which encode a pancreatic islet protein identical in sequence to that of the putative liver-type glucose-transporter protein. Xenopus oocytes injected with synthetic mRNA transcribed from a full-length cDNA construct exhibited increased uptake of 2-deoxyglucose, confirming the functional identity of the clone. These cDNA clones can now be used to study regulation of expression of the gene and to assess the role of inherited defects in this gene as a candidate for inherited susceptibility to non-insulin-dependent diabetes mellitus

  20. Laser capture microdissection and cDNA array analysis of endometrium identify CCL16 and CCL21 as epithelial-derived inflammatory mediators associated with endometriosis

    Directory of Open Access Journals (Sweden)

    Jones Rebecca L

    2007-05-01

    Full Text Available Abstract Background Understanding the pathophysiology of chemokine secretion in endometriosis may offer a novel area of therapeutic intervention. This study aimed to identify chemokines differentially expressed in epithelial glands in eutopic endometrium from normal women and those with endometriosis, and to establish the expression profiles of key chemokines in endometriotic lesions. Methods Laser capture microdissection isolated epithelial glands from endometrial eutopic tissue from women with and without endometriosis in the mid-secretory phase of their menstrual cycles. Gene profiling of the excised glands used a human chemokine and receptor cDNA array. Selected chemokines were further examined using real-time PCR and immunohistochemistry. Results 22 chemokine/receptor genes were upregulated and two downregulated in pooled endometrial epithelium of women with endometriosis compared with controls. CCL16 and CCL21 mRNA was confirmed as elevated in some women with endometriosis compared to controls on individual samples. Immunoreactive CCL16 and CCL21 were predominantly confined to glands in eutopic and ectopic endometrium: leukocytes also stained. Immunoreactive CCL16 was overall higher in glands in ectopic vs. eutopic endometrium from the same woman (P Conclusion This study provides novel candidate molecules and suggests a potential local role for CCL16 and CCL21 as mediators contributing to the inflammatory events associated with endometriosis.

  1. Restriction landmark cDNA scanning (RLCS): a novel cDNA display system using two-dimensional gel electrophoresis.

    Science.gov (United States)

    Suzuki, H; Yaoi, T; Kawai, J; Hara, A; Kuwajima, G; Wantanabe, S

    1996-01-01

    We have developed a new method, designated restriction landmark cDNA scanning (RLCS), which displays many cDNA species quantitatively and simultaneously as two-dimensional gel spots. In this method cDNA species of uniform length were prepared for each mRNA species using restriction enzymes. After the restriction enzyme sites were radiolabeled as landmarks, the labeled fragments were subjected to high resolution two-dimensional gel electrophoresis. In analyses of cDNA samples from adult mouse liver and brain (cerebral cortex, cerebellum and brain stem) we detected approximately 500 and >1000 discrete gel spots respectively of various intensities at a time. The spot patterns of the three brain regions were very similar, although not identical, but were quite different from the pattern for the liver. RNA blot hybridization analysis using several cloned spot DNAs as probes showed that differences in intensity of the spots among RLCS profiles correlated well with expression levels of the corresponding mRNA species in the brain regions. Because the spots and their intensities reflect distinct mRNA species and their expression level respectively, the RLCS is a novel cDNA display system which provides a great deal of information and should be useful for systematic documentation of differentially expressed genes. PMID:8628652

  2. Cloning of a nitrate reductase inactivator (NRI) cDNA from Spinacia oleracea L. and expression of mRNA and protein of NRI in cultured spinach cells.

    Science.gov (United States)

    Sonoda, Masatoshi; Ide, Hiroaki; Nakayama, Shinya; Sasaki, Asako; Kitazaki, Shinei; Sato, Takahide; Nakagawa, Hiroki

    2003-04-01

    The spinach ( Spinacia oleracea L. (cv. Hoyo) nitrate reductase inactivator (NRI) is a novel protein that irreversibly inactivates NR. Using degenerate primers based on an N-terminal amino acid sequence of NRI purified from spinach leaves and a cDNA library, we isolated a full-length NRI cDNA from spinach that contains an open reading frame encoding 479 amino acid residues. This protein shares 67.4% and 51.1-68.3% amino acid sequence similarities with a nucleotide pyrophosphatase (EC 3.6.1.9) from rice and three types of the nucleotide pyrophosphatase-like protein from Arabidopsis thaliana, respectively. Immunoblot analysis revealed that NRI was constitutively expressed in suspension-cultured spinach cells; however, its expression level is quite low in 1-day-subcultured cells. Moreover, northern blot analysis indicated that this expression was regulated at the mRNA level. These results suggest that NRI functions in mature cells.

  3. Cloning of a cDNA that encodes farnesyl diphosphate synthase and the blue-light-induced expression of the corresponding gene in the leaves of rice plants.

    Science.gov (United States)

    Sanmiya, K; Iwasaki, T; Matsuoka, M; Miyao, M; Yamamoto, N

    1997-02-28

    A cDNA encoding farnesyl diphosphate synthase (FPPS), a key enzyme in isoprenoid biosynthesis, was isolated from a cDNA library constructed from mRNA that had been prepared from etiolated rice (Oriza sativa L. variety Nipponbare) seedlings after three hours of illumination by a subtraction method. The putative polypeptide deduced from the 1289 bp nucleotide sequence consisted of 353 amino acids and had a molecular mass of 40 676 Da. The predicted amino acid sequence exhibited high homology to those of FPPS from Arabidopsis (73% to type 1, 72% to type 2) and white lupin (74%). Southern blot analysis showed that the rice genome might contain only one gene for FPPS. The highest level of expression of the gene was demonstrated in leaves by RNA blot analysis. Moreover, light, in particular blue light, effectively enhanced expression of the gene.

  4. Characterization of MMP-9 gene from a normalized cDNA library of kidney tissue of yellow catfish (Pelteobagrus fulvidraco).

    Science.gov (United States)

    Ke, Fei; Wang, Yun; Hong, Jun; Xu, Chen; Chen, Huan; Zhou, Shuai-Bang

    2015-08-01

    Matrix metalloproteinase-9 (MMP-9), one of members of the MMP family, is important for the cleaving of structural extracellular matrix (ECM) molecules and involved in inflammatory processes. In this study, MMP-9 cDNA was isolated and characterized from a normalized cDNA library of kidney tissue of yellow catfish (designated as YcMMP-9). The complete sequence of YcMMP-9 cDNA consisted of 2561 nucleotides. The open reading frame potentially encoded a protein of 685 amino acids with a calculated molecular mass of approximately 77.182 kDa. Amino acid sequence of YcMMP-9 have typical characteristics of MMP-9 family and showed highest identity (85.3%) to channel catfish MMP-9. The YcMMP-9 genomic DNA contains 13 exons and 12 introns. Quantitative RT-PCR (qRT-PCR) analysis showed that YcMMP-9 mRNA was constitutively expressed in all examined tissues in normal fish with high expression in head kidney, trunk kidney, blood, and spleen. However, expression of YcMMP-9 mRNA was induced by Aeromonas hydrophila stimulation, especially in these four tissues mentioned above. It indicated that YcMMP-9 was involved in innate immune responses against bacterial infection. Copyright © 2015 Elsevier Ltd. All rights reserved.

  5. [cDNA library constructing and specific antigen expression of Streptomyces thermohydroscopicus].

    Science.gov (United States)

    Xu, Lei; Wang, Ling-ling; Liu, Shuo; Ling, Yuan; Ma, Lie; Wang, Qun; Zhang, Li-jiao; He, Xiao-yu; Zhao, Ming-jing; Wang, Xiao-ge

    2012-03-01

    To construct a cDNA library from Streptomyces thermohydroscopicus and screen genes with virulence, obtain the recombinant fusion virulence proteins by prokaryotic expression system. The Streptomyces thermohydroscopicus cDNA library was constructed by switching mechanism at 5'end of RNA transcript approach. A total of 1020 clones randomly selected from the cDNA library were sequenced and these expressed sequence tags (EST) were further analyzed for the screen of antigen-specific genes. The two candidate genes were subcloned into expression vector pET-28a. The recombinants were transformed into BL2 and proteins were expressed by the induction of isopropyl-β-D-1-thiogalactopyranoside (IPTG). A high-quality cDNA library from Streptomyces thermohydroscopicus was constructed and a set of 978 valid sequences were obtained. Clustering and assembly of these cDNA sequences resulted in 347 unique genes, among which 2 potential antigen-specific genes were highly allied with outer membrane lipoprotein (51%) and transferring-binding protein B (42%) from Actinobacillus pleuropneumoniae serotype (APP). The open reading frame (ORF) of the two candidate genes are 1554 bp and 726 bp, which coded two peptides with 517 and 241 amino acids, respectively. The molecular weights of the recombinant fusion proteins were 63 000 and 30 000. The cDNA library of Streptomyces thermohydroscopicus reached the quality requirement of gene library. EST database in the library would greatly facilitate further screening of virulence genes.

  6. cDNA cloning of human DNA topoisomerase I. Catalytic activity of a 67.7-kDa carboxyl-terminal fragment

    International Nuclear Information System (INIS)

    D'Arpa, P.; Machlin, P.S.; Ratrie, H. III; Rothfield, N.F.; Cleveland, D.W.; Earnshaw, W.C.

    1988-01-01

    cDNA clones encoding human topoisomerase I were isolated from an expression vector library (λgt11) screened with autoimmune anti-topoisomerase I serum. One of these clones has been expressed as a fusion protein comprised of a 32-kDa fragment of the bacterial TrpE protein linked to 67.7 kDa of protein encoded by the cDNA. Three lines of evidence indicate that the cloned cDNA encodes topoisomerase I. (i) Proteolysis maps of the fusion protein and human nuclear topoisomerase I are essentially identical. (ii) The fusion protein relaxes supercoiled DNA, an activity that can be immunoprecipitated by anti-topoisomerase I serum. (iii) Sequence analysis has revealed that the longest cDNA clone (3645 base pairs) encodes a protein of 765 amino acids that shares 42% identity with Saccharomyces cerevisiae topoisomerase I. The sequence data also show that the catalytically active 67.7-kDa fragment is comprised of the carboxyl terminus

  7. Full-length cDNA cloning of Toll-like receptor 4 in dogs and cats.

    Science.gov (United States)

    Asahina, Yuka; Yoshioka, Noriyuki; Kano, Rui; Moritomo, Tadaaki; Hasegawa, Atsuhiko

    2003-12-15

    In the present study, full length of canine and feline Toll-like receptor 4 (TLR4) cDNAs were sequenced, and the expression of canine and feline TLR4 mRNAs in dog and cat tissues were investigated. The full-length cDNA of TLR4 of dog and cat was 2709 bp encoding 637 amino acids and 3113 bp encoding 833 amino acids, respectively. The similarity of canine and feline TLR4 were 83.6% at the nucleotide sequence level and 77.6% at the amino acid sequence level. At the amino acid sequence level, canine and feline TLR4 showed sequence similarities of approximately 62-78% with those of Homo sapiens, Mus musculus, Bos taurus and Equus caballus, respectively. Southern hybridization analyses with TLR4 cDNA probes gave one distinct band in BamHI, EcoRI and HindIII digests of genomic DNA from dogs and cats, respectively, indicating the likely presence of a single TLR4 gene in each species. By RT-PCR analysis, mRNA of canine TLR4 was expressed highly in peripheral blood leukocytes (PBL), moderately in spleen, stomach and small intestine, at low levels in liver, with no expression in kidney, large intestine and skin. On the other hand, mRNA of feline TLR4 was expressed highly in lung, bladder and PBL, moderately in kidney, liver, spleen and large intestine and at low levels in pancreas and small intestine.

  8. RNA-Seq analysis of Cocos nucifera: transcriptome sequencing and de novo assembly for subsequent functional genomics approaches.

    Science.gov (United States)

    Fan, Haikuo; Xiao, Yong; Yang, Yaodong; Xia, Wei; Mason, Annaliese S; Xia, Zhihui; Qiao, Fei; Zhao, Songlin; Tang, Haoru

    2013-01-01

    Cocos nucifera (coconut), a member of the Arecaceae family, is an economically important woody palm grown in tropical regions. Despite its agronomic importance, previous germplasm assessment studies have relied solely on morphological and agronomical traits. Molecular biology techniques have been scarcely used in assessment of genetic resources and for improvement of important agronomic and quality traits in Cocos nucifera, mostly due to the absence of available sequence information. To provide basic information for molecular breeding and further molecular biological analysis in Cocos nucifera, we applied RNA-seq technology and de novo assembly to gain a global overview of the Cocos nucifera transcriptome from mixed tissue samples. Using Illumina sequencing, we obtained 54.9 million short reads and conducted de novo assembly to obtain 57,304 unigenes with an average length of 752 base pairs. Sequence comparison between assembled unigenes and released cDNA sequences of Cocos nucifera and Elaeis guineensis indicated that the assembled sequences were of high quality. Approximately 99.9% of unigenes were novel compared to the released coconut EST sequences. Using BLASTX, 68.2% of unigenes were successfully annotated based on the Genbank non-redundant (Nr) protein database. The annotated unigenes were then further classified using the Gene Ontology (GO), Clusters of Orthologous Groups (COG) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Our study provides a large quantity of novel genetic information for Cocos nucifera. This information will act as a valuable resource for further molecular genetic studies and breeding in coconut, as well as for isolation and characterization of functional genes involved in different biochemical pathways in this important tropical crop species.

  9. Molecular cloning, sequence analysis and structure prediction of the ...

    African Journals Online (AJOL)

    Molecular cloning, sequence analysis and structure prediction of the related to b 0,+ amino acid transporter (rBAT) in Cyprinus carpio L. ... The amplified product was 2370 bp, including a 42 bp 5'-untranslated region, a 288 bp 3'-untranslated region, and a 2040 bp open reading frame (ORF), which encoded 679 amino acids ...

  10. Sequence analysis corresponding to the PPE and PE proteins in ...

    Indian Academy of Sciences (India)

    Unknown

    AB repeats; Mycobacterium tuberculosis genome; PE-PPE domain; PPE, PE proteins; sequence analysis; surface antigens. J. Biosci. | Vol. ... bacterium tuberculosis genomes resulted in the identification of a previously uncharacterized 225 amino acid- ...... Vega Lopez F, Brooks L A, Dockrell H M, De Smet K A,. Thompson ...

  11. Sequence analysis corresponding to the PPE and PE proteins in ...

    Indian Academy of Sciences (India)

    Unknown

    Amino acid sequence analysis corresponding to the PE proteins resulted in the identification of tandem repeats comprising 41–43 amino acid ..... Q9RAP8. Leptospira interrogans serovar ictero- haemorrhagiae-B. ORFC protein. 5. P71003. Bacillus subtilis-B. Hypothetical 49⋅4 kDa protein. 5. ALL2941. Anabaena sp.

  12. Sequence symmetry analysis in pharmacovigilance and pharmacoepidemiologic studies

    DEFF Research Database (Denmark)

    Lai, Edward Chia Cheng; Pratt, Nicole; Hsieh, Cheng Yang

    2017-01-01

    Sequence symmetry analysis (SSA) is a method for detecting adverse drug events by utilizing computerized claims data. The method has been increasingly used to investigate safety concerns of medications and as a pharmacovigilance tool to identify unsuspected side effects. Validation studies have i...

  13. Inter simple sequence repeat analysis of genetic diversity of five ...

    African Journals Online (AJOL)

    This paper studied the genetic diversity of five cultivated pepper species using inter simple sequence repeat (ISSR) analysis. The amplicons of 13 out of 15 designed primers were stable polymorphic and therefore were used as genetic biomarkers. 135 total clear bands were obtained, of which 102 were polymorphic bands ...

  14. Molecular cloning, expression analysis and sequence prediction of ...

    African Journals Online (AJOL)

    CCAAT/enhancer-binding protein beta as an essential transcriptional factor, regulates the differentiation of adipocytes and the deposition of fat. Herein, we cloned the whole open reading frame (ORF) of bovine C/EBPβ gene and analyzed its putative protein structures via DNA cloning and sequence analysis. Then, the ...

  15. Sequence analysis of mitochondrial 16S ribosomal RNA gene ...

    Indian Academy of Sciences (India)

    Unknown

    Sequence analysis of mitochondrial 16S ribosomal RNA gene fragment from seven mosquito species. YOGESH S SHOUCHE* and MILIND S PATOLE. National Center for Cell Science, Pune University Campus, Pune 411 007, India. *Corresponding author (Fax, 91-20-5672259; Email, yogesh@nccs.res.in). Mosquitoes are ...

  16. Phylogenetic relationships of Malassezia species based on multilocus sequence analysis.

    Science.gov (United States)

    Castellá, Gemma; Coutinho, Selene Dall' Acqua; Cabañes, F Javier

    2014-01-01

    Members of the genus Malassezia are lipophilic basidiomycetous yeasts, which are part of the normal cutaneous microbiota of humans and other warm-blooded animals. Currently, this genus consists of 14 species that have been characterized by phenetic and molecular methods. Although several molecular methods have been used to identify and/or differentiate Malassezia species, the sequencing of the rRNA genes and the chitin synthase-2 gene (CHS2) are the most widely employed. There is little information about the β-tubulin gene in the genus Malassezia, a gene has been used for the analysis of complex species groups. The aim of the present study was to sequence a fragment of the β-tubulin gene of Malassezia species and analyze their phylogenetic relationship using a multilocus sequence approach based on two rRNA genes (ITS including 5.8S rRNA and D1/D2 region of 26S rRNA) together with two protein encoding genes (CHS2 and β-tubulin). The phylogenetic study of the partial β-tubulin gene sequences indicated that this molecular marker can be used to assess diversity and identify new species. The multilocus sequence analysis of the four loci provides robust support to delineate species at the terminal nodes and could help to estimate divergence times for the origin and diversification of Malassezia species.

  17. Transcriptomic identification of candidate genes involved in sunflower responses to chilling and salt stresses based on cDNA microarray analysis

    OpenAIRE

    Paniego Norma; Hopp H Esteban; Fernandez Luis; Di Rienzo Julio; Fernandez Paula; Heinz Ruth A

    2008-01-01

    Abstract Background Considering that sunflower production is expanding to arid regions, tolerance to abiotic stresses as drought, low temperatures and salinity arises as one of the main constrains nowadays. Differential organ-specific sunflower ESTs (expressed sequence tags) were previously generated by a subtractive hybridization method that included a considerable number of putative abiotic stress associated sequences. The objective of this work is to analyze concerted gene expression profi...

  18. The SCALE criticality safety analysis sequences: Status and future directions

    International Nuclear Information System (INIS)

    Parks, C.V.

    1993-01-01

    The Standardized Computer Analyses for Licensing Evaluation (SCALE) code system. Was originally conceived and developed in the late 1970s for the US Nuclear Regulatory Commission. The goal was to provide easy-to-use, yet accurate, analysis capabilities for use in evaluating the criticality safety, shielding, and heat transfer aspects of transportation packages for radioactive material. The Criticality Safety Analysis Sequences (CSAS) for SCALE were developed to ''automate'' problem-dependent cross-section and material processing prior to execution of the wellestablished XSDRNPM or KENO codes for calculation of k eff . The criticality analysis sequences provided in SCALE-4 are summarized. The SCALE system continues to be maintained and enhanced by staff of the Computing Applications Division at Oak Ridge National Laboratory (ORNL). The purpose of this paper is to discuss recent work to improve system portability and user interfaces and to provide information on ongoing work to enhance the analysis capabilities

  19. Complete genome sequence analysis of chicken astrovirus isolate from India.

    Science.gov (United States)

    Patel, Amrutlal K; Pandit, Ramesh J; Thakkar, Jalpa R; Hinsu, Ankit T; Pandey, Vinod C; Pal, Joy K; Prajapati, Kantilal S; Jakhesara, Subhash J; Joshi, Chaitanya G

    2017-03-01

    Chicken astroviruses have been known to cause severe disease in chickens leading to increased mortality and "white chicks" condition. Here we aim to characterize the causative agent of visceral gout suspected for astrovirus infection in broiler breeder chickens. Total RNA isolated from allantoic fluid of SPF embryo passaged with infected chicken sample was sequenced by whole genome shotgun sequencing using ion-torrent PGM platform. The sequence was analysed for the presence of coding and non-coding features, its similarity with reported isolates and epitope analysis of capsid structural protein. The consensus length of 7513 bp genome sequence of Indian isolate of chicken astrovirus was obtained after assembly of 14,121 high quality reads. The genome was comprised of 13 bp 5'-UTR, three open reading frames (ORFs) including ORF1a encoding serine protease, ORF1b encoding RNA dependent RNA polymerase (RdRp) and ORF2 encoding capsid protein, and 298 bp of 3'-UTR which harboured two corona virus stem loop II like "s2m" motifs and a poly A stretch of 19 nucleotides. The genetic analysis of CAstV/INDIA/ANAND/2016 suggested highest sequence similarity of 86.94% with the chicken astrovirus isolate CAstV/GA2011 followed by 84.76% with CAstV/4175 and 74.48%% with CAstV/Poland/G059/2014 isolates. The capsid structural protein of CAstV/INDIA/ANAND/2016 showed 84.67% similarity with chicken astrovirus isolate CAstV/GA2011, 81.06% with CAstV/4175 and 41.18% with CAstV/Poland/G059/2014 isolates. However, the capsid protein sequence showed high degree of sequence identity at nucleotide level (98.64-99.32%) and at amino acids level (97.74-98.69%) with reported sequences of Indian isolates suggesting their common origin and limited sequence divergence. The epitope analysis by SVMTriP identified two unique epitopes in our isolate, seven shared epitopes among Indian isolates and two shared epitopes among all isolates except Poland isolate which carried all distinct epitopes.

  20. Phylogeny and classification of Dickeya based on multilocus sequence analysis.

    Science.gov (United States)

    Marrero, Glorimar; Schneider, Kevin L; Jenkins, Daniel M; Alvarez, Anne M

    2013-09-01

    Bacterial heart rot of pineapple reported in Hawaii in 2003 and reoccurring in 2006 was caused by an undetermined species of Dickeya. Classification of the bacterial strains isolated from infected pineapple to one of the recognized Dickeya species and their phylogenetic relationships with Dickeya were determined by a multilocus sequence analysis (MLSA), based on the partial gene sequences of dnaA, dnaJ, dnaX, gyrB and recN. Individual and concatenated gene phylogenies revealed that the strains form a clade with reference Dickeya sp. isolated from pineapple in Malaysia and are closely related to D. zeae; however, previous DNA-DNA reassociation values suggest that these strains do not meet the genomic threshold for consideration in D. zeae, and require further taxonomic analysis. An analysis of the markers used in this MLSA determined that recN was the best overall marker for resolution of species within Dickeya. Differential intraspecies resolution was observed with the other markers, suggesting that marker selection is important for defining relationships within a clade. Phylogenies produced with gene sequences from the sequenced genomes of strains D. dadantii Ech586, D. dadantii Ech703 and D. zeae Ech1591 did not place the sequenced strains with members of other well-characterized members of their respective species. The average nucleotide identity (ANI) and tetranucleotide frequencies determined for the sequenced strains corroborated the results of the MLSA that D. dadantii Ech586 and D. dadantii Ech703 should be reclassified as Dickeya zeae Ech586 and Dickeya paradisiaca Ech703, respectively, whereas D. zeae Ech1591 should be reclassified as Dickeya chrysanthemi Ech1591.

  1. Purification, characterisation and cDNA cloning of an antimicrobial peptide from Macadamia integrifolia.

    Science.gov (United States)

    Marcus, J P; Goulter, K C; Green, J L; Harrison, S J; Manners, J M

    1997-03-15

    An antimicrobial peptide with no significant amino acid sequence similarity to previously described peptides has been isolated from the nut kernels of Macadcamia integrifolia. The peptide, termed MiAMP1, is highly basic with an estimated pI of 10.1, a mass of 8.1 kDa and contains 76 amino acids including 6 cysteine residues. A cDNA clone containing the entire coding region corresponding to the peptide was obtained. The deduced amino acid sequence of the cDNA indicated a 26-amino-acid signal peptide at the N-terminus of the preprotein. Purified MiAMP1 inhibited the growth of a variety of fungal, oomycete and gram-positive bacterial phytopathogens in vitro. Some pathogens exhibited close to 100% inhibition in less than 1 microM peptide (5 microg/ml). Antimicrobial activity was diminished against most, but not all, microbes in the presence of calcium and potassium chloride salts (1 mM and 50 mM, respectively). MiAMP1 was active against bakers yeast, was inactive against Escherichia coli and was non-toxic to plant and mammalian cells. Analysis of genomic DNA indicated that MiAMP1 was encoded on a single copy gene containing no introns. The MiAMP1 gene may prove useful in genetic manipulations to increase disease resistance in transgenic plants.

  2. Construction of an integrated database to support genomic sequence analysis

    Energy Technology Data Exchange (ETDEWEB)

    Gilbert, W.; Overbeek, R.

    1994-11-01

    The central goal of this project is to develop an integrated database to support comparative analysis of genomes including DNA sequence data, protein sequence data, gene expression data and metabolism data. In developing the logic-based system GenoBase, a broader integration of available data was achieved due to assistance from collaborators. Current goals are to easily include new forms of data as they become available and to easily navigate through the ensemble of objects described within the database. This report comments on progress made in these areas.

  3. Cloning and sequence analysis of benzo-a-pyreneinducible ...

    African Journals Online (AJOL)

    Abstract. Polycyclic aromatic hydrocarbons (PAHs), dioxins, dibenzofurans and polychlorinated biphenyls (PCBs) present in polluted environment induce cytochrome P4501A ... The full-length cDNA was 2530 bp long and contained an open reading frame of 1566 bp encoding a protein of 521 amino acids and a stop codon.

  4. A new set of ESTs and cDNA clones from full-length and normalized libraries for gene discovery and functional characterization in citrus

    Science.gov (United States)

    Marques, M Carmen; Alonso-Cantabrana, Hugo; Forment, Javier; Arribas, Raquel; Alamar, Santiago; Conejero, Vicente; Perez-Amador, Miguel A

    2009-01-01

    Background Interpretation of ever-increasing raw sequence information generated by modern genome sequencing technologies faces multiple challenges, such as gene function analysis and genome annotation. Indeed, nearly 40% of genes in plants encode proteins of unknown function. Functional characterization of these genes is one of the main challenges in modern biology. In this regard, the availability of full-length cDNA clones may fill in the gap created between sequence information and biological knowledge. Full-length cDNA clones facilitate functional analysis of the corresponding genes enabling manipulation of their expression in heterologous systems and the generation of a variety of tagged versions of the native protein. In addition, the development of full-length cDNA sequences has the power to improve the quality of genome annotation. Results We developed an integrated method to generate a new normalized EST collection enriched in full-length and rare transcripts of different citrus species from multiple tissues and developmental stages. We constructed a total of 15 cDNA libraries, from which we isolated 10,898 high-quality ESTs representing 6142 different genes. Percentages of redundancy and proportion of full-length clones range from 8 to 33, and 67 to 85, respectively, indicating good efficiency of the approach employed. The new EST collection adds 2113 new citrus ESTs, representing 1831 unigenes, to the collection of citrus genes available in the public databases. To facilitate functional analysis, cDNAs were introduced in a Gateway-based cloning vector for high-throughput functional analysis of genes in planta. Herein, we describe the technical methods used in the library construction, sequence analysis of clones and the overexpression of CitrSEP, a citrus homolog to the Arabidopsis SEP3 gene, in Arabidopsis as an example of a practical application of the engineered Gateway vector for functional analysis. Conclusion The new EST collection denotes an

  5. A new set of ESTs and cDNA clones from full-length and normalized libraries for gene discovery and functional characterization in citrus

    Directory of Open Access Journals (Sweden)

    Alamar Santiago

    2009-09-01

    Full Text Available Abstract Background Interpretation of ever-increasing raw sequence information generated by modern genome sequencing technologies faces multiple challenges, such as gene function analysis and genome annotation. Indeed, nearly 40% of genes in plants encode proteins of unknown function. Functional characterization of these genes is one of the main challenges in modern biology. In this regard, the availability of full-length cDNA clones may fill in the gap created between sequence information and biological knowledge. Full-length cDNA clones facilitate functional analysis of the corresponding genes enabling manipulation of their expression in heterologous systems and the generation of a variety of tagged versions of the native protein. In addition, the development of full-length cDNA sequences has the power to improve the quality of genome annotation. Results We developed an integrated method to generate a new normalized EST collection enriched in full-length and rare transcripts of different citrus species from multiple tissues and developmental stages. We constructed a total of 15 cDNA libraries, from which we isolated 10,898 high-quality ESTs representing 6142 different genes. Percentages of redundancy and proportion of full-length clones range from 8 to 33, and 67 to 85, respectively, indicating good efficiency of the approach employed. The new EST collection adds 2113 new citrus ESTs, representing 1831 unigenes, to the collection of citrus genes available in the public databases. To facilitate functional analysis, cDNAs were introduced in a Gateway-based cloning vector for high-throughput functional analysis of genes in planta. Herein, we describe the technical methods used in the library construction, sequence analysis of clones and the overexpression of CitrSEP, a citrus homolog to the Arabidopsis SEP3 gene, in Arabidopsis as an example of a practical application of the engineered Gateway vector for functional analysis. Conclusion The new

  6. A new set of ESTs and cDNA clones from full-length and normalized libraries for gene discovery and functional characterization in citrus.

    Science.gov (United States)

    Marques, M Carmen; Alonso-Cantabrana, Hugo; Forment, Javier; Arribas, Raquel; Alamar, Santiago; Conejero, Vicente; Perez-Amador, Miguel A

    2009-09-11

    Interpretation of ever-increasing raw sequence information generated by modern genome sequencing technologies faces multiple challenges, such as gene function analysis and genome annotation. Indeed, nearly 40% of genes in plants encode proteins of unknown function. Functional characterization of these genes is one of the main challenges in modern biology. In this regard, the availability of full-length cDNA clones may fill in the gap created between sequence information and biological knowledge. Full-length cDNA clones facilitate functional analysis of the corresponding genes enabling manipulation of their expression in heterologous systems and the generation of a variety of tagged versions of the native protein. In addition, the development of full-length cDNA sequences has the power to improve the quality of genome annotation. We developed an integrated method to generate a new normalized EST collection enriched in full-length and rare transcripts of different citrus species from multiple tissues and developmental stages. We constructed a total of 15 cDNA libraries, from which we isolated 10,898 high-quality ESTs representing 6142 different genes. Percentages of redundancy and proportion of full-length clones range from 8 to 33, and 67 to 85, respectively, indicating good efficiency of the approach employed. The new EST collection adds 2113 new citrus ESTs, representing 1831 unigenes, to the collection of citrus genes available in the public databases. To facilitate functional analysis, cDNAs were introduced in a Gateway-based cloning vector for high-throughput functional analysis of genes in planta. Herein, we describe the technical methods used in the library construction, sequence analysis of clones and the overexpression of CitrSEP, a citrus homolog to the Arabidopsis SEP3 gene, in Arabidopsis as an example of a practical application of the engineered Gateway vector for functional analysis. The new EST collection denotes an important step towards the

  7. Analysis of Sequence Diagram Layout in Advanced UML Modelling Tools

    Directory of Open Access Journals (Sweden)

    Ņikiforova Oksana

    2016-05-01

    Full Text Available System modelling using Unified Modelling Language (UML is the task that should be solved for software development. The more complex software becomes the higher requirements are stated to demonstrate the system to be developed, especially in its dynamic aspect, which in UML is offered by a sequence diagram. To solve this task, the main attention is devoted to the graphical presentation of the system, where diagram layout plays the central role in information perception. The UML sequence diagram due to its specific structure is selected for a deeper analysis on the elements’ layout. The authors research represents the abilities of modern UML modelling tools to offer automatic layout of the UML sequence diagram and analyse them according to criteria required for the diagram perception.

  8. Evolutionary analysis of hepatitis C virus gene sequences from 1953

    Science.gov (United States)

    Gray, Rebecca R.; Tanaka, Yasuhito; Takebe, Yutaka; Magiorkinis, Gkikas; Buskell, Zelma; Seeff, Leonard; Alter, Harvey J.; Pybus, Oliver G.

    2013-01-01

    Reconstructing the transmission history of infectious diseases in the absence of medical or epidemiological records often relies on the evolutionary analysis of pathogen genetic sequences. The precision of evolutionary estimates of epidemic history can be increased by the inclusion of sequences derived from ‘archived’ samples that are genetically distinct from contemporary strains. Historical sequences are especially valuable for viral pathogens that circulated for many years before being formally identified, including HIV and the hepatitis C virus (HCV). However, surprisingly few HCV isolates sampled before discovery of the virus in 1989 are currently available. Here, we report and analyse two HCV subgenomic sequences obtained from infected individuals in 1953, which represent the oldest genetic evidence of HCV infection. The pairwise genetic diversity between the two sequences indicates a substantial period of HCV transmission prior to the 1950s, and their inclusion in evolutionary analyses provides new estimates of the common ancestor of HCV in the USA. To explore and validate the evolutionary information provided by these sequences, we used a new phylogenetic molecular clock method to estimate the date of sampling of the archived strains, plus the dates of four more contemporary reference genomes. Despite the short fragments available, we conclude that the archived sequences are consistent with a proposed sampling date of 1953, although statistical uncertainty is large. Our cross-validation analyses suggest that the bias and low statistical power observed here likely arise from a combination of high evolutionary rate heterogeneity and an unstructured, star-like phylogeny. We expect that attempts to date other historical viruses under similar circumstances will meet similar problems. PMID:23938759

  9. cDNA library generation from ribonucleoprotein particles.

    Science.gov (United States)

    Rederstorff, Mathieu; Hüttenhofer, Alexander

    2011-02-01

    Most, if not all, known noncoding RNAs (ncRNAs) are associated with RNA binding proteins, thus forming ribonucleoprotein particles or RNPs. Here we describe a protocol for the generation of a specialized cDNA library from RNPs, thereby increasing the proportion of functional ncRNA species in the library. To that end, cellular extracts are fractionated on 10-30% glycerol gradients. Subsequently, RNP-derived ncRNAs are isolated and 3'-tailed by cytidine triphosphate and poly(A) polymerase; this is followed by 5' adapter ligation by T4 RNA ligase. Reverse transcription of ncRNAs into cDNAs is carried out with an oligo-d(G) anchor primer. The generated cDNA libraries are subsequently submitted to high-throughput sequencing. This RNP selection procedure increases the probability of the presence of biologically relevant ncRNA species in the library compared with libraries generation methods that use size-selected, protein-devoid ncRNAs. The protocol enables the generation of deep-sequencing-compatible cDNA libraries that code for functional ncRNAs within 1 week.

  10. A general sequence processing and analysis program for protein engineering.

    Science.gov (United States)

    Stafford, Ryan L; Zimmerman, Erik S; Hallam, Trevor J; Sato, Aaron K

    2014-10-27

    Protein engineering projects often amass numerous raw DNA sequences, but no readily available software combines sequence processing and activity correlation required for efficient lead identification. XLibraryDisplay is an open source program integrated into Microsoft Excel for Windows that automates batch sequence processing via a simple step-by-step, menu-driven graphical user interface. XLibraryDisplay accepts any DNA template which is used as a basis for trimming, filtering, translating, and aligning hundreds to thousands of sequences (raw, FASTA, or Phred PHD file formats). Key steps for library characterization through lead discovery are available including library composition analysis, filtering by experimental data, graphing and correlating to experimental data, alignment to structural data extracted from PDB files, and generation of PyMOL visualization scripts. Though larger data sets can be handled, the program is best suited for analyzing approximately 10 000 or fewer leads or naïve clones which have been characterized using Sanger sequencing and other experimental approaches. XLibraryDisplay can be downloaded for free from sourceforge.net/projects/xlibrarydisplay/ .

  11. Now and next-generation sequencing techniques: future of sequence analysis using cloud computing.

    Science.gov (United States)

    Thakur, Radhe Shyam; Bandopadhyay, Rajib; Chaudhary, Bratati; Chatterjee, Sourav

    2012-01-01

    Advances in the field of sequencing techniques have resulted in the greatly accelerated production of huge sequence datasets. This presents immediate challenges in database maintenance at datacenters. It provides additional computational challenges in data mining and sequence analysis. Together these represent a significant overburden on traditional stand-alone computer resources, and to reach effective conclusions quickly and efficiently, the virtualization of the resources and computation on a pay-as-you-go concept (together termed "cloud computing") has recently appeared. The collective resources of the datacenter, including both hardware and software, can be available publicly, being then termed a public cloud, the resources being provided in a virtual mode to the clients who pay according to the resources they employ. Examples of public companies providing these resources include Amazon, Google, and Joyent. The computational workload is shifted to the provider, which also implements required hardware and software upgrades over time. A virtual environment is created in the cloud corresponding to the computational and data storage needs of the user via the internet. The task is then performed, the results transmitted to the user, and the environment finally deleted after all tasks are completed. In this discussion, we focus on the basics of cloud computing, and go on to analyze the prerequisites and overall working of clouds. Finally, the applications of cloud computing in biological systems, particularly in comparative genomics, genome informatics, and SNP detection are discussed with reference to traditional workflows.

  12. Now And Next Generation Sequencing Techniques: Future of Sequence Analysis using Cloud Computing

    Directory of Open Access Journals (Sweden)

    Radhe Shyam Thakur

    2012-12-01

    Full Text Available Advancements in the field of sequencing techniques resulted in the huge sequenced data to be produced at a very faster rate. It is going cumbersome for the datacenter to maintain the databases. Data mining and sequence analysis approaches needs to analyze the databases several times to reach any efficient conclusion. To cope with such overburden on computer resources and to reach efficient and effective conclusions quickly, the virtualization of the resources and computation on pay as you go concept was introduced and termed as cloud computing. The datacenter’s hardware and software is collectively known as cloud which when available publicly is termed as public cloud. The datacenter’s resources are provided in a virtual mode to the clients via a service provider like Amazon, Google and Joyent which charges on pay as you go manner. The workload is shifted to the provider which is maintained by the required hardware and software upgradation. The service provider manages it by upgrading the requirements in the virtual mode. Basically a virtual environment is created according to the need of the user by taking permission from datacenter via internet, the task is performed and the environment is deleted after the task is over. In this discussion, we are focusing on the basics of cloud computing, the prerequisites and overall working of clouds. Furthermore, briefly the applications of cloud computing in biological systems, especially in comparative genomics, genome informatics and SNP detection with reference to traditional workflow are discussed.

  13. SEQUENCING AND SEQUENCE ANALYSIS OF MYOSTATIN GENE IN THE EXON 1 OF THE CAMEL (CAMELUS DROMEDARIUS

    Directory of Open Access Journals (Sweden)

    M. G. SHAH, A. S. QURESHI1, M. REISSMANN2 AND H. J. SCHWARTZ3

    2006-10-01

    Full Text Available Myostatin, also called growth differentiation factor-8 (GDF-8, is a member of the mammalian growth transforming family (TGF-beta superfamily, which is expressed specifically in developing an adult skeletal muscle. Muscular hypertrophy allele (mh allele in the double muscle breeds involved mutation within the myostatin gene. Genomic DNA was isolated from the camel hair using NucleoSpin Tissue kit. Two animals of each of the six breeds namely, Marecha, Dhatti, Larri, Kohi, Sakrai and Cambelpuri were used for sequencing. For PCR amplification of the gene, a primer pair was designed from homolog regions of already published sequences of farm animals from GenBank. Results showed that camel myostatin possessed more than 90% homology with that of cattle, sheep and pig. Camel formed separate cluster from the pig in spite of having high homology (98% and showed 94% homology with cattle and sheep as reported in literature. Sequence analysis of the PCR amplified part of exon 1 (256 bp of the camel myostatin was identical among six camel breeds.

  14. Infrared thermal facial image sequence registration analysis and verification

    Science.gov (United States)

    Chen, Chieh-Li; Jian, Bo-Lin

    2015-03-01

    To study the emotional responses of subjects to the International Affective Picture System (IAPS), infrared thermal facial image sequence is preprocessed for registration before further analysis such that the variance caused by minor and irregular subject movements is reduced. Without affecting the comfort level and inducing minimal harm, this study proposes an infrared thermal facial image sequence registration process that will reduce the deviations caused by the unconscious head shaking of the subjects. A fixed image for registration is produced through the localization of the centroid of the eye region as well as image translation and rotation processes. Thermal image sequencing will then be automatically registered using the two-stage genetic algorithm proposed. The deviation before and after image registration will be demonstrated by image quality indices. The results show that the infrared thermal image sequence registration process proposed in this study is effective in localizing facial images accurately, which will be beneficial to the correlation analysis of psychological information related to the facial area.

  15. Multilocus sequence analysis of Treponema denticola strains of diverse origin

    Science.gov (United States)

    2013-01-01

    Background The oral spirochete bacterium Treponema denticola is associated with both the incidence and severity of periodontal disease. Although the biological or phenotypic properties of a significant number of T. denticola isolates have been reported in the literature, their genetic diversity or phylogeny has never been systematically investigated. Here, we describe a multilocus sequence analysis (MLSA) of 20 of the most highly studied reference strains and clinical isolates of T. denticola; which were originally isolated from subgingival plaque samples taken from subjects from China, Japan, the Netherlands, Canada and the USA. Results The sequences of the 16S ribosomal RNA gene, and 7 conserved protein-encoding genes (flaA, recA, pyrH, ppnK, dnaN, era and radC) were successfully determined for each strain. Sequence data was analyzed using a variety of bioinformatic and phylogenetic software tools. We found no evidence of positive selection or DNA recombination within the protein-encoding genes, where levels of intraspecific sequence polymorphism varied from 18.8% (flaA) to 8.9% (dnaN). Phylogenetic analysis of the concatenated protein-encoding gene sequence data (ca. 6,513 nucleotides for each strain) using Bayesian and maximum likelihood approaches indicated that the T. denticola strains were monophyletic, and formed 6 well-defined clades. All analyzed T. denticola strains appeared to have a genetic origin distinct from that of ‘Treponema vincentii’ or Treponema pallidum. No specific geographical relationships could be established; but several strains isolated from different continents appear to be closely related at the genetic level. Conclusions Our analyses indicate that previous biological and biophysical investigations have predominantly focused on a subset of T. denticola strains with a relatively narrow range of genetic diversity. Our methodology and results establish a genetic framework for the discrimination and phylogenetic analysis of T

  16. Multilocus sequence analysis of Treponema denticola strains of diverse origin

    Directory of Open Access Journals (Sweden)

    Mo Sisu

    2013-02-01

    Full Text Available Abstract Background The oral spirochete bacterium Treponema denticola is associated with both the incidence and severity of periodontal disease. Although the biological or phenotypic properties of a significant number of T. denticola isolates have been reported in the literature, their genetic diversity or phylogeny has never been systematically investigated. Here, we describe a multilocus sequence analysis (MLSA of 20 of the most highly studied reference strains and clinical isolates of T. denticola; which were originally isolated from subgingival plaque samples taken from subjects from China, Japan, the Netherlands, Canada and the USA. Results The sequences of the 16S ribosomal RNA gene, and 7 conserved protein-encoding genes (flaA, recA, pyrH, ppnK, dnaN, era and radC were successfully determined for each strain. Sequence data was analyzed using a variety of bioinformatic and phylogenetic software tools. We found no evidence of positive selection or DNA recombination within the protein-encoding genes, where levels of intraspecific sequence polymorphism varied from 18.8% (flaA to 8.9% (dnaN. Phylogenetic analysis of the concatenated protein-encoding gene sequence data (ca. 6,513 nucleotides for each strain using Bayesian and maximum likelihood approaches indicated that the T. denticola strains were monophyletic, and formed 6 well-defined clades. All analyzed T. denticola strains appeared to have a genetic origin distinct from that of ‘Treponema vincentii’ or Treponema pallidum. No specific geographical relationships could be established; but several strains isolated from different continents appear to be closely related at the genetic level. Conclusions Our analyses indicate that previous biological and biophysical investigations have predominantly focused on a subset of T. denticola strains with a relatively narrow range of genetic diversity. Our methodology and results establish a genetic framework for the discrimination and phylogenetic

  17. Construction of multiple-epitope tag sequence by PCR for sensitive Western blot analysis.

    OpenAIRE

    Nakajima, K; Yaoita, Y

    1997-01-01

    Epitope tagging is a powerful technique to characterize a recombinantly expressed protein encoded by cDNA without the purification of the protein and the immunization of animals. In some cases, however, the expression of a tagged protein is too low to analyze by Western blot. We have developed a simple method to generate tandem repetitive nucleotide sequence by PCR, which allows us to label a protein of interest with a multiple-epitope tag. When five myc epitopes were attached to vaccinia vir...

  18. Automated sequence analysis of atmospheric oxidation pathways: SEQUENCE version 1.0

    Directory of Open Access Journals (Sweden)

    T. M. Butler

    2009-10-01

    Full Text Available An algorithm for the sequential analysis of the atmospheric oxidation of chemical species using output from a photochemical model is presented. Starting at a "root species", the algorithm traverses all possible reaction sequences which consume this species, and lead, via intermediate products, to final products. The algorithm keeps track of the effects of all of these reactions on their respective reactants and products. Upon completion, the algorithm has built a detailed picture of the effects of the oxidation of the root species on its chemical surroundings. The output of the algorithm can be used to determine product yields, radical recycling fractions, and ozone production potentials of arbitrary chemical species.

  19. Analysis of expressed sequence tags for Frankliniella occidentalis, the western flower thrips.

    Science.gov (United States)

    Rotenberg, D; Whitfield, A E

    2010-08-01

    Thrips are members of the insect order Thysanoptera and Frankliniella occidentalis (the western flower thrips) is the most economically important pest within this order. F. occidentalis is both a direct pest of crops and an efficient vector of plant viruses, including Tomato spotted wilt virus (TSWV). Despite the world-wide importance of thrips in agriculture, there is little knowledge of the F. occidentalis genome or gene functions at this time. A normalized cDNA library was constructed from first instar thrips and 13 839 expressed sequence tags (ESTs) were obtained. Our EST data assembled into 894 contigs and 11 806 singletons (12 700 nonredundant sequences). We found that 31% of these sequences had significant similarity (Eoccidentalis and other thrips species with regards to vital biological processes, studying the mechanism of interactions with the viruses harboured and transmitted by the vector, and identifying new insect gene-centred targets for plant disease and insect control.

  20. A stochastic model for EEG microstate sequence analysis.

    Science.gov (United States)

    Gärtner, Matthias; Brodbeck, Verena; Laufs, Helmut; Schneider, Gaby

    2015-01-01

    The analysis of spontaneous resting state neuronal activity is assumed to give insight into the brain function. One noninvasive technique to study resting state activity is electroencephalography (EEG) with a subsequent microstate analysis. This technique reduces the recorded EEG signal to a sequence of prototypical topographical maps, which is hypothesized to capture important spatio-temporal properties of the signal. In a statistical EEG microstate analysis of healthy subjects in wakefulness and three stages of sleep, we observed a simple structure in the microstate transition matrix. It can be described with a first order Markov chain in which the transition probability from the current state (i.e., map) to a different map does not depend on the current map. The resulting transition matrix shows a high agreement with the observed transition matrix, requiring only about 2% of mass transport (1/2 L1-distance). In the second part, we introduce an extended framework in which the simple Markov chain is used to make inferences on a potential underlying time continuous process. This process cannot be directly observed and is therefore usually estimated from discrete sampling points of the EEG signal given by the local maxima of the global field power. Therefore, we propose a simple stochastic model called sampled marked intervals (SMI) model that relates the observed sequence of microstates to an assumed underlying process of background intervals and thus, complements approaches that focus on the analysis of observable microstate sequences. Copyright © 2014 Elsevier Inc. All rights reserved.

  1. Congruence analysis of point clouds from unstable stereo image sequences

    Directory of Open Access Journals (Sweden)

    C. Jepping

    2014-06-01

    Full Text Available This paper deals with the correction of exterior orientation parameters of stereo image sequences over deformed free-form surfaces without control points. Such imaging situation can occur, for example, during photogrammetric car crash test recordings where onboard high-speed stereo cameras are used to measure 3D surfaces. As a result of such measurements 3D point clouds of deformed surfaces are generated for a complete stereo sequence. The first objective of this research focusses on the development and investigation of methods for the detection of corresponding spatial and temporal tie points within the stereo image sequences (by stereo image matching and 3D point tracking that are robust enough for a reliable handling of occlusions and other disturbances that may occur. The second objective of this research is the analysis of object deformations in order to detect stable areas (congruence analysis. For this purpose a RANSAC-based method for congruence analysis has been developed. This process is based on the sequential transformation of randomly selected point groups from one epoch to another by using a 3D similarity transformation. The paper gives a detailed description of the congruence analysis. The approach has been tested successfully on synthetic and real image data.

  2. Germacrene C synthase from Lycopersicon esculentum cv. VFNT Cherry tomato: cDNA isolation, characterization, and bacterial expression of the multiple product sesquiterpene cyclase

    Science.gov (United States)

    Colby, Sheila M.; Crock, John; Dowdle-Rizzo, Barbara; Lemaux, Peggy G.; Croteau, Rodney

    1998-01-01

    Germacrene C was found by GC-MS and NMR analysis to be the most abundant sesquiterpene in the leaf oil of Lycopersicon esculentum cv. VFNT Cherry, with lesser amounts of germacrene A, guaia-6,9-diene, germacrene B, β-caryophyllene, α-humulene, and germacrene D. Soluble enzyme preparations from leaves catalyzed the divalent metal ion-dependent cyclization of [1-3H]farnesyl diphosphate to these same sesquiterpene olefins, as determined by radio-GC. To obtain a germacrene synthase cDNA, a set of degenerate primers was constructed based on conserved amino acid sequences of related terpenoid cyclases. With cDNA prepared from leaf epidermis-enriched mRNA, these primers amplified a 767-bp fragment that was used as a hybridization probe to screen the cDNA library. Thirty-one clones were evaluated for functional expression of terpenoid cyclase activity in Escherichia coli by using labeled geranyl, farnesyl, and geranylgeranyl diphosphates as substrates. Nine cDNA isolates expressed sesquiterpene synthase activity, and GC-MS analysis of the products identified germacrene C with smaller amounts of germacrene A, B, and D. None of the expressed proteins was active with geranylgeranyl diphosphate; however, one truncated protein converted geranyl diphosphate to the monoterpene limonene. The cDNA inserts specify a deduced polypeptide of 548 amino acids (Mr = 64,114), and sequence comparison with other plant sesquiterpene cyclases indicates that germacrene C synthase most closely resembles cotton δ-cadinene synthase (50% identity). PMID:9482865

  3. Comparative genomic mapping of the bovine Fragile Histidine Triad (FHIT tumour suppressor gene: characterization of a 2 Mb BAC contig covering the locus, complete annotation of the gene, analysis of cDNA and of physiological expression profiles

    Directory of Open Access Journals (Sweden)

    Boussaha Mekki

    2006-05-01

    Full Text Available Abstract Background The Fragile Histidine Triad gene (FHIT is an oncosuppressor implicated in many human cancers, including vesical tumors. FHIT is frequently hit by deletions caused by fragility at FRA3B, the most active of human common fragile sites, where FHIT lays. Vesical tumors affect also cattle, including animals grazing in the wild on bracken fern; compounds released by the fern are known to induce chromosome fragility and may trigger cancer with the interplay of latent Papilloma virus. Results The bovine FHIT was characterized by assembling a contig of 78 BACs. Sequence tags were designed on human exons and introns and used directly to select bovine BACs, or compared with sequence data in the bovine genome database or in the trace archive of the bovine genome sequencing project, and adapted before use. FHIT is split in ten exons like in man, with exons 5 to 9 coding for a 149 amino acids protein. VISTA global alignments between bovine genomic contigs retrieved from the bovine genome database and the human FHIT region were performed. Conservation was extremely high over a 2 Mb region spanning the whole FHIT locus, including the size of introns. Thus, the bovine FHIT covers about 1.6 Mb compared to 1.5 Mb in man. Expression was analyzed by RT-PCR and Northern blot, and was found to be ubiquitous. Four cDNA isoforms were isolated and sequenced, that originate from an alternative usage of three variants of exon 4, revealing a size very close to the major human FHIT cDNAs. Conclusion A comparative genomic approach allowed to assemble a contig of 78 BACs and to completely annotate a 1.6 Mb region spanning the bovine FHIT gene. The findings confirmed the very high level of conservation between human and bovine genomes and the importance of comparative mapping to speed the annotation process of the recently sequenced bovine genome. The detailed knowledge of the genomic FHIT region will allow to study the role of FHIT in bovine cancerogenesis

  4. CISAPS: Complex Informational Spectrum for the Analysis of Protein Sequences

    Directory of Open Access Journals (Sweden)

    Charalambos Chrysostomou

    2015-01-01

    Full Text Available Complex informational spectrum analysis for protein sequences (CISAPS and its web-based server are developed and presented. As recent studies show, only the use of the absolute spectrum in the analysis of protein sequences using the informational spectrum analysis is proven to be insufficient. Therefore, CISAPS is developed to consider and provide results in three forms including absolute, real, and imaginary spectrum. Biologically related features to the analysis of influenza A subtypes as presented as a case study in this study can also appear individually either in the real or imaginary spectrum. As the results presented, protein classes can present similarities or differences according to the features extracted from CISAPS web server. These associations are probable to be related with the protein feature that the specific amino acid index represents. In addition, various technical issues such as zero-padding and windowing that may affect the analysis are also addressed. CISAPS uses an expanded list of 611 unique amino acid indices where each one represents a different property to perform the analysis. This web-based server enables researchers with little knowledge of signal processing methods to apply and include complex informational spectrum analysis to their work.

  5. Molecular cloning of a cDNA and chromosomal localization of a human theta-class glutathione S-transferase gene (GSTT2) to chromosome 22

    Energy Technology Data Exchange (ETDEWEB)

    Tan, K.L.; Baker, R.T.; Board, P.G. [Australian National Univ., Canberra (Australia)] [and others

    1995-01-20

    Until recently the Theta-class glutathione S-transferases (GSTs) were largely overlooked due to their low activity with the model substrate 1-chloro-2,4-dinitrobenzene (CDNB) and their failure to bind to immobilized glutathione affinity matrices. Little is known about the number of genes in this class. Recently, Pemble et al. reported the cDNA cloning of a human Theta-class GST, termed GSTT1. In this study, we describe the molecular cloning of a cDNA encoding a second human Theta-class GST (GSTT2) from a {lambda}gt11 human liver 5{prime}-stretch cDNA library. The encoded protein contains 244 amino acids and has 78.3% sequence identity with the rat subunit 12 and only 55.0% identity with human GSTT1. GSTT2 has been mapped to chromosome 22 by somatic cell hybrid analysis. The precise position of the gene was localized to subband 22q11.2 by in situ hybridization. The absence of other regions of hybridization suggests that there are no closely related sequences (e.g., reverse transcribed pseudogenes) scattered throughout the genome and that if there are closely related genes, they must be clustered near GSTT2. Southern blot analysis of human DNA digested with BamHI shows that the size of the GSTT2 gene is relatively small, as the coding sequence falls within a 3.6-kb BamHI fragment. 35 refs., 6 figs.

  6. Identification of a cDNA encoding a parathyroid hormone-like peptide from a human tumor associated with humoral hypercalcemia of malignancy

    International Nuclear Information System (INIS)

    Mangin, M.; Webb, A.C.; Dreyer, B.E.

    1988-01-01

    Humoral hypercalcemia of malignancy is a common paraneoplastic syndrome that appears to be mediated in many instances by a parathyroid hormone-like peptide. Poly(A) + RNA from a human renal carcinoma associated with this syndrome was enriched by preparative electrophoresis and used to construct an enriched cDNA library in phage λgt10. The library was screened with a codon-preference oligonucleotide synthesized on the basis of a partial N-terminal amino acid sequence from a human tumor-derived peptide, and a 2.0 kilo-base cDNA was identified. The cDNA encodes a 177 amino acid protein consisting of a 36 amino acid leader sequence and a 141 amino acid mature peptide. The first 13 amino acids of the deduced sequence of the mature peptide display strong homology to human PTH, with complete divergence thereafter. RNA blot-hybridization analysis revealed multiple transcripts in mRNA from tumors associated with the humor syndrome and also in mRNA from normal human keratinocytes. Southern blot analysis of genomic DNA from humans and rodents revealed a simple pattern compatible with a single-copy gene. The gene has been mapped to chromosome 12

  7. Transcriptome analysis of the model protozoan, Tetrahymena thermophila, using Deep RNA sequencing.

    Directory of Open Access Journals (Sweden)

    Jie Xiong

    Full Text Available BACKGROUND: The ciliated protozoan Tetrahymena thermophila is a well-studied single-celled eukaryote model organism for cellular and molecular biology. However, the lack of extensive T. thermophila cDNA libraries or a large expressed sequence tag (EST database limited the quality of the original genome annotation. METHODOLOGY/PRINCIPAL FINDINGS: This RNA-seq study describes the first deep sequencing analysis of the T. thermophila transcriptome during the three major stages of the life cycle: growth, starvation and conjugation. Uniquely mapped reads covered more than 96% of the 24,725 predicted gene models in the somatic genome. More than 1,000 new transcribed regions were identified. The great dynamic range of RNA-seq allowed detection of a nearly six order-of-magnitude range of measurable gene expression orchestrated by this cell. RNA-seq also allowed the first prediction of transcript untranslated regions (UTRs and an updated (larger size estimate of the T. thermophila transcriptome: 57 Mb, or about 55% of the somatic genome. Our study identified nearly 1,500 alternative splicing (AS events distributed over 5.2% of T. thermophila genes. This percentage represents a two order-of-magnitude increase over previous EST-based estimates in Tetrahymena. Evidence of stage-specific regulation of alternative splicing was also obtained. Finally, our study allowed us to completely confirm about 26.8% of the genes originally predicted by the gene finder, to correct coding sequence boundaries and intron-exon junctions for about a third, and to reassign microarray probes and correct earlier microarray data. CONCLUSIONS/SIGNIFICANCE: RNA-seq data significantly improve the genome annotation and provide a fully comprehensive view of the global transcriptome of T. thermophila. To our knowledge, 5.2% of T. thermophila genes with AS is the highest percentage of genes showing AS reported in a unicellular eukaryote. Tetrahymena thus becomes an excellent unicellular

  8. The Arabidopsis Root Transcriptome by Serial Analysis of Gene Expression. Gene Identification Using the Genome Sequence1

    Science.gov (United States)

    Fizames, Cécile; Muños, Stéphane; Cazettes, Céline; Nacry, Philippe; Boucherez, Jossia; Gaymard, Frédéric; Piquemal, David; Delorme, Valérie; Commes, Thérèse; Doumas, Patrick; Cooke, Richard; Marti, Jacques; Sentenac, Hervé; Gojon, Alain

    2004-01-01

    Large-scale identification of genes expressed in roots of the model plant Arabidopsis was performed by serial analysis of gene expression (SAGE), on a total of 144,083 sequenced tags, representing at least 15,964 different mRNAs. For tag to gene assignment, we developed a computational approach based on 26,620 genes annotated from the complete sequence of the genome. The procedure selected warrants the identification of the genes corresponding to the majority of the tags found experimentally, with a high level of reliability, and provides a reference database for SAGE studies in Arabidopsis. This new resource allowed us to characterize the expression of more than 3,000 genes, for which there is no expressed sequence tag (EST) or cDNA in the databases. Moreover, 85% of the tags were specific for one gene. To illustrate this advantage of SAGE for functional genomics, we show that our data allow an unambiguous analysis of most of the individual genes belonging to 12 different ion transporter multigene families. These results indicate that, compared with EST-based tag to gene assignment, the use of the annotated genome sequence greatly improves gene identification in SAGE studies. However, more than 6,000 different tags remained with no gene match, suggesting that a significant proportion of transcripts present in the roots originate from yet unknown or wrongly annotated genes. The root transcriptome characterized in this study markedly differs from those obtained in other organs, and provides a unique resource for investigating the functional specificities of the root system. As an example of the use of SAGE for transcript profiling in Arabidopsis, we report here the identification of 270 genes differentially expressed between roots of plants grown either with NO3- or NH4NO3 as N source. PMID:14730065

  9. Frame to Frame Diffeomorphic Motion Analysis from Echocardiographic Sequences

    OpenAIRE

    Zhang, Zhijun; Sahn, David; Song, Xubo

    2011-01-01

    International audience; Quantitative motion analysis from echocardiography is an important yet challenging problem. We develop a motion estimation algorithm for echocardiographic image sequences based on diffeomorphic image registration in which the velocity field is spatiotemporally smooth. The novelty of this work is that instead of optimizing a functional of velocity field which consists of similarity metrics between a reference image to each of the following images (\\textitfirst-to-follow...

  10. The sequence and analysis of Trypanosoma brucei chromosome II

    OpenAIRE

    El-Sayed, Najib M. A.; Ghedin, Elodie; Song, Jinming; MacLeod, Annette; Bringaud, Frederic; Larkin, Christopher; Wanless, David; Peterson, Jeremy; Hou, Lihua; Taylor, Sonya; Tweedie, Alison; Biteau, Nicolas; Khalak, Hanif G.; Lin, Xiaoying; Mason, Tanya

    2003-01-01

    We report here the sequence of chromosome II from Trypanosoma brucei, the causative agent of African sleeping sickness. The 1.2-Mb pairs encode about 470 predicted genes organised in 17 directional clusters on either strand, the largest cluster of which has 92 genes lined up over a 284-kb region. An analysis of the GC skew reveals strand compositional asymmetries that coincide with the distribution of protein-coding genes, suggesting these asymmetries may be the result of transcription-couple...

  11. Analysis on Response of Dynamic Systems to Pulse Sequences Excitation

    Directory of Open Access Journals (Sweden)

    Xie Lili

    2009-07-01

    Full Text Available Near-fault ground motions with long-period pulses can place severe demands on s