WorldWideScience

Sample records for cdna deep sequencing

  1. cDNA sequence quality data - Budding yeast cDNA sequencing project | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Budding yeast cDNA sequencing project cDNA sequence quality data Data detail Data name cDNA sequence quality... data DOI 10.18908/lsdba.nbdc00838-003 Description of data contents Phred's quality score. P...tion Download License Update History of This Database Site Policy | Contact Us cDNA sequence quality

  2. Sequence of human protamine 2 cDNA

    Energy Technology Data Exchange (ETDEWEB)

    Domenjoud, L; Fronia, C; Uhde, F; Engel, W [Universitaet Goettingen (West Germany)

    1988-08-11

    The authors report the cloning and sequencing of a cDNA clone for human protamine 2 (hp2), isolated from a human testis cDNA library cloned in the vector {lambda}-gt11. A 66mer oligonucleotide, that corresponds to an amino acid sequence which is highly conserved between hp2 and mouse protamine 2 (mp2) served as hybridization probe. The homology between the amino acid sequence deduced from our cDNA and the published amino acid sequence for hp2 is 100%.

  3. Mouse tetranectin: cDNA sequence, tissue-specific expression, and chromosomal mapping

    DEFF Research Database (Denmark)

    Ibaraki, K; Kozak, C A; Wewer, U M

    1995-01-01

    regulation, mouse tetranectin cDNA was cloned from a 16-day-old mouse embryo library. Sequence analysis revealed a 992-bp cDNA with an open reading frame of 606 bp, which is identical in length to the human tetranectin cDNA. The deduced amino acid sequence showed high homology to the human cDNA with 76......(s) of tetranectin. The sequence analysis revealed a difference in both sequence and size of the noncoding regions between mouse and human cDNAs. Northern analysis of the various tissues from mouse, rat, and cow showed the major transcript(s) to be approximately 1 kb, which is similar in size to that observed...

  4. Cloning, sequencing, and expression of cDNA for human β-glucuronidase

    International Nuclear Information System (INIS)

    Oshima, A.; Kyle, J.W.; Miller, R.D.

    1987-01-01

    The authors report here the cDNA sequence for human placental β-glucuronidase (β-D-glucuronoside glucuronosohydrolase, EC 3.2.1.31) and demonstrate expression of the human enzyme in transfected COS cells. They also sequenced a partial cDNA clone from human fibroblasts that contained a 153-base-pair deletion within the coding sequence and found a second type of cDNA clone from placenta that contained the same deletion. Nuclease S1 mapping studies demonstrated two types of mRNAs in human placenta that corresponded to the two types of cDNA clones isolated. The NH 2 -terminal amino acid sequence determined for human spleen β-glucuronidase agreed with that inferred from the DNA sequence of the two placental clones, beginning at amino acid 23, suggesting a cleaved signal sequence of 22 amino acids. When transfected into COS cells, plasmids containing either placental clone expressed an immunoprecipitable protein that contained N-linked oligosaccharides as evidenced by sensitivity to endoglycosidase F. However, only transfection with the clone containing the 153-base-pair segment led to expression of human β-glucuronidase activity. These studies provide the sequence for the full-length cDNA for human β-glucuronidase, demonstrate the existence of two populations of mRNA for β-glucuronidase in human placenta, only one of which specifies a catalytically active enzyme, and illustrate the importance of expression studies in verifying that a cDNA is functionally full-length

  5. Sequence of a cDNA encoding turtle high mobility group 1 protein.

    Science.gov (United States)

    Zheng, Jifang; Hu, Bi; Wu, Duansheng

    2005-07-01

    In order to understand sequence information about turtle HMG1 gene, a cDNA encoding HMG1 protein of the Chinese soft-shell turtle (Pelodiscus sinensis) was amplified by RT-PCR from kidney total RNA, and was cloned, sequenced and analyzed. The results revealed that the open reading frame (ORF) of turtle HMG1 cDNA is 606 bp long. The ORF codifies 202 amino acid residues, from which two DNA-binding domains and one polyacidic region are derived. The DNA-binding domains share higher amino acid identity with homologues sequences of chicken (96.5%) and mammalian (74%) than homologues sequence of rainbow trout (67%). The polyacidic region shows 84.6% amino acid homology with the equivalent region of chicken HMG1 cDNA. Turtle HMG1 protein contains 3 Cys residues located at completely conserved positions. Conservation in sequence and structure suggests that the functions of turtle HMG1 cDNA may be highly conserved during evolution. To our knowledge, this is the first report of HMG1 cDNA sequence in any reptilian.

  6. The function analysis of full-length cDNA sequence from IRM-2 mouse cDNA library

    International Nuclear Information System (INIS)

    Wang Qin; Liu Xiaoqiu; Xu Chang; Du Liqing; Sun Zhijuan; Wang Yan; Liu Qiang; Song Li; Li Jin; Fan Feiyue

    2013-01-01

    Objective: To identify the function of full-length cDNA sequence from IRM-2 mouse cDNA library. Methods: Full-length cDNA products were amplified by PCR from IRM-2 mouse cDNA library according to twenty-one pieces of expressed sequence tag. The expression of full-length cDNAs were detected after mouse embryonic fibroblasts were exposed to 6.5 Gy γ-ray radiation. And the effect on the growth of radiosensitivity cells AT5B1VA transfected with full-length cDNAs was investigated. Results: The expression of No.4, 5 and 2 full-length cDNAs from IRM-2 mouse were higher than that of parental ICR and 615 mouse after mouse embryonic fibroblasts irradiated with γ-ray radiation. And the survival rate of AT5B1VA cells transfected with No.4, 5 and 2 full-length cDNAs was high. Conclusion: No.4, 5 and 2 full-length cDNAs of IRM-2 mouse are of high radioresistance. (authors)

  7. The cDNA sequence of a neutral horseradish peroxidase.

    Science.gov (United States)

    Bartonek-Roxå, E; Eriksson, H; Mattiasson, B

    1991-02-16

    A cDNA clone encoding a horseradish (Armoracia rusticana) peroxidase has been isolated and characterized. The cDNA contains 1378 nucleotides excluding the poly(A) tail and the deduced protein contains 327 amino acids which includes a 28 amino acid leader sequence. The predicted amino acid sequence is nine amino acids shorter than the major isoenzyme belonging to the horseradish peroxidase C group (HRP-C) and the sequence shows 53.7% identity with this isoenzyme. The described clone encodes nine cysteines of which eight correspond well with the cysteines found in HRP-C. Five potential N-glycosylation sites with the general sequence Asn-X-Thr/Ser are present in the deduced sequence. Compared to the earlier described HRP-C this is three glycosylation sites less. The shorter sequence and fewer N-glycosylation sites give the native isoenzyme a molecular weight of several thousands less than the horseradish peroxidase C isoenzymes. Comparison with the net charge value of HRP-C indicates that the described cDNA clone encodes a peroxidase which has either the same or a slightly less basic pI value, depending on whether the encoded protein is N-terminally blocked or not. This excludes the possibility that HRP-n could belong to either the HRP-A, -D or -E groups. The low sequence identity (53.7%) with HRP-C indicates that the described clone does not belong to the HRP-C isoenzyme group and comparison of the total amino acid composition with the HRP-B group does not place the described clone within this isoenzyme group. Our conclusion is that the described cDNA clone encodes a neutral horseradish peroxidase which belongs to a new, not earlier described, horseradish peroxidase group.

  8. Sequence of a cloned cDNA encoding human ribosomal protein S11

    Energy Technology Data Exchange (ETDEWEB)

    Lott, J B; Mackie, G A

    1988-02-11

    The authors have isolated a cloned cDNA that encodes human ribosomal protein (rp) S11 by screening a human fibroblast cDNA library with a labelled 204 bp DNA fragment encompassing residues 212-416 of pRS11, a rat rp Sll cDNA clone. The human rp S11 cloned cDNA consists of 15 residues of the 5' leader, the entire coding sequence and all 51 residues of the 3' untranslated region. The predicted amino acid sequence of 158 residues is identical to rat rpS11. The nucleotide sequence in the coding region differs, however, from that in rat in the first position in two codons and in the third position in 44 codons.

  9. cDNA sequences of two inducible T-cell genes

    Energy Technology Data Exchange (ETDEWEB)

    Kwon, B.S. (Indiana Univ. School of Medicine, Indianapolis (USA) Guthrie Research Institute, Sayre, PA (USA)); Weissman, S.M. (Yale Univ., New Haven, CT (USA))

    1989-03-01

    The authors have previously described a set of human T-lymphocyte-specific cDNA clones isolated by a modified differential screening procedure. Apparent full-length cDNAs containing the sequences of 14 of the 16 initial isolates were sequenced and were found to represent five different species of mRNA; three of the five species were identical to previously reported cDNA sequences of preproenkephalin, T-cell-replacing factor, and a serine esterase, respectively. The other two species, 4-1BB and L2G25B, were inducible sequences found in mRNA from both a cytolytic T-lymphocyte and a helper T-lymphocyte clone and were not previously described in T-cell mRNA; these mRNA sequences encode peptides of 256 and 92 amino acids, respectively. Both peptides contain putative leader sequences. The protein encoded by 4-1BB also has a potential membrane anchor segment and other features also seen in known receptor proteins.

  10. CDNA encoding a polypeptide including a hevein sequence

    Science.gov (United States)

    Raikhel, Natasha V.; Broekaert, Willem F.; Chua, Nam-Hai; Kush, Anil

    1995-03-21

    A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74-79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli.

  11. The nucleotide sequence of human transition protein 1 cDNA

    Energy Technology Data Exchange (ETDEWEB)

    Luerssen, H; Hoyer-Fender, S; Engel, W [Universitaet Goettingen (West Germany)

    1988-08-11

    The authors have screened a human testis cDNA library with an oligonucleotide of 81 mer prepared according to a part of the published nucleotide sequence of the rat transition protein TP 1. They have isolated a cDNA clone with the length of 441 bp containing the coding region of 162 bp for human transition protein 1. There is about 84% homology in the coding region of the sequence compared to rat. The human cDNA-clone encodes a polypeptide of 54 amino acids of which 7 are different to that of rat.

  12. Molecular cloning and nucleotide sequence of cDNA for human liver arginase

    International Nuclear Information System (INIS)

    Haraguchi, Y.; Takiguchi, M.; Amaya, Y.; Kawamoto, S.; Matsuda, I.; Mori, M.

    1987-01-01

    Arginase (EC3.5.3.1) catalyzes the last step of the urea cycle in the liver of ureotelic animals. Inherited deficiency of the enzyme results in argininemia, an autosomal recessive disorder characterized by hyperammonemia. To facilitate investigation of the enzyme and gene structures and to elucidate the nature of the mutation in argininemia, the authors isolated cDNA clones for human liver arginase. Oligo(dT)-primed and random primer human liver cDNA libraries in λ gt11 were screened using isolated rat arginase cDNA as a probe. Two of the positive clones, designated λ hARG6 and λ hARG109, contained an overlapping cDNA sequence with an open reading frame encoding a polypeptide of 322 amino acid residues (predicted M/sub r/, 34,732), a 5'-untranslated sequence of 56 base pairs, a 3'-untranslated sequence of 423 base pairs, and a poly(A) segment. Arginase activity was detected in Escherichia coli cells transformed with the plasmid carrying λ hARG6 cDNA insert. RNA gel blot analysis of human liver RNA showed a single mRNA of 1.6 kilobases. The predicted amino acid sequence of human liver arginase is 87% and 41% identical with those of the rat liver and yeast enzymes, respectively. There are several highly conserved segments among the human, rat, and yeast enzymes

  13. cDNA sequences of two apolipoproteins from lamprey

    International Nuclear Information System (INIS)

    Pontes, M.; Xu, X.; Graham, D.; Riley, M.; Doolittle, R.F.

    1987-01-01

    The messages for two small but abundant apolipoproteins found in lamprey blood plasma were cloned with the aid of oligonucleotide probes based on amino-terminal sequences. In both cases, numerous clones were identified in a lamprey liver cDNA library, consistent with the great abundance of these proteins in lamprey blood. One of the cDNAs (LAL1) has a coding region of 105 amino acids that corresponds to a 21-residue signal peptide, a putative 8-residue propeptide, and the 76-residue mature protein found in blood. The other cDNA (LAL2) codes for a total of 191 residues, the first 23 of which constitute a signal peptide. The two proteins, which occur in the high-density lipoprotein fraction of ultracentrifuged plasma, have amino acid compositions similar to those of apolipoproteins found in mammalian blood; computer analysis indicates that the sequences are largely helix-permissive. When the sequences were searched against an amino acid sequence data base, rat apolipoprotein IV was the best matching candidate in both cases. Although a reasonable alignment can be made with that sequence and LAL1, definitive assignment of the two lamprey proteins to typical mammalian classes cannot be made at this point

  14. cDNA sequencing improves the detection of P53 missense mutations in colorectal cancer

    International Nuclear Information System (INIS)

    Szybka, Malgorzata; Kordek, Radzislaw; Zakrzewska, Magdalena; Rieske, Piotr; Pasz-Walczak, Grazyna; Kulczycka-Wojdala, Dominika; Zawlik, Izabela; Stawski, Robert; Jesionek-Kupnicka, Dorota; Liberski, Pawel P

    2009-01-01

    Recently published data showed discrepancies beteween P53 cDNA and DNA sequencing in glioblastomas. We hypothesised that similar discrepancies may be observed in other human cancers. To this end, we analyzed 23 colorectal cancers for P53 mutations and gene expression using both DNA and cDNA sequencing, real-time PCR and immunohistochemistry. We found P53 gene mutations in 16 cases (15 missense and 1 nonsense). Two of the 15 cases with missense mutations showed alterations based only on cDNA, and not DNA sequencing. Moreover, in 6 of the 15 cases with a cDNA mutation those mutations were difficult to detect in the DNA sequencing, so the results of DNA analysis alone could be misinterpreted if the cDNA sequencing results had not also been available. In all those 15 cases, we observed a higher ratio of the mutated to the wild type template by cDNA analysis, but not by the DNA analysis. Interestingly, a similar overexpression of P53 mRNA was present in samples with and without P53 mutations. In terms of colorectal cancer, those discrepancies might be explained under three conditions: 1, overexpression of mutated P53 mRNA in cancer cells as compared with normal cells; 2, a higher content of cells without P53 mutation (normal cells and cells showing K-RAS and/or APC but not P53 mutation) in samples presenting P53 mutation; 3, heterozygous or hemizygous mutations of P53 gene. Additionally, for heterozygous mutations unknown mechanism(s) causing selective overproduction of mutated allele should also be considered. Our data offer new clues for studying discrepancy in P53 cDNA and DNA sequencing analysis

  15. Complete cDNA sequence coding for human docking protein

    Energy Technology Data Exchange (ETDEWEB)

    Hortsch, M; Labeit, S; Meyer, D I

    1988-01-11

    Docking protein (DP, or SRP receptor) is a rough endoplasmic reticulum (ER)-associated protein essential for the targeting and translocation of nascent polypeptides across this membrane. It specifically interacts with a cytoplasmic ribonucleoprotein complex, the signal recognition particle (SRP). The nucleotide sequence of cDNA encoding the entire human DP and its deduced amino acid sequence are given.

  16. Human tissue factor: cDNA sequence and chromosome localization of the gene

    International Nuclear Information System (INIS)

    Scarpati, E.M.; Wen, D.; Broze, G.J. Jr.; Miletich, J.P.; Flandermeyer, R.R.; Siegel, N.R.; Sadler, J.E.

    1987-01-01

    A human placenta cDNA library in λgt11 was screened for the expression of tissue factor antigens with rabbit polyclonal anti-human tissue factor immunoglobulin G. Among 4 million recombinant clones screened, one positive, λHTF8, expressed a protein that shared epitopes with authentic human brain tissue factor. The 1.1-kilobase cDNA insert of λHTF8 encoded a peptide that contained the amino-terminal protein sequence of human brain tissue factor. Northern blotting identified a major mRNA species of 2.2 kilobases and a minor species of ∼ 3.2 kilobases in poly(A) + RNA of placenta. Only 2.2-kilobase mRNA was detected in human brain and in the human monocytic U937 cell line. In U937 cells, the quantity of tissue factor mRNA was increased several fold by exposure of the cells to phorbol 12-myristate 13-acetate. Additional cDNA clones were selected by hybridization with the cDNA insert of λHTF8. These overlapping isolates span 2177 base pairs of the tissue factor cDNA sequence that includes a 5'-noncoding region of 75 base pairs, an open reading frame of 885 base pairs, a stop codon, a 3'-noncoding region of 1141 base pairs, and a poly(a) tail. The open reading frame encodes a 33-kilodalton protein of 295 amino acids. The predicted sequence includes a signal peptide of 32 or 34 amino acids, a probable extracellular factor VII binding domain of 217 or 219 amino acids, a transmembrane segment of 23 acids, and a cytoplasmic tail of 21 amino acids. There are three potential glycosylation sites with the sequence Asn-X-Thr/Ser. The 3'-noncoding region contains an inverted Alu family repetitive sequence. The tissue factor gene was localized to chromosome 1 by hybridization of the cDNA insert of λHTF8 to flow-sorted human chromosomes

  17. Cloning and sequence analysis of cDNA coding for rat nucleolar protein C23

    International Nuclear Information System (INIS)

    Ghaffari, S.H.; Olson, M.O.J.

    1986-01-01

    Using synthetic oligonucleotides as primers and probes, the authors have isolated and sequenced cDNA clones encoding protein C23, a putative nucleolus organizer protein. Poly(A + ) RNA was isolated from rat Novikoff hepatoma cells and enriched in C23 mRNA by sucrose density gradient ultracentrifugation. Two deoxyoligonuleotides, a 48- and a 27-mer, were synthesized on the basis of amino acid sequence from the C-terminal half of protein C23 and cDNA sequence data from CHO cell protein. The 48-mer was used a primer for synthesis of cDNA which was then inserted into plasmid pUC9. Transformed bacterial colonies were screened by hybridization with 32 P labeled 27-mer. Two clones among 5000 gave a strong positive signal. Plasmid DNAs from these clones were purified and characterized by blotting and nucleotide sequence analysis. The length of C23 mRNA was estimated to be 3200 bases in a northern blot analysis. The sequence of a 267 b.p. insert shows high homology with the CHO cDNA with only 9 nucleotide differences and an identical amino acid sequence. These studies indicate that this region of the protein is highly conserved

  18. cDNA encoding a polypeptide including a hevein sequence

    Energy Technology Data Exchange (ETDEWEB)

    Raikhel, N.V.; Broekaert, W.F.; Chua, N.H.; Kush, A.

    2000-07-04

    A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74--79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli.

  19. cDNA encoding a polypeptide including a hevein sequence

    Energy Technology Data Exchange (ETDEWEB)

    Raikhel, N.V.; Broekaert, W.F.; Chua, N.H.; Kush, A.

    1999-05-04

    A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74--79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli. 12 figs.

  20. cDNA encoding a polypeptide including a hevein sequence

    Energy Technology Data Exchange (ETDEWEB)

    Raikhel, Natasha V. (Okemos, MI); Broekaert, Willem F. (Dilbeek, BE); Chua, Nam-Hai (Scarsdale, NY); Kush, Anil (New York, NY)

    1999-05-04

    A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74-79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli.

  1. cDNA encoding a polypeptide including a hevein sequence

    Energy Technology Data Exchange (ETDEWEB)

    Raikhel, N.V.; Broekaert, W.F.; Chua, N.H.; Kush, A.

    1995-03-21

    A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1,018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74--79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli. 11 figures.

  2. Cloning, sequencing and expression of cDNA encoding growth ...

    Indian Academy of Sciences (India)

    Unknown

    of medicine, animal husbandry, fish farming and animal ..... northern pike (Esox lucius) growth hormone; Mol. Mar. Biol. ... prolactin 1-luciferase fusion gene in African catfish and ... 1988 Cloning and sequencing of cDNA that encodes goat.

  3. cDNA encoding a polypeptide including a hevein sequence

    Energy Technology Data Exchange (ETDEWEB)

    Raikhel, Natasha V. (Okemos, MI); Broekaert, Willem F. (Dilbeek, BE); Chua, Nam-Hai (Scarsdale, NY); Kush, Anil (New York, NY)

    1993-02-16

    A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a pu GOVERNMENT RIGHTS This application was funded under Department of Energy Contract DE-AC02-76ER01338. The U.S. Government has certain rights under this application and any patent issuing thereon.

  4. Cloning and cDNA sequence of the dihydrolipoamide dehydrogenase component of human α-ketoacid dehydrogenase complexes

    International Nuclear Information System (INIS)

    Pons, G.; Raefsky-Estrin, C.; Carothers, D.J.; Pepin, R.A.; Javed, A.A.; Jesse, B.W.; Ganapathi, M.K.; Samols, D.; Patel, M.S.

    1988-01-01

    cDNA clones comprising the entire coding region for human dihydrolipoamide dehydrogenase have been isolated from a human liver cDNA library. The cDNA sequence of the largest clone consisted of 2082 base pairs and contained a 1527-base open reading frame that encodes a precursor dihydrolipoamide dehydrogenase of 509 amino acid residues. The first 35-amino acid residues of the open reading frame probably correspond to a typical mitochondrial import leader sequence. The predicted amino acid sequence of the mature protein, starting at the residue number 36 of the open reading frame, is almost identical (>98% homology) with the known partial amino acid sequence of the pig heart dihydrolipoamide dehydrogenase. The cDNA clone also contains a 3' untranslated region of 505 bases with an unusual polyadenylylation signal (TATAAA) and a short poly(A) track. By blot-hybridization analysis with the cDNA as probe, two mRNAs, 2.2 and 2.4 kilobases in size, have been detected in human tissues and fibroblasts, whereas only one mRNA (2.4 kilobases) was detected in rat tissues

  5. Cost-effective sequencing of full-length cDNA clones powered by a de novo-reference hybrid assembly.

    Science.gov (United States)

    Kuroshu, Reginaldo M; Watanabe, Junichi; Sugano, Sumio; Morishita, Shinichi; Suzuki, Yutaka; Kasahara, Masahiro

    2010-05-07

    Sequencing full-length cDNA clones is important to determine gene structures including alternative splice forms, and provides valuable resources for experimental analyses to reveal the biological functions of coded proteins. However, previous approaches for sequencing cDNA clones were expensive or time-consuming, and therefore, a fast and efficient sequencing approach was demanded. We developed a program, MuSICA 2, that assembles millions of short (36-nucleotide) reads collected from a single flow cell lane of Illumina Genome Analyzer to shotgun-sequence approximately 800 human full-length cDNA clones. MuSICA 2 performs a hybrid assembly in which an external de novo assembler is run first and the result is then improved by reference alignment of shotgun reads. We compared the MuSICA 2 assembly with 200 pooled full-length cDNA clones finished independently by the conventional primer-walking using Sanger sequencers. The exon-intron structure of the coding sequence was correct for more than 95% of the clones with coding sequence annotation when we excluded cDNA clones insufficiently represented in the shotgun library due to PCR failure (42 out of 200 clones excluded), and the nucleotide-level accuracy of coding sequences of those correct clones was over 99.99%. We also applied MuSICA 2 to full-length cDNA clones from Toxoplasma gondii, to confirm that its ability was competent even for non-human species. The entire sequencing and shotgun assembly takes less than 1 week and the consumables cost only approximately US$3 per clone, demonstrating a significant advantage over previous approaches.

  6. cDNA encoding a polypeptide including a hev ein sequence

    Energy Technology Data Exchange (ETDEWEB)

    Raikhel, Natasha V. (Okemos, MI); Broekaert, Willem F. (Dilbeek, BE); Chua, Nam-Hai (Scarsdale, NY); Kush, Anil (New York, NY)

    2000-07-04

    A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74-79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli.

  7. Full-length cDNA sequences from Rhesus monkey placenta tissue: analysis and utility for comparative mapping

    Directory of Open Access Journals (Sweden)

    Lee Sang-Rae

    2010-07-01

    Full Text Available Abstract Background Rhesus monkeys (Macaca mulatta are widely-used as experimental animals in biomedical research and are closely related to other laboratory macaques, such as cynomolgus monkeys (Macaca fascicularis, and to humans, sharing a last common ancestor from about 25 million years ago. Although rhesus monkeys have been studied extensively under field and laboratory conditions, research has been limited by the lack of genetic resources. The present study generated placenta full-length cDNA libraries, characterized the resulting expressed sequence tags, and described their utility for comparative mapping with human RefSeq mRNA transcripts. Results From rhesus monkey placenta full-length cDNA libraries, 2000 full-length cDNA sequences were determined and 1835 rhesus placenta cDNA sequences longer than 100 bp were collected. These sequences were annotated based on homology to human genes. Homology search against human RefSeq mRNAs revealed that our collection included the sequences of 1462 putative rhesus monkey genes. Moreover, we identified 207 genes containing exon alterations in the coding region and the untranslated region of rhesus monkey transcripts, despite the highly conserved structure of the coding regions. Approximately 10% (187 of all full-length cDNA sequences did not represent any public human RefSeq mRNAs. Intriguingly, two rhesus monkey specific exons derived from the transposable elements of AluYRa2 (SINE family and MER11B (LTR family were also identified. Conclusion The 1835 rhesus monkey placenta full-length cDNA sequences described here could expand genomic resources and information of rhesus monkeys. This increased genomic information will greatly contribute to the development of evolutionary biology and biomedical research.

  8. Benchmarking of the Oxford Nanopore MinION sequencing for quantitative and qualitative assessment of cDNA populations.

    Science.gov (United States)

    Oikonomopoulos, Spyros; Wang, Yu Chang; Djambazian, Haig; Badescu, Dunarel; Ragoussis, Jiannis

    2016-08-24

    To assess the performance of the Oxford Nanopore Technologies MinION sequencing platform, cDNAs from the External RNA Controls Consortium (ERCC) RNA Spike-In mix were sequenced. This mix mimics mammalian mRNA species and consists of 92 polyadenylated transcripts with known concentration. cDNA libraries were generated using a template switching protocol to facilitate the direct comparison between different sequencing platforms. The MinION performance was assessed for its ability to sequence the cDNAs directly with good accuracy in terms of abundance and full length. The abundance of the ERCC cDNA molecules sequenced by MinION agreed with their expected concentration. No length or GC content bias was observed. The majority of cDNAs were sequenced as full length. Additionally, a complex cDNA population derived from a human HEK-293 cell line was sequenced on an Illumina HiSeq 2500, PacBio RS II and ONT MinION platforms. We observed that there was a good agreement in the measured cDNA abundance between PacBio RS II and ONT MinION (rpearson = 0.82, isoforms with length more than 700bp) and between Illumina HiSeq 2500 and ONT MinION (rpearson = 0.75). This indicates that the ONT MinION can sequence quantitatively both long and short full length cDNA molecules.

  9. cDNA cloning, sequence analysis, and chromosomal localization of the gene for human carnitine palmitoyltransferase

    International Nuclear Information System (INIS)

    Finocchiaro, G.; Taroni, F.; Martin, A.L.; Colombo, I.; Tarelli, G.T.; DiDonato, S.; Rocchi, M.

    1991-01-01

    The authors have cloned and sequenced a cDNA encoding human liver carnitine palmitoyltransferase an inner mitochondrial membrane enzyme that plays a major role in the fatty acid oxidation pathway. Mixed oligonucleotide primers whose sequences were deduced from one tryptic peptide obtained from purified CPTase were used in a polymerase chain reaction, allowing the amplification of a 0.12-kilobase fragment of human genomic DNA encoding such a peptide. A 60-base-pair (bp) oligonucleotide synthesized on the basis of the sequence from this fragment was used for the screening of a cDNA library from human liver and hybridized to a cDNA insert of 2255 bp. This cDNA contains an open reading frame of 1974 bp that encodes a protein of 658 amino acid residues including 25 residues of an NH 2 -terminal leader peptide. The assignment of this open reading frame to human liver CPTase is confirmed by matches to seven different amino acid sequences of tryptic peptides derived from pure human CPTase and by the 82.2% homology with the amino acid sequence of rat CPTase. The NH 2 -terminal region of CPTase contains a leucine-proline motif that is shared by carnitine acetyl- and octanoyltransferases and by choline acetyltransferase. The gene encoding CPTase was assigned to human chromosome 1, region 1q12-1pter, by hybridization of CPTase cDNA with a DNA panel of 19 human-hanster somatic cell hybrids

  10. cDNA, genomic cloning and sequence analysis of ribosomal protein ...

    African Journals Online (AJOL)

    enoh

    2012-03-13

    Mar 13, 2012 ... cDNA and the genomic sequence of RPS4X were cloned successfully from ... S4 genes plays a role in Turner syndrome; however, this ..... Project of Educational Committee of Sichuan Province ... Molecular biology of the cell.

  11. A putative peroxidase cDNA from turnip and analysis of the encoded protein sequence.

    Science.gov (United States)

    Romero-Gómez, S; Duarte-Vázquez, M A; García-Almendárez, B E; Mayorga-Martínez, L; Cervantes-Avilés, O; Regalado, C

    2008-12-01

    A putative peroxidase cDNA was isolated from turnip roots (Brassica napus L. var. purple top white globe) by reverse transcriptase-polymerase chain reaction (RT-PCR) and rapid amplification of cDNA ends (RACE). Total RNA extracted from mature turnip roots was used as a template for RT-PCR, using a degenerated primer designed to amplify the highly conserved distal motif of plant peroxidases. The resulting partial sequence was used to design the rest of the specific primers for 5' and 3' RACE. Two cDNA fragments were purified, sequenced, and aligned with the partial sequence from RT-PCR, and a complete overlapping sequence was obtained and labeled as BbPA (Genbank Accession No. AY423440, named as podC). The full length cDNA is 1167bp long and contains a 1077bp open reading frame (ORF) encoding a 358 deduced amino acid peroxidase polypeptide. The putative peroxidase (BnPA) showed a calculated Mr of 34kDa, and isoelectric point (pI) of 4.5, with no significant identity with other reported turnip peroxidases. Sequence alignment showed that only three peroxidases have a significant identity with BnPA namely AtP29a (84%), and AtPA2 (81%) from Arabidopsis thaliana, and HRPA2 (82%) from horseradish (Armoracia rusticana). Work is in progress to clone this gene into an adequate host to study the specific role and possible biotechnological applications of this alternative peroxidase source.

  12. cDNA, genomic sequence cloning and overexpression of ribosomal ...

    African Journals Online (AJOL)

    RPS16 of eukaryote is a component of the 40S small ribosomal subunit encoded by RPS16 gene and is also a homolog of prokaryotic RPS9. The cDNA and genomic sequence of RPS16 was cloned successfully for the first time from the Giant Panda (Ailuropoda melanoleuca) using reverse transcription-polymerase chain ...

  13. Cloning and sequencing of Indian Water buffalo (Bubalus bubalis) interleukin-3 cDNA

    KAUST Repository

    Sugumar, Thennarasu; Harishankar, M.; Dhinakar Raj, G.

    2011-01-01

    Full-length cDNA (435 bp) of the interleukin-3(IL-3) gene of the Indian water buffalo was amplified by reverse transcriptase-polymerase chain reaction and sequenced. This sequence had 96% nucleotide identity and 92% amino acid identity with bovine

  14. cDNA sequence of human transforming gene hst and identification of the coding sequence required for transforming activity

    International Nuclear Information System (INIS)

    Taira, M.; Yoshida, T.; Miyagawa, K.; Sakamoto, H.; Terada, M.; Sugimura, T.

    1987-01-01

    The hst gene was originally identified as a transforming gene in DNAs from human stomach cancers and from a noncancerous portion of stomach mucosa by DNA-mediated transfection assay using NIH3T3 cells. cDNA clones of hst were isolated from the cDNA library constructed from poly(A) + RNA of a secondary transformant induced by the DNA from a stomach cancer. The sequence analysis of the hst cDNA revealed the presence of two open reading frames. When this cDNA was inserted into an expression vector containing the simian virus 40 promoter, it efficiently induced the transformation of NIH3T3 cells upon transfection. It was found that one of the reading frames, which coded for 206 amino acids, was responsible for the transforming activity

  15. Differential representation of sunflower ESTs in enriched organ-specific cDNA libraries in a small scale sequencing project

    Directory of Open Access Journals (Sweden)

    Heinz Ruth A

    2003-09-01

    Full Text Available Abstract Background Subtractive hybridization methods are valuable tools for identifying differentially regulated genes in a given tissue avoiding redundant sequencing of clones representing the same expressed genes, maximizing detection of low abundant transcripts and thus, affecting the efficiency and cost effectiveness of small scale cDNA sequencing projects aimed to the specific identification of useful genes for breeding purposes. The objective of this work is to evaluate alternative strategies to high-throughput sequencing projects for the identification of novel genes differentially expressed in sunflower as a source of organ-specific genetic markers that can be functionally associated to important traits. Results Differential organ-specific ESTs were generated from leaf, stem, root and flower bud at two developmental stages (R1 and R4. The use of different sources of RNA as tester and driver cDNA for the construction of differential libraries was evaluated as a tool for detection of rare or low abundant transcripts. Organ-specificity ranged from 75 to 100% of non-redundant sequences in the different cDNA libraries. Sequence redundancy varied according to the target and driver cDNA used in each case. The R4 flower cDNA library was the less redundant library with 62% of unique sequences. Out of a total of 919 sequences that were edited and annotated, 318 were non-redundant sequences. Comparison against sequences in public databases showed that 60% of non-redundant sequences showed significant similarity to known sequences. The number of predicted novel genes varied among the different cDNA libraries, ranging from 56% in the R4 flower to 16 % in the R1 flower bud library. Comparison with sunflower ESTs on public databases showed that 197 of non-redundant sequences (60% did not exhibit significant similarity to previously reported sunflower ESTs. This approach helped to successfully isolate a significant number of new reported sequences

  16. Characterization of full-length sequenced cDNA inserts (FLIcs) from Atlantic salmon (Salmo salar)

    Science.gov (United States)

    Andreassen, Rune; Lunner, Sigbjørn; Høyheim, Bjørn

    2009-01-01

    Background Sequencing of the Atlantic salmon genome is now being planned by an international research consortium. Full-length sequenced inserts from cDNAs (FLIcs) are an important tool for correct annotation and clustering of the genomic sequence in any species. The large amount of highly similar duplicate sequences caused by the relatively recent genome duplication in the salmonid ancestor represents a particular challenge for the genome project. FLIcs will therefore be an extremely useful resource for the Atlantic salmon sequencing project. In addition to be helpful in order to distinguish between duplicate genome regions and in determining correct gene structures, FLIcs are an important resource for functional genomic studies and for investigation of regulatory elements controlling gene expression. In contrast to the large number of ESTs available, including the ESTs from 23 developmental and tissue specific cDNA libraries contributed by the Salmon Genome Project (SGP), the number of sequences where the full-length of the cDNA insert has been determined has been small. Results High quality full-length insert sequences from 560 pre-smolt white muscle tissue specific cDNAs were generated, accession numbers [GenBank: BT043497 - BT044056]. Five hundred and ten (91%) of the transcripts were annotated using Gene Ontology (GO) terms and 440 of the FLIcs are likely to contain a complete coding sequence (cCDS). The sequence information was used to identify putative paralogs, characterize salmon Kozak motifs, polyadenylation signal variation and to identify motifs likely to be involved in the regulation of particular genes. Finally, conserved 7-mers in the 3'UTRs were identified, of which some were identical to miRNA target sequences. Conclusion This paper describes the first Atlantic salmon FLIcs from a tissue and developmental stage specific cDNA library. We have demonstrated that many FLIcs contained a complete coding sequence (cCDS). This suggests that the remaining cDNA

  17. Characterization of full-length sequenced cDNA inserts (FLIcs from Atlantic salmon (Salmo salar

    Directory of Open Access Journals (Sweden)

    Lunner Sigbjørn

    2009-10-01

    Full Text Available Abstract Background Sequencing of the Atlantic salmon genome is now being planned by an international research consortium. Full-length sequenced inserts from cDNAs (FLIcs are an important tool for correct annotation and clustering of the genomic sequence in any species. The large amount of highly similar duplicate sequences caused by the relatively recent genome duplication in the salmonid ancestor represents a particular challenge for the genome project. FLIcs will therefore be an extremely useful resource for the Atlantic salmon sequencing project. In addition to be helpful in order to distinguish between duplicate genome regions and in determining correct gene structures, FLIcs are an important resource for functional genomic studies and for investigation of regulatory elements controlling gene expression. In contrast to the large number of ESTs available, including the ESTs from 23 developmental and tissue specific cDNA libraries contributed by the Salmon Genome Project (SGP, the number of sequences where the full-length of the cDNA insert has been determined has been small. Results High quality full-length insert sequences from 560 pre-smolt white muscle tissue specific cDNAs were generated, accession numbers [GenBank: BT043497 - BT044056]. Five hundred and ten (91% of the transcripts were annotated using Gene Ontology (GO terms and 440 of the FLIcs are likely to contain a complete coding sequence (cCDS. The sequence information was used to identify putative paralogs, characterize salmon Kozak motifs, polyadenylation signal variation and to identify motifs likely to be involved in the regulation of particular genes. Finally, conserved 7-mers in the 3'UTRs were identified, of which some were identical to miRNA target sequences. Conclusion This paper describes the first Atlantic salmon FLIcs from a tissue and developmental stage specific cDNA library. We have demonstrated that many FLIcs contained a complete coding sequence (cCDS. This

  18. Molecular cloning and sequence analysis of growth hormone cDNA of Neotropical freshwater fish Pacu (Piaractus mesopotamicus

    Directory of Open Access Journals (Sweden)

    Janeth Silva Pinheiro

    2008-01-01

    Full Text Available RT-PCR was used for amplifying Piaractus mesopotamicus growth hormone (GH cDNA obtained from mRNA extracted from pituitary cells. The amplified fragment was cloned and the complete cDNA sequence was determined. The cloned cDNA encompassed a sequence of 543 nucleotides that encoded a polypeptide of 178 amino acids corresponding to mature P. mesopotamicus GH. Comparison with other GH sequences showed a gap of 10 amino acids localized in the N terminus of the putative polypeptide of P. mesopotamicus. This same gap was also observed in other members of the family. Neighbor-joining tree analysis with GH sequences from fishes belonging to different taxonomic groups placed the P. mesopotamicus GH within the Otophysi group. To our knowledge, this is the first GH sequence of a Neotropical characiform fish deposited in GenBank.

  19. Microarray and cDNA sequence analysis of transcription during nerve-dependent limb regeneration

    Directory of Open Access Journals (Sweden)

    Bryant Susan V

    2009-01-01

    Full Text Available Abstract Background Microarray analysis and 454 cDNA sequencing were used to investigate a centuries-old problem in regenerative biology: the basis of nerve-dependent limb regeneration in salamanders. Innervated (NR and denervated (DL forelimbs of Mexican axolotls were amputated and transcripts were sampled after 0, 5, and 14 days of regeneration. Results Considerable similarity was observed between NR and DL transcriptional programs at 5 and 14 days post amputation (dpa. Genes with extracellular functions that are critical to wound healing were upregulated while muscle-specific genes were downregulated. Thus, many processes that are regulated during early limb regeneration do not depend upon nerve-derived factors. The majority of the transcriptional differences between NR and DL limbs were correlated with blastema formation; cell numbers increased in NR limbs after 5 dpa and this yielded distinct transcriptional signatures of cell proliferation in NR limbs at 14 dpa. These transcriptional signatures were not observed in DL limbs. Instead, gene expression changes within DL limbs suggest more diverse and protracted wound-healing responses. 454 cDNA sequencing complemented the microarray analysis by providing deeper sampling of transcriptional programs and associated biological processes. Assembly of new 454 cDNA sequences with existing expressed sequence tag (EST contigs from the Ambystoma EST database more than doubled (3935 to 9411 the number of non-redundant human-A. mexicanum orthologous sequences. Conclusion Many new candidate gene sequences were discovered for the first time and these will greatly enable future studies of wound healing, epigenetics, genome stability, and nerve-dependent blastema formation and outgrowth using the axolotl model.

  20. Molecular cloning of a human glycophorin B cDNA: nucleotide sequence and genomic relationship to glycophorin A

    International Nuclear Information System (INIS)

    Siebert, P.D.; Fukuda, M.

    1987-01-01

    The authors describe the isolation and nucleotide sequence of a human glycophorin B cDNA. The cDNA was identified by differential hybridization of synthetic oligonucleotide probes to a human erythroleukemic cell line (K562) cDNA library constructed in phage vector λgt10. The nucleotide sequence of the glycophorin B cDNA was compared with that of a previously cloned glycophorin A cDNA. The nucleotide sequences encoding the NH 2 -terminal leader peptide and first 26 amino acids of the two proteins are nearly identical. This homologous region is followed by areas specific to either glycophorin A or B and a number of small regions of homology, which in turn are followed by a very homologous region encoding the presumed membrane-spanning portion of the proteins. They used RNA blot hybridization with both cDNA and synthetic oligonucleotide probes to prove our previous hypothesis that glycophorin B is encoded by a single 0.5- to 0.6-kb mRNA and to show that glycophorins A and B are negatively and coordinately regulated by a tumor-promoting phorbol ester, phorbol 12-myristate 13-acetate. They established the intron/exon structure of the glycophorin A and B genes by oligonucleotide mapping; the results suggest a complex evolution of the glycophorin genes

  1. cDNA, genomic sequence cloning and analysis of the ribosomal ...

    African Journals Online (AJOL)

    Ribosomal protein L37A (RPL37A) is a component of 60S large ribosomal subunit encoded by the RPL37A gene, which belongs to the family of ribosomal L37AE proteins, located in the cytoplasm. The complementary deoxyribonucleic acid (cDNA) and the genomic sequence of RPL37A were cloned successfully from giant ...

  2. Nucleotide sequence of a cDNA coding for the amino-terminal region of human prepro. alpha. 1(III) collagen

    Energy Technology Data Exchange (ETDEWEB)

    Toman, P D; Ricca, G A [Rorer Biotechnology, Inc., Springfield, VA (USA); de Crombrugghe, B [National Institutes of Health, Bethesda, MD (USA)

    1988-07-25

    Type III Collagen is synthesized in a variety of tissues as a precursor macromolecule containing a leader sequence, a N-propeptide, a N-telopeptide, the triple helical region, a C-telopeptide, and C-propeptide. To further characterize the human type III collagen precursor, a human placental cDNA library was constructed in gt11 using an oligonucleotide derived from a partial cDNA sequence corresponding to the carboxy-terminal part of the 1(III) collagen. A cDNA was identified which contains the leader sequence, the N-propeptide and N-telopeptide regions. The DNA sequence of these regions are presented here. The triple helical, C-telopeptide and C-propeptide amino acid sequence for human type III collagen has been determined previously. A comparison of the human amino acid sequence with mouse, chicken, and calf sequence shows 81%, 81%, and 92% similarity, respectively. At the DNA level, the sequence similarity between human and mouse or chicken type III collagen sequences in this area is 82% and 77%, respectively.

  3. Human pro. cap alpha. 1(III) collagen: cDNA sequence for the 3' end

    Energy Technology Data Exchange (ETDEWEB)

    Mankoo, B S; Dalgleish, R

    1988-03-25

    The authors have previously isolated two overlapping cDNA clones, pIII-21 and pIII-33, which encode the C-terminal end of human type III procollagen. They now present the sequence of 2520 bases encoded in these cDNAs which overlaps other previously published sequences for the same gene. The sequence presented differs from previously published sequences at five positions.

  4. cDNA cloning and nucleotide sequence comparison of Chinese hamster metallothionein I and II mRNAs

    Energy Technology Data Exchange (ETDEWEB)

    Griffith, B B; Walters, R A; Enger, M D; Hildebrand, C E; Griffith, J K

    1983-01-01

    Polyadenylated RNA was extracted from a cadmium resistant Chinese hamster (CHO) cell line, enriched for metal-induced, abundant RNA sequences and cloned as double-stranded cDNA in the plasmid pBR322. Two cDNA clones, pCHMT1 and pCHMT2, encoding two Chinese hamster isometallothioneins were identified, and the nucleotide sequence of each insert was determined. The two Chinese hamster metallothioneins show nucleotide sequence homologies of 80% in the protein coding region and approximately 35% in both the 5' and 3' untranslated regions. Interestingly, an 8 nucleotide sequence (TGTAAATA) has been conserved in sequence and position in the 3' untranslated regions of each metallothionein mRNA sequenced thus far. Estimated nucleotide substitution rates derived from interspecies comparisons were used to calculate a metallothionein gene duplication time of 45 to 120 million years ago. 39 references, 1 figure, 1 table.

  5. Isolation and characterization of human glycophorin A cDNA clones by a synthetic oligonucleotide approach: nucleotide sequence and mRNA structure

    International Nuclear Information System (INIS)

    Siebert, P.D.; Fukuda, M.

    1986-01-01

    In an effort to understand the relationships among and the regulation of human glycophorins, the authors have isolated and characterized several glycophorin A-specific cDNA clones obtained from a human erythroleukemic K562 cell cDNA library. This was accomplished by using mixed synthetic oligonucleotides, corresponding to various regions of the known amino acid sequence, to prime the synthesis of the cDNA as well as to screen the cDNA library. They also used synthetic oligonucleotides to sequence the largest of the glycophorin cDNAs. The nucleotide sequence obtained suggests the presence of a potential leader peptide, consistent with the membrane localization of this glycoprotein. Examination of the structure of glycophorin mRNA by blot hybridization revealed the existence of several electrophoretically distinct mRNAs numbering three or four, depending on the size of the glycophorin cDNA used as a hybridization probe. The smaller cDNA hybridized to three mRNAs of approximately 2.8, 1.7, and 1.0 kilobases. In contrast, the larger cDNA hybridized to an additional mRNA of approximately 0.6 kilobases. Further examination of the relationships between these multiple mRNAs by blot hybridization was conducted with the use of exact-sequence oligonucleotide probes constructed from various regions of the cDNA representing portions of the amino acid sequence of glycophorin A with or without known homology with glycophorin B. In total, the results obtained are consistent with the hypothesis that the three larger mRNAs represent glycophorin A gene transcripts and that the smallest (0.6 kilobase) mRNA may be specific for glycophorin B

  6. Assessing the utility of the Oxford Nanopore MinION for snake venom gland cDNA sequencing.

    Science.gov (United States)

    Hargreaves, Adam D; Mulley, John F

    2015-01-01

    Portable DNA sequencers such as the Oxford Nanopore MinION device have the potential to be truly disruptive technologies, facilitating new approaches and analyses and, in some cases, taking sequencing out of the lab and into the field. However, the capabilities of these technologies are still being revealed. Here we show that single-molecule cDNA sequencing using the MinION accurately characterises venom toxin-encoding genes in the painted saw-scaled viper, Echis coloratus. We find the raw sequencing error rate to be around 12%, improved to 0-2% with hybrid error correction and 3% with de novo error correction. Our corrected data provides full coding sequences and 5' and 3' UTRs for 29 of 33 candidate venom toxins detected, far superior to Illumina data (13/40 complete) and Sanger-based ESTs (15/29). We suggest that, should the current pace of improvement continue, the MinION will become the default approach for cDNA sequencing in a variety of species.

  7. Nucleotide sequence of Phaseolus vulgaris L. alcohol dehydrogenase encoding cDNA and three-dimensional structure prediction of the deduced protein.

    Science.gov (United States)

    Amelia, Kassim; Khor, Chin Yin; Shah, Farida Habib; Bhore, Subhash J

    2015-01-01

    Common beans (Phaseolus vulgaris L.) are widely consumed as a source of proteins and natural products. However, its yield needs to be increased. In line with the agenda of Phaseomics (an international consortium), work of expressed sequence tags (ESTs) generation from bean pods was initiated. Altogether, 5972 ESTs have been isolated. Alcohol dehydrogenase (AD) encoding gene cDNA was a noticeable transcript among the generated ESTs. This AD is an important enzyme; therefore, to understand more about it this study was undertaken. The objective of this study was to elucidate P. vulgaris L. AD (PvAD) gene cDNA sequence and to predict the three-dimensional (3D) structure of deduced protein. positive and negative strands of the PvAD cDNA clone were sequenced using M13 forward and M13 reverse primers to elucidate the nucleotide sequence. Deduced PvAD cDNA and protein sequence was analyzed for their basic features using online bioinformatics tools. Sequence comparison was carried out using bl2seq program, and tree-view program was used to construct a phylogenetic tree. The secondary structures and 3D structure of PvAD protein were predicted by using the PHYRE automatic fold recognition server. The sequencing results analysis showed that PvAD cDNA is 1294 bp in length. It's open reading frame encodes for a protein that contains 371 amino acids. Deduced protein sequence analysis showed the presence of putative substrate binding, catalytic Zn binding, and NAD binding sites. Results indicate that the predicted 3D structure of PvAD protein is analogous to the experimentally determined crystal structure of s-nitrosoglutathione reductase from an Arabidopsis species. The 1294 bp long PvAD cDNA encodes for 371 amino acid long protein that contains conserved domains required for biological functions of AD. The predicted deduced PvAD protein's 3D structure reflects the analogy with the crystal structure of Arabidopsis thaliana s-nitrosoglutathione reductase. Further study is required

  8. Genome-wide detection and analysis of hippocampus core promoters using DeepCAGE

    DEFF Research Database (Denmark)

    Valen, Eivind; Pascarella, Giovanni; Chalk, Alistair

    2009-01-01

    in a given tissue. Here, we present a new method for high-throughput sequencing of 5' cDNA tags-DeepCAGE: merging the Cap Analysis of Gene Expression method with ultra-high-throughput sequence technology. We apply DeepCAGE to characterize 1.4 million sequenced TSS from mouse hippocampus and reveal a wealth...

  9. Molecular cloning of chicken metallothionein. Deduction of the complete amino acid sequence and analysis of expression using cloned cDNA

    Energy Technology Data Exchange (ETDEWEB)

    Wei, D; Andrews, G K

    1988-01-25

    A cDNA library was constructed using RNA isolated from the livers of chickens which had been treated with zinc. This library was screened with a RNA probe complementary to mouse metallothionein-I (MT), and eight chicken MT cDNA clones were obtained. All of the cDNA clones contained nucleotide sequences homologous to regions of the longest (375 bp) cDNA clone. The latter contained an open reading frame of 189 bp, and the deduced amino acid sequence indicates a protein of 63 amino acids of which 20 are cysteine residues. Amino acid composition and partial amino acid sequence analyses of purified chicken MT protein agreed with the amino acid composition and sequence deduced from the cloned cDNA. Amino acid sequence comparison establish that chicken MT shares extensive homology with mammalian MTs. Southern blot analysis of chicken DNA indicates that the chicken MT gene is not a part of a large family of related sequences, but rather is likely to be a unique gene sequence. In the chicken liver, levels of chicken MT mRNA were rapidly induced by metals (Cd/sup 2 +/, Zn/sup 2 +/, Cu/sup 2 +/), glucocorticoids and lipopolysaccharide. MT mRNA was present in low levels in embryonic liver and increased to high levels during the first week after hatching before decreasing again to the basal levels found in adult liver. The results of this study establish that MT is highly conserved between birds and mammals and is regulated in the chicken by agents which also regulate expression of mammalian MT genes. However, in contrast to the mammals, the results suggest the existence of a single isoform of MT in the chicken.

  10. Detection of reverse transcriptase termination sites using cDNA ligation and massive parallel sequencing

    DEFF Research Database (Denmark)

    Kielpinski, Lukasz J; Boyd, Mette; Sandelin, Albin

    2013-01-01

    Detection of reverse transcriptase termination sites is important in many different applications, such as structural probing of RNAs, rapid amplification of cDNA 5' ends (5' RACE), cap analysis of gene expression, and detection of RNA modifications and protein-RNA cross-links. The throughput...... of these methods can be increased by applying massive parallel sequencing technologies.Here, we describe a versatile method for detection of reverse transcriptase termination sites based on ligation of an adapter to the 3' end of cDNA with bacteriophage TS2126 RNA ligase (CircLigase™). In the following PCR...

  11. Assessing the utility of the Oxford Nanopore MinION for snake venom gland cDNA sequencing

    Directory of Open Access Journals (Sweden)

    Adam D. Hargreaves

    2015-11-01

    Full Text Available Portable DNA sequencers such as the Oxford Nanopore MinION device have the potential to be truly disruptive technologies, facilitating new approaches and analyses and, in some cases, taking sequencing out of the lab and into the field. However, the capabilities of these technologies are still being revealed. Here we show that single-molecule cDNA sequencing using the MinION accurately characterises venom toxin-encoding genes in the painted saw-scaled viper, Echis coloratus. We find the raw sequencing error rate to be around 12%, improved to 0–2% with hybrid error correction and 3% with de novo error correction. Our corrected data provides full coding sequences and 5′ and 3′ UTRs for 29 of 33 candidate venom toxins detected, far superior to Illumina data (13/40 complete and Sanger-based ESTs (15/29. We suggest that, should the current pace of improvement continue, the MinION will become the default approach for cDNA sequencing in a variety of species.

  12. DeepSimulator: a deep simulator for Nanopore sequencing

    KAUST Repository

    Li, Yu

    2017-12-23

    Motivation: Oxford Nanopore sequencing is a rapidly developed sequencing technology in recent years. To keep pace with the explosion of the downstream data analytical tools, a versatile Nanopore sequencing simulator is needed to complement the experimental data as well as to benchmark those newly developed tools. However, all the currently available simulators are based on simple statistics of the produced reads, which have difficulty in capturing the complex nature of the Nanopore sequencing procedure, the main task of which is the generation of raw electrical current signals. Results: Here we propose a deep learning based simulator, DeepSimulator, to mimic the entire pipeline of Nanopore sequencing. Starting from a given reference genome or assembled contigs, we simulate the electrical current signals by a context-dependent deep learning model, followed by a base-calling procedure to yield simulated reads. This workflow mimics the sequencing procedure more naturally. The thorough experiments performed across four species show that the signals generated by our context-dependent model are more similar to the experimentally obtained signals than the ones generated by the official context-independent pore model. In terms of the simulated reads, we provide a parameter interface to users so that they can obtain the reads with different accuracies ranging from 83% to 97%. The reads generated by the default parameter have almost the same properties as the real data. Two case studies demonstrate the application of DeepSimulator to benefit the development of tools in de novo assembly and in low coverage SNP detection. Availability: The software can be accessed freely at: https://github.com/lykaust15/DeepSimulator.

  13. Construction and evaluation of normalized cDNA libraries enriched with full-length sequences for rapid discovery of new genes from Sisal (Agave sisalana Perr.) different developmental stages.

    Science.gov (United States)

    Zhou, Wen-Zhao; Zhang, Yan-Mei; Lu, Jun-Ying; Li, Jun-Feng

    2012-10-12

    To provide a resource of sisal-specific expressed sequence data and facilitate this powerful approach in new gene research, the preparation of normalized cDNA libraries enriched with full-length sequences is necessary. Four libraries were produced with RNA pooled from Agave sisalana multiple tissues to increase efficiency of normalization and maximize the number of independent genes by SMART™ method and the duplex-specific nuclease (DSN). This procedure kept the proportion of full-length cDNAs in the subtracted/normalized libraries and dramatically enhanced the discovery of new genes. Sequencing of 3875 cDNA clones of libraries revealed 3320 unigenes with an average insert length about 1.2 kb, indicating that the non-redundancy of libraries was about 85.7%. These unigene functions were predicted by comparing their sequences to functional domain databases and extensively annotated with Gene Ontology (GO) terms. Comparative analysis of sisal unigenes and other plant genomes revealed that four putative MADS-box genes and knotted-like homeobox (knox) gene were obtained from a total of 1162 full-length transcripts. Furthermore, real-time PCR showed that the characteristics of their transcripts mainly depended on the tight expression regulation of a number of genes during the leaf and flower development. Analysis of individual library sequence data indicated that the pooled-tissue approach was highly effective in discovering new genes and preparing libraries for efficient deep sequencing.

  14. An analysis of expressed sequence tags of developing castor endosperm using a full-length cDNA library

    Directory of Open Access Journals (Sweden)

    Wallis James G

    2007-07-01

    Full Text Available Abstract Background Castor seeds are a major source for ricinoleate, an important industrial raw material. Genomics studies of castor plant will provide critical information for understanding seed metabolism, for effectively engineering ricinoleate production in transgenic oilseeds, or for genetically improving castor plants by eliminating toxic and allergic proteins in seeds. Results Full-length cDNAs are useful resources in annotating genes and in providing functional analysis of genes and their products. We constructed a full-length cDNA library from developing castor endosperm, and obtained 4,720 ESTs from 5'-ends of the cDNA clones representing 1,908 unique sequences. The most abundant transcripts are genes encoding storage proteins, ricin, agglutinin and oleosins. Several other sequences are also very numerous, including two acidic triacylglycerol lipases, and the oleate hydroxylase (FAH12 gene that is responsible for ricinoleate biosynthesis. The role(s of the lipases in developing castor seeds are not clear, and co-expressing of a lipase and the FAH12 did not result in significant changes in hydroxy fatty acid accumulation in transgenic Arabidopsis seeds. Only one oleate desaturase (FAD2 gene was identified in our cDNA sequences. Sequence and functional analyses of the castor FAD2 were carried out since it had not been characterized previously. Overexpression of castor FAD2 in a FAH12-expressing Arabidopsis line resulted in decreased accumulation of hydroxy fatty acids in transgenic seeds. Conclusion Our results suggest that transcriptional regulation of FAD2 and FAH12 genes maybe one of the mechanisms that contribute to a high level of ricinoleate accumulation in castor endosperm. The full-length cDNA library will be used to search for additional genes that affect ricinoleate accumulation in seed oils. Our EST sequences will also be useful to annotate the castor genome, which whole sequence is being generated by shotgun sequencing at

  15. Human thyroid peroxidase: complete cDNA and protein sequence, chromosome mapping, and identification of two alternately spliced mRNAs

    International Nuclear Information System (INIS)

    Kimura, S.; Kotani, T.; McBride, O.W.; Umeki, K.; Hirai, K.; Nakayama, T.; Ohtaki, S.

    1987-01-01

    Two forms of human thyroid peroxidase cDNAs were isolated from a λgt11 cDNA library, prepared from Graves disease thyroid tissue mRNA, by use of oligonucleotides. The longest complete cDNA, designated phTPO-1, has 3048 nucleotides and an open reading frame consisting of 933 amino acids, which would encode a protein with a molecular weight of 103,026. Five potential asparagine-linked glycosylation sites are found in the deduced amino acid sequence. The second peroxidase cDNA, designated phTPO-2, is almost identical to phTPO-1 beginning 605 base pairs downstream except that it contains 1-base-pair difference and lacks 171 base pairs in the middle of the sequence. This results in a loss of 57 amino acids corresponding to a molecular weight of 6282. Interestingly, this 171-nucleotide sequence has GT and AG at its 5' and 3' boundaries, respectively, that are in good agreement with donor and acceptor splice site consensus sequences. Using specific oligonucleotide probes for the mRNAs derived from the cDNA sequences hTOP-1 and hTOP-2, the authors show that both are expressed in all thyroid tissues examined and the relative level of two mRNAs is different in each sample. The results suggest that two thyroid peroxidase proteins might be generated through alternate splicing of the same gene. By using somatic cell hybrid lines, the thyroid peroxidase gene was mapped to the short arm of human chromosome 2

  16. Cloning and sequencing of Indian Water buffalo (Bubalus bubalis) interleukin-3 cDNA

    KAUST Repository

    Sugumar, Thennarasu

    2011-12-12

    Full-length cDNA (435 bp) of the interleukin-3(IL-3) gene of the Indian water buffalo was amplified by reverse transcriptase-polymerase chain reaction and sequenced. This sequence had 96% nucleotide identity and 92% amino acid identity with bovine IL-3. There are 10 amino acid substitutions in buffalo compared with that of bovine. The amino acid sequence of buffalo IL-3 also showed very high identity with that of other ruminants, indicating functional cross-reactivity. Structural homology modelling of buffalo IL-3 protein with human IL-3 showed the presence of five helical structures.

  17. Quantitative phenotyping via deep barcode sequencing.

    Science.gov (United States)

    Smith, Andrew M; Heisler, Lawrence E; Mellor, Joseph; Kaper, Fiona; Thompson, Michael J; Chee, Mark; Roth, Frederick P; Giaever, Guri; Nislow, Corey

    2009-10-01

    Next-generation DNA sequencing technologies have revolutionized diverse genomics applications, including de novo genome sequencing, SNP detection, chromatin immunoprecipitation, and transcriptome analysis. Here we apply deep sequencing to genome-scale fitness profiling to evaluate yeast strain collections in parallel. This method, Barcode analysis by Sequencing, or "Bar-seq," outperforms the current benchmark barcode microarray assay in terms of both dynamic range and throughput. When applied to a complex chemogenomic assay, Bar-seq quantitatively identifies drug targets, with performance superior to the benchmark microarray assay. We also show that Bar-seq is well-suited for a multiplex format. We completely re-sequenced and re-annotated the yeast deletion collection using deep sequencing, found that approximately 20% of the barcodes and common priming sequences varied from expectation, and used this revised list of barcode sequences to improve data quality. Together, this new assay and analysis routine provide a deep-sequencing-based toolkit for identifying gene-environment interactions on a genome-wide scale.

  18. Human liver phosphatase 2A: cDNA and amino acid sequence of two catalytic subunit isotypes

    International Nuclear Information System (INIS)

    Arino, J.; Woon, Chee Wai; Brautigan, D.L.; Miller, T.B. Jr.; Johnson, G.L.

    1988-01-01

    Two cDNA clones were isolated from a human liver library that encode two phosphatase 2A catalytic subunits. The two cDNAs differed in eight amino acids (97% identity) with three nonconservative substitutions. All of the amino acid substitutions were clustered in the amino-terminal domain of the protein. Amino acid sequence of one human liver clone (HL-14) was identical to the rabbit skeletal muscle phosphatase 2A cDNA (with 97% nucleotide identity). The second human liver clone (HL-1) is encoded by a separate gene, and RNA gel blot analysis indicates that both mRNAs are expressed similarly in several human clonal cell lines. Sequence comparison with phosphatase 1 and 2A indicates highly divergent amino acid sequences at the amino and carboxyl termini of the proteins and identifies six highly conserved regions between the two proteins that are predicted to be important for phosphatase enzymatic activity

  19. Molecular cloning of lupin leghemoglobin cDNA

    DEFF Research Database (Denmark)

    Konieczny, A; Jensen, E O; Marcker, K A

    1987-01-01

    Poly(A)+ RNA isolated from root nodules of yellow lupin (Lupinus luteus, var. Ventus) has been used as a template for the construction of a cDNA library. The ds cDNA was synthesized and inserted into the Hind III site of plasmid pBR 322 using synthetic Hind III linkers. Clones containing sequences...... specific for nodules were selected by differential colony hybridization using 32P-labeled cDNA synthesized either from nodule poly(A)+ RNA or from poly(A)+ RNA of uninfected root as probes. Among the recombinant plasmids, the cDNA gene for leghemoglobin was identified. The protein structure derived from...... its nucleotide sequence was consistent with known amino acid sequence of lupin Lb II. The cloned lupin Lb cDNA hybridized to poly(A)+ RNA from nodules only, which is in accordance with the general concept, that leghemoglobin is expressed exclusively in nodules. Udgivelsesdato: 1987-null...

  20. Analysis of expressed sequence tags generated from full-length enriched cDNA libraries of melon

    Directory of Open Access Journals (Sweden)

    Bendahmane Abdelhafid

    2011-05-01

    Full Text Available Abstract Background Melon (Cucumis melo, an economically important vegetable crop, belongs to the Cucurbitaceae family which includes several other important crops such as watermelon, cucumber, and pumpkin. It has served as a model system for sex determination and vascular biology studies. However, genomic resources currently available for melon are limited. Result We constructed eleven full-length enriched and four standard cDNA libraries from fruits, flowers, leaves, roots, cotyledons, and calluses of four different melon genotypes, and generated 71,577 and 22,179 ESTs from full-length enriched and standard cDNA libraries, respectively. These ESTs, together with ~35,000 ESTs available in public domains, were assembled into 24,444 unigenes, which were extensively annotated by comparing their sequences to different protein and functional domain databases, assigning them Gene Ontology (GO terms, and mapping them onto metabolic pathways. Comparative analysis of melon unigenes and other plant genomes revealed that 75% to 85% of melon unigenes had homologs in other dicot plants, while approximately 70% had homologs in monocot plants. The analysis also identified 6,972 gene families that were conserved across dicot and monocot plants, and 181, 1,192, and 220 gene families specific to fleshy fruit-bearing plants, the Cucurbitaceae family, and melon, respectively. Digital expression analysis identified a total of 175 tissue-specific genes, which provides a valuable gene sequence resource for future genomics and functional studies. Furthermore, we identified 4,068 simple sequence repeats (SSRs and 3,073 single nucleotide polymorphisms (SNPs in the melon EST collection. Finally, we obtained a total of 1,382 melon full-length transcripts through the analysis of full-length enriched cDNA clones that were sequenced from both ends. Analysis of these full-length transcripts indicated that sizes of melon 5' and 3' UTRs were similar to those of tomato, but

  1. Fiscal 2000 report on result of the full-length cDNA structure analysis; 2000 nendo kanzen cho cDNA kozo kaiseki seika hokokusho

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    2001-03-01

    This paper explains the results of research on full-length cDNA structure analysis for the period from April, 2000 to March, 2001. The outline of human genome sequence was published in June, 2000. In Japan, human gene analysis was such that, as the basic technology of the bio industry, a millennium project was decided in the budget of fiscal 2000. The full-length cDNA structure analysis is the core of the project. The libraries of cDNA were prepared using full-length and more than 4-5kbp-long cDNAs by oligo-capping method. It began from determining partial sequence data at end cDNA, and then, with new clones selected therefrom, full-length human cDNA sequence data were determined. The partial sequence data determined by fiscal 2000 were 1,035,000 clones while the full-length sequence data were 12,144 clones. The sequence data obtained were analyzed by homology search and translated into amino acid coding sequences, with predictions conducted on protein functions. A clustering method was examined that selects new clones from partial sequences. Database was constructed on gene expression profiles and disease-related gene sequence data. (NEDO)

  2. Generation and Analysis of Full-length cDNA Sequences from Elephant Shark (Callorhinchus milii)

    KAUST Repository

    Kodzius, Rimantas

    2009-03-17

    Cartilaginous fishes are the oldest living group of jawed vertebrates and therefore is an important group for understanding the evolution of vertebrate genomes including the human genome. Our laboratory has proposed elephant shark (C. milii) as a model cartilaginous fish genome because of its relatively small genome size (910 Mb). The whole genome of C. milii is being sequenced (first cartilaginous fish genome to be sequenced completely). To characterize the transcriptome of C. milii and to assist in annotating exon-intron boundaries, transcriptional start sites and alternatively spliced transcripts, we are generating full-length cDNA sequences from C. milii.

  3. [cDNA library construction from panicle meristem of finger millet].

    Science.gov (United States)

    Radchuk, V; Pirko, Ia V; Isaenkov, S V; Emets, A I; Blium, Ia B

    2014-01-01

    The protocol for production of full-size cDNA using SuperScript Full-Length cDNA Library Construction Kit II (Invitrogen) was tested and high quality cDNA library from meristematic tissue of finger millet panicle (Eleusine coracana (L.) Gaertn) was created. The titer of obtained cDNA library comprised 3.01 x 10(5) CFU/ml in avarage. In average the length of cDNA insertion consisted about 1070 base pairs, the effectivity of cDNA fragment insertions--99.5%. The selective sequencing of cDNA clones from created library was performed. The sequences of cDNA clones were identified with usage of BLAST-search. The results of cDNA library analysis and selective sequencing represents prove good functionality and full length character of inserted cDNA clones. Obtained cDNA library from meristematic tissue of finger millet panicle represents good and valuable source for isolation and identification of key genes regulating metabolism and meristematic development and for mining of new molecular markers to conduct out high quality genetic investigations and molecular breeding as well.

  4. Molecular cloning and sequence analysis of hamster CENP-A cDNA

    Directory of Open Access Journals (Sweden)

    Valdivia Manuel M

    2002-05-01

    Full Text Available Abstract Background The centromere is a specialized locus that mediates chromosome movement during mitosis and meiosis. This chromosomal domain comprises a uniquely packaged form of heterochromatin that acts as a nucleus for the assembly of the kinetochore a trilaminar proteinaceous structure on the surface of each chromatid at the primary constriction. Kinetochores mediate interactions with the spindle fibers of the mitotic apparatus. Centromere protein A (CENP-A is a histone H3-like protein specifically located to the inner plate of kinetochore at active centromeres. CENP-A works as a component of specialized nucleosomes at centromeres bound to arrays of repeat satellite DNA. Results We have cloned the hamster homologue of human and mouse CENP-A. The cDNA isolated was found to contain an open reading frame encoding a polypeptide consisting of 129 amino acid residues with a C-terminal histone fold domain highly homologous to those of CENP-A and H3 sequences previously released. However, significant sequence divergence was found at the N-terminal region of hamster CENP-A that is five and eleven residues shorter than those of mouse and human respectively. Further, a human serine 7 residue, a target site for Aurora B kinase phosphorylation involved in the mechanism of cytokinesis, was not found in the hamster protein. A human autoepitope at the N-terminal region of CENP-A described in autoinmune diseases is not conserved in the hamster protein. Conclusions We have cloned the hamster cDNA for the centromeric protein CENP-A. Significant differences on protein sequence were found at the N-terminal tail of hamster CENP-A in comparison with that of human and mouse. Our results show a high degree of evolutionary divergence of kinetochore CENP-A proteins in mammals. This is related to the high diverse nucleotide repeat sequences found at the centromere DNA among species and support a current centromere model for kinetochore function and structural

  5. Complete cDNA sequence and amino acid analysis of a bovine ribonuclease K6 gene.

    Science.gov (United States)

    Pietrowski, D; Förster, M

    2000-01-01

    The complete cDNA sequence of a ribonuclease k6 gene of Bos Taurus has been determined. It codes for a protein with 154 amino acids and contains the invariant cysteine, histidine and lysine residues as well as the characteristic motifs specific to ribonuclease active sites. The deduced protein sequence is 27 residues longer than other known ribonucleases k6 and shows amino acids exchanges which could reflect a strain specificity or polymorphism within the bovine genome. Based on sequence similarity we have termed the identified gene bovine ribonuclease k6 b (brk6b).

  6. Nucleotide sequence of a cDNA for branched chain acyltransferase with analysis of the deduced protein structure

    International Nuclear Information System (INIS)

    Hummel, K.B.; Litwer, S.; Bradford, A.P.; Aitken, A.; Danner, D.J.; Yeaman, S.J.

    1988-01-01

    Nucleotide sequence was determined for a 1.6-kilobase human cDNA putative for the branched chain acyltransferase protein of the branched chain α-ketoacid dehydrogenase complex. Translation of the sequence reveals an open reading frame encoding a 315-amino acid protein of molecular weight 35,759 followed by 560 bases of 3'-untranslated sequence. Three repeats of the polyadenylation signal hexamer ATTAAA are present prior to the polyadenylate tail. Within the open reading frame is a 10-amino acid fragment which matches exactly the amino acid sequence around the lipoate-lysine residue in bovine kidney branched chain acyltransferase, thus confirming the identity of the cDNA. Analysis of the deduced protein structure for the human branched chain acyltransferase revealed an organization into domains similar to that reported for the acyltransferase proteins of the pyruvate and α-ketoglutarate dehydrogenase complexes. This similarity in organization suggests that a more detailed analysis of the proteins will be required to explain the individual substrate and multienzyme complex specificity shown by these acyltransferases

  7. Normalized cDNA libraries

    Science.gov (United States)

    Soares, Marcelo B.; Efstratiadis, Argiris

    1997-01-01

    This invention provides a method to normalize a directional cDNA library constructed in a vector that allows propagation in single-stranded circle form comprising: (a) propagating the directional cDNA library in single-stranded circles; (b) generating fragments complementary to the 3' noncoding sequence of the single-stranded circles in the library to produce partial duplexes; (c) purifying the partial duplexes; (d) melting and reassociating the purified partial duplexes to moderate Cot; and (e) purifying the unassociated single-stranded circles, thereby generating a normalized cDNA library.

  8. Complete amino acid sequence of human intestinal aminopeptidase N as deduced from cloned cDNA

    DEFF Research Database (Denmark)

    Cowell, G M; Kønigshøfer, E; Danielsen, E M

    1988-01-01

    The complete primary structure (967 amino acids) of an intestinal human aminopeptidase N (EC 3.4.11.2) was deduced from the sequence of a cDNA clone. Aminopeptidase N is anchored to the microvillar membrane via an uncleaved signal for membrane insertion. A domain constituting amino acid 250...

  9. Determination of cDNA and genomic DNA sequences of hevamine, a chitinase from the rubber tree Hevea brasiliensis

    NARCIS (Netherlands)

    Bokma, E; Spiering, M; Chow, KS; Mulder, PPMFA; Subroto, T; Beintema, JJ

    Hevamine is a chitinase from the rubber tree Hevea brasiliensis and belongs to the family 18 glycosyl hydrolases. This paper describes the cloning of hevamine DNA and cDNA sequences. Hevamine contains a signal peptide at the N-terminus and a putative vacuolar targeting sequence at the C-terminus

  10. [Cloning and sequencing of KIR2DL1 framework gene cDNA and identification of a novel allele].

    Science.gov (United States)

    Sun, Ge; Wang, Chang; Zhen, Jianxin; Zhang, Guobin; Xu, Yunping; Deng, Zhihui

    2016-10-01

    To develop an assay for cDNA cloning and haplotype sequencing of KIR2DL1 framework gene and determine the genotype of an ethnic Han from southern China. Total RNA was isolated from peripheral blood sample, and complementary DNA (cDNA) transcript was synthesized by RT-PCR. The entire coding sequence of the KIR2DL1 framework gene was amplified with a pair of KIR2DL1-specific PCR primers. The PCR products with a length of approximately 1.2 kb were then subjected to cloning and haplotype sequencing. A specific target fragment of the KIR2DL1 framework gene was obtained. Following allele separation, a wild-type KIR2DL1*00302 allele and a novel variant allele, KIR2DL1*031, were identified. Sequence alignment with KIR2DL1 alleles from the IPD-KIR Database showed that the novel allele KIR2DL1*031 has differed from the closest allele KIR2DL1*00302 by a non-synonymous mutation at CDS nt 188A>G (codon 42 GAG>GGG) in exon 4, which has caused an amino acid change Glu42Gly. The sequence of the novel allele KIR2DL1*031 was submitted to GenBank under the accession number KP025960 and to the IPD-KIR Database under the submission number IWS40001982. A name KIR2DL1*031 has been officially assigned by the World Health Organization (WHO) Nomenclature Committee. An assay for cDNA cloning and haplotype sequencing of KIR2DL1 has been established, which has a broad applications in KIR studies at allelic level.

  11. Human uroporphyrinogen III synthase: Molecular cloning, nucleotide sequence, and expression of a full-length cDNA

    International Nuclear Information System (INIS)

    Tsai, Shihfeng; Bishop, D.F.; Desnick, R.J.

    1988-01-01

    Uroporphyrinogen III synthase, the fourth enzyme in the heme biosynthetic pathway, is responsible for conversion of the linear tetrapyrrole, hydroxymethylbilane, to the cyclic tetrapyrrole, uroporphyrinogen III. The deficient activity of URO-synthase is the enzymatic defect in the autosomal recessive disorder congenital erythropoietic porphyria. To facilitate the isolation of a full-length cDNA for human URO-synthase, the human erythrocyte enzyme was purified to homogeneity and 81 nonoverlapping amino acids were determined by microsequencing the N terminus and four tryptic peptides. Two synthetic oligonucleotide mixtures were used to screen 1.2 x 10 6 recombinants from a human adult liver cDNA library. Eight clones were positive with both oligonucleotide mixtures. Of these, dideoxy sequencing of the 1.3 kilobase insert from clone pUROS-2 revealed 5' and 3' untranslated sequences of 196 and 284 base pairs, respectively, and an open reading frame of 798 base pairs encoding a protein of 265 amino acids with a predicted molecular mass of 28,607 Da. The isolation and expression of this full-length cDNA for human URO-synthase should facilitate studies of the structure, organization, and chromosomal localization of this heme biosynthetic gene as well as the characterization of the molecular lesions causing congenital erythropoietic porphyria

  12. Molecular cloning of growth hormone encoding cDNA of Indian

    Indian Academy of Sciences (India)

    A modified rapid amplification of cDNA ends (RACE) strategy has been developed for cloning highly conserved cDNA sequences. Using this modified method, the growth hormone (GH) encoding cDNA sequences of Labeo rohita, Cirrhina mrigala and Catla catla have been cloned, characterized and overexpressed in ...

  13. Brain cDNA clone for human cholinesterase

    International Nuclear Information System (INIS)

    McTiernan, C.; Adkins, S.; Chatonnet, A.; Vaughan, T.A.; Bartels, C.F.; Kott, M.; Rosenberry, T.L.; La Du, B.N.; Lockridge, O.

    1987-01-01

    A cDNA library from human basal ganglia was screened with oligonucleotide probes corresponding to portions of the amino acid sequence of human serum cholinesterase. Five overlapping clones, representing 2.4 kilobases, were isolated. The sequenced cDNA contained 207 base pairs of coding sequence 5' to the amino terminus of the mature protein in which there were four ATG translation start sites in the same reading frame as the protein. Only the ATG coding for Met-(-28) lay within a favorable consensus sequence for functional initiators. There were 1722 base pairs of coding sequence corresponding to the protein found circulating in human serum. The amino acid sequence deduced from the cDNA exactly matched the 574 amino acid sequence of human serum cholinesterase, as previously determined by Edman degradation. Therefore, our clones represented cholinesterase rather than acetylcholinesterase. It was concluded that the amino acid sequences of cholinesterase from two different tissues, human brain and human serum, were identical. Hybridization of genomic DNA blots suggested that a single gene, or very few genes coded for cholinesterase

  14. Nucleotide sequence of cloned cDNA for human sphingolipid activator protein 1 precursor

    International Nuclear Information System (INIS)

    Dewji, N.N.; Wenger, D.A.; O'Brien, J.S.

    1987-01-01

    Two cDNA clones encoding prepro-sphingolipid activator protein 1 (SAP-1) were isolated from a λ gt11 human hepatoma expression library using polyclonal antibodies. These had inserts of ≅ 2 kilobases (λ-S-1.2 and λ-S-1.3) and both were both homologous with a previously isolated clone (λ-S-1.1) for mature SAP-1. The authors report here the nucleotide sequence of the longer two EcoRI fragments of S-1.2 and S-1.3 that were not the same and the derived amino acid sequences of mature SAP-1 and its prepro form. The open reading frame encodes 19 amino acids, which are colinear with the amino-terminal sequence of mature SAP-1, and extends far beyond the predicted carboxyl terminus of mature SAP-1, indicating extensive carboxyl-terminal processing. The nucleotide sequence of cDNA encoding prepro-SAP-1 includes 1449 bases from the assigned initiation codon ATG at base-pair 472 to the stop codon TGA at base-pair 1921. The first 23 amino acids coded after the initiation ATG are characteristic of a signal peptide. The calculated molecular mass for a polypeptide encoded by 1449 bases is ≅ 53 kDa, in keeping with the reported value for pro-SAP-1. The data indicate that after removal of the signal peptide mature SAP-1 is generated by removing an additional 7 amino acids from the amino terminus and ≅ 373 amino acids from the carboxyl terminus. One potential glycosylation site was previously found in mature SAP-1. Three additional potential glycosylation sites are present in the processed carboxyl-terminal polypeptide, which they designate as P-2

  15. Development of polymorphic genic-SSR markers by cDNA library sequencing in boxwood, Buxus spp. (Buxaceae)

    Science.gov (United States)

    Genic microsatellites or simple sequence repeat (genic-SSR) markers were developed in boxwood (Buxus taxa) for genetic diversity analysis, identification of taxa, and to facilitate breeding. cDNA libraries were developed from mRNA extracted from leaves of Buxus sempervirens ‘Vardar Valley’ and seque...

  16. Full-Length Venom Protein cDNA Sequences from Venom-Derived mRNA: Exploring Compositional Variation and Adaptive Multigene Evolution.

    Science.gov (United States)

    Modahl, Cassandra M; Mackessy, Stephen P

    2016-06-01

    Envenomation of humans by snakes is a complex and continuously evolving medical emergency, and treatment is made that much more difficult by the diverse biochemical composition of many venoms. Venomous snakes and their venoms also provide models for the study of molecular evolutionary processes leading to adaptation and genotype-phenotype relationships. To compare venom complexity and protein sequences, venom gland transcriptomes are assembled, which usually requires the sacrifice of snakes for tissue. However, toxin transcripts are also present in venoms, offering the possibility of obtaining cDNA sequences directly from venom. This study provides evidence that unknown full-length venom protein transcripts can be obtained from the venoms of multiple species from all major venomous snake families. These unknown venom protein cDNAs are obtained by the use of primers designed from conserved signal peptide sequences within each venom protein superfamily. This technique was used to assemble a partial venom gland transcriptome for the Middle American Rattlesnake (Crotalus simus tzabcan) by amplifying sequences for phospholipases A2, serine proteases, C-lectins, and metalloproteinases from within venom. Phospholipase A2 sequences were also recovered from the venoms of several rattlesnakes and an elapid snake (Pseudechis porphyriacus), and three-finger toxin sequences were recovered from multiple rear-fanged snake species, demonstrating that the three major clades of advanced snakes (Elapidae, Viperidae, Colubridae) have stable mRNA present in their venoms. These cDNA sequences from venom were then used to explore potential activities derived from protein sequence similarities and evolutionary histories within these large multigene superfamilies. Venom-derived sequences can also be used to aid in characterizing venoms that lack proteomic profiles and identify sequence characteristics indicating specific envenomation profiles. This approach, requiring only venom, provides

  17. cDNA encoding a polypeptide including a hevein sequence

    Energy Technology Data Exchange (ETDEWEB)

    Raikhel, N.V.; Broekaert, W.F.; Namhai Chua; Kush, A.

    1993-02-16

    A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1,018 nucleotides long and includes an open reading frame of 204 amino acids.

  18. Characterization of the porcine carboxypeptidase E cDNA

    DEFF Research Database (Denmark)

    Hreidarsdôttir, G.E.; Cirera, Susanna; Fredholm, Merete

    2007-01-01

    the sequence of the cDNA for the porcine CPE gene including all the coding region and the 3'-UTR region was generated. Comparisons with bovine, human, mouse, and rat CPE cDNA sequences showed that the coding regions of the gene are highly conserved both at the nucleotide and at the amino acid level. A very low...

  19. Isolation of full-length putative rat lysophospholipase cDNA using improved methods for mRNA isolation and cDNA cloning

    International Nuclear Information System (INIS)

    Han, J.H.; Stratowa, C.; Rutter, W.J.

    1987-01-01

    The authors have cloned a full-length putative rat pancreatic lysophospholipase cDNA by an improved mRNA isolation method and cDNA cloning strategy using [ 32 P]-labelled nucleotides. These new methods allow the construction of a cDNA library from the adult rat pancreas in which the majority of recombinant clones contained complete sequences for the corresponding mRNAs. A previously recognized but unidentified long and relatively rare cDNA clone containing the entire sequence from the cap site at the 5' end to the poly(A) tail at the 3' end of the mRNA was isolated by single-step screening of the library. The size, amino acid composition, and the activity of the protein expressed in heterologous cells strongly suggest this mRNA codes for lysophospholipase

  20. cDNA sequence analysis of a 29-kDa cysteine-rich surface antigen of pathogenic Entamoeba histolytica

    International Nuclear Information System (INIS)

    Torian, B.E.; Stroeher, V.L.; Stamm, W.E.; Flores, B.M.; Hagen, F.S.

    1990-01-01

    A λgt11 cDNA library was constructed from poly(U)-Spharose-selected Entamoeba histolytica trophozoite RNA in order to clone and identify surface antigens. The library was screened with rabbit polyclonal anti-E. histolytica serum. A 700-base-pair cDNA insert was isolated and the nucleotide sequence was determined. The deduced amino acid sequence of the cDNA revealed a cysteine-rich protein. DNA hybridizations showed that the gene was specific to E. histolytica since the cDNA probe reacted with DNA from four axenic strains of E. histolytica but did not react with DNA from Entamoeba invadens, Acanthamoeba castellanii, or Trichomonas vaginalis. The insert was subcloned into the expression vector pGEX-1 and the protein was expressed as a fusion with the C terminus of glutathione S-transferase. Purified fusion protein was used to generate 22 monoclonal antibodies (mAbs) and a mouse polyclonal antiserum specific for the E. histolytica portion of the fusion protein. A 29-kDa protein was identified as a surface antigen when mAbs were used to immunoprecipitate the antigen from metabolically 35 S-labeled live trophozoites. The surface location of the antigen was corroborated by mAb immunoprecipitation of a 29-kDa protein from surface- 125 I-labeled whole trophozoites as well as by the reaction of mAbs with live trophozoites in an indirect immunofluorescence assay performed at 4 degree C. Immunoblotting with mAbs demonstrated that the antigen was present on four axenic isolates tested. mAbs recognized epitopes on the 29-kDa native antigen on some but not all clinical isolates tested

  1. cDNA sequence analysis of a 29-kDa cysteine-rich surface antigen of pathogenic Entamoeba histolytica

    Energy Technology Data Exchange (ETDEWEB)

    Torian, B.E.; Stroeher, V.L.; Stamm, W.E. (Univ. of Washington, Seattle (USA)); Flores, B.M. (Louisiana State Univ. Medical Center, New Orleans (USA)); Hagen, F.S. (Zymogenetics Incorporated, Seattle, WA (USA))

    1990-08-01

    A {lambda}gt11 cDNA library was constructed from poly(U)-Spharose-selected Entamoeba histolytica trophozoite RNA in order to clone and identify surface antigens. The library was screened with rabbit polyclonal anti-E. histolytica serum. A 700-base-pair cDNA insert was isolated and the nucleotide sequence was determined. The deduced amino acid sequence of the cDNA revealed a cysteine-rich protein. DNA hybridizations showed that the gene was specific to E. histolytica since the cDNA probe reacted with DNA from four axenic strains of E. histolytica but did not react with DNA from Entamoeba invadens, Acanthamoeba castellanii, or Trichomonas vaginalis. The insert was subcloned into the expression vector pGEX-1 and the protein was expressed as a fusion with the C terminus of glutathione S-transferase. Purified fusion protein was used to generate 22 monoclonal antibodies (mAbs) and a mouse polyclonal antiserum specific for the E. histolytica portion of the fusion protein. A 29-kDa protein was identified as a surface antigen when mAbs were used to immunoprecipitate the antigen from metabolically {sup 35}S-labeled live trophozoites. The surface location of the antigen was corroborated by mAb immunoprecipitation of a 29-kDa protein from surface-{sup 125}I-labeled whole trophozoites as well as by the reaction of mAbs with live trophozoites in an indirect immunofluorescence assay performed at 4{degree}C. Immunoblotting with mAbs demonstrated that the antigen was present on four axenic isolates tested. mAbs recognized epitopes on the 29-kDa native antigen on some but not all clinical isolates tested.

  2. Hybridization-based antibody cDNA recovery for the production of recombinant antibodies identified by repertoire sequencing.

    Science.gov (United States)

    Valdés-Alemán, Javier; Téllez-Sosa, Juan; Ovilla-Muñoz, Marbella; Godoy-Lozano, Elizabeth; Velázquez-Ramírez, Daniel; Valdovinos-Torres, Humberto; Gómez-Barreto, Rosa E; Martinez-Barnetche, Jesús

    2014-01-01

    High-throughput sequencing of the antibody repertoire is enabling a thorough analysis of B cell diversity and clonal selection, which may improve the novel antibody discovery process. Theoretically, an adequate bioinformatic analysis could allow identification of candidate antigen-specific antibodies, requiring their recombinant production for experimental validation of their specificity. Gene synthesis is commonly used for the generation of recombinant antibodies identified in silico. Novel strategies that bypass gene synthesis could offer more accessible antibody identification and validation alternatives. We developed a hybridization-based recovery strategy that targets the complementarity-determining region 3 (CDRH3) for the enrichment of cDNA of candidate antigen-specific antibody sequences. Ten clonal groups of interest were identified through bioinformatic analysis of the heavy chain antibody repertoire of mice immunized with hen egg white lysozyme (HEL). cDNA from eight of the targeted clonal groups was recovered efficiently, leading to the generation of recombinant antibodies. One representative heavy chain sequence from each clonal group recovered was paired with previously reported anti-HEL light chains to generate full antibodies, later tested for HEL-binding capacity. The recovery process proposed represents a simple and scalable molecular strategy that could enhance antibody identification and specificity assessment, enabling a more cost-efficient generation of recombinant antibodies.

  3. Use of Non-Normalized, Non-Amplified cDNA for 454-Based RNA Sequencing of Fleshy Melon Fruit

    Directory of Open Access Journals (Sweden)

    Vitaly Portnoy

    2011-03-01

    Full Text Available The melon ( L. fruit is an important crop and model system for the genomic study of both fleshy fruit development and the Cucurbitaceae family. To obtain an accurate representation of the melon fruit transcriptome based on expressed sequence tag (EST abundance in 454-pyrosequencing data, we prepared double-stranded complementary DNA (cDNA of melon without the usual amplification and normalization steps. A purification step was also included to eliminate small fragments. Complementary DNAs were obtained from 14 individual fruit libraries derived from two genotypes, separated into flesh and peel tissues, and sampled throughout fruit development. Pyrosequencing was performed using Genome Sequencer FLX (GS FLX technology, resulting in 1,215,359 reads, with mean length of >200 nucleotides. The global digital expression data was validated by comparative reverse transcription quantitative real-time polymerase chain reaction (RT-qPCR of 40 selected genes and expression patterns were similar for the two methods. The results indicate that high-quality, nonbiased cDNA for next-generation sequencing can be prepared from mature, fleshy fruit, which are notorious for difficulties in ribonucleic acid (RNA preparation.

  4. DeepBase: annotation and discovery of microRNAs and other noncoding RNAs from deep-sequencing data.

    Science.gov (United States)

    Yang, Jian-Hua; Qu, Liang-Hu

    2012-01-01

    Recent advances in high-throughput deep-sequencing technology have produced large numbers of short and long RNA sequences and enabled the detection and profiling of known and novel microRNAs (miRNAs) and other noncoding RNAs (ncRNAs) at unprecedented sensitivity and depth. In this chapter, we describe the use of deepBase, a database that we have developed to integrate all public deep-sequencing data and to facilitate the comprehensive annotation and discovery of miRNAs and other ncRNAs from these data. deepBase provides an integrative, interactive, and versatile web graphical interface to evaluate miRBase-annotated miRNA genes and other known ncRNAs, explores the expression patterns of miRNAs and other ncRNAs, and discovers novel miRNAs and other ncRNAs from deep-sequencing data. deepBase also provides a deepView genome browser to comparatively analyze these data at multiple levels. deepBase is available at http://deepbase.sysu.edu.cn/.

  5. DeepSimulator: a deep simulator for Nanopore sequencing

    KAUST Repository

    Li, Yu; Han, Renmin; Bi, Chongwei; Li, Mo; Wang, Sheng; Gao, Xin

    2017-01-01

    or assembled contigs, we simulate the electrical current signals by a context-dependent deep learning model, followed by a base-calling procedure to yield simulated reads. This workflow mimics the sequencing procedure more naturally. The thorough experiments

  6. Generation and analysis of large-scale expressed sequence tags (ESTs from a full-length enriched cDNA library of porcine backfat tissue

    Directory of Open Access Journals (Sweden)

    Lee Hae-Young

    2006-02-01

    Full Text Available Abstract Background Genome research in farm animals will expand our basic knowledge of the genetic control of complex traits, and the results will be applied in the livestock industry to improve meat quality and productivity, as well as to reduce the incidence of disease. A combination of quantitative trait locus mapping and microarray analysis is a useful approach to reduce the overall effort needed to identify genes associated with quantitative traits of interest. Results We constructed a full-length enriched cDNA library from porcine backfat tissue. The estimated average size of the cDNA inserts was 1.7 kb, and the cDNA fullness ratio was 70%. In total, we deposited 16,110 high-quality sequences in the dbEST division of GenBank (accession numbers: DT319652-DT335761. For all the expressed sequence tags (ESTs, approximately 10.9 Mb of porcine sequence were generated with an average length of 674 bp per EST (range: 200–952 bp. Clustering and assembly of these ESTs resulted in a total of 5,008 unique sequences with 1,776 contigs (35.46% and 3,232 singleton (65.54% ESTs. From a total of 5,008 unique sequences, 3,154 (62.98% were similar to other sequences, and 1,854 (37.02% were identified as having no hit or low identity (Sus scrofa. Gene ontology (GO annotation of unique sequences showed that approximately 31.7, 32.3, and 30.8% were assigned molecular function, biological process, and cellular component GO terms, respectively. A total of 1,854 putative novel transcripts resulted after comparison and filtering with the TIGR SsGI; these included a large percentage of singletons (80.64% and a small proportion of contigs (13.36%. Conclusion The sequence data generated in this study will provide valuable information for studying expression profiles using EST-based microarrays and assist in the condensation of current pig TCs into clusters representing longer stretches of cDNA sequences. The isolation of genes expressed in backfat tissue is the

  7. Cloning of the cDNA for human 12-lipoxygenase

    International Nuclear Information System (INIS)

    Izumi, T.; Hoshiko, S.; Radmark, O.; Samuelsson, B.

    1990-01-01

    A full-length cDNA clone encoding 12-lipoxygenase was isolated from a human platelet cDNA library by using a cDNA for human reticulocyte 15-lipoxygenase as probe for the initial screening. The cDNA had an open reading frame encoding 662 amino acid residues with a calculated molecular weight of 75,590. Three independent clones revealed minor heterogeneities in their DNA sequences. Thus, in three positions of the deduced amino acid sequence, there is a choice between two different amino acids. The deduced sequence from the clone plT3 showed 65% identity with human reticulocyte 15-lipoxygenase and 42% identity with human leukocyte 5-lipoxygenase. The 12-lipoxygenase cDNA recognized a 3.0-kilobase mRNA species in platelets and human erythroleukemia cells (HEL cells). Phorbol 12-tetradecanoyl 13-acetate induced megakaryocytic differentiation of HEL cells and 12-lipoxygenase activity and increased mRNA for 12-lipoxygenase. The identity of the cloned 12-lipoxygenase was assured by expression in a mammalian cell line (COS cells). Human platelet 12-lipoxygenase has been difficult to purify to homogeneity. The cloning of this cDNA will increase the possibilities to elucidate the structure and function of this enzyme

  8. PMS2 gene mutational analysis: direct cDNA sequencing to circumvent pseudogene interference.

    Science.gov (United States)

    Wimmer, Katharina; Wernstedt, Annekatrin

    2014-01-01

    The presence of highly homologous pseudocopies can compromise the mutation analysis of a gene of interest. In particular, when using PCR-based strategies, pseudogene co-amplification has to be effectively prevented. This is often achieved by using primers designed to be parental gene specific according to the reference sequence and by applying stringent PCR conditions. However, there are cases in which this approach is of limited utility. For example, it has been shown that the PMS2 gene exchanges sequences with one of its pseudogenes, named PMS2CL. This results in functional PMS2 alleles containing pseudogene-derived sequences at their 3'-end and in nonfunctional PMS2CL pseudogene alleles that contain gene-derived sequences. Hence, the paralogues cannot be distinguished according to the reference sequence. This shortcoming can be effectively circumvented by using direct cDNA sequencing. This approach is based on the selective amplification of PMS2 transcripts in two overlapping 1.6-kb RT-PCR products. In addition to avoiding pseudogene co-amplification and allele dropout, this method has also the advantage that it allows to effectively identify deletions, splice mutations, and de novo retrotransposon insertions that escape the detection of most DNA-based mutation analysis protocols.

  9. New Approaches to Attenuated Hepatitis a Vaccine Development: Cloning and Sequencing of Cell-Culture Adapted Viral cDNA.

    Science.gov (United States)

    1987-10-13

    after multiple passages in vivo and in vitro. J. Gen. Virol. 67, 1741- 1744. Sabin , A.B. (1985). Oral poliovirus vaccine : history of its development...IN (N NEW APPROACHES TO ATTENUATED HEPATITIS A VACCINE DEVELOPMENT: Q) CLONING AND SEQUENCING OF CELL-CULTURE ADAPTED VIRAL cDNA I ANNUAL REPORT...6ll02Bsl0 A 055 11. TITLE (Include Security Classification) New Approaches to Attenuated Hepatitis A Vaccine Development: Cloning and Sequencing of Cell

  10. Development of genic-SSR markers by deep transcriptome sequencing in pigeonpea [Cajanus cajan (L.) Millspaugh

    Science.gov (United States)

    2011-01-01

    Background Pigeonpea [Cajanus cajan (L.) Millspaugh], one of the most important food legumes of semi-arid tropical and subtropical regions, has limited genomic resources, particularly expressed sequence based (genic) markers. We report a comprehensive set of validated genic simple sequence repeat (SSR) markers using deep transcriptome sequencing, and its application in genetic diversity analysis and mapping. Results In this study, 43,324 transcriptome shotgun assembly unigene contigs were assembled from 1.696 million 454 GS-FLX sequence reads of separate pooled cDNA libraries prepared from leaf, root, stem and immature seed of two pigeonpea varieties, Asha and UPAS 120. A total of 3,771 genic-SSR loci, excluding homopolymeric and compound repeats, were identified; of which 2,877 PCR primer pairs were designed for marker development. Dinucleotide was the most common repeat motif with a frequency of 60.41%, followed by tri- (34.52%), hexa- (2.62%), tetra- (1.67%) and pentanucleotide (0.76%) repeat motifs. Primers were synthesized and tested for 772 of these loci with repeat lengths of ≥18 bp. Of these, 550 markers were validated for consistent amplification in eight diverse pigeonpea varieties; 71 were found to be polymorphic on agarose gel electrophoresis. Genetic diversity analysis was done on 22 pigeonpea varieties and eight wild species using 20 highly polymorphic genic-SSR markers. The number of alleles at these loci ranged from 4-10 and the polymorphism information content values ranged from 0.46 to 0.72. Neighbor-joining dendrogram showed distinct separation of the different groups of pigeonpea cultivars and wild species. Deep transcriptome sequencing of the two parental lines helped in silico identification of polymorphic genic-SSR loci to facilitate the rapid development of an intra-species reference genetic map, a subset of which was validated for expected allelic segregation in the reference mapping population. Conclusion We developed 550 validated genic

  11. High-throughput verification of transcriptional starting sites by Deep-RACE

    DEFF Research Database (Denmark)

    Olivarius, Signe; Plessy, Charles; Carninci, Piero

    2009-01-01

    We present a high-throughput method for investigating the transcriptional starting sites of genes of interest, which we named Deep-RACE (Deep–rapid amplification of cDNA ends). Taking advantage of the latest sequencing technology, it allows the parallel analysis of multiple genes and is free...

  12. Opsin cDNA sequences of a UV and green rhodopsin of the satyrine butterfly Bicyclus anynana.

    Science.gov (United States)

    Vanhoutte, K J A; Eggen, B J L; Janssen, J J M; Stavenga, D G

    2002-11-01

    The cDNAs of an ultraviolet (UV) and long-wavelength (LW) (green) absorbing rhodopsin of the bush brown Bicyclus anynana were partially identified. The UV sequence, encoding 377 amino acids, is 76-79% identical to the UV sequences of the papilionids Papilio glaucus and Papilio xuthus and the moth Manduca sexta. A dendrogram derived from aligning the amino acid sequences reveals an equidistant position of Bicyclus between Papilio and Manduca. The sequence of the green opsin cDNA fragment, which encodes 242 amino acids, represents six of the seven transmembrane regions. At the amino acid level, this fragment is more than 80% identical to the corresponding LW opsin sequences of Dryas, Heliconius, Papilio (rhodopsin 2) and Manduca. Whereas three LW absorbing rhodopsins were identified in the papilionid butterflies, only one green opsin was found in B. anynana.

  13. Heterogeneity of rat tropoelastin mRNA revealed by cDNA cloning

    International Nuclear Information System (INIS)

    Pierce, R.A.; Deak, S.B.; Stolle, C.A.; Boyd, C.D.

    1990-01-01

    A λgt11 library constructed from poly(A+) RNA isolated from aortic tissue of neonatal rats was screened for rat tropoelastin cDNAs. The first, screen, utilizing a human tropoelastin cDNA clone, provided rat tropoelastin cDNAs spanning 2.3 kb of carboxy-terminal coding sequence and extended into the 3'-untranslated region. A subsequent screen using a 5' rat tropoelastin cDNA clone yielded clones extending into the amino-terminal signal sequence coding region. Sequence analysis of these clones has provided the complete derived amino acid sequence of rat tropoelastin and allowed alignment and comparison with published bovine cDNA sequence. While the overall structure of rat tropoelastin is similar to bovine sequence, numerous substitutions, deletions, and insertions demonstrated considerable heterogeneity between species. In particular, the pentapeptide repeat VPGVG, characteristic of all tropoelastins analyzed to date, is replaced in rat tropoelastin by a repeating pentapeptide, IPGVG. The hexapeptide repeat VGVAPG, the bovine elastin receptor binding peptide, is not encoded by rat tropoelastin cDNAs. Variations in coding sequence between rat tropoelastin CDNA clones were also found which may represent mRNA heterogeneity produced by alternative splicing of the rat tropoelastin pre-mRNA

  14. 5'-end sequences of budding yeast full-length cDNA clones and quality scores - Budding yeast cDNA sequencing project | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available east_seq_qual.zip File URL: ftp://ftp.biosciencedbc.jp/archive/yeast_cdna/LATEST/...yeast_seq_qual.zip File size: 59.9MB Simple search URL http://togodb.biosciencedbc.jp/togodb/view/budding_yeast_cdna

  15. Cloning and expression of cDNA coding for bouganin.

    Science.gov (United States)

    den Hartog, Marcel T; Lubelli, Chiara; Boon, Louis; Heerkens, Sijmie; Ortiz Buijsse, Antonio P; de Boer, Mark; Stirpe, Fiorenzo

    2002-03-01

    Bouganin is a ribosome-inactivating protein that recently was isolated from Bougainvillea spectabilis Willd. In this work, the cloning and expression of the cDNA encoding for bouganin is described. From the cDNA, the amino-acid sequence was deduced, which correlated with the primary sequence data obtained by amino-acid sequencing on the native protein. Bouganin is synthesized as a pro-peptide consisting of 305 amino acids, the first 26 of which act as a leader signal while the 29 C-terminal amino acids are cleaved during processing of the molecule. The mature protein consists of 250 amino acids. Using the cDNA sequence encoding the mature protein of 250 amino acids, a recombinant protein was expressed, purified and characterized. The recombinant molecule had similar activity in a cell-free protein synthesis assay and had comparable toxicity on living cells as compared to the isolated native bouganin.

  16. Epitopes of human testis-specific lactate dehydrogenase deduced from a cDNA sequence

    International Nuclear Information System (INIS)

    Millan, J.L.; Driscoll, C.E.; LeVan, K.M.; Goldberg, E.

    1987-01-01

    The sequence and structure of human testis-specific L-lactate dehydrogenase [LDHC 4 , LDHX; (L)-lactate:NAD + oxidoreductase, EC 1.1.1.27] has been derived from analysis of a complementary DNA (cDNA) clone comprising the complete protein coding region of the enzyme. From the deduced amino acid sequence, human LDHC 4 is as different from rodent LDHC 4 (73% homology) as it is from human LDHA 4 (76% homology) and porcine LDHB 4 (68% homology). Subunit homologies are consistent with the conclusion that the LDHC gene arose by at least two independent duplication events. Furthermore, the lower degree of homology between mouse and human LDHC 4 and the appearance of this isozyme late in evolution suggests a higher rate of mutation in the mammalian LDHC genes than in the LDHA and -B genes. Comparison of exposed amino acid residues of discrete anti-genic determinants of mouse and human LDHC 4 reveals significant differences. Knowledge of the human LDHC 4 sequence will help design human-specific peptides useful in the development of a contraceptive vaccine

  17. Frameshift mutations in infectious cDNA clones of Citrus tristeza virus: a strategy to minimize the toxicity of viral sequences to Escherichia coli

    International Nuclear Information System (INIS)

    Satyanarayana, Tatineni; Gowda, Siddarame; Ayllon, Maria A.; Dawson, William O.

    2003-01-01

    The advent of reverse genetics revolutionized the study of positive-stranded RNA viruses that were amenable for cloning as cDNAs into high-copy-number plasmids of Escherichia coli. However, some viruses are inherently refractory to cloning in high-copy-number plasmids due to toxicity of viral sequences to E. coli. We report a strategy that is a compromise between infectivity of the RNA transcripts and toxicity to E. coli effected by introducing frameshift mutations into 'slippery sequences' near the viral 'toxicity sequences' in the viral cDNA. Citrus tristeza virus (CTV) has cDNA sequences that are toxic to E. coli. The original full-length infectious cDNA of CTV and a derivative replicon, CTV-ΔCla, cloned into pUC119, resulted in unusually limited E. coli growth. However, upon sequencing of these cDNAs, an additional uridinylate (U) was found in a stretch of U's between nts 3726 and 3731 that resulted in a change to a reading frame with a stop codon at nt 3734. Yet, in vitro produced RNA transcripts from these clones infected protoplasts, and the resulting progeny virus was repaired. Correction of the frameshift mutation in the CTV cDNA constructs resulted in increased infectivity of in vitro produced RNA transcripts, but also caused a substantial increase of toxicity to E. coli, now requiring 3 days to develop visible colonies. Frameshift mutations created in sequences not suspected to facilitate reading frame shifting and silent mutations introduced into oligo(U) regions resulted in complete loss of infectivity, suggesting that the oligo(U) region facilitated the repair of the frameshift mutation. Additional frameshift mutations introduced into other oligo(U) regions also resulted in transcripts with reduced infectivity similarly to the original clones with the +1 insertion. However, only the frameshift mutations introduced into oligo(U) regions that were near and before the toxicity region improved growth and stability in E. coli. These data demonstrate that

  18. Increased mRNA expression of a laminin-binding protein in human colon carcinoma: Complete sequence of a full-length cDNA encoding the protein

    International Nuclear Information System (INIS)

    Yow, Hsiukang; Wong, Jau Min; Chen, Hai Shiene; Lee, C.; Steele, G.D. Jr.; Chen, Lanbo

    1988-01-01

    Reliable markers to distinguish human colon carcinoma from normal colonic epithelium are needed particularly for poorly differentiated tumors where no useful marker is currently available. To search for markers the authors constructed cDNA libraries from human colon carcinoma cell lines and screened for clones that hybridize to a greater degree with mRNAs of colon carcinomas than with their normal counterparts. Here they report one such cDNA clone that hybridizes with a 1.2-kilobase (kb) mRNA, the level of which is ∼9-fold greater in colon carcinoma than in adjacent normal colonic epithelium. Blot hybridization of total RNA from a variety of human colon carcinoma cell lines shows that the level of this 1.2-kb mRNA in poorly differentiated colon carcinomas is as high as or higher than that in well-differentiated carcinomas. Molecular cloning and complete sequencing of cDNA corresponding to the full-length open reading frame of this 1.2-kb mRNA unexpectedly show it to contain all the partial cDNA sequence encoding 135 amino acid residues previously reported for a human laminin receptor. The deduced amino acid sequence suggests that this putative laminin-binding protein from human colon carcinomas consists of 295 amino acid residues with interesting features. There is an unusual C-terminal 70-amino acid segment, which is trypsin-resistant and highly negatively charged

  19. Cloning and sequencing of cDNA encoding human DNA topoisomerase II and localization of the gene to chromosome region 17q21-22

    International Nuclear Information System (INIS)

    Tsai-Pflugfelder, M.; Liu, L.F.; Liu, A.A.; Tewey, K.M.; Whang-Peng, J.; Knutsen, T.; Huebner, K.; Croce, C.M.; Wang, J.C.

    1988-01-01

    Two overlapping cDNA clones encoding human DNA topoisomerase II were identified by two independent methods. In one, a human cDNA library in phage λ was screened by hybridization with a mixed oligonucleotide probe encoding a stretch of seven amino acids found in yeast and Drosophila DNA topoisomerase II; in the other, a different human cDNA library in a λgt11 expression vector was screened for the expression of antigenic determinants that are recognized by rabbit antibodies specific to human DNA topoisomerase II. The entire coding sequences of the human DNA topoisomerase II gene were determined from these and several additional clones, identified through the use of the cloned human TOP2 gene sequences as probes. Hybridization between the cloned sequences and mRNA and genomic DNA indicates that the human enzyme is encoded by a single-copy gene. The location of the gene was mapped to chromosome 17q21-22 by in situ hybridization of a cloned fragment to metaphase chromosomes and by hybridization analysis with a panel of mouse-human hybrid cell lines, each retaining a subset of human chromosomes

  20. Cloning and characterization of the human colipase cDNA

    International Nuclear Information System (INIS)

    Lowe, M.E.; Rosenblum, J.L.; McEwen, P.; Strauss, A.W.

    1990-01-01

    Pancreatic lipase hydrolyzes dietary triglycerides to monoglycerides and fatty acids. In the presence of bile salts, the activity of pancreatic lipase is markedly decreased. The activity can be restored by the addition of colipase, a low molecular weight protein secreted by the pancreas. The action of pancreatic lipase in the gut lumen is dependent upon its interaction with colipase. As a first step in elucidating the molecular events governing the interaction of lipase and colipase with each other and with fatty acids, a cDNA encoding human colipase was isolated from a λgt11 cDNA library with a rabbit polyclonal anti-human colipase antibody. The full-length 525 bp cDNA contained an open reading frame encoding 112 amino acids, including a 17 amino acid signal peptide. The predicted sequence contains 100% of the published protein sequence for human colipase determined by chemical methods, but predicts the presence of five additional NH 2 -terminal amino acids and four additional COOH-terminal amino acids. Comparison of the predicted protein sequence with the known sequences of colipase from other species reveals regions of extensive identity. The authors report, for the first time, a cDNA for colipase. The cDNA predicts a human procolipase an suggests that there may also be processing at the COOH-terminus. The regions of identity with colipase from other species will aid in defining the interaction with lipase and lipids through site-specific mutagenesis

  1. Constructing and detecting a cDNA library for mites.

    Science.gov (United States)

    Hu, Li; Zhao, YaE; Cheng, Juan; Yang, YuanJun; Li, Chen; Lu, ZhaoHui

    2015-10-01

    RNA extraction and construction of complementary DNA (cDNA) library for mites have been quite challenging due to difficulties in acquiring tiny living mites and breaking their hard chitin. The present study is to explore a better method to construct cDNA library for mites that will lay the foundation on transcriptome and molecular pathogenesis research. We selected Psoroptes cuniculi as an experimental subject and took the following steps to construct and verify cDNA library. First, we combined liquid nitrogen grinding with TRIzol for total RNA extraction. Then, switching mechanism at 5' end of the RNA transcript (SMART) technique was used to construct full-length cDNA library. To evaluate the quality of cDNA library, the library titer and recombination rate were calculated. The reliability of cDNA library was detected by sequencing and analyzing positive clones and genes amplified by specific primers. The results showed that the RNA concentration was 836 ng/μl and the absorbance ratio at 260/280 nm was 1.82. The library titer was 5.31 × 10(5) plaque-forming unit (PFU)/ml and the recombination rate was 98.21%, indicating that the library was of good quality. In the 33 expressed sequence tags (ESTs) of P. cuniculi, two clones of 1656 and 1658 bp were almost identical with only three variable sites detected, which had an identity of 99.63% with that of Psoroptes ovis, indicating that the cDNA library was reliable. Further detection by specific primers demonstrated that the 553-bp Pso c II gene sequences of P. cuniculi had an identity of 98.56% with those of P. ovis, confirming that the cDNA library was not only reliable but also feasible.

  2. License - Budding yeast cDNA sequencing project | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Budding yeast cDNA sequencing project License to Use This Database Last updated : 2010/02/15 You may use this databas...ional License described below. The Standard License specifies the license terms regarding the use of this database... and the requirements you must follow in using this database. The Additiona...n the Standard License. Standard License The Standard License for this database is the license specified in ...the Creative Commons Attribution-Share Alike 2.1 Japan . If you use data from this database

  3. Construction of cDNA library and preliminary analysis of expressed sequence tags from Siberian tiger

    Science.gov (United States)

    Liu, Chang-Qing; Lu, Tao-Feng; Feng, Bao-Gang; Liu, Dan; Guan, Wei-Jun; Ma, Yue-Hui

    2010-01-01

    In this study we successfully constructed a full-length cDNA library from Siberian tiger, Panthera tigris altaica, the most well-known wild Animal. Total RNA was extracted from cultured Siberian tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.30×106 pfu/ml and 1.62×109 pfu/ml respectively. The proportion of recombinants from unamplified library was 90.5% and average length of exogenous inserts was 1.13 kb. A total of 282 individual ESTs with sizes ranging from 328 to 1,142bps were then analyzed the BLASTX score revealed that 53.9% of the sequences were classified as strong match, 38.6% as nominal and 7.4% as weak match. 28.0% of them were found to be related to enzyme/catalytic protein, 20.9% ESTs to metabolism, 13.1% ESTs to transport, 12.1% ESTs to signal transducer/cell communication, 9.9% ESTs to structure protein, 3.9% ESTs to immunity protein/defense metabolism, 3.2% ESTs to cell cycle, and 8.9 ESTs classified as novel genes. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genomic research of Siberian tigers. PMID:20941376

  4. Nucleotide sequence of a cDNA coding for the barley seed protein CMa: an inhibitor of insect α-amylase

    DEFF Research Database (Denmark)

    Rasmussen, Søren Kjærsgård; Johansson, A.

    1992-01-01

    The primary structure of the insect alpha-amylase inhibitor CMa of barley seeds was deduced from a full-length cDNA clone pc43F6. Analysis of RNA from barley endosperm shows high levels 15 and 20 days after flowering. The cDNA predicts an amino acid sequence of 119 residues preceded by a signal...... peptide of 25 amino acids. Ala and Leu account for 55% of the signal peptide. CMa is 60-85% identical with alpha-amylase inhibitors of wheat, but shows less than 50% identity to trypsin inhibitors of barley and wheat. The 10 Cys residues are located in identical positions compared to the cereal inhibitor...

  5. High-throughput sequencing and analysis of the gill tissue transcriptome from the deep-sea hydrothermal vent mussel Bathymodiolus azoricus

    Directory of Open Access Journals (Sweden)

    Gomes Paula

    2010-10-01

    Full Text Available Abstract Background Bathymodiolus azoricus is a deep-sea hydrothermal vent mussel found in association with large faunal communities living in chemosynthetic environments at the bottom of the sea floor near the Azores Islands. Investigation of the exceptional physiological reactions that vent mussels have adopted in their habitat, including responses to environmental microbes, remains a difficult challenge for deep-sea biologists. In an attempt to reveal genes potentially involved in the deep-sea mussel innate immunity we carried out a high-throughput sequence analysis of freshly collected B. azoricus transcriptome using gills tissues as the primary source of immune transcripts given its strategic role in filtering the surrounding waterborne potentially infectious microorganisms. Additionally, a substantial EST data set was produced and from which a comprehensive collection of genes coding for putative proteins was organized in a dedicated database, "DeepSeaVent" the first deep-sea vent animal transcriptome database based on the 454 pyrosequencing technology. Results A normalized cDNA library from gills tissue was sequenced in a full 454 GS-FLX run, producing 778,996 sequencing reads. Assembly of the high quality reads resulted in 75,407 contigs of which 3,071 were singletons. A total of 39,425 transcripts were conceptually translated into amino-sequences of which 22,023 matched known proteins in the NCBI non-redundant protein database, 15,839 revealed conserved protein domains through InterPro functional classification and 9,584 were assigned with Gene Ontology terms. Queries conducted within the database enabled the identification of genes putatively involved in immune and inflammatory reactions which had not been previously evidenced in the vent mussel. Their physical counterpart was confirmed by semi-quantitative quantitative Reverse-Transcription-Polymerase Chain Reactions (RT-PCR and their RNA transcription level by quantitative PCR (q

  6. Geoseq: a tool for dissecting deep-sequencing datasets

    Directory of Open Access Journals (Sweden)

    Homann Robert

    2010-10-01

    Full Text Available Abstract Background Datasets generated on deep-sequencing platforms have been deposited in various public repositories such as the Gene Expression Omnibus (GEO, Sequence Read Archive (SRA hosted by the NCBI, or the DNA Data Bank of Japan (ddbj. Despite being rich data sources, they have not been used much due to the difficulty in locating and analyzing datasets of interest. Results Geoseq http://geoseq.mssm.edu provides a new method of analyzing short reads from deep sequencing experiments. Instead of mapping the reads to reference genomes or sequences, Geoseq maps a reference sequence against the sequencing data. It is web-based, and holds pre-computed data from public libraries. The analysis reduces the input sequence to tiles and measures the coverage of each tile in a sequence library through the use of suffix arrays. The user can upload custom target sequences or use gene/miRNA names for the search and get back results as plots and spreadsheet files. Geoseq organizes the public sequencing data using a controlled vocabulary, allowing identification of relevant libraries by organism, tissue and type of experiment. Conclusions Analysis of small sets of sequences against deep-sequencing datasets, as well as identification of public datasets of interest, is simplified by Geoseq. We applied Geoseq to, a identify differential isoform expression in mRNA-seq datasets, b identify miRNAs (microRNAs in libraries, and identify mature and star sequences in miRNAS and c to identify potentially mis-annotated miRNAs. The ease of using Geoseq for these analyses suggests its utility and uniqueness as an analysis tool.

  7. Method for construction of normalized cDNA libraries

    Science.gov (United States)

    Soares, Marcelo B.; Efstratiadis, Argiris

    1998-01-01

    This invention provides a method to normalize a directional cDNA library constructed in a vector that allows propagation in single-stranded circle form comprising: (a) propagating the directional cDNA library in single-stranded circles; (b) generating fragments complementary to the 3' noncoding sequence of the single-stranded circles in the library to produce partial duplexes; (c) purifying the partial duplexes; (d) melting and reassociating the purified partial duplexes to appropriate Cot; and (e) purifying the unassociated single-stranded circles, thereby generating a normalized cDNA library. This invention also provides normalized cDNA libraries generated by the above-described method and uses of the generated libraries.

  8. Analysis of a cDNA clone expressing a human autoimmune antigen: full-length sequence of the U2 small nuclear RNA-associated B antigen

    International Nuclear Information System (INIS)

    Habets, W.J.; Sillekens, P.T.G.; Hoet, M.H.; Schalken, J.A.; Roebroek, A.J.M.; Leunissen, J.A.M.; Van de Ven, W.J.M.; Van Venrooij, W.J.

    1987-01-01

    A U2 small nuclear RNA-associated protein, designated B'', was recently identified as the target antigen for autoimmune sera from certain patients with systemic lupus erythematosus and other rheumatic diseases. Such antibodies enabled them to isolate cDNA clone λHB''-1 from a phage λgt11 expression library. This clone appeared to code for the B'' protein as established by in vitro translation of hybrid-selected mRNA. The identity of clone λHB''-1 was further confirmed by partial peptide mapping and analysis of the reactivity of the recombinant antigen with monospecific and monoclonal antibodies. Analysis of the nucleotide sequence of the 1015-base-pair cDNA insert of clone λHB''-1 revealed a large open reading frame of 800 nucleotides containing the coding sequence for a polypeptide of 25,457 daltons. In vitro transcription of the λHB''-1 cDNA insert and subsequent translation resulted in a protein product with the molecular size of the B'' protein. These data demonstrate that clone λHB''-1 contains the complete coding sequence of this antigen. The deduced polypeptide sequence contains three very hydrophilic regions that might constitute RNA binding sites and/or antigenic determinants. These findings might have implications both for the understanding of the pathogenesis of rheumatic diseases as well as for the elucidation of the biological function of autoimmune antigens

  9. Isolation of a cDNA clone complementary to sequences for a 34-kilodalton protein which is a pp60v-src substrate.

    OpenAIRE

    Tomasiewicz, H G; Cook-Deegan, R; Chikaraishi, D M

    1984-01-01

    We have isolated a partial cDNA clone containing sequences complementary to a mRNA encoding a 34- to 36-kilodalton normal chicken cell protein which is a substrate for pp60v-src kinase activity. Using this 34-kilodalton cDNA clone as a probe, we determined that the size of the 34-kilodalton mRNA was 1,100 nucleotides and the level of the 34-kilodalton RNA was the same in various tissues of mature chickens but was significantly higher in chicken embryo fibroblast cells.

  10. Isolation and sequence of cDNA encoding a cytochrome P-450 from an insecticide-resistant strain of the house fly, Musca domestica.

    OpenAIRE

    Feyereisen, R; Koener, J F; Farnsworth, D E; Nebert, D W

    1989-01-01

    A cDNA expression library from phenobarbital-treated house fly (Musca domestica) was screened with rabbit antisera directed against partially purified house fly cytochrome P-450. Two overlapping clones with insert lengths of 1.3 and 1.5 kilobases were isolated. The sequence of a 1629-base-pair (bp) cDNA was obtained, with an open reading frame (nucleotides 81-1610) encoding a P-450 protein of 509 residues (Mr = 58,738). The insect P-450 protein contains a hydrophobic NH2 terminus and a 22-res...

  11. Cloning of cDNA encoding steroid 11β-hydroxylase (P450c11)

    International Nuclear Information System (INIS)

    Chua, S.C.; Szabo, P.; Vitek, A.; Grzeschik, K.H.; John, M.; White, P.C.

    1987-01-01

    The authors have isolated bovine and human adrenal cDNA clones encoding the adrenal cytochrome P-450 specific for 11β-hydroxylation (P450c11). A bovine adrenal cDNA library constructed in the bacteriophage λ vector gt10 was probed with a previously isolated cDNA clone corresponding to part of the 3' untranslated region of the 4.2-kilobase (kb) mRNA encoding P450c11. Several clones with 3.2-kb cDNA inserts were isolated. Sequence analysis showed that they overlapped the original probe by 300 base pairs (bp). Combined cDNA and RNA sequence data demonstrated a continuous open reading frame of 1509 bases. P450c11 is predicted to contain 479 amino acid residues in the mature protein in addition to a 24-residue amino-terminal mitochondrial signal sequence. A bovine clone was used to isolate a homologous clone with a 3.5-kb insert from a human adrenal cDNA library. A region of 1100 bp was 81% homologous to 769 bp of the coding sequence of the bovine cDNA except for a 400-bp segment presumed to be an unprocessed intron. Hybridization of the human cDNA to DNA from a panel of human-rodent somatic cell hybrid lines and in situ hybridization to metaphase spreads of human chromosomes localized the gene to the middle of the long arm of chromosome 8. These data should be useful in developing reagents for heterozygote detection and prenatal diagnosis of 11β-hydroxylase deficiency, the second most frequent cause of congenital adrenal hyperplasia

  12. Cloning and expression of a cDNA coding for a human monocyte-derived plasminogen activator inhibitor.

    OpenAIRE

    Antalis, T M; Clark, M A; Barnes, T; Lehrbach, P R; Devine, P L; Schevzov, G; Goss, N H; Stephens, R W; Tolstoshev, P

    1988-01-01

    Human monocyte-derived plasminogen activator inhibitor (mPAI-2) was purified to homogeneity from the U937 cell line and partially sequenced. Oligonucleotide probes derived from this sequence were used to screen a cDNA library prepared from U937 cells. One positive clone was sequenced and contained most of the coding sequence as well as a long incomplete 3' untranslated region (1112 base pairs). This cDNA sequence was shown to encode mPAI-2 by hybrid-select translation. A cDNA clone encoding t...

  13. CDNA cloning, characterization and expression of an endosperm-specific barley peroxidase

    DEFF Research Database (Denmark)

    Rasmussen, Søren Kjærsgård; Welinder, K.G.; Hejgaard, J.

    1991-01-01

    A barley peroxidase (BP 1) of pI ca. 8.5 and M(r) 37000 has been purified from mature barley grains. Using antibodies towards peroxidase BP 1, a cDNA clone (pcR7) was isolated from cDNA expression library. The nucleotide sequence of pcR7 gave a derived amino acid sequence identical to the 158 C...

  14. Nucleotide sequence of a human cDNA encoding a ras-related protein (rap1B)

    Energy Technology Data Exchange (ETDEWEB)

    Pizon, V; Lerosey, I; Chardin, P; Tavitian, A [INSERM, Paris (France)

    1988-08-11

    The authors have previously characterized two human ras-related genes rap1 and rap2. Using the rap1 clone as probe they isolated and sequenced a new rap cDNA encoding the 184aa rap1B protein. The rap1B protein is 95% identical to rap1 and shares several properties with the ras protein suggesting that it could bind GTP/GDP and have a membrane location. As for rap1, the structural characteristics of rap1B suggest that the rap and ras proteins might interact on the same effector.

  15. Primary structure of bovine pituitary secretory protein I (chromogranin A) deduced from the cDNA sequence

    International Nuclear Information System (INIS)

    Ahn, T.G.; Cohn, D.V.; Gorr, S.U.; Ornstein, D.L.; Kashdan, M.A.; Levine, M.A.

    1987-01-01

    Secretory protein I (SP-I), also referred to as chromogranin A, is an acidic glycoprotein that has been found in every tissue of endocrine and neuroendocrine origin examined but never in exocrine or epithelial cells. Its co-storage and co-secretion with peptide hormones and neurotransmitters suggest that it has an important endocrine or secretory function. The authors have isolated cDNA clones from a bovine pituitary λgt11 expression library using an antiserum to parathyroid SP-I. The largest clone (SP4B) hybridized to a transcript of 2.1 kilobases in RNA from parathyroid, pituitary, and adrenal medulla. Immunoblots of bacterial lysates derived from SP4B lysognes demonstrated specific antibody binding to an SP4B/β-galactosidase fusion protein (160 kDa) with a cDNA-derived component of 46 kDa. Radioimmunoassay of the bacterial lystates with SP-I antiserum yielded parallel displacement curves of 125 I-labeled SP-I by the SP4B lysate and authentic SP-I. SP4B contains a cDNA of 1614 nucleotides that encodes a 449-amino acid protein (calculated mass, 50 kDa). The nucleotide sequences of the pituitary SP-I cDNA and adrenal medullary SP-I cDNAs are nearly identical. Analysis of genomic DNA suggests that pituitary, adrenal, and parathyroid SP-I are products of the same gene

  16. Primary structure of bovine pituitary secretory protein I (chromogranin A) deduced from the cDNA sequence

    Energy Technology Data Exchange (ETDEWEB)

    Ahn, T.G.; Cohn, D.V.; Gorr, S.U.; Ornstein, D.L.; Kashdan, M.A.; Levine, M.A.

    1987-07-01

    Secretory protein I (SP-I), also referred to as chromogranin A, is an acidic glycoprotein that has been found in every tissue of endocrine and neuroendocrine origin examined but never in exocrine or epithelial cells. Its co-storage and co-secretion with peptide hormones and neurotransmitters suggest that it has an important endocrine or secretory function. The authors have isolated cDNA clones from a bovine pituitary lambdagt11 expression library using an antiserum to parathyroid SP-I. The largest clone (SP4B) hybridized to a transcript of 2.1 kilobases in RNA from parathyroid, pituitary, and adrenal medulla. Immunoblots of bacterial lysates derived from SP4B lysognes demonstrated specific antibody binding to an SP4B/..beta..-galactosidase fusion protein (160 kDa) with a cDNA-derived component of 46 kDa. Radioimmunoassay of the bacterial lystates with SP-I antiserum yielded parallel displacement curves of /sup 125/I-labeled SP-I by the SP4B lysate and authentic SP-I. SP4B contains a cDNA of 1614 nucleotides that encodes a 449-amino acid protein (calculated mass, 50 kDa). The nucleotide sequences of the pituitary SP-I cDNA and adrenal medullary SP-I cDNAs are nearly identical. Analysis of genomic DNA suggests that pituitary, adrenal, and parathyroid SP-I are products of the same gene.

  17. Detection of Emerging Vaccine-Related Polioviruses by Deep Sequencing.

    Science.gov (United States)

    Sahoo, Malaya K; Holubar, Marisa; Huang, ChunHong; Mohamed-Hadley, Alisha; Liu, Yuanyuan; Waggoner, Jesse J; Troy, Stephanie B; Garcia-Garcia, Lourdes; Ferreyra-Reyes, Leticia; Maldonado, Yvonne; Pinsky, Benjamin A

    2017-07-01

    Oral poliovirus vaccine can mutate to regain neurovirulence. To date, evaluation of these mutations has been performed primarily on culture-enriched isolates by using conventional Sanger sequencing. We therefore developed a culture-independent, deep-sequencing method targeting the 5' untranslated region (UTR) and P1 genomic region to characterize vaccine-related poliovirus variants. Error analysis of the deep-sequencing method demonstrated reliable detection of poliovirus mutations at levels of vaccinated, asymptomatic children and their close contacts collected during a prospective cohort study in Veracruz, Mexico, revealed no vaccine-derived polioviruses. This was expected given that the longest duration between sequenced sample collection and the end of the most recent national immunization week was 66 days. However, we identified many low-level variants (Sabin serotypes, as well as vaccine-related viruses with multiple canonical mutations associated with phenotypic reversion present at high levels (>90%). These results suggest that monitoring emerging vaccine-related poliovirus variants by deep sequencing may aid in the poliovirus endgame and efforts to ensure global polio eradication. Copyright © 2017 Sahoo et al.

  18. Cloning, characterization and heterologous expression of epoxide hydrolase-encoding cDNA sequences from yeasts belonging to the genera Rhodotorula and Rhodosporidium

    NARCIS (Netherlands)

    Visser, H.; Weijers, C.A.G.M.; Ooyen, van A.J.J.; Verdoes, J.C.

    2002-01-01

    Epoxide hydrolase-encoding cDNA sequences were isolated from the basidiomycetous yeast species Rhodosporidium toruloides CBS 349, Rhodosporidium toruloides CBS 14 and Rhodotorula araucariae CBS 6031 in order to evaluate the molecular data and potential application of this type of enzymes. The

  19. DNA Replication Profiling Using Deep Sequencing.

    Science.gov (United States)

    Saayman, Xanita; Ramos-Pérez, Cristina; Brown, Grant W

    2018-01-01

    Profiling of DNA replication during progression through S phase allows a quantitative snap-shot of replication origin usage and DNA replication fork progression. We present a method for using deep sequencing data to profile DNA replication in S. cerevisiae.

  20. Large-scale Identification of Expressed Sequence Tags (ESTs from Nicotianatabacum by Normalized cDNA Library Sequencing

    Directory of Open Access Journals (Sweden)

    Alvarez S Perez

    2014-12-01

    Full Text Available An expressed sequence tags (EST resource for tobacco plants (Nicotianatabacum was established using high-throughput sequencing of randomly selected clones from one cDNA library representing a range of plant organs (leaf, stem, root and root base. Over 5000 ESTs were generated from the 3’ ends of 8000 clones, analyzed by BLAST searches and categorized functionally. All annotated ESTs were classified into 18 functional categories, unique transcripts involved in energy were the largest group accounting for 831 (32.32% of the annotated ESTs. After excluding 2450 non-significant tentative unique transcripts (TUTs, 100 unique sequences (1.67% of total TUTs were identified from the N. tabacum database. In the array result two genes strongly related to the tobacco mosaic virus (TMV were obtained, one basic form of pathogenesis-related protein 1 precursor (TBT012G08 and ubiquitin (TBT087G01. Both of them were found in the variety Hongda, some other important genes were classified into two groups, one of these implicated in plant development like those genes related to a photosynthetic process (chlorophyll a-b binding protein, photosystem I, ferredoxin I and III, ATP synthase and a further group including genes related to plant stress response (ubiquitin, ubiquitin-like protein SMT3, glycine-rich RNA binding protein, histones and methallothionein. The interesting finding in this study is that two of these genes have never been reported before in N. tabacum (ubiquitin-like protein SMT3 and methallothionein. The array results were confirmed using quantitative PCR.

  1. Two human cDNA molecules coding for the Duchenne muscular dystrophy (DMD) locus are highly homologous

    Energy Technology Data Exchange (ETDEWEB)

    Rosenthal, A.; Speer, A.; Billwitz, H. (Zentralinstitut fuer Molekularbiologie, Berlin-Buch (Germany Democratic Republic)); Cross, G.S.; Forrest, S.M.; Davies, K.E. (Univ. of Oxford (England))

    1989-07-11

    Recently the complete sequence of the human fetal cDNA coding for the Duchenne muscular dystrophy (DMD) locus was reported and a 3,685 amino acid long, rod-shaped cytoskeletal protein (dystrophin) was predicted as the protein product. Independently, the authors have isolated and sequenced different DMD cDNA molecules from human adult and fetal muscle. The complete 12.5 kb long sequence of all their cDNA clones has now been determined and they report here the nucleotide (nt) and amino acid (aa) differences between the sequences of both groups. The cDNA sequence comprises the whole coding region but lacks the first 110 nt from the 5{prime}-untranslated region and the last 1,417 nt of the 3{prime}-untranslated region. They have found 11 nt differences (approximately 99.9% homology) from which 7 occurred at the aa level.

  2. An integrated PCR colony hybridization approach to screen cDNA libraries for full-length coding sequences.

    Science.gov (United States)

    Pollier, Jacob; González-Guzmán, Miguel; Ardiles-Diaz, Wilson; Geelen, Danny; Goossens, Alain

    2011-01-01

    cDNA-Amplified Fragment Length Polymorphism (cDNA-AFLP) is a commonly used technique for genome-wide expression analysis that does not require prior sequence knowledge. Typically, quantitative expression data and sequence information are obtained for a large number of differentially expressed gene tags. However, most of the gene tags do not correspond to full-length (FL) coding sequences, which is a prerequisite for subsequent functional analysis. A medium-throughput screening strategy, based on integration of polymerase chain reaction (PCR) and colony hybridization, was developed that allows in parallel screening of a cDNA library for FL clones corresponding to incomplete cDNAs. The method was applied to screen for the FL open reading frames of a selection of 163 cDNA-AFLP tags from three different medicinal plants, leading to the identification of 109 (67%) FL clones. Furthermore, the protocol allows for the use of multiple probes in a single hybridization event, thus significantly increasing the throughput when screening for rare transcripts. The presented strategy offers an efficient method for the conversion of incomplete expressed sequence tags (ESTs), such as cDNA-AFLP tags, to FL-coding sequences.

  3. Construction and characterization of cDNA library for IRM-2 mice

    International Nuclear Information System (INIS)

    Wang Qin; Li Jin; Song Li; Liu Qiang; Yue Jingyin; Mu Chuanjie; Tang Weisheng; Fan Feiyue

    2010-01-01

    Objective: To screen and isolate the radioresistance related genes of IRM-2 mice. Methods: cDNA library of IRM-2 mice was constructed by SMART technique. Total RNA was isolated from spleens of IRM-2 male mice. The first-strand cDNA was synthesized by using PowerScript reverse transcriptase, and double-strand cDNA was synthesized and amplified by long PCR. The PCR products were purified, digested with restriction enzyme Sfi I. The ds-cDNA fragment less than 500 bp was fractionated and ligated to the Sfi I-digested pDNR-LIB vector. The ligation mixture was transformed into E. coil DH5 α by electroporation transformation to generate the unamplified cDNA library. The quality of cDNA library was identified by PCR technique. 130 clones from cDNA library were sequenced and compared with GenBank database. Results: The cDNA library contained 2.25 x 10 6 independent clones with an average insert size of 1.2 kb. The ratio of recombination and full-length was 95% and 55%, respectively. 21 pieces of EST sequences from cDNA library were not the same as the known mice genes and registered into GenBank EST database, with registered number DW474856-DW474876. Conclusions: cDNA library of IRM-2 mice has been constructed successfully. 21 pieces of EST implies that radioresistance correlative genes may be in IRM-2 mice, which will lay a foundation for isolating and identifying radioresistance related genes in further study. (authors)

  4. cDNA, genomic sequence cloning and overexpression of ribosomal protein S25 gene (RPS25) from the Giant Panda.

    Science.gov (United States)

    Hao, Yan-Zhe; Hou, Wan-Ru; Hou, Yi-Ling; Du, Yu-Jie; Zhang, Tian; Peng, Zheng-Song

    2009-11-01

    RPS25 is a component of the 40S small ribosomal subunit encoded by RPS25 gene, which is specific to eukaryotes. Studies in reference to RPS25 gene from animals were handful. The Giant Panda (Ailuropoda melanoleuca), known as a "living fossil", are increasingly concerned by the world community. Studies on RPS25 of the Giant Panda could provide scientific data for inquiring into the hereditary traits of the gene and formulating the protective strategy for the Giant Panda. The cDNA of the RPS25 cloned from Giant Panda is 436 bp in size, containing an open reading frame of 378 bp encoding 125 amino acids. The length of the genomic sequence is 1,992 bp, which was found to possess four exons and three introns. Alignment analysis indicated that the nucleotide sequence of the coding sequence shows a high homology to those of Homo sapiens, Bos taurus, Mus musculus and Rattus norvegicus as determined by Blast analysis, 92.6, 94.4, 89.2 and 91.5%, respectively. Primary structure analysis revealed that the molecular weight of the putative RPS25 protein is 13.7421 kDa with a theoretical pI 10.12. Topology prediction showed there is one N-glycosylation site, one cAMP and cGMP-dependent protein kinase phosphorylation site, two Protein kinase C phosphorylation sites and one Tyrosine kinase phosphorylation site in the RPS25 protein of the Giant Panda. The RPS25 gene was overexpressed in E. coli BL21 and Western Blotting of the RPS25 protein was also done. The results indicated that the RPS25 gene can be really expressed in E. coli and the RPS25 protein fusioned with the N-terminally his-tagged form gave rise to the accumulation of an expected 17.4 kDa polypeptide. The cDNA and the genomic sequence of RPS25 were cloned successfully for the first time from the Giant Panda using RT-PCR technology and Touchdown-PCR, respectively, which were both sequenced and analyzed preliminarily; then the cDNA of the RPS25 gene was overexpressed in E. coli BL21 and immunoblotted, which is the first

  5. Serine Protease Variants Encoded by Echis ocellatus Venom Gland cDNA: Cloning and Sequencing Analysis

    Directory of Open Access Journals (Sweden)

    S. S. Hasson

    2010-01-01

    Full Text Available Envenoming by Echis saw-scaled viper is the leading cause of death and morbidity in Africa due to snake bite. Despite its medical importance, there have been few investigations into the toxin composition of the venom of this viper. Here, we report the cloning of cDNA sequences encoding four groups or isoforms of the haemostasis-disruptive Serine protease proteins (SPs from the venom glands of Echis ocellatus. All these SP sequences encoded the cysteine residues scaffold that form the 6-disulphide bonds responsible for the characteristic tertiary structure of venom serine proteases. All the Echis ocellatus EoSP groups showed varying degrees of sequence similarity to published viper venom SPs. However, these groups also showed marked intercluster sequence conservation across them which were significantly different from that of previously published viper SPs. Because viper venom SPs exhibit a high degree of sequence similarity and yet exert profoundly different effects on the mammalian haemostatic system, no attempt was made to assign functionality to the new Echis ocellatus EoSPs on the basis of sequence alone. The extraordinary level of interspecific and intergeneric sequence conservation exhibited by the Echis ocellatus EoSPs and analogous serine proteases from other viper species leads us to speculate that antibodies to representative molecules should neutralise (that we will exploit, by epidermal DNA immunization the biological function of this important group of venom toxins in vipers that are distributed throughout Africa, the Middle East, and the Indian subcontinent.

  6. An Ambystoma mexicanum EST sequencing project: analysis of 17,352 expressed sequence tags from embryonic and regenerating blastema cDNA libraries

    Science.gov (United States)

    Habermann, Bianca; Bebin, Anne-Gaelle; Herklotz, Stephan; Volkmer, Michael; Eckelt, Kay; Pehlke, Kerstin; Epperlein, Hans Henning; Schackert, Hans Konrad; Wiebe, Glenis; Tanaka, Elly M

    2004-01-01

    Background The ambystomatid salamander, Ambystoma mexicanum (axolotl), is an important model organism in evolutionary and regeneration research but relatively little sequence information has so far been available. This is a major limitation for molecular studies on caudate development, regeneration and evolution. To address this lack of sequence information we have generated an expressed sequence tag (EST) database for A. mexicanum. Results Two cDNA libraries, one made from stage 18-22 embryos and the other from day-6 regenerating tail blastemas, generated 17,352 sequences. From the sequenced ESTs, 6,377 contigs were assembled that probably represent 25% of the expressed genes in this organism. Sequence comparison revealed significant homology to entries in the NCBI non-redundant database. Further examination of this gene set revealed the presence of genes involved in important cell and developmental processes, including cell proliferation, cell differentiation and cell-cell communication. On the basis of these data, we have performed phylogenetic analysis of key cell-cycle regulators. Interestingly, while cell-cycle proteins such as the cyclin B family display expected evolutionary relationships, the cyclin-dependent kinase inhibitor 1 gene family shows an unusual evolutionary behavior among the amphibians. Conclusions Our analysis reveals the importance of a comprehensive sequence set from a representative of the Caudata and illustrates that the EST sequence database is a rich source of molecular, developmental and regeneration studies. To aid in data mining, the ESTs have been organized into an easily searchable database that is freely available online. PMID:15345051

  7. Cloning and expression of a cDNA encoding human sterol carrier protein 2

    International Nuclear Information System (INIS)

    Yamamoto, Ritsu; Kallen, C.B.; Babalola, G.O.; Rennert, H.; Strauss, J.F. III; Billheimer, J.T.

    1991-01-01

    The authors report the cloning and expression of a cDNA encoding human sterol carrier protein 2 (SCP 2 ). The 1.3-kilobase (kb) cDNA contains an open reading frame which encompasses a 143-amino acid sequence which is 89% identical to the rat SCP 2 amino acid sequence. The deduced amino acid sequence of the polypeptide reveals a 20-residue amino-terminal leader sequence in front of the mature polypeptide, which contains a carboxyl-terminal tripeptide (Ala-Lys-Leu) related to the peroxisome targeting sequence. The expressed cDNA in COS-7 cells yields a 15.3-kDa polypeptide and increased amounts of a 13.2-kDa polypeptide, both reacting with a specific rabbit antiserum to rat liver SCP 2 . The cDNA insert hybridizes with 3.2- and 1.8-kb mRNA species in human liver poly(A) + RNA. In human fibroblasts and placenta the 1.8-kb mRNA was most abundant. Southern blot analysis suggests either that there are multiple copies of the SCP 2 gene in the human genome or that the SCP 2 gene is very large. Coexpression of the SCP 2 cDNA with expression vectors for cholesterol side-chain cleavage enzyme and adrenodoxin resulted in a 2.5-fold enhancement of progestin synthesis over that obtained with expression of the steroidogenic enzyme system alone. These findings are concordant with the notion that SCP 2 plays a role in regulating steroidogenesis, among other possible functions

  8. Cloning and expression of human deoxycytidine kinase cDNA

    International Nuclear Information System (INIS)

    Chottiner, E.G.; Shewach, D.S.; Datta, N.S.; Ashcraft, E.; Gribbin, D.; Ginsburg, D.; Fox, I.H.; Mitchell, B.S.

    1991-01-01

    Deoxycytidine (dCyd) kinase is required for the phosphorylation of several deoxyribonucleosides and certain nucleoside analogs widely employed as antiviral and chemotherapeutic agents. Detailed analysis of this enzyme has been limited, however, by its low abundance and instability. Using oligonucleotides based on primary amino acid sequence derived from purified dCyd kinase, the authors have screened T-lymphoblast cDNA libraries and identified a cDNA sequence that encodes a 30.5-kDa protein corresponding to the subunit molecular mass of the purified protein. Expression of the cDNA in Escherichia coli results in a 40-fold increase in dCyd kinase activity over control levels. Northern blot analysis reveals a single 2.8-kilobase mRNA expressed in T lymphoblasts at 5- to 10-fold higher levels than in B lymphoblasts, and decreased dCyd kinase mRNA levels are present in T-lymphoblast cell lines resistant to arabinofuranosylcytosine and dideoxycytidine. These findings document that this cDNA encodes the T-lymphoblast dCyd kinase responsible for the phosphorylation of dAdo and dGuo as well as dCyd and arabinofuranosylcytosine

  9. Deep sequencing-based transcriptome analysis of Plutella xylostella larvae parasitized by Diadegma semiclausum

    Science.gov (United States)

    2011-01-01

    Background Parasitoid insects manipulate their hosts' physiology by injecting various factors into their host upon parasitization. Transcriptomic approaches provide a powerful approach to study insect host-parasitoid interactions at the molecular level. In order to investigate the effects of parasitization by an ichneumonid wasp (Diadegma semiclausum) on the host (Plutella xylostella), the larval transcriptome profile was analyzed using a short-read deep sequencing method (Illumina). Symbiotic polydnaviruses (PDVs) associated with ichneumonid parasitoids, known as ichnoviruses, play significant roles in host immune suppression and developmental regulation. In the current study, D. semiclausum ichnovirus (DsIV) genes expressed in P. xylostella were identified and their sequences compared with other reported PDVs. Five of these genes encode proteins of unknown identity, that have not previously been reported. Results De novo assembly of cDNA sequence data generated 172,660 contigs between 100 and 10000 bp in length; with 35% of > 200 bp in length. Parasitization had significant impacts on expression levels of 928 identified insect host transcripts. Gene ontology data illustrated that the majority of the differentially expressed genes are involved in binding, catalytic activity, and metabolic and cellular processes. In addition, the results show that transcription levels of antimicrobial peptides, such as gloverin, cecropin E and lysozyme, were up-regulated after parasitism. Expression of ichnovirus genes were detected in parasitized larvae with 19 unique sequences identified from five PDV gene families including vankyrin, viral innexin, repeat elements, a cysteine-rich motif, and polar residue rich protein. Vankyrin 1 and repeat element 1 genes showed the highest transcription levels among the DsIV genes. Conclusion This study provides detailed information on differential expression of P. xylostella larval genes following parasitization, DsIV genes expressed in the

  10. Cloning and sequencing of the cDNA encoding a core protein of the paired helical filament of Alzheimer's disease: Identification as the microtubule-associated protein tau

    International Nuclear Information System (INIS)

    Goedert, M.; Wischik, C.M.; Crowther, R.A.; Walker, J.E.; Klug, A.

    1988-01-01

    Screening of cDNA libraries prepared from the frontal cortex of an Alzheimer's disease patient and from fetal human brain has led to isolation of the cDNA for a core protein of the paired helical filament of Alzheimer's disease. The partial amino acid sequence of this core protein was used to design synthetic oligonucleotide probes. The cDNA encodes a protein of 352 amino acids that contains a characteristic amino acid repeat in its carboxyl-terminal half. This protein is highly homologous to the sequence of the mouse microtubule-associated protein tau and thus constitutes the human equivalent of mouse tau. RNA blot analysis indicates the presence of two major transcripts, 6 and 2 kilobases long, with a wide distribution in normal human brain. Tau protein mRNAs were found in normal amounts in the frontal cortex from patients with Alzheimer's disease. The proof that at least part of tau protein forms a component of the paired helical filament core opens the way to understanding the mode of formation of paired helical filaments and thus, ultimately, the pathogenesis of Alzheimer's disease

  11. Fiscal 1998 achievement report. Industrial technology research and development project. (Strategic human cDNA genome application technology development); 1998 nendo senryakuteki hito cDNA genome oyo gijutsu kaihatsu seika hokokusho

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    2000-03-01

    A human genome related project named above was started, and studies were conducted for base sequence determination and function analysis for approximately 10,000 kinds of full-length or long-chain human cDNA clones owned by research organizations in this country. The Institute of Medical Science of University of Tokyo and Helix Research Institute dealt with a full-length human cDNA library constructed by oligo-capping, and determined the base sequences of all specimens in the library. The Kazusa DNA Research Institute determined partial sequences for long-chain clones which are not shorter than 4-5kbp, and determined entire sequences for some bases. The obtained base sequence data were subjected to homology analysis, the base sequences were converted into amino acid sequences, and functions of proteins were predicted. In the analysis of gene functions, ATAC-PCR (adaptor tagged competitive-polymerase chain reaction) was applied to the clones covered by this project, and a database was prepared by use of the results of analyses of frequency-related information. For the preparation of a comprehensive gene expression profile, technologies for cDNA microarray construction were established. (NEDO)

  12. cDNA fingerprinting of osteoprogenitor cells to isolate differentiation stage-specific genes.

    OpenAIRE

    Candeliere, G A; Rao, Y; Floh, A; Sandler, S D; Aubin, J E

    1999-01-01

    A cDNA fingerprinting strategy was developed to identify genes based on their differential expression pattern during osteoblast development. Preliminary biological and molecular staging of cDNA pools prepared by global amplification PCR allowed discrim-inating choices to be made in selection of expressed sequence tags (ESTs) to be isolated. Sequencing of selected ESTs confirmed that both known and novel genes can be isolated from any developmental stage of interest, e.g. from primitive progen...

  13. A large scale analysis of cDNA in Arabidopsis thaliana: generation of 12,028 non-redundant expressed sequence tags from normalized and size-selected cDNA libraries.

    Science.gov (United States)

    Asamizu, E; Nakamura, Y; Sato, S; Tabata, S

    2000-06-30

    For comprehensive analysis of genes expressed in the model dicotyledonous plant, Arabidopsis thaliana, expressed sequence tags (ESTs) were accumulated. Normalized and size-selected cDNA libraries were constructed from aboveground organs, flower buds, roots, green siliques and liquid-cultured seedlings, respectively, and a total of 14,026 5'-end ESTs and 39,207 3'-end ESTs were obtained. The 3'-end ESTs could be clustered into 12,028 non-redundant groups. Similarity search of the non-redundant ESTs against the public non-redundant protein database indicated that 4816 groups show similarity to genes of known function, 1864 to hypothetical genes, and the remaining 5348 are novel sequences. Gene coverage by the non-redundant ESTs was analyzed using the annotated genomic sequences of approximately 10 Mb on chromosomes 3 and 5. A total of 923 regions were hit by at least one EST, among which only 499 regions were hit by the ESTs deposited in the public database. The result indicates that the EST source generated in this project complements the EST data in the public database and facilitates new gene discovery.

  14. Cloning and expression of a cDNA coding for a human monocyte-derived plasminogen activator inhibitor

    International Nuclear Information System (INIS)

    Antalis, T.M.; Clark, M.A.; Barnes, T.; Lehrbach, P.R.; Devine, P.L.; Schevzov, G.; Goss, N.H.; Stephens, R.W.; Tolstoshev, P.

    1988-01-01

    Human monocyte-derived plasminogen activator inhibitor (mPAI-2) was purified to homogeneity from the U937 cell line and partially sequenced. Oligonucleotide probes derived from this sequence were used to screen a cDNA library prepared from U937 cells. One positive clone was sequenced and contained most of the coding sequence as well as a long incomplete 3' untranslated region (1112 base pairs). This cDNA sequence was shown to encode mPAI-2 by hybrid-select translation. A cDNA clone encoding the remainder of the mPAI-2 mRNA was obtained by primer extension of U937 poly(A) + RNA using a probe complementary to the mPAI-2 coding region. The coding sequence for mPAI-2 was placed under the control of the λ P/sub L/ promoter, and the protein expressed in Escherichia coli formed a complex with urokinase that could be detected immunologically. By nucleotide sequence analysis, mPAI-2 cDNA encodes a protein containing 415 amino acids with a predicted unglycosylated M/sub r/ of 46,543. The predicted amino acid sequence of mPAI-2 is very similar to placental PAI-2 and shows extensive homology with members of the serine protease inhibitor (serpin) superfamily. mPAI-2 was found to be more homologous to ovalbumin (37%) than the endothelial plasminogen activator inhibitor, PAI-1 (26%). The 3' untranslated region of the mPAI-2 cDNA contains a putative regulatory sequence that has been associated with the inflammatory mediators

  15. RICD: A rice indica cDNA database resource for rice functional genomics

    Directory of Open Access Journals (Sweden)

    Zhang Qifa

    2008-11-01

    Full Text Available Abstract Background The Oryza sativa L. indica subspecies is the most widely cultivated rice. During the last few years, we have collected over 20,000 putative full-length cDNAs and over 40,000 ESTs isolated from various cDNA libraries of two indica varieties Guangluai 4 and Minghui 63. A database of the rice indica cDNAs was therefore built to provide a comprehensive web data source for searching and retrieving the indica cDNA clones. Results Rice Indica cDNA Database (RICD is an online MySQL-PHP driven database with a user-friendly web interface. It allows investigators to query the cDNA clones by keyword, genome position, nucleotide or protein sequence, and putative function. It also provides a series of information, including sequences, protein domain annotations, similarity search results, SNPs and InDels information, and hyperlinks to gene annotation in both The Rice Annotation Project Database (RAP-DB and The TIGR Rice Genome Annotation Resource, expression atlas in RiceGE and variation report in Gramene of each cDNA. Conclusion The online rice indica cDNA database provides cDNA resource with comprehensive information to researchers for functional analysis of indica subspecies and for comparative genomics. The RICD database is available through our website http://www.ncgr.ac.cn/ricd.

  16. Cloning and expression of a cDNA coding for a human monocyte-derived plasminogen activator inhibitor.

    Science.gov (United States)

    Antalis, T M; Clark, M A; Barnes, T; Lehrbach, P R; Devine, P L; Schevzov, G; Goss, N H; Stephens, R W; Tolstoshev, P

    1988-02-01

    Human monocyte-derived plasminogen activator inhibitor (mPAI-2) was purified to homogeneity from the U937 cell line and partially sequenced. Oligonucleotide probes derived from this sequence were used to screen a cDNA library prepared from U937 cells. One positive clone was sequenced and contained most of the coding sequence as well as a long incomplete 3' untranslated region (1112 base pairs). This cDNA sequence was shown to encode mPAI-2 by hybrid-select translation. A cDNA clone encoding the remainder of the mPAI-2 mRNA was obtained by primer extension of U937 poly(A)+ RNA using a probe complementary to the mPAI-2 coding region. The coding sequence for mPAI-2 was placed under the control of the lambda PL promoter, and the protein expressed in Escherichia coli formed a complex with urokinase that could be detected immunologically. By nucleotide sequence analysis, mPAI-2 cDNA encodes a protein containing 415 amino acids with a predicted unglycosylated Mr of 46,543. The predicted amino acid sequence of mPAI-2 is very similar to placental PAI-2 (3 amino acid differences) and shows extensive homology with members of the serine protease inhibitor (serpin) superfamily. mPAI-2 was found to be more homologous to ovalbumin (37%) than the endothelial plasminogen activator inhibitor, PAI-1 (26%). Like ovalbumin, mPAI-2 appears to have no typical amino-terminal signal sequence. The 3' untranslated region of the mPAI-2 cDNA contains a putative regulatory sequence that has been associated with the inflammatory mediators.

  17. Predicting effects of noncoding variants with deep learning-based sequence model.

    Science.gov (United States)

    Zhou, Jian; Troyanskaya, Olga G

    2015-10-01

    Identifying functional effects of noncoding variants is a major challenge in human genetics. To predict the noncoding-variant effects de novo from sequence, we developed a deep learning-based algorithmic framework, DeepSEA (http://deepsea.princeton.edu/), that directly learns a regulatory sequence code from large-scale chromatin-profiling data, enabling prediction of chromatin effects of sequence alterations with single-nucleotide sensitivity. We further used this capability to improve prioritization of functional variants including expression quantitative trait loci (eQTLs) and disease-associated variants.

  18. Construction of Infectious cDNA Clone of a Chrysanthemum stunt viroid Korean Isolate

    Directory of Open Access Journals (Sweden)

    Ju-Yeon Yoon

    2014-03-01

    Full Text Available Chrysanthemum stunt viroid (CSVd, a noncoding infectious RNA molecule, causes seriously economic losses of chrysanthemum for 3 or 4 years after its first infection. Monomeric cDNA clones of CSVd isolate SK1 (CSVd-SK1 were constructed in the plasmids pGEM-T easy vector and pUC19 vector. Linear positive-sense transcripts synthesized in vitro from the full-length monomeric cDNA clones of CSVd-SK1 could infect systemically tomato seedlings and chrysanthemum plants, suggesting that the linear CSVd RNA transcribed from the cDNA clones could be replicated as efficiently as circular CSVd in host species. However, direct inoculation of plasmid cDNA clones containing full-length monomeric cDNA of CSVd-SK1 failed to infect tomato and chrysanthemum and linear negative-sense transcripts from the plasmid DNAs were not infectious in the two plant species. The cDNA sequences of progeny viroid in systemically infected tomato and chrysanthemum showed a few substitutions at a specific nucleotide position, but there were no deletions and insertions in the sequences of the CSVd progeny from tomato and chrysanthemum plants.

  19. Avoiding cross hybridization by choosing nonredundant targets on cDNA arrays

    DEFF Research Database (Denmark)

    Nielsen, Henrik Bjørn; Knudsen, Steen

    2002-01-01

    PROBEWIZ designs PCR primers for amplifying probes for cDNA arrays. The probes are designed to have minimal homology to other expressed sequences from a given organism. The primer selection is based on user-defined penalties for homology, primer quality, and proximity to the 3' end.......PROBEWIZ designs PCR primers for amplifying probes for cDNA arrays. The probes are designed to have minimal homology to other expressed sequences from a given organism. The primer selection is based on user-defined penalties for homology, primer quality, and proximity to the 3' end....

  20. Isolation of an insulin-like growth factor II cDNA with a unique 5' untranslated region from human placenta

    International Nuclear Information System (INIS)

    Shen, Shujane; Daimon, Makoto; Wang, Chunyeh; Ilan, J.; Jansen, M.

    1988-01-01

    Human insulin-like growth factor II (IGF-II) cDNA from a placental library was isolated and sequenced. The 5' untranslated region (5'-UTR) sequence of this cDNA differs completely from that of adult human liver and has considerable base sequence identity to the same region of an IGF-II cDNA of a rat liver cell line, BRL-3A. Human placental poly(A) + RNA was probed with either the 5'-UTR of the isolated human placental IGF-II cDNA or the 5'-UTR of the IGF-II cDNA obtained from adult human liver. No transcripts were detected by using the 5'-UTR of the adult liver IGF-II as the probe. In contrast, three transcripts of 6.0, 3.2, and 2.2 kilobases were detected by using the 5'-UTR of the placental IGF-II cDNA as the probe or the probe from the coding sequence. A fourth IGF-II transcript of 4.9 kilobases presumably containing a 5'-UTR consisting of a base sequence dissimilar to that of either IGF-II 5'-UTR was apparent. Therefore, IGF-II transcripts detected may be products of alternative splicing as their 5'-UTR sequence is contained within the human IGF-II gene or they may be a consequence of alternative promoter utilization in placenta

  1. Transcriptome sequences resolve deep relationships of the grape family.

    Science.gov (United States)

    Wen, Jun; Xiong, Zhiqiang; Nie, Ze-Long; Mao, Likai; Zhu, Yabing; Kan, Xian-Zhao; Ickert-Bond, Stefanie M; Gerrath, Jean; Zimmer, Elizabeth A; Fang, Xiao-Dong

    2013-01-01

    Previous phylogenetic studies of the grape family (Vitaceae) yielded poorly resolved deep relationships, thus impeding our understanding of the evolution of the family. Next-generation sequencing now offers access to protein coding sequences very easily, quickly and cost-effectively. To improve upon earlier work, we extracted 417 orthologous single-copy nuclear genes from the transcriptomes of 15 species of the Vitaceae, covering its phylogenetic diversity. The resulting transcriptome phylogeny provides robust support for the deep relationships, showing the phylogenetic utility of transcriptome data for plants over a time scale at least since the mid-Cretaceous. The pros and cons of transcriptome data for phylogenetic inference in plants are also evaluated.

  2. Characterization of the cDNA encoding human nucleophosmin and studies of its role in normal and abnormal growth

    International Nuclear Information System (INIS)

    Chan, Waiyee; Liu, Qingrong; Borjigin, J.; Busch, H.; Rennert, O.M.; Tease, L.A.; Chan, Puikwong

    1989-01-01

    A cDNA encoding human nucleophosmin (protein B23) was obtained by screening a human placental cDNA library in δgtll first with monoclonal antibody to rat nucleophosmin and then with confirmed partial cDNA of human nucleophosmin as probes. The cDNA had 1,311 bp with a coding sequence encoding a protein of 294 amino acids. The identity of the cDNA was confirmed by the presence of encoded amino acid sequences identical with those determined by sequencing pure rat nucleophosmin (a total of 138 amino acids). The most striking feature of the sequence is an acidic cluster located in the middle of the molecule. The cluster consists of 26 Asp/Glu and 1 Phe and Ala. Comparison of human nucleophosmin and Xenopus nucleolar protein NO38 shows 64.3% sequence identity. The N-terminal 130 amino acids of human nucleophosmin also bear 50% identity with that of Xenopus nucleoplasmin. Northern blot analysis of rat liver total RNA with a partial nucleophosmin cDNA as probe demonstrated a homogeneous mRNA band of about 1.6 kb. Similar observations were made in hypertrophic rat liver and Novikoff hepatoma. When the protein levels were compared with Western blot immunoassays, Navikoff hepatoma showed 20 times more nucleophosmin, while only about 5 times more nucleophosmin was observed in hypertrophic rat liver than in unstimulated normal liver

  3. Exploring fungal diversity in deep-sea sediments from Okinawa Trough using high-throughput Illumina sequencing

    Science.gov (United States)

    Zhang, Xiao-Yong; Wang, Guang-Hua; Xu, Xin-Ya; Nong, Xu-Hua; Wang, Jie; Amin, Muhammad; Qi, Shu-Hua

    2016-10-01

    The present study investigated the fungal diversity in four different deep-sea sediments from Okinawa Trough using high-throughput Illumina sequencing of the nuclear ribosomal internal transcribed spacer-1 (ITS1). A total of 40,297 fungal ITS1 sequences clustered into 420 operational taxonomic units (OTUs) with 97% sequence similarity and 170 taxa were recovered from these sediments. Most ITS1 sequences (78%) belonged to the phylum Ascomycota, followed by Basidiomycota (17.3%), Zygomycota (1.5%) and Chytridiomycota (0.8%), and a small proportion (2.4%) belonged to unassigned fungal phyla. Compared with previous studies on fungal diversity of sediments from deep-sea environments by culture-dependent approach and clone library analysis, the present result suggested that Illumina sequencing had been dramatically accelerating the discovery of fungal community of deep-sea sediments. Furthermore, our results revealed that Sordariomycetes was the most diverse and abundant fungal class in this study, challenging the traditional view that the diversity of Sordariomycetes phylotypes was low in the deep-sea environments. In addition, more than 12 taxa accounted for 21.5% sequences were found to be rarely reported as deep-sea fungi, suggesting the deep-sea sediments from Okinawa Trough harbored a plethora of different fungal communities compared with other deep-sea environments. To our knowledge, this study is the first exploration of the fungal diversity in deep-sea sediments from Okinawa Trough using high-throughput Illumina sequencing.

  4. Molecular cloning and sequence of cDNA encoding the plasma membrane proton pump (H+-ATPase) of Arabidopsis thaliana

    International Nuclear Information System (INIS)

    Harper, J.F.; Surowy, T.K.; Sussman, M.R.

    1989-01-01

    In plants, the transport of solutes across the plasma membrane is driven by a proton pump (H + -ATPase) that produces an electric potential and pH gradient. The authors isolated and sequenced a full-length cDNA clone that encodes this enzyme in Arabidopsis thaliana. The protein predicted from its nucleotide sequence encodes 959 amino acids and has a molecular mass of 104,207 Da. The plant protein shows structural features common to a family of cation-translocating ATPases found in the plasma membrane of prokaryotic and eukaryotic cells, with the greatest overall identity in amino acid sequence (36%) to the H + -ATPase observed in the plasma membrane of fungi. The structure predicted from a hydropathy plant contains at least eight transmembrane segments, with most of the protein (73%) extending into the cytoplasm and only 5% of the residues exposed on the external surface. Unique features of the plant enzyme include diverged sequences at the amino and carboxyl termini as well as greater hydrophilic character in three extracellular loops

  5. miRBase: integrating microRNA annotation and deep-sequencing data.

    Science.gov (United States)

    Kozomara, Ana; Griffiths-Jones, Sam

    2011-01-01

    miRBase is the primary online repository for all microRNA sequences and annotation. The current release (miRBase 16) contains over 15,000 microRNA gene loci in over 140 species, and over 17,000 distinct mature microRNA sequences. Deep-sequencing technologies have delivered a sharp rise in the rate of novel microRNA discovery. We have mapped reads from short RNA deep-sequencing experiments to microRNAs in miRBase and developed web interfaces to view these mappings. The user can view all read data associated with a given microRNA annotation, filter reads by experiment and count, and search for microRNAs by tissue- and stage-specific expression. These data can be used as a proxy for relative expression levels of microRNA sequences, provide detailed evidence for microRNA annotations and alternative isoforms of mature microRNAs, and allow us to revisit previous annotations. miRBase is available online at: http://www.mirbase.org/.

  6. Three human alcohol dehydrogenase subunits: cDNA structure and molecular and evolutionary divergence

    International Nuclear Information System (INIS)

    Ikuta, T.; Szeto, S.; Yoshida, A.

    1986-01-01

    Class I human alcohol dehydrogenase (ADH; alcohol:NAD + oxidoreductase, EC 1.1.1.1) consists of several homo- and heterodimers of α, β, and γ subunits that are governed by the ADH1, ADH2, and ADH3 loci. The authors previously cloned a full length of cDNA for the β subunit, and the complete sequence of 374 amino acid residues was established. cDNAs for the α and γ subunits were cloned and characterized. A human liver cDNA library, constructed in phage λgt11, was screened by using a synthetic oligonucleotide probe that was matched to the γ but not to the β sequence. Clone pUCADHγ21 and clone pUCADHα15L differed from β cDNA with respect to restriction sites and hybridization with the nucleotide probe. Clone pUCADHγ21 contained an insertion of 1.5 kilobase pairs (kbp) and encodes 374 amino acid residues compatible with the reported amino acid sequence of the γ subunit. Clone pUCADHα15L contained an insertion of 2.4 kbp and included nucleotide sequences that encode 374 amino acid residues for another subunit, the γ subunit. In addition, this clone contained the sequences that encode the COOH-terminal part of the β subunit at its extended 5' region. The amino acid sequences and coding regions of the cDNAs of the three subunits are very similar. A high degree of resemblance is observed also in their 3' noncoding regions. However, distinctive differences exist in the vicinity of the Zn-binding cysteine residue at position 46. Based on the cDNA sequences and the deduced amino acid sequences of the three subunits, their structural and evolutionary relationships are discussed

  7. Discovery radiomics via evolutionary deep radiomic sequencer discovery for pathologically proven lung cancer detection.

    Science.gov (United States)

    Shafiee, Mohammad Javad; Chung, Audrey G; Khalvati, Farzad; Haider, Masoom A; Wong, Alexander

    2017-10-01

    While lung cancer is the second most diagnosed form of cancer in men and women, a sufficiently early diagnosis can be pivotal in patient survival rates. Imaging-based, or radiomics-driven, detection methods have been developed to aid diagnosticians, but largely rely on hand-crafted features that may not fully encapsulate the differences between cancerous and healthy tissue. Recently, the concept of discovery radiomics was introduced, where custom abstract features are discovered from readily available imaging data. We propose an evolutionary deep radiomic sequencer discovery approach based on evolutionary deep intelligence. Motivated by patient privacy concerns and the idea of operational artificial intelligence, the evolutionary deep radiomic sequencer discovery approach organically evolves increasingly more efficient deep radiomic sequencers that produce significantly more compact yet similarly descriptive radiomic sequences over multiple generations. As a result, this framework improves operational efficiency and enables diagnosis to be run locally at the radiologist's computer while maintaining detection accuracy. We evaluated the evolved deep radiomic sequencer (EDRS) discovered via the proposed evolutionary deep radiomic sequencer discovery framework against state-of-the-art radiomics-driven and discovery radiomics methods using clinical lung CT data with pathologically proven diagnostic data from the LIDC-IDRI dataset. The EDRS shows improved sensitivity (93.42%), specificity (82.39%), and diagnostic accuracy (88.78%) relative to previous radiomics approaches.

  8. Transcriptome sequences resolve deep relationships of the grape family.

    Directory of Open Access Journals (Sweden)

    Jun Wen

    Full Text Available Previous phylogenetic studies of the grape family (Vitaceae yielded poorly resolved deep relationships, thus impeding our understanding of the evolution of the family. Next-generation sequencing now offers access to protein coding sequences very easily, quickly and cost-effectively. To improve upon earlier work, we extracted 417 orthologous single-copy nuclear genes from the transcriptomes of 15 species of the Vitaceae, covering its phylogenetic diversity. The resulting transcriptome phylogeny provides robust support for the deep relationships, showing the phylogenetic utility of transcriptome data for plants over a time scale at least since the mid-Cretaceous. The pros and cons of transcriptome data for phylogenetic inference in plants are also evaluated.

  9. DeepGO: predicting protein functions from sequence and interactions using a deep ontology-aware classifier

    KAUST Repository

    Kulmanov, Maxat

    2017-09-27

    Motivation A large number of protein sequences are becoming available through the application of novel high-throughput sequencing technologies. Experimental functional characterization of these proteins is time-consuming and expensive, and is often only done rigorously for few selected model organisms. Computational function prediction approaches have been suggested to fill this gap. The functions of proteins are classified using the Gene Ontology (GO), which contains over 40 000 classes. Additionally, proteins have multiple functions, making function prediction a large-scale, multi-class, multi-label problem. Results We have developed a novel method to predict protein function from sequence. We use deep learning to learn features from protein sequences as well as a cross-species protein–protein interaction network. Our approach specifically outputs information in the structure of the GO and utilizes the dependencies between GO classes as background information to construct a deep learning model. We evaluate our method using the standards established by the Computational Assessment of Function Annotation (CAFA) and demonstrate a significant improvement over baseline methods such as BLAST, in particular for predicting cellular locations.

  10. DeepProbe: Information Directed Sequence Understanding and Chatbot Design via Recurrent Neural Networks

    OpenAIRE

    Yin, Zi; Chang, Keng-hao; Zhang, Ruofei

    2017-01-01

    Information extraction and user intention identification are central topics in modern query understanding and recommendation systems. In this paper, we propose DeepProbe, a generic information-directed interaction framework which is built around an attention-based sequence to sequence (seq2seq) recurrent neural network. DeepProbe can rephrase, evaluate, and even actively ask questions, leveraging the generative ability and likelihood estimation made possible by seq2seq models. DeepProbe makes...

  11. Transcription profiling of the model cyanobacterium Synechococcus sp. strain PCC 7002 by NextGen (SOLiD™ Sequencing of cDNA

    Directory of Open Access Journals (Sweden)

    Marcus eLudwig

    2011-03-01

    Full Text Available The genome of the unicellular, euryhaline cyanobacterium Synechococcus sp. PCC 7002 encodes about 3200 proteins. Transcripts were detected for nearly all annotated open reading frames by a global transcriptomic analysis by Next-Generation (SOLiDTM sequencing of cDNA. In the cDNA samples sequenced, ~90% of the mapped sequences were derived from the 16S and 23S ribosomal RNAs and ~10% of the sequences were derived from mRNAs. In cells grown photoautotrophically under standard conditions (38 °C, 1% (v/v CO2 in air, 250 µmol photons m-2 s-1, the highest transcript levels (up to 2% of the total mRNA for the most abundantly transcribed genes (e. g., cpcAB, psbA, psaA were generally derived from genes encoding structural components of the photosynthetic apparatus. High light exposure for one hour caused changes in transcript levels for genes encoding proteins of the photosynthetic apparatus, Type-1 NADH dehydrogenase complex and ATP synthase, whereas dark incubation for one hour resulted in a global decrease in transcript levels for photosynthesis-related genes and an increase in transcript levels for genes involved in carbohydrate degradation. Transcript levels for pyruvate kinase and the pyruvate dehydrogenase complex decreased sharply in cells incubated in the dark. Under dark anoxic (fermentative conditions, transcript changes indicated a global decrease in transcripts for respiratory proteins and suggested that cells employ an alternative phosphoenolpyruvate degradation pathway via phosphoenolpyruvate synthase (ppsA and the pyruvate:ferredoxin oxidoreductase (nifJ. Finally, the data suggested that an apparent operon involved in tetrapyrrole biosynthesis and fatty acid desaturation, acsF2-ho2-hemN2-desF, may be regulated by oxygen concentration.

  12. Cloning and characterization of transferrin cDNA and rapid detection of transferrin gene polymorphism in rainbow trout (Oncorhynchus mykiss).

    Science.gov (United States)

    Tange, N; Jong-Young, L; Mikawa, N; Hirono, I; Aoki, T

    1997-12-01

    A cDNA clone of rainbow trout (Oncorhynchus mykiss) transferrin was obtained from a liver cDNA library. The 2537-bp cDNA sequence contained an open reading frame encoding 691 amino acids and the 5' and 3' noncoding regions. The amino acid sequences at the iron-binding sites and the two N-linked glycosylation sites, and the cysteine residues were consistent with known, conserved vertebrate transferrin cDNA sequences. Single N-linked glycosylation sites existed on the N- and C-lobe. The deduced amino acid sequence of the rainbow trout transferrin cDNA had 92.9% identities with transferrin of coho salmon (Oncorhynchus kisutch); 85%, Atlantic salmon (Salmo salar); 67.3%, medaka (Oryzias latipes); 61.3% Atlantic cod (Gadus morhua); and 59.7%, Japanese flounder (Paralichthys olivaceus). The long and accurate polymerase chain reaction (LA-PCR) was used to amplify approximately 6.5 kb of the transferrin gene from rainbow trout genomic DNA. Restriction fragment length polymorphisms (RFLPs) of the LA-PCR products revealed three digestion patterns in 22 samples.

  13. [Preparation of the cDNA microarray on the differential expressed cDNA of senescence-accelerated mouse's hippocampus].

    Science.gov (United States)

    Cheng, Xiao-Rui; Zhou, Wen-Xia; Zhang, Yong-Xiang

    2006-05-01

    Alzheimer' s disease (AD) is the most common form of dementia in the elderly. AD is an invariably fatal neurodegenerative disorder with no effective treatment. Senescence-accelerated mouse prone 8 (SAMP8) is a model for studying age-related cognitive impairments and also is a good model to study brain aging and one of mouse model of AD. The technique of cDNA microarray can monitor the expression levels of thousands of genes simultaneously and can be used to study AD with the character of multi-mechanism, multi-targets and multi-pathway. In order to disclose the mechanism of AD and find the drug targets of AD, cDNA microarray containing 3136 cDNAs amplified from the suppression subtracted cDNA library of hippocampus of SAMP8 and SAMR1 was prepared with 16 blocks and 14 x 14 pins, the housekeeping gene beta-actin and G3PDH as inner conference. The background of this microarray was low and unanimous, and dots divided evenly. The conditions of hybridization and washing were optimized during the hybridization of probe and target molecule. After the data of hybridization analysis, the differential expressed cDNAs were sequenced and analyzed by the bioinformatics, and some of genes were quantified by the real time RT-PCR and the reliability of this cDNA microarray were validated. This cDNA microarray may be the good means to select the differential expressed genes and disclose the molecular mechanism of SAMP8's brain aging and AD.

  14. cDNA library construction of two human Demodexspecies.

    Science.gov (United States)

    Niu, DongLing; Wang, RuiLing; Zhao, YaE; Yang, Rui; Hu, Li; Lei, YuYang; Dan, WeiChao

    2017-06-01

    The research of Demodex, a type of pathogen causing various dermatoses in animals and human beings, is lacking at RNA level. This study aims at extracting RNA and constructing cDNA library for Demodex. First, P. cuniculiand D. farinaewere mixed to establish homogenization method for RNA extraction. Second, D. folliculorumand D. breviswere collected and preserved in Trizol, which were mixed with D. farinaerespectively to extract RNA. Finally, cDNA library was constructed and its quality was assessed. The results indicated that for D. folliculorum& D. farinae, the recombination rate of cDNA library was 90.67% and the library titer was 7.50 × 104 pfu/ml. 17 of the 59 positive clones were predicted to be of D. folliculorum; For D. brevis& D. farinae, the recombination rate was 90.96% and the library titer was 7.85 x104 pfu/ml. 40 of the 59 positive clones were predicted to be of D. brevis. Further detection by specific primers demonstrated that mtDNA cox1, cox3and ATP6 detected from cDNA libraries had 96.52%-99.73% identities with the corresponding sequences in GenBank. In conclusion, the cDNA libraries constructed for Demodexmixed with D. farinaewere successful and could satisfy the requirements for functional genes detection.

  15. Gene expression in the deep biosphere.

    Science.gov (United States)

    Orsi, William D; Edgcomb, Virginia P; Christman, Glenn D; Biddle, Jennifer F

    2013-07-11

    Scientific ocean drilling has revealed a deep biosphere of widespread microbial life in sub-seafloor sediment. Microbial metabolism in the marine subsurface probably has an important role in global biogeochemical cycles, but deep biosphere activities are not well understood. Here we describe and analyse the first sub-seafloor metatranscriptomes from anaerobic Peru Margin sediment up to 159 metres below the sea floor, represented by over 1 billion complementary DNA (cDNA) sequence reads. Anaerobic metabolism of amino acids, carbohydrates and lipids seem to be the dominant metabolic processes, and profiles of dissimilatory sulfite reductase (dsr) transcripts are consistent with pore-water sulphate concentration profiles. Moreover, transcripts involved in cell division increase as a function of microbial cell concentration, indicating that increases in sub-seafloor microbial abundance are a function of cell division across all three domains of life. These data support calculations and models of sub-seafloor microbial metabolism and represent the first holistic picture of deep biosphere activities.

  16. Assessment of adaptive evolution between wheat and rice as deduced from full-length common wheat cDNA sequence data and expression patterns

    Directory of Open Access Journals (Sweden)

    Hayashizaki Yoshihide

    2009-06-01

    Full Text Available Abstract Background Wheat is an allopolyploid plant that harbors a huge, complex genome. Therefore, accumulation of expressed sequence tags (ESTs for wheat is becoming particularly important for functional genomics and molecular breeding. We prepared a comprehensive collection of ESTs from the various tissues that develop during the wheat life cycle and from tissues subjected to stress. We also examined their expression profiles in silico. As full-length cDNAs are indispensable to certify the collected ESTs and annotate the genes in the wheat genome, we performed a systematic survey and sequencing of the full-length cDNA clones. This sequence information is a valuable genetic resource for functional genomics and will enable carrying out comparative genomics in cereals. Results As part of the functional genomics and development of genomic wheat resources, we have generated a collection of full-length cDNAs from common wheat. By grouping the ESTs of recombinant clones randomly selected from the full-length cDNA library, we were able to sequence 6,162 independent clones with high accuracy. About 10% of the clones were wheat-unique genes, without any counterparts within the DNA database. Wheat clones that showed high homology to those of rice were selected in order to investigate their expression patterns in various tissues throughout the wheat life cycle and in response to abiotic-stress treatments. To assess the variability of genes that have evolved differently in wheat and rice, we calculated the substitution rate (Ka/Ks of the counterparts in wheat and rice. Genes that were preferentially expressed in certain tissues or treatments had higher Ka/Ks values than those in other tissues and treatments, which suggests that the genes with the higher variability expressed in these tissues is under adaptive selection. Conclusion We have generated a high-quality full-length cDNA resource for common wheat, which is essential for continuation of the

  17. deepTools2: a next generation web server for deep-sequencing data analysis.

    Science.gov (United States)

    Ramírez, Fidel; Ryan, Devon P; Grüning, Björn; Bhardwaj, Vivek; Kilpert, Fabian; Richter, Andreas S; Heyne, Steffen; Dündar, Friederike; Manke, Thomas

    2016-07-08

    We present an update to our Galaxy-based web server for processing and visualizing deeply sequenced data. Its core tool set, deepTools, allows users to perform complete bioinformatic workflows ranging from quality controls and normalizations of aligned reads to integrative analyses, including clustering and visualization approaches. Since we first described our deepTools Galaxy server in 2014, we have implemented new solutions for many requests from the community and our users. Here, we introduce significant enhancements and new tools to further improve data visualization and interpretation. deepTools continue to be open to all users and freely available as a web service at deeptools.ie-freiburg.mpg.de The new deepTools2 suite can be easily deployed within any Galaxy framework via the toolshed repository, and we also provide source code for command line usage under Linux and Mac OS X. A public and documented API for access to deepTools functionality is also available. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  18. cDNA sequence and tissue expression analysis of glucokinase from ...

    African Journals Online (AJOL)

    Yomi

    2012-01-10

    Jan 10, 2012 ... distribution of GK mRNA in brain, mesenteric adipose tissue, spleen, white muscle and liver of grass ... expression profile of GK mRNA in liver normalized with β-actin level was 31, 454 and 649-fold compared .... Primers and expected products used for GK gene cDNA RT-PCR, RACE and real-time PCR.

  19. Molecular cloning and mammalian expression of human beta 2-glycoprotein I cDNA

    DEFF Research Database (Denmark)

    Kristensen, Torsten; Schousboe, Inger; Boel, Espen

    1991-01-01

    Human β2-glycoprotein (β2gpI) cDNA was isolated from a liver cDNA library and sequenced. The cDNA encoded a 19-residue hydrophobic signal peptide followed by the mature β2gpI of 326 amino acid residues. In liver and in the hepatoma cell line HepG2 there are two mRNA species of about 1.4 and 4.3 kb......, respectively, hybridizing specifically with the β2gpI cDNA. Upon isoelectric focusing, recombinant β2gpI obtained from expression of β2gpI cDNA in baby hamster kidney cells showed the same pattern of bands as β2gpI isolated from plasma, and at least 5 polypeptides were visible...

  20. Deep sequencing as a method of typing bluetongue virus isolates.

    Science.gov (United States)

    Rao, Pavuluri Panduranga; Reddy, Yella Narasimha; Ganesh, Kapila; Nair, Shreeja G; Niranjan, Vidya; Hegde, Nagendra R

    2013-11-01

    Bluetongue (BT) is an economically important endemic disease of livestock in tropics and subtropics. In addition, its recent spread to temperate regions like North America and Northern Europe is of serious concern. Rapid serotyping and characterization of BT virus (BTV) is an essential step in the identification of origin of the virus and for controlling the disease. Serotyping of BTV is typically performed by serum neutralization, and of late by nucleotide sequencing. This report describes the near complete genome sequencing and typing of two isolates of BTV using Illumina next generation sequencing platform. Two of the BTV RNAs were multiplexed with ten other unknown samples. Viral RNA was isolated and fragmented, reverse transcribed, the cDNA ends were repaired and ligated with a multiplex oligo. The genome library was amplified using primers complementary to the ligated oligo and subjected to single and paired end sequencing. The raw reads were assembled using a de novo method and reference-based assembly was performed based on the contig data. Near complete sequences of all segments of BTV were obtained with more than 20× coverage, and single read sequencing method was sufficient to identify the genotype and serotype of the virus. The two viruses used in this study were typed as BTV-1 and BTV-9E. Copyright © 2013 Elsevier B.V. All rights reserved.

  1. Complete cDNA sequence of the preproform of human pregnancy-associated plasma protein-A. Evidence for expression in the brain and induction by cAMP

    DEFF Research Database (Denmark)

    Haaning, Jesper; Oxvig, Claus; Overgaard, Michael Toft

    1996-01-01

    A cDNA that encodes the prepropeptide of pregnancy-associated plasma protein-A (preproPAPP-A), a putative metalloproteinase, has been cloned and sequenced. PAPP-A is synthesized in the placenta as a 1627-residue precursor preproprotein with a putative 22-residue signal peptide and a highly basic...

  2. Transcriptome analysis of the model protozoan, Tetrahymena thermophila, using Deep RNA sequencing.

    Directory of Open Access Journals (Sweden)

    Jie Xiong

    Full Text Available BACKGROUND: The ciliated protozoan Tetrahymena thermophila is a well-studied single-celled eukaryote model organism for cellular and molecular biology. However, the lack of extensive T. thermophila cDNA libraries or a large expressed sequence tag (EST database limited the quality of the original genome annotation. METHODOLOGY/PRINCIPAL FINDINGS: This RNA-seq study describes the first deep sequencing analysis of the T. thermophila transcriptome during the three major stages of the life cycle: growth, starvation and conjugation. Uniquely mapped reads covered more than 96% of the 24,725 predicted gene models in the somatic genome. More than 1,000 new transcribed regions were identified. The great dynamic range of RNA-seq allowed detection of a nearly six order-of-magnitude range of measurable gene expression orchestrated by this cell. RNA-seq also allowed the first prediction of transcript untranslated regions (UTRs and an updated (larger size estimate of the T. thermophila transcriptome: 57 Mb, or about 55% of the somatic genome. Our study identified nearly 1,500 alternative splicing (AS events distributed over 5.2% of T. thermophila genes. This percentage represents a two order-of-magnitude increase over previous EST-based estimates in Tetrahymena. Evidence of stage-specific regulation of alternative splicing was also obtained. Finally, our study allowed us to completely confirm about 26.8% of the genes originally predicted by the gene finder, to correct coding sequence boundaries and intron-exon junctions for about a third, and to reassign microarray probes and correct earlier microarray data. CONCLUSIONS/SIGNIFICANCE: RNA-seq data significantly improve the genome annotation and provide a fully comprehensive view of the global transcriptome of T. thermophila. To our knowledge, 5.2% of T. thermophila genes with AS is the highest percentage of genes showing AS reported in a unicellular eukaryote. Tetrahymena thus becomes an excellent unicellular

  3. Accurate identification of RNA editing sites from primitive sequence with deep neural networks.

    Science.gov (United States)

    Ouyang, Zhangyi; Liu, Feng; Zhao, Chenghui; Ren, Chao; An, Gaole; Mei, Chuan; Bo, Xiaochen; Shu, Wenjie

    2018-04-16

    RNA editing is a post-transcriptional RNA sequence alteration. Current methods have identified editing sites and facilitated research but require sufficient genomic annotations and prior-knowledge-based filtering steps, resulting in a cumbersome, time-consuming identification process. Moreover, these methods have limited generalizability and applicability in species with insufficient genomic annotations or in conditions of limited prior knowledge. We developed DeepRed, a deep learning-based method that identifies RNA editing from primitive RNA sequences without prior-knowledge-based filtering steps or genomic annotations. DeepRed achieved 98.1% and 97.9% area under the curve (AUC) in training and test sets, respectively. We further validated DeepRed using experimentally verified U87 cell RNA-seq data, achieving 97.9% positive predictive value (PPV). We demonstrated that DeepRed offers better prediction accuracy and computational efficiency than current methods with large-scale, mass RNA-seq data. We used DeepRed to assess the impact of multiple factors on editing identification with RNA-seq data from the Association of Biomolecular Resource Facilities and Sequencing Quality Control projects. We explored developmental RNA editing pattern changes during human early embryogenesis and evolutionary patterns in Drosophila species and the primate lineage using DeepRed. Our work illustrates DeepRed's state-of-the-art performance; it may decipher the hidden principles behind RNA editing, making editing detection convenient and effective.

  4. Isolation and expression of a pea vicilin cDNA in the yeast Saccharomyces cerevisiae.

    OpenAIRE

    Watson, M D; Lambert, N; Delauney, A; Yarwood, J N; Croy, R R; Gatehouse, J A; Wright, D J; Boulter, D

    1988-01-01

    A cDNA clone containing the complete coding sequence for vicilin from pea (Pisum sativum L.) was isolated. It specifies a 50,000-Mr protein that in pea is neither post-translationally processed nor glycosylated. The cDNA clone was expressed in yeast from a 2 micron plasmid by using the yeast phosphoglycerate kinase promoter and initiator codon. The resultant fusion protein, which contains the first 16 amino acid residues of phosphoglycerate kinase in addition to the vicilin sequence, was puri...

  5. Gene discovery from Jatropha curcas by sequencing of ESTs from normalized and full-length enriched cDNA library from developing seeds

    Directory of Open Access Journals (Sweden)

    Sugantham Priyanka Annabel

    2010-10-01

    Full Text Available Abstract Background Jatropha curcas L. is promoted as an important non-edible biodiesel crop worldwide. Jatropha oil, which is a triacylglycerol, can be directly blended with petro-diesel or transesterified with methanol and used as biodiesel. Genetic improvement in jatropha is needed to increase the seed yield, oil content, drought and pest resistance, and to modify oil composition so that it becomes a technically and economically preferred source for biodiesel production. However, genetic improvement efforts in jatropha could not take advantage of genetic engineering methods due to lack of cloned genes from this species. To overcome this hurdle, the current gene discovery project was initiated with an objective of isolating as many functional genes as possible from J. curcas by large scale sequencing of expressed sequence tags (ESTs. Results A normalized and full-length enriched cDNA library was constructed from developing seeds of J. curcas. The cDNA library contained about 1 × 106 clones and average insert size of the clones was 2.1 kb. Totally 12,084 ESTs were sequenced to average high quality read length of 576 bp. Contig analysis revealed 2258 contigs and 4751 singletons. Contig size ranged from 2-23 and there were 7333 ESTs in the contigs. This resulted in 7009 unigenes which were annotated by BLASTX. It showed 3982 unigenes with significant similarity to known genes and 2836 unigenes with significant similarity to genes of unknown, hypothetical and putative proteins. The remaining 191 unigenes which did not show similarity with any genes in the public database may encode for unique genes. Functional classification revealed unigenes related to broad range of cellular, molecular and biological functions. Among the 7009 unigenes, 6233 unigenes were identified to be potential full-length genes. Conclusions The high quality normalized cDNA library was constructed from developing seeds of J. curcas for the first time and 7009 unigenes coding

  6. DSAP: deep-sequencing small RNA analysis pipeline.

    Science.gov (United States)

    Huang, Po-Jung; Liu, Yi-Chung; Lee, Chi-Ching; Lin, Wei-Chen; Gan, Richie Ruei-Chi; Lyu, Ping-Chiang; Tang, Petrus

    2010-07-01

    DSAP is an automated multiple-task web service designed to provide a total solution to analyzing deep-sequencing small RNA datasets generated by next-generation sequencing technology. DSAP uses a tab-delimited file as an input format, which holds the unique sequence reads (tags) and their corresponding number of copies generated by the Solexa sequencing platform. The input data will go through four analysis steps in DSAP: (i) cleanup: removal of adaptors and poly-A/T/C/G/N nucleotides; (ii) clustering: grouping of cleaned sequence tags into unique sequence clusters; (iii) non-coding RNA (ncRNA) matching: sequence homology mapping against a transcribed sequence library from the ncRNA database Rfam (http://rfam.sanger.ac.uk/); and (iv) known miRNA matching: detection of known miRNAs in miRBase (http://www.mirbase.org/) based on sequence homology. The expression levels corresponding to matched ncRNAs and miRNAs are summarized in multi-color clickable bar charts linked to external databases. DSAP is also capable of displaying miRNA expression levels from different jobs using a log(2)-scaled color matrix. Furthermore, a cross-species comparative function is also provided to show the distribution of identified miRNAs in different species as deposited in miRBase. DSAP is available at http://dsap.cgu.edu.tw.

  7. Isolation and structure of a cDNA encoding the B1 (CD20) cell-surface antigen of human B lymphocytes

    International Nuclear Information System (INIS)

    Tender, T.F.; Streuli, M.; Schlossman, S.F.; Saito, H.

    1988-01-01

    The B1 (CD20) molecule is a M/sub r/ 33,000 phosphoprotein on the surface of human B lymphocytes that may serve a central role in the homoral immune response by regulating B-cell proliferation and differentiation. In this report, a cDNA clone that encodes the B1 molecule was isolated and the amino acid sequence of B1 was determined. B-cell-specific cDNA clones were selected from a human tonsillar cDNA library by differential hybridization with labeled cDNA derived from either size-fractionated B-cell mRNA or size-fractionated T-cell mRNA. Of the 261 cDNA clones isolated, 3 cross-hybridizing cDNA clones were chosen as potential candidates for encoding B1 based on their selective hybridization to RNA from B1-positive cell lines. The longest clone, pB1-21, contained a 2.8-kilobase insert with an 891-base-pair open reading frame that encodes a protein of 33 kDa. mRNA synthesized from the pB1-21 cDNA clone in vitro was translated into a protein of the same apparent molecular weight as B1. Limited proteinase digestion of the pB1-21 translation product and B1 generated peptides of the same sizes, indicating that the pB1-21 cDNA encodes the B1 molecule. Gel blot analysis indicated that pB1-21 hybridized with two mRNA species of 2.8 and 3.4 kilobases only in B1-positive cell lines. The amino acid sequence deduced from the pB1-21 nucleotide sequence apparently lacks a signal sequence and contains three extensive hydrophobic regions. The deduced B1 amino acid sequence shows no significant homology with other known patients

  8. Transcriptome sequencing of the Microarray Quality Control (MAQC RNA reference samples using next generation sequencing

    Directory of Open Access Journals (Sweden)

    Thierry-Mieg Danielle

    2009-06-01

    Full Text Available Abstract Background Transcriptome sequencing using next-generation sequencing platforms will soon be competing with DNA microarray technologies for global gene expression analysis. As a preliminary evaluation of these promising technologies, we performed deep sequencing of cDNA synthesized from the Microarray Quality Control (MAQC reference RNA samples using Roche's 454 Genome Sequencer FLX. Results We generated more that 3.6 million sequence reads of average length 250 bp for the MAQC A and B samples and introduced a data analysis pipeline for translating cDNA read counts into gene expression levels. Using BLAST, 90% of the reads mapped to the human genome and 64% of the reads mapped to the RefSeq database of well annotated genes with e-values ≤ 10-20. We measured gene expression levels in the A and B samples by counting the numbers of reads that mapped to individual RefSeq genes in multiple sequencing runs to evaluate the MAQC quality metrics for reproducibility, sensitivity, specificity, and accuracy and compared the results with DNA microarrays and Quantitative RT-PCR (QRTPCR from the MAQC studies. In addition, 88% of the reads were successfully aligned directly to the human genome using the AceView alignment programs with an average 90% sequence similarity to identify 137,899 unique exon junctions, including 22,193 new exon junctions not yet contained in the RefSeq database. Conclusion Using the MAQC metrics for evaluating the performance of gene expression platforms, the ExpressSeq results for gene expression levels showed excellent reproducibility, sensitivity, and specificity that improved systematically with increasing shotgun sequencing depth, and quantitative accuracy that was comparable to DNA microarrays and QRTPCR. In addition, a careful mapping of the reads to the genome using the AceView alignment programs shed new light on the complexity of the human transcriptome including the discovery of thousands of new splice variants.

  9. Characterization and immunological identification of cDNA clones encoding two human DNA topoisomerase II isozymes

    International Nuclear Information System (INIS)

    Chung, T.D.Y.; Drake, F.H.; Tan, K.B.; Per, S.R.; Crooke, S.T.; Mirabelli, C.K.

    1989-01-01

    Several DNA topoisomerase II partial cDNA clones obtained from a human Raji-HN2 cDNA library were sequenced and two classes of nucleotide sequences were found. One member of the first class, SP1, was identical to an internal fragment of human HeLa cell Topo II cDNA described earlier. A member of the second class, SP11, shared extensive nucleotide (75%) and predicted peptide (92%) sequence similarities with the first two-thirds of HeLa Topo II. Each class of cDNAs hybridized to unique, nonoverlapping restriction enzyme fragments of genomic DNA from several human cell lines. Synthetic 24-mer oligonucleotide probes specific for each cDNA class hybridized to 6.5-kilobase mRNAs; furthermore, hybridization of probe specific for one class was not blocked by probe specific for the other. Antibodies raised against a synthetic SP1-encoded dodecapeptide specifically recognized the 170-kDa form of Topo II, while antibodies raised against the corresponding SP11-encoded dodecapeptide, or a second unique SP11-encoded tridecapeptide, selectively recognized the 180-kDa form of Topo II. These data provide genetic and immunochemical evidence for two Topo II isozymes

  10. 3G vector-primer plasmid for constructing full-length-enriched cDNA libraries.

    Science.gov (United States)

    Zheng, Dong; Zhou, Yanna; Zhang, Zidong; Li, Zaiyu; Liu, Xuedong

    2008-09-01

    We designed a 3G vector-primer plasmid for the generation of full-length-enriched complementary DNA (cDNA) libraries. By employing the terminal transferase activity of reverse transcriptase and the modified strand replacement method, this plasmid (assembled with a polydT end and a deoxyguanosine [dG] end) combines priming full-length cDNA strand synthesis and directional cDNA cloning. As a result, the number of steps involved in cDNA library preparation is decreased while simplifying downstream gene manipulation, sequencing, and subcloning. The 3G vector-primer plasmid method yields fully represented plasmid primed libraries that are equivalent to those made by the SMART (switching mechanism at 5' end of RNA transcript) approach.

  11. Molecular characterization of MHC-DRB cDNA in water buffalo (Bubalus bubalis

    Directory of Open Access Journals (Sweden)

    Soumen Naskar

    2012-01-01

    Full Text Available In the present study, water buffalo MHC (Bubu-DRB cDNA was cloned and characterized. The 1022 base long-amplified cDNA product encompassed a single open reading frame of 801 bases that coded for 266 amino acids. The Bubu-DRB sequence showed maximum homology with the BoLA-DRB3*0101 allele of cattle. A total of seven amino acid residues were found to be unique for the Bubu-DRB sequence. The majority of amino acid substitutions was observed in the β1 domain. Residues associated with important functions were mostly conserved. Water buffalo DRB was phylogenetically closer to goat DRB*A.

  12. Cloning and functional expression of a human pancreatic islet glucose-transporter cDNA

    International Nuclear Information System (INIS)

    Permutt, M.A.; Koranyi, L.; Keller, K.; Lacy, P.E.; Scharp, D.W.; Mueckler, M.

    1989-01-01

    Previous studies have suggested that pancreatic islet glucose transport is mediated by a high-K m , low-affinity facilitated transporter similar to that expressed in liver. To determine the relationship between islet and liver glucose transporters, liver-type glucose-transporter cDNA clones were isolated from a human liver cDNA library. The liver-type glucose-transporter cDNA clone hybridized to mRNA transcripts of the same size in human liver and pancreatic islet RNA. A cDNA library was prepared from purified human pancreatic islet tissue and screened with human liver-type glucose-transporter cDNA. The authors isolated two overlapping cDNA clones encompassing 2600 base pairs, which encode a pancreatic islet protein identical in sequence to that of the putative liver-type glucose-transporter protein. Xenopus oocytes injected with synthetic mRNA transcribed from a full-length cDNA construct exhibited increased uptake of 2-deoxyglucose, confirming the functional identity of the clone. These cDNA clones can now be used to study regulation of expression of the gene and to assess the role of inherited defects in this gene as a candidate for inherited susceptibility to non-insulin-dependent diabetes mellitus

  13. Construction of a Full-Length Enriched cDNA Library and Preliminary Analysis of Expressed Sequence Tags from Bengal Tiger Panthera tigris tigris

    Science.gov (United States)

    Liu, Changqing; Liu, Dan; Guo, Yu; Lu, Taofeng; Li, Xiangchen; Zhang, Minghai; Ma, Jianzhang; Ma, Yuehui; Guan, Weijun

    2013-01-01

    In this study, a full-length enriched cDNA library was successfully constructed from Bengal tiger, Panthera tigris tigris, the most well-known wild Animal. Total RNA was extracted from cultured Bengal tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.28 × 106 pfu/mL and 1.56 × 109 pfu/mL respectively. The percentage of recombinants from unamplified library was 90.2% and average length of exogenous inserts was 0.98 kb. A total of 212 individual ESTs with sizes ranging from 356 to 1108 bps were then analyzed. The BLASTX score revealed that 48.1% of the sequences were classified as a strong match, 45.3% as nominal and 6.6% as a weak match. Among the ESTs with known putative function, 26.4% ESTs were found to be related to all kinds of metabolisms, 19.3% ESTs to information storage and processing, 11.3% ESTs to posttranslational modification, protein turnover, chaperones, 11.3% ESTs to transport, 9.9% ESTs to signal transducer/cell communication, 9.0% ESTs to structure protein, 3.8% ESTs to cell cycle, and only 6.6% ESTs classified as novel genes. By EST sequencing, a full-length gene coding ferritin was identified and characterized. The recombinant plasmid pET32a-TAT-Ferritin was constructed, coded for the TAT-Ferritin fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-Ferritin recombinant protein was 2.32 ± 0.12 mg/mL. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genome and transcriptome research of Bengal tigers. PMID:23708105

  14. Construction of a Full-Length Enriched cDNA Library and Preliminary Analysis of Expressed Sequence Tags from Bengal Tiger Panthera tigris tigris

    Directory of Open Access Journals (Sweden)

    Changqing Liu

    2013-05-01

    Full Text Available In this study, a full-length enriched cDNA library was successfully constructed from Bengal tiger, Panthera tigris tigris, the most well-known wild Animal. Total RNA was extracted from cultured Bengal tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.28 × 106 pfu/mL and 1.56 × 109 pfu/mL respectively. The percentage of recombinants from unamplified library was 90.2% and average length of exogenous inserts was 0.98 kb. A total of 212 individual ESTs with sizes ranging from 356 to 1108 bps were then analyzed. The BLASTX score revealed that 48.1% of the sequences were classified as a strong match, 45.3% as nominal and 6.6% as a weak match. Among the ESTs with known putative function, 26.4% ESTs were found to be related to all kinds of metabolisms, 19.3% ESTs to information storage and processing, 11.3% ESTs to posttranslational modification, protein turnover, chaperones, 11.3% ESTs to transport, 9.9% ESTs to signal transducer/cell communication, 9.0% ESTs to structure protein, 3.8% ESTs to cell cycle, and only 6.6% ESTs classified as novel genes. By EST sequencing, a full-length gene coding ferritin was identified and characterized. The recombinant plasmid pET32a-TAT-Ferritin was constructed, coded for the TAT-Ferritin fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-Ferritin recombinant protein was 2.32 ± 0.12 mg/mL. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genome and transcriptome research of Bengal tigers.

  15. LookSeq: A browser-based viewer for deep sequencing data

    OpenAIRE

    Manske, Heinrich Magnus; Kwiatkowski, Dominic P.

    2009-01-01

    Sequencing a genome to great depth can be highly informative about heterogeneity within an individual or a population. Here we address the problem of how to visualize the multiple layers of information contained in deep sequencing data. We propose an interactive AJAX-based web viewer for browsing large data sets of aligned sequence reads. By enabling seamless browsing and fast zooming, the LookSeq program assists the user to assimilate information at different levels of resolution, from an ov...

  16. Sequencing and characterization of asclepain f: the first cysteine peptidase cDNA cloned and expressed from Asclepias fruticosa latex.

    Science.gov (United States)

    Trejo, Sebastián A; López, Laura M I; Caffini, Néstor O; Natalucci, Claudia L; Canals, Francesc; Avilés, Francesc X

    2009-07-01

    Asclepain f is a papain-like protease previously isolated and characterized from latex of Asclepias fruticosa. This enzyme is a member of the C1 family of cysteine proteases that are synthesized as preproenzymes. The enzyme belongs to the alpha + beta class of proteins, with two disulfide bridges (Cys22-Cys63 and Cys56-Cys95) in the alpha domain, and another one (Cys150-Cys201) in the beta domain, as was determined by molecular modeling. A full-length 1,152 bp cDNA was cloned by RT-RACE-PCR from latex mRNA. The sequence was predicted as an open reading frame of 340 amino acid residues, of which 16 residues belong to the signal peptide, 113 to the propeptide and 211 to the mature enzyme. The full-length cDNA was ligated to pPICZalpha vector and expressed in Pichia pastoris. Recombinant asclepain f showed endopeptidase activity on pGlu-Phe-Leu-p-nitroanilide and was identified by PMF-MALDI-TOF MS. Asclepain f is the first peptidase cloned and expressed from mRNA isolated from plant latex, confirming the presence of the preprocysteine peptidase in the latex.

  17. Deep amplicon sequencing reveals mixed phytoplasma infection within single grapevine plants

    DEFF Research Database (Denmark)

    Nicolaisen, Mogens; Contaldo, Nicoletta; Makarova, Olga

    2011-01-01

    The diversity of phytoplasmas within single plants has not yet been fully investigated. In this project, deep amplicon sequencing was used to generate 50,926 phytoplasma sequences from 11 phytoplasma-infected grapevine samples from a PCR amplicon in the 5' end of the 16S region. After clustering ...

  18. cDNA cloning and sequencing of human fibrillarin, a conserved nucleolar protein recognized by autoimmune antisera

    International Nuclear Information System (INIS)

    Aris, J.P.; Blobel, G.

    1991-01-01

    The authors have isolated a 1.1-kilobase cDNA clone that encodes human fibrillarin by screening a hepatoma library in parallel with DNA probes derived from the fibrillarin genes of Saccharomyces cerevisiae (NOP1) and Xenopus laevis. RNA blot analysis indicates that the corresponding mRNA is ∼1,300 nucleotides in length. Human fibrillarin expressed in vitro migrates on SDS gels as a 36-kDa protein that is specifically immunoprecipitated by antisera from humans with scleroderma autoimmune disease. Human fibrillarin contains an amino-terminal repetitive domain ∼75-80 amino acids in length that is rich in glycine and arginine residues and is similar to amino-terminal domains in the yeast and Xenopus fibrillarins. The occurrence of a putative RNA-binding domain and an RNP consensus sequence within the protein is consistent with the association of fibrillarin with small nucleolar RNAs. Protein sequence alignments show that 67% of amino acids from human fibrillarin are identical to those in yeast fibrillarin and that 81% are identical to those in Xenopus fibrillarin. This identity suggests the evolutionary conservation of an important function early in the pathway for ribosome biosynthesis

  19. Molecular cloning and expression of cDNA encoding a lumenal calcium binding glycoprotein from sarcoplasmic reticulum

    International Nuclear Information System (INIS)

    Leberer, E.; Charuk, J.H.M.; MacLennan, D.H.; Green, N.M.

    1989-01-01

    Antibody screening was used to isolate a cDNA encoding the 160-kDa glycoprotein of rabbit skeletal muscle sarcoplasmic reticulum. The cDNA is identical to that encoding the 53-kDa glycoprotein except that it contains an in-frame insertion of 1,308 nucleotides near its 5' end, apparently resulting from alternative splicing. The protein encoded by the cDNA would contain a 19-residue NH 2 -terminal signal sequence and a 453-residue COOH-terminal sequence identical to the 53-kDa glycoprotein. It would also contain a 436-amino acid insert between these sequences. This insert would be highly acidic, suggesting that it might bind Ca 2+ . The purified 160-kDa glycoprotein and the glycoprotein expressed in COS-1 cells transfected with cDNA encoding the 160-kDa glycoprotein were shown to bind 45 C 2+ in a gel overlay assay. The protein was shown to be located in the lumen of the sarcoplasmic reticulum and to be associated through Ca 2+ with the membrane. The authors propose that this lumenal Ca 2+ binding glycoprotein of the sarcoplasmic reticulum be designated sarcalumenin

  20. A simple method for the parallel deep sequencing of full influenza A genomes

    DEFF Research Database (Denmark)

    Kampmann, Marie-Louise; Fordyce, Sarah Louise; Avila Arcos, Maria del Carmen

    2011-01-01

    Given the major threat of influenza A to human and animal health, and its ability to evolve rapidly through mutation and reassortment, tools that enable its timely characterization are necessary to help monitor its evolution and spread. For this purpose, deep sequencing can be a very valuable tool....... This study reports a comprehensive method that enables deep sequencing of the complete genomes of influenza A subtypes using the Illumina Genome Analyzer IIx (GAIIx). By using this method, the complete genomes of nine viruses were sequenced in parallel, representing the 2009 pandemic H1N1 virus, H5N1 virus...

  1. Unified Deep Learning Architecture for Modeling Biology Sequence.

    Science.gov (United States)

    Wu, Hongjie; Cao, Chengyuan; Xia, Xiaoyan; Lu, Qiang

    2017-10-09

    Prediction of the spatial structure or function of biological macromolecules based on their sequence remains an important challenge in bioinformatics. When modeling biological sequences using traditional sequencing models, characteristics, such as long-range interactions between basic units, the complicated and variable output of labeled structures, and the variable length of biological sequences, usually lead to different solutions on a case-by-case basis. This study proposed the use of bidirectional recurrent neural networks based on long short-term memory or a gated recurrent unit to capture long-range interactions by designing the optional reshape operator to adapt to the diversity of the output labels and implementing a training algorithm to support the training of sequence models capable of processing variable-length sequences. Additionally, the merge and pooling operators enhanced the ability to capture short-range interactions between basic units of biological sequences. The proposed deep-learning model and its training algorithm might be capable of solving currently known biological sequence-modeling problems through the use of a unified framework. We validated our model on one of the most difficult biological sequence-modeling problems currently known, with our results indicating the ability of the model to obtain predictions of protein residue interactions that exceeded the accuracy of current popular approaches by 10% based on multiple benchmarks.

  2. DeepGO: predicting protein functions from sequence and interactions using a deep ontology-aware classifier.

    Science.gov (United States)

    Kulmanov, Maxat; Khan, Mohammed Asif; Hoehndorf, Robert; Wren, Jonathan

    2018-02-15

    A large number of protein sequences are becoming available through the application of novel high-throughput sequencing technologies. Experimental functional characterization of these proteins is time-consuming and expensive, and is often only done rigorously for few selected model organisms. Computational function prediction approaches have been suggested to fill this gap. The functions of proteins are classified using the Gene Ontology (GO), which contains over 40 000 classes. Additionally, proteins have multiple functions, making function prediction a large-scale, multi-class, multi-label problem. We have developed a novel method to predict protein function from sequence. We use deep learning to learn features from protein sequences as well as a cross-species protein-protein interaction network. Our approach specifically outputs information in the structure of the GO and utilizes the dependencies between GO classes as background information to construct a deep learning model. We evaluate our method using the standards established by the Computational Assessment of Function Annotation (CAFA) and demonstrate a significant improvement over baseline methods such as BLAST, in particular for predicting cellular locations. Web server: http://deepgo.bio2vec.net, Source code: https://github.com/bio-ontology-research-group/deepgo. robert.hoehndorf@kaust.edu.sa. Supplementary data are available at Bioinformatics online. © The Author(s) 2017. Published by Oxford University Press.

  3. cDNA cloning and sequence determination of the pheromone biosynthesis activating neuropeptide from the seabuckthorn carpenterworm, Holcocerus hippophaecolus (Lepidoptera: Cossidae).

    Science.gov (United States)

    Li, Juan; Zhou, Jiao; Sun, Rongbo; Zhang, Haolin; Zong, Shixiang; Luo, Youqing; Sheng, Xia; Weng, Qiang

    2013-04-01

    The PBAN (pheromone biosynthesis activating neuropeptide)/pyrokinin peptides comprise a major neuropeptide family characterized by a common FXPRL amide at the C-terminus. These peptides are actively involved in many essential endocrine functions. For the first time, we reported the cDNA cloning and sequence determination of the PBAN from the seabuckthorn carpenterworm, Holcocerus hippophaecolus, by using rapid amplification of cDNA ends. The full-length cDNA of Hh-DH-PBAN contained five peptides: diapause hormone (DH) homolog, α-neuropeptide (NP), β-NP, PBAN, and γ-NP. All of the peptides were amidated at their C-terminus and shared a conserved motif, FXPR (or K) L. Moreover, Hh-DH-PBAN had high homology to the other members of the PBAN peptide family: 56% with Manduca sexta, 66% with Bombyx mori, 77% with Helicoverpa zea, and 47% with Plutella xylostella. Phylogenetic analysis revealed that Hh-DH-PBAN was closely related to PBANs from Noctuidae, demonstrated by the relatively higher similarity compared with H. zea. In addition, real-time quantitative PCR (qRT-PCR) analysis showed that Hh-DH-PBAN mRNA expression peaked in the brain-subesophageal ganglion (Br-SOG) complex, and was also detected at high levels during larval and adult stages. The expression decreased significantly after pupation. These results provided information concerning molecular structure characteristics of Hh-DH-PBAN, whose expression profile suggested that the Hh-DH-PBAN gene might be correlated with larval development and sex pheromone biosynthesis in females of the H. hippophaecolus. 2013 Wiley Periodicals, Inc

  4. Cloning and analysis of the mouse Fanconi anemia group A cDNA and an overlapping penta zinc finger cDNA.

    Science.gov (United States)

    Wong, J C; Alon, N; Norga, K; Kruyt, F A; Youssoufian, H; Buchwald, M

    2000-08-01

    Despite the cloning of four disease-associated genes for Fanconi anemia (FA), the molecular pathogenesis of FA remains largely unknown. To study FA complementation group A using the mouse as a model system, we cloned and characterized the mouse homolog of the human FANCA cDNA. The mouse cDNA (Fanca) encodes a 161-kDa protein that shares 65% amino acid sequence identity with human FANCA. Fanca is located at the distal region of mouse chromosome 8 and has a ubiquitous pattern of expression in embryonic and adult tissues. Expression of the mouse cDNA in human FA-A cells restores the cellular drug sensitivity to normal levels. Thus, the expression pattern, protein structure, chromosomal location, and function of FANCA are conserved in the mouse. We also isolated a novel zinc finger protein, Zfp276, which has five C(2)H(2) domains. Interestingly, Zfp276 is situated in the Fanca locus, and the 3'UTR of its cDNA overlaps with the last four exons of Fanca in a tail-to-tail manner. Zfp276 is expressed in the same tissues as Fanca, but does not complement the mitomycin C (MMC)-sensitive phenotype of FA-A cells. The overlapping genomic organization between Zfp276 and Fanca may have relevance to the disease phenotype of FA. Copyright 2000 Academic Press.

  5. Display of a Maize cDNA library on baculovirus infected insect cells

    Directory of Open Access Journals (Sweden)

    Jones Ian M

    2008-08-01

    Full Text Available Abstract Background Maize is a good model system for cereal crop genetics and development because of its rich genetic heritage and well-characterized morphology. The sequencing of its genome is well advanced, and new technologies for efficient proteomic analysis are needed. Baculovirus expression systems have been used for the last twenty years to express in insect cells a wide variety of eukaryotic proteins that require complex folding or extensive posttranslational modification. More recently, baculovirus display technologies based on the expression of foreign sequences on the surface of Autographa californica (AcMNPV have been developed. We investigated the potential of a display methodology for a cDNA library of maize young seedlings. Results We constructed a full-length cDNA library of young maize etiolated seedlings in the transfer vector pAcTMVSVG. The library contained a total of 2.5 × 105 independent clones. Expression of two known maize proteins, calreticulin and auxin binding protein (ABP1, was shown by western blot analysis of protein extracts from insect cells infected with the cDNA library. Display of the two proteins in infected insect cells was shown by selective biopanning using magnetic cell sorting and demonstrated proof of concept that the baculovirus maize cDNA display library could be used to identify and isolate proteins. Conclusion The maize cDNA library constructed in this study relies on the novel technology of baculovirus display and is unique in currently published cDNA libraries. Produced to demonstrate proof of principle, it opens the way for the development of a eukaryotic in vivo display tool which would be ideally suited for rapid screening of the maize proteome for binding partners, such as proteins involved in hormone regulation or defence.

  6. Cloning, sequencing and expression of a novel xylanase cDNA from ...

    African Journals Online (AJOL)

    STORAGESEVER

    2008-12-03

    Dec 3, 2008 ... First strand cDNA was synthesized by RT-PCR with Oligo(dT)15 using mRNA isolated ... 4°C. Single colonies were picked into 5 mL BMGY medium for preculture, and incubated ... to fold properly into a native conformation. Without the .... polymorphism is often used in taxonomy, but now, it is being well ...

  7. Isolation and characterization of human cDNA clones encoding the α and the α' subunits of casein kinase II

    International Nuclear Information System (INIS)

    Lozeman, F.J.; Litchfield, D.W.; Piening, C.; Takio, Koji; Walsh, K.A.; Krebs, E.G.

    1990-01-01

    Casein kinase II is a widely distributed protein serine/threonine kinase. The holoenzyme appears to be a tetramer, containing two α or α' subunits (or one of each) and two β subunits. Complementary DNA clones encoding the subunits of casein kinase II were isolated from a human T-cell λgt 10 library using cDNA clones isolated from Drosophila melanogasten. One of the human cDNA clones (hT4.1) was 2.2 kb long, including a coding region of 1176 bp preceded by 156 bp (5' untranslated region) and followed by 871 bp (3' untranslated region). The hT4.1 close was nearly identical in size and sequence with a cDNA clone from HepG2 human hepatoma cultured cells. Another of the human T-cell cDNA clones (hT9.1) was 1.8 kb long, containing a coding region of 1053 bp preceded by 171 by (5' untranslated region) and followed by 550 bp (3' untranslated region). Amino acid sequences deduced from these two cDNA clones were about 85% identical. Most of the difference between the two encoded polypeptides was in the carboxy-terminal region, but heterogeneity was distributed throughout the molecules. Partial amino acid sequence was determined in a mixture of α and α' subunits from bovine lung casein kinase II. The bovine sequences aligned with the 2 human cDNA-encoded polypeptides with only 2 discrepancies out of 535 amino acid positions. This confirmed that the two human T-cell cDNA clones encoded the α and α' subunits of casein kinase II. These studies show that there are two distinct catalytic subunits for casein II (α and α') and that the sequence of these subunits is largely conserved between the bovine and the human

  8. Deep whole-genome sequencing of 90 Han Chinese genomes.

    Science.gov (United States)

    Lan, Tianming; Lin, Haoxiang; Zhu, Wenjuan; Laurent, Tellier Christian Asker Melchior; Yang, Mengcheng; Liu, Xin; Wang, Jun; Wang, Jian; Yang, Huanming; Xu, Xun; Guo, Xiaosen

    2017-09-01

    Next-generation sequencing provides a high-resolution insight into human genetic information. However, the focus of previous studies has primarily been on low-coverage data due to the high cost of sequencing. Although the 1000 Genomes Project and the Haplotype Reference Consortium have both provided powerful reference panels for imputation, low-frequency and novel variants remain difficult to discover and call with accuracy on the basis of low-coverage data. Deep sequencing provides an optimal solution for the problem of these low-frequency and novel variants. Although whole-exome sequencing is also a viable choice for exome regions, it cannot account for noncoding regions, sometimes resulting in the absence of important, causal variants. For Han Chinese populations, the majority of variants have been discovered based upon low-coverage data from the 1000 Genomes Project. However, high-coverage, whole-genome sequencing data are limited for any population, and a large amount of low-frequency, population-specific variants remain uncharacterized. We have performed whole-genome sequencing at a high depth (∼×80) of 90 unrelated individuals of Chinese ancestry, collected from the 1000 Genomes Project samples, including 45 Northern Han Chinese and 45 Southern Han Chinese samples. Eighty-three of these 90 have been sequenced by the 1000 Genomes Project. We have identified 12 568 804 single nucleotide polymorphisms, 2 074 210 short InDels, and 26 142 structural variations from these 90 samples. Compared to the Han Chinese data from the 1000 Genomes Project, we have found 7 000 629 novel variants with low frequency (defined as minor allele frequency genome. Compared to the 1000 Genomes Project, these Han Chinese deep sequencing data enhance the characterization of a large number of low-frequency, novel variants. This will be a valuable resource for promoting Chinese genetics research and medical development. Additionally, it will provide a valuable supplement to the 1000

  9. Cloning the human lysozyme cDNA: Inverted Alu repeat in the mRNA and in situ hybridization for macrophages and Paneth cells

    International Nuclear Information System (INIS)

    Chung, L.P.; Keshav, S.; Gordon, S.

    1988-01-01

    Lysozyme is a major secretory product of human and rodent macrophages and a useful marker for myelomonocytic cells. Based on the known human lysozyme amino acid sequence, oligonucleotides were synthesized and used as probes to screen a phorbol 12-myristate 13-acetate-treated U937 cDNA library. A full-length human lysozyme cDNA clone, pHL-2, was obtained and characterized. Sequence analysis shows that human lysozyme, like chicken lysozyme, has in 18-amino-acid-long signal peptide, but unlike the chicken lysozyme cDNA, the human lysozyme cDNA has a >1-kilobase-long 3' nontranslated sequence. Interestingly, within this 3' region, an inverted repeat of the Alu family of repetitive sequences was discovered. In RNA blot analyses, DNA probes prepared from pHL-2 can be used to detect lysozyme mRNA not only from human but also from mouse and rat. Moreover, by in situ hybridization, complementary RNA transcripts have been used as probes to detect lysozyme mRNA in mouse macrophages and Paneth cells. This human lysozyme cDNA clone is therefore likely to be a useful molecular probe for studying macrophage distribution and gene expression

  10. Isolation of cDNA clones coding for human tissue factor: primary structure of the protein and cDNA

    International Nuclear Information System (INIS)

    Spicer, E.K.; Horton, R.; Bloem, L.

    1987-01-01

    Tissue factor is a membrane-bound procoagulant protein that activates the extrinsic pathway of blood coagulation in the presence of factor VII and calcium. λ Phage containing the tissue factor gene were isolated from a human placental cDNA library. The amino acid sequence deduced from the nucleotide sequence of the cDNAs indicates that tissue factor is synthesized as a higher molecular weight precursor with a leader sequence of 32 amino acids, while the mature protein is a single polypeptide chain composed of 263 residues. The derived primary structure of tissue factor has been confirmed by comparison to protein and peptide sequence data. The sequence of the mature protein suggests that there are three distinct domains: extracellular, residues 1-219; hydrophobic, residues 220-242; and cytoplasmic, residues 243-263. Three potential N-linked carbohydrate attachment sites occur in the extracellular domain. The amino acid sequence of tissue factor shows no significant homology with the vitamin K-dependent serine proteases, coagulation cofactors, or any other protein in the National Biomedical Research Foundation sequence data bank (Washington, DC)

  11. Human pro. cap alpha. 1)(I) collagen: cDNA sequence for the C-propeptide domain

    Energy Technology Data Exchange (ETDEWEB)

    Maekelae, J K; Raassina, M; Virta, A; Vuorio, E

    1988-01-11

    The authors have previously constructed a cDNA clone pHCAL1, covering most of the C-terminal propeptide domain of human pro..cap alpha..1(I) collagen mRNA,by inserting a 678 bp EcoRI-XhoI fragment of cDNA into pBR322. Since the XhoI/SalI ligation prevented removal of the insert, they used the same strategy to obtain a similar clone in pUC8. RNA was isolated from fetal calvarial bones. The cDNA was digested with EcoRI and XhoI and fractionated on a 1 % agarose gel. Fragments of 650-700 bp were cloned in pUC8 at the polylinker site, which now permits easy removal of the insert. The new clone was named pHCAL1U since the RNA was isolated from another individual. The approach outlined is useful for studies on individual variation which is important to recognize when searching for disease-related mutations in type I collagen.

  12. Error Analysis of Deep Sequencing of Phage Libraries: Peptides Censored in Sequencing

    Directory of Open Access Journals (Sweden)

    Wadim L. Matochko

    2013-01-01

    Full Text Available Next-generation sequencing techniques empower selection of ligands from phage-display libraries because they can detect low abundant clones and quantify changes in the copy numbers of clones without excessive selection rounds. Identification of errors in deep sequencing data is the most critical step in this process because these techniques have error rates >1%. Mechanisms that yield errors in Illumina and other techniques have been proposed, but no reports to date describe error analysis in phage libraries. Our paper focuses on error analysis of 7-mer peptide libraries sequenced by Illumina method. Low theoretical complexity of this phage library, as compared to complexity of long genetic reads and genomes, allowed us to describe this library using convenient linear vector and operator framework. We describe a phage library as N×1 frequency vector n=ni, where ni is the copy number of the ith sequence and N is the theoretical diversity, that is, the total number of all possible sequences. Any manipulation to the library is an operator acting on n. Selection, amplification, or sequencing could be described as a product of a N×N matrix and a stochastic sampling operator (Sa. The latter is a random diagonal matrix that describes sampling of a library. In this paper, we focus on the properties of Sa and use them to define the sequencing operator (Seq. Sequencing without any bias and errors is Seq=Sa IN, where IN is a N×N unity matrix. Any bias in sequencing changes IN to a nonunity matrix. We identified a diagonal censorship matrix (CEN, which describes elimination or statistically significant downsampling, of specific reads during the sequencing process.

  13. cDNA cloning and immunological characterization of the rye grass allergen Lol p I.

    Science.gov (United States)

    Perez, M; Ishioka, G Y; Walker, L E; Chesnut, R W

    1990-09-25

    The complete amino acid sequence of two "isoallergenic" forms of Lol p I, the major rye grass (Lolium perenne) pollen allergen, was deduced from cDNA sequence analysis. cDNA clones isolated from a Lolium perenne pollen library contained an open reading frame coding for a 240-amino acid protein. Comparison of the nucleotide and deduced amino acid sequence of two of these clones revealed four changes at the amino acid level and numerous nucleotide differences. Both clones contained one possible asparagine-linked glycosylation site. Northern blot analysis shows one RNA species of 1.2 kilobases. Based on the complete amino acid sequence of Lol p I, overlapping peptides covering the entire molecule were synthesized. Utilizing these peptides we have identified a determinant within the Lol p I molecule that is recognized by human leukocyte antigen class II-restricted T cells obtained from persons allergic to rye grass pollen.

  14. LEDGF/p75 Deficiency Increases Deletions at the HIV-1 cDNA Ends.

    Science.gov (United States)

    Bueno, Murilo T D; Reyes, Daniel; Llano, Manuel

    2017-09-15

    Processing of unintegrated linear HIV-1 cDNA by the host DNA repair system results in its degradation and/or circularization. As a consequence, deficient viral cDNA integration generally leads to an increase in the levels of HIV-1 cDNA circles containing one or two long terminal repeats (LTRs). Intriguingly, impaired HIV-1 integration in LEDGF/p75-deficient cells does not result in a correspondent increase in viral cDNA circles. We postulate that increased degradation of unintegrated linear viral cDNA in cells lacking the lens epithelium-derived growth factor (LEDGF/p75) account for this inconsistency. To evaluate this hypothesis, we characterized the nucleotide sequence spanning 2-LTR junctions isolated from LEDGF/p75-deficient and control cells. LEDGF/p75 deficiency resulted in a significant increase in the frequency of 2-LTRs harboring large deletions. Of note, these deletions were dependent on the 3' processing activity of integrase and were not originated by aberrant reverse transcription. Our findings suggest a novel role of LEDGF/p75 in protecting the unintegrated 3' processed linear HIV-1 cDNA from exonucleolytic degradation.

  15. Isolation, nucleotide sequence and expression of a cDNA encoding feline granulocyte colony-stimulating factor.

    Science.gov (United States)

    Dunham, S P; Onions, D E

    2001-06-21

    A cDNA encoding feline granulocyte colony stimulating factor (fG-CSF) was cloned from alveolar macrophages using the reverse transcriptase-polymerase chain reaction. The cDNA is 949 bp in length and encodes a predicted mature protein of 174 amino acids. Recombinant fG-CSF was expressed as a glutathione S-transferase fusion and purified by affinity chromatography. Biological activity of the recombinant protein was demonstrated using the murine myeloblastic cell line GNFS-60, which showed an ED50 for fG-CSF of approximately 2 ng/ml. Copyright 2001 Academic Press.

  16. Construction and characterization of the alpha form of a cardiac myosin heavy chain cDNA clone and its developmental expression in the Syrian hamster.

    OpenAIRE

    Liew, C C; Jandreski, M A

    1986-01-01

    A cDNA clone, pVHC1, was isolated from a Syrian hamster heart cDNA library and was compared to the rat alpha (pCMHC21) and beta (pCMHC5) ventricular myosin heavy chain cDNA clones. The DNA sequence and amino acid sequence deducted from the DNA show more homology with pCMHC21 than pCMHC5. This indicates that pVHC1 is an alpha ventricular myosin heavy chain cDNA clone. However, even though pVHC1 shows a high degree of nucleotide and amino acid conservation with the rat myosin heavy chain sequen...

  17. Isolation and Tissue Distribution of an Insulin-Like Androgenic Gland Hormone (IAG of the Male Red Deep-Sea Crab, Chaceon quinquedens

    Directory of Open Access Journals (Sweden)

    Amanda Lawrence

    2017-08-01

    Full Text Available The insulin-like androgenic gland hormone (IAG found in decapod crustaceans is known to regulate sexual development in males. IAG is produced in the male-specific endocrine tissue, the androgenic gland (AG; however, IAG expression has been also observed in other tissues of decapod crustacean species including Callinectes sapidus and Scylla paramamosain. This study aimed to isolate the full-length cDNA sequence of IAG from the AG of male red deep-sea crabs, Chaceon quinquedens (ChqIAG, and to examine its tissue distribution. To this end, we employed polymerase chain reaction cloning with degenerate primers and 5′ and 3′ rapid amplification of cDNA ends (RACE. The full-length ChqIAG cDNA sequence (1555 nt includes a 366 nt 5′ untranslated region a 453 nt open reading frame encoding 151 amino acids, and a relatively long 3′ UTR of 733 nt. The ORF consists of a 19 aa signal peptide, 32 aa B chain, 56 aa C chain, and 44 aa A chain. The putative ChqIAG amino acid sequence is most similar to those found in other crab species, including C. sapidus and S. paramamosain, which are clustered together phylogenetically.

  18. Structure and characterization of a cDNA clone for phenylalanine ammonia-lyase from cut-injured roots of sweet potato

    International Nuclear Information System (INIS)

    Tanaka, Yoshiyuki; Matsuoka, Makoto; Yamanoto, Naoki; Ohashi, Yuko; Kano-Murakami, Yuriko; Ozeki, Yoshihiro

    1989-01-01

    A cDNA clone for phenylalanine ammonia-lyase (PAL) induced in wounded sweet potato (Ipomoea batatas Lam.) root was obtained by immunoscreening a cDNA library. The protein produced in Escherichia coli cells containing the plasmid pPAL02 was indistinguishable from sweet potato PAL as judged by Ouchterlony double diffusion assays. The M r of its subunit was 77,000. The cells converted [ 14 C]-L-phenylalanine into [ 14 C]-t-cinnamic acid and PAL activity was detected in the homogenate of the cells. The activity was dependent on the presence of the pPAL02 plasmid DNA. The nucleotide sequence of the cDNA contained a 2,121-base pair (bp) open-reading frame capable of coding for a polypeptide with 707 amino acids (M r 77,137), a 22-bp 5'-noncoding region and a 207-bp 3'-noncoding region. The results suggest that the insert DNA fully encoded the amino acid sequence for sweet potato PAL that is induced by wounding. Comparison of the deduced amino acid sequence with that of a PAL cDNA fragment from Phaseolus vulgaris revealed 78.9% homology. The sequence from amino acid residues 258 to 494 was highly conserved, showing 90.7% homology

  19. Human cDNA clones for an α subunit of G/sub i/ signal-transduction protein

    International Nuclear Information System (INIS)

    Bray, P.; Carter, A.; Guo, V.; Puckett, C.; Kamholz, J.; Spiegel, A.; Nirenberg, M.

    1987-01-01

    Two cDNA clones were obtained from a λgt11 cDNA human brain library that correspond to α/sub i/ subunits of G signal-transduction proteins (where α/sub i/ subunits refer to the α subunits of G proteins that inhibit adenylate cyclase). The nucleotide sequence of human brain α/sub i/ is highly homologous to that of bovine brain α/sub i/ and the predicted amino acid sequences are identical. However, human and bovine brain α/sub i/ cDNAs differ significantly from α/sub i/ cDNAs from human monocytes, rat glioma, and mouse macrophages in amino acid (88% homology) and nucleotide (71-75% homology) sequences. In addition, the nucleotide sequences of the 3' untranslated regions of human and bovine brain α/sub i/ cDNAs differ markedly from the sequences of human monocyte, rat glioma, and mouse macrophage α/sub i/ cDNAs. These results suggest there are at least two classes of α/sub i/ mRNA

  20. Evaluation and Adaptation of a Laboratory-Based cDNA Library Preparation Protocol for Retrospective Sequencing of Archived MicroRNAs from up to 35-Year-Old Clinical FFPE Specimens.

    Science.gov (United States)

    Loudig, Olivier; Wang, Tao; Ye, Kenny; Lin, Juan; Wang, Yihong; Ramnauth, Andrew; Liu, Christina; Stark, Azadeh; Chitale, Dhananjay; Greenlee, Robert; Multerer, Deborah; Honda, Stacey; Daida, Yihe; Spencer Feigelson, Heather; Glass, Andrew; Couch, Fergus J; Rohan, Thomas; Ben-Dov, Iddo Z

    2017-03-14

    Formalin-fixed paraffin-embedded (FFPE) specimens, when used in conjunction with patient clinical data history, represent an invaluable resource for molecular studies of cancer. Even though nucleic acids extracted from archived FFPE tissues are degraded, their molecular analysis has become possible. In this study, we optimized a laboratory-based next-generation sequencing barcoded cDNA library preparation protocol for analysis of small RNAs recovered from archived FFPE tissues. Using matched fresh and FFPE specimens, we evaluated the robustness and reproducibility of our optimized approach, as well as its applicability to archived clinical specimens stored for up to 35 years. We then evaluated this cDNA library preparation protocol by performing a miRNA expression analysis of archived breast ductal carcinoma in situ (DCIS) specimens, selected for their relation to the risk of subsequent breast cancer development and obtained from six different institutions. Our analyses identified six miRNAs (miR-29a, miR-221, miR-375, miR-184, miR-363, miR-455-5p) differentially expressed between DCIS lesions from women who subsequently developed an invasive breast cancer (cases) and women who did not develop invasive breast cancer within the same time interval (control). Our thorough evaluation and application of this laboratory-based miRNA sequencing analysis indicates that the preparation of small RNA cDNA libraries can reliably be performed on older, archived, clinically-classified specimens.

  1. PCR-based cDNA library construction: general cDNA libraries at the level of a few cells.

    OpenAIRE

    Belyavsky, A; Vinogradova, T; Rajewsky, K

    1989-01-01

    A procedure for the construction of general cDNA libraries is described which is based on the amplification of total cDNA in vitro. The first cDNA strand is synthesized from total RNA using an oligo(dT)-containing primer. After oligo(dG) tailing the total cDNA is amplified by PCR using two primers complementary to oligo(dA) and oligo(dG) ends of the cDNA. For insertion of the cDNA into a vector a controlled trimming of the 3' ends of the cDNA by Klenow enzyme was used. Starting from 10 J558L ...

  2. Pattern analysis approach reveals restriction enzyme cutting abnormalities and other cDNA library construction artifacts using raw EST data

    Directory of Open Access Journals (Sweden)

    Zhou Sun

    2012-05-01

    Full Text Available Abstract Background Expressed Sequence Tag (EST sequences are widely used in applications such as genome annotation, gene discovery and gene expression studies. However, some of GenBank dbEST sequences have proven to be “unclean”. Identification of cDNA termini/ends and their structures in raw ESTs not only facilitates data quality control and accurate delineation of transcription ends, but also furthers our understanding of the potential sources of data abnormalities/errors present in the wet-lab procedures for cDNA library construction. Results After analyzing a total of 309,976 raw Pinus taeda ESTs, we uncovered many distinct variations of cDNA termini, some of which prove to be good indicators of wet-lab artifacts, and characterized each raw EST by its cDNA terminus structure patterns. In contrast to the expected patterns, many ESTs displayed complex and/or abnormal patterns that represent potential wet-lab errors such as: a failure of one or both of the restriction enzymes to cut the plasmid vector; a failure of the restriction enzymes to cut the vector at the correct positions; the insertion of two cDNA inserts into a single vector; the insertion of multiple and/or concatenated adapters/linkers; the presence of 3′-end terminal structures in designated 5′-end sequences or vice versa; and so on. With a close examination of these artifacts, many problematic ESTs that have been deposited into public databases by conventional bioinformatics pipelines or tools could be cleaned or filtered by our methodology. We developed a software tool for Abnormality Filtering and Sequence Trimming for ESTs (AFST, http://code.google.com/p/afst/ using a pattern analysis approach. To compare AFST with other pipelines that submitted ESTs into dbEST, we reprocessed 230,783 Pinus taeda and 38,709 Arachis hypogaea GenBank ESTs. We found 7.4% of Pinus taeda and 29.2% of Arachis hypogaea GenBank ESTs are “unclean” or abnormal, all of which could be cleaned

  3. Sequence-based prediction of protein protein interaction using a deep-learning algorithm.

    Science.gov (United States)

    Sun, Tanlin; Zhou, Bo; Lai, Luhua; Pei, Jianfeng

    2017-05-25

    Protein-protein interactions (PPIs) are critical for many biological processes. It is therefore important to develop accurate high-throughput methods for identifying PPI to better understand protein function, disease occurrence, and therapy design. Though various computational methods for predicting PPI have been developed, their robustness for prediction with external datasets is unknown. Deep-learning algorithms have achieved successful results in diverse areas, but their effectiveness for PPI prediction has not been tested. We used a stacked autoencoder, a type of deep-learning algorithm, to study the sequence-based PPI prediction. The best model achieved an average accuracy of 97.19% with 10-fold cross-validation. The prediction accuracies for various external datasets ranged from 87.99% to 99.21%, which are superior to those achieved with previous methods. To our knowledge, this research is the first to apply a deep-learning algorithm to sequence-based PPI prediction, and the results demonstrate its potential in this field.

  4. Deep sequencing methods for protein engineering and design.

    Science.gov (United States)

    Wrenbeck, Emily E; Faber, Matthew S; Whitehead, Timothy A

    2017-08-01

    The advent of next-generation sequencing (NGS) has revolutionized protein science, and the development of complementary methods enabling NGS-driven protein engineering have followed. In general, these experiments address the functional consequences of thousands of protein variants in a massively parallel manner using genotype-phenotype linked high-throughput functional screens followed by DNA counting via deep sequencing. We highlight the use of information rich datasets to engineer protein molecular recognition. Examples include the creation of multiple dual-affinity Fabs targeting structurally dissimilar epitopes and engineering of a broad germline-targeted anti-HIV-1 immunogen. Additionally, we highlight the generation of enzyme fitness landscapes for conducting fundamental studies of protein behavior and evolution. We conclude with discussion of technological advances. Copyright © 2016 Elsevier Ltd. All rights reserved.

  5. Triazole-linked DNA as a primer surrogate in the synthesis of first-strand cDNA.

    Science.gov (United States)

    Fujino, Tomoko; Yasumoto, Ken-ichi; Yamazaki, Naomi; Hasome, Ai; Sogawa, Kazuhiro; Isobe, Hiroyuki

    2011-11-04

    A phosphate-eliminated nonnatural oligonucleotide serves as a primer surrogate in reverse transcription reaction of mRNA. Despite of the nonnatural triazole linkages in the surrogate, the reverse transcriptase effectively elongated cDNA sequences on the 3'-downstream of the primer by transcription of the complementary sequence of mRNA. A structure-activity comparison with the reference natural oligonucleotides shows the superior priming activity of the surrogate containing triazole-linkages. The nonnatural linkages also protect the transcribed cDNA from digestion reactions with 5'-exonuclease and enable us to remove noise transcripts of unknown origins. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  6. A tobacco cDNA reveals two different transcription patterns in vegetative and reproductive organs

    Directory of Open Access Journals (Sweden)

    I. da Silva

    2002-08-01

    Full Text Available In order to identify genes expressed in the pistil that may have a role in the reproduction process, we have established an expressed sequence tags project to randomly sequence clones from a Nicotiana tabacum stigma/style cDNA library. A cDNA clone (MTL-8 showing high sequence similarity to genes encoding glycine-rich RNA-binding proteins was chosen for further characterization. Based on the extensive identity of MTL-8 to the RGP-1a sequence of N. sylvestris, a primer was defined to extend the 5' sequence of MTL-8 by RT-PCR from stigma/style RNAs. The amplification product was sequenced and it was confirmed that MTL-8 corresponds to an mRNA encoding a glycine-rich RNA-binding protein. Two transcripts of different sizes and expression patterns were identified when the MTL-8 cDNA insert was used as a probe in RNA blots. The largest is 1,100 nucleotides (nt long and markedly predominant in ovaries. The smaller transcript, with 600 nt, is ubiquitous to the vegetative and reproductive organs analyzed (roots, stems, leaves, sepals, petals, stamens, stigmas/styles and ovaries. Plants submitted to stress (wounding, virus infection and ethylene treatment presented an increased level of the 600-nt transcript in leaves, especially after tobacco necrosis virus infection. In contrast, the level of the 1,100-nt transcript seems to be unaffected by the stress conditions tested. Results of Southern blot experiments have suggested that MTL-8 is present in one or two copies in the tobacco genome. Our results suggest that the shorter transcript is related to stress while the larger one is a flower predominant and nonstress-inducible messenger.

  7. Expression analysis of a ''Cucurbita'' cDNA encoding endonuclease

    International Nuclear Information System (INIS)

    Szopa, J.

    1995-01-01

    The nuclear matrices of plant cell nuclei display intrinsic nuclease activity which consists in nicking supercoiled DNA. A cDNA encoding a 32 kDa endonuclease has been cloned and sequenced. The nucleotide and deduced amino-acid sequences show high homology to known 14-3-3-protein sequences from other sources. The amino-acid sequence shows agreement with consensus sequences for potential phosphorylation by protein kinase A and C and for calcium, lipid and membrane-binding sites. The nucleotide-binding site is also present within the conserved part of the sequence. By Northern blot analysis, the differential expression of the corresponding mRNA was detected; it was the strongest in sink tissues. The endonuclease activity found on DNA-polyacrylamide gel electrophoresis coincided with mRNA content and was the highest in tuber. (author). 22 refs, 6 figs

  8. Cloning of the cDNA for murine von Willebrand factor and identification of orthologous genes reveals the extent of conservation among diverse species.

    Science.gov (United States)

    Chitta, Mohan S; Duhé, Roy J; Kermode, John C

    2007-05-01

    Interaction of von Willebrand factor (VWF) with circulating platelets promotes hemostasis when a blood vessel is injured. The A1 domain of VWF is responsible for the initial interaction with platelets and is well conserved among species. Knowledge of the cDNA and genomic DNA sequences for human VWF allowed us to predict the cDNA sequence for murine VWF in silico and amplify its entire coding region by RT-PCR. The murine VWF cDNA has an open reading frame of 8,442 bp, encoding a protein of 2,813 amino acid residues with 83% identity to human pre-pro-VWF. The same strategy was used to predict in silico the cDNA sequence for the ortholog of VWF in a further six species. Many of these predictions diverged substantially from the putative Reference Sequences derived by ab initio methods. Our predicted sequences indicated that the VWF gene has a conserved structure of 52 exons in all seven mammalian species examined, as well as in the chicken. There is a minor structural variation in the pufferfish Takifugu rubripes insofar as the VWF gene in this species has 53 exons. Comparison of the translated amino acid sequences also revealed a high degree of conservation. In particular, the cysteine residues are conserved precisely throughout both the pro-peptide and the mature VWF sequence in all species, with a minor exception in the pufferfish VWF ortholog where two adjacent cysteine residues are omitted. The marked conservation of cysteine residues emphasizes the importance of the intricate pattern of disulfide bonds in governing the structure of pro-VWF and regulating the function of the mature VWF protein. It should also be emphasized that many of the conserved features of the VWF gene and protein were obscured when the comparison among species was based on the putative Reference Sequences instead of our predicted cDNA sequences.

  9. Continuous Distributed Representation of Biological Sequences for Deep Proteomics and Genomics.

    Directory of Open Access Journals (Sweden)

    Ehsaneddin Asgari

    Full Text Available We introduce a new representation and feature extraction method for biological sequences. Named bio-vectors (BioVec to refer to biological sequences in general with protein-vectors (ProtVec for proteins (amino-acid sequences and gene-vectors (GeneVec for gene sequences, this representation can be widely used in applications of deep learning in proteomics and genomics. In the present paper, we focus on protein-vectors that can be utilized in a wide array of bioinformatics investigations such as family classification, protein visualization, structure prediction, disordered protein identification, and protein-protein interaction prediction. In this method, we adopt artificial neural network approaches and represent a protein sequence with a single dense n-dimensional vector. To evaluate this method, we apply it in classification of 324,018 protein sequences obtained from Swiss-Prot belonging to 7,027 protein families, where an average family classification accuracy of 93%±0.06% is obtained, outperforming existing family classification methods. In addition, we use ProtVec representation to predict disordered proteins from structured proteins. Two databases of disordered sequences are used: the DisProt database as well as a database featuring the disordered regions of nucleoporins rich with phenylalanine-glycine repeats (FG-Nups. Using support vector machine classifiers, FG-Nup sequences are distinguished from structured protein sequences found in Protein Data Bank (PDB with a 99.8% accuracy, and unstructured DisProt sequences are differentiated from structured DisProt sequences with 100.0% accuracy. These results indicate that by only providing sequence data for various proteins into this model, accurate information about protein structure can be determined. Importantly, this model needs to be trained only once and can then be applied to extract a comprehensive set of information regarding proteins of interest. Moreover, this representation can be

  10. Budding yeast cDNA sequencing project: S03036-05_I15 [Budding yeast cDNA sequencing project

    Lifescience Database Archive (English)

    Full Text Available EST - Link to UCSC Genome Browser - Sequence >S03036-05_I15.phd NNNTNNTNNNNCNCTCACATANAAGACGGANNAGNNNGCTGGGC...CAATGCGTTCCATATGCG AAAATTCTTGGNCAATGTATTCTCTAGCAATCTNTNCTTTTGTACANTCGGAGGNTTNTC ATGNTCCTTTCATANATTATANAAANNG

  11. Full-length cDNA sequence cloning and analysis of Ghrelin in Cervus nippon%梅花鹿Ghrelin全长cDNA克隆及其序列分析

    Institute of Scientific and Technical Information of China (English)

    张曼; 金鑫; 田巧珍; 刘骄; 王云鹤; 杨银凤

    2017-01-01

    为获得梅花鹿Ghrelin eDNA全序列,以梅花鹿皱胃黏膜上皮组织提取的总RNA为模板,通过RT-PCR和RACE法克隆了梅花鹿皱胃中Ghrelin基因eDNA的全序列.结果表明梅花鹿Ghrelin eDNA序列全长为539 bp,其中5’非翻译区(5'UTR)为46 bp,3'UTR为128 bp,开放阅读框(ORF)为351 bp,该ORF编码116个氨基酸残基.将梅花鹿Ghrelin基因的eDNA与人和其他动物的Ghrelin相比,发现:梅花鹿Ghrelin与驯鹿、山羊、绵羊和牛的同源性达90.4%~99.1%;与恒河猴、人、猪、犬的同源性达76.6%~66.9%;与鸡和野鸽的同源性分别为36.4%和35.4%.研究表明Ghrelin的结构具有明显的种属特异性,因此Ghrelin在反刍动物体内可能有着重要的生理功能.%In order to obtain the full-length cDNA of Ghrelin in Cervus nippon,RT-PCR and RACE methods were used by using total RNA of abomasus tissue in C.nippon as template.The results of sequence analysis revealed a 539 bp length cDNA containing 46 bp 5'-untranslated region (5'UTR),128 bp 3'-untranslated region (3'UTR) and 351 bp open reading frame (ORF) encoding 116 amino acids.The cDNA sequence alignments of C.nippon Ghrelin gene with human and other animals showed that the cDNA sequence homology of C.nippon Ghrelin was 90.4%-99.1% to reindeer,goat,sheep and cattle,66.9%-76.6% with rhesus monkey,human,pig and dog,only 36.4% with chicken and C.livia.These results indicated that the structure of Ghrelin displayed an obvious varietal specificity,suggesting that Ghrelin might play an important physiological function role in ruminants.

  12. miRBase: annotating high confidence microRNAs using deep sequencing data.

    Science.gov (United States)

    Kozomara, Ana; Griffiths-Jones, Sam

    2014-01-01

    We describe an update of the miRBase database (http://www.mirbase.org/), the primary microRNA sequence repository. The latest miRBase release (v20, June 2013) contains 24 521 microRNA loci from 206 species, processed to produce 30 424 mature microRNA products. The rate of deposition of novel microRNAs and the number of researchers involved in their discovery continue to increase, driven largely by small RNA deep sequencing experiments. In the face of these increases, and a range of microRNA annotation methods and criteria, maintaining the quality of the microRNA sequence data set is a significant challenge. Here, we describe recent developments of the miRBase database to address this issue. In particular, we describe the collation and use of deep sequencing data sets to assign levels of confidence to miRBase entries. We now provide a high confidence subset of miRBase entries, based on the pattern of mapped reads. The high confidence microRNA data set is available alongside the complete microRNA collection at http://www.mirbase.org/. We also describe embedding microRNA-specific Wikipedia pages on the miRBase website to encourage the microRNA community to contribute and share textual and functional information.

  13. LookSeq: a browser-based viewer for deep sequencing data.

    Science.gov (United States)

    Manske, Heinrich Magnus; Kwiatkowski, Dominic P

    2009-11-01

    Sequencing a genome to great depth can be highly informative about heterogeneity within an individual or a population. Here we address the problem of how to visualize the multiple layers of information contained in deep sequencing data. We propose an interactive AJAX-based web viewer for browsing large data sets of aligned sequence reads. By enabling seamless browsing and fast zooming, the LookSeq program assists the user to assimilate information at different levels of resolution, from an overview of a genomic region to fine details such as heterogeneity within the sample. A specific problem, particularly if the sample is heterogeneous, is how to depict information about structural variation. LookSeq provides a simple graphical representation of paired sequence reads that is more revealing about potential insertions and deletions than are conventional methods.

  14. AUC-Maximized Deep Convolutional Neural Fields for Protein Sequence Labeling.

    Science.gov (United States)

    Wang, Sheng; Sun, Siqi; Xu, Jinbo

    2016-09-01

    Deep Convolutional Neural Networks (DCNN) has shown excellent performance in a variety of machine learning tasks. This paper presents Deep Convolutional Neural Fields (DeepCNF), an integration of DCNN with Conditional Random Field (CRF), for sequence labeling with an imbalanced label distribution. The widely-used training methods, such as maximum-likelihood and maximum labelwise accuracy, do not work well on imbalanced data. To handle this, we present a new training algorithm called maximum-AUC for DeepCNF. That is, we train DeepCNF by directly maximizing the empirical Area Under the ROC Curve (AUC), which is an unbiased measurement for imbalanced data. To fulfill this, we formulate AUC in a pairwise ranking framework, approximate it by a polynomial function and then apply a gradient-based procedure to optimize it. Our experimental results confirm that maximum-AUC greatly outperforms the other two training methods on 8-state secondary structure prediction and disorder prediction since their label distributions are highly imbalanced and also has similar performance as the other two training methods on solvent accessibility prediction, which has three equally-distributed labels. Furthermore, our experimental results show that our AUC-trained DeepCNF models greatly outperform existing popular predictors of these three tasks. The data and software related to this paper are available at https://github.com/realbigws/DeepCNF_AUC.

  15. Characterization of cDNA encoding human placental anticoagulant protein (PP4): Homology with the lipocortin family

    International Nuclear Information System (INIS)

    Grundmann, U.; Abel, K.J.; Bohn, H.; Loebermann, H.; Lottspeich, F.; Kuepper, H.

    1988-01-01

    A cDNA library prepared from human placenta was screened for sequences encoding the placental protein 4 (PP4). PP4 is an anticoagulant protein that acts as an indirect inhibitor of the thromboplastin-specific complex, which is involved in the blood coagulation cascade. Partial amino acid sequence information from PP4-derived cyanogen bromide fragments was used to design three oligonucleotide probes for screening the library. From 10 6 independent recombinants, 18 clones were identified that hybridized to all three probes. These 18 recombinants contained cDNA inserts encoding a protein of 320 amino acid residues. In addition to the PP4 cDNA the authors identified 9 other recombinants encoding a protein with considerable similarity (74%) to PP4, which was termed PP4-X. PP4 and PP4-X belong to the lipocortin family, as judged by their homology to lipocortin I and calpactin I

  16. cDNA cloning of human DNA topoisomerase I. Catalytic activity of a 67.7-kDa carboxyl-terminal fragment

    International Nuclear Information System (INIS)

    D'Arpa, P.; Machlin, P.S.; Ratrie, H. III; Rothfield, N.F.; Cleveland, D.W.; Earnshaw, W.C.

    1988-01-01

    cDNA clones encoding human topoisomerase I were isolated from an expression vector library (λgt11) screened with autoimmune anti-topoisomerase I serum. One of these clones has been expressed as a fusion protein comprised of a 32-kDa fragment of the bacterial TrpE protein linked to 67.7 kDa of protein encoded by the cDNA. Three lines of evidence indicate that the cloned cDNA encodes topoisomerase I. (i) Proteolysis maps of the fusion protein and human nuclear topoisomerase I are essentially identical. (ii) The fusion protein relaxes supercoiled DNA, an activity that can be immunoprecipitated by anti-topoisomerase I serum. (iii) Sequence analysis has revealed that the longest cDNA clone (3645 base pairs) encodes a protein of 765 amino acids that shares 42% identity with Saccharomyces cerevisiae topoisomerase I. The sequence data also show that the catalytically active 67.7-kDa fragment is comprised of the carboxyl terminus

  17. Isolation and sequencing of a cDNA coding for the human DF3 breast carcinoma-associated antigen

    International Nuclear Information System (INIS)

    Siddiqui, J.; Abe, M.; Hayes, D.; Shani, E.; Yunis, E.; Kufe, D.

    1988-01-01

    The murine monoclonal antibody (mAb) DF3 reacts with a high molecular weight glycoprotein detectable in human breast carcinomas. DF3 antigen expression correlates with human breast tumor differentiation, and the detection of a cross-reactive species in human milk has suggested that this antigen might be useful as a marker of differentiated mammary epithelium. To further characterize DF3 antigen expression, the authors have isolated a cDNA clone from a λgt11 library by screening with mAb DF3. The results demonstrate that this 309-base-pair cDNA, designated pDF9.3, codes for the DF3 epitope. Southern blot analyses of EcoRI-digested DNAs from six human tumor cell lines with 32 P-labeled pDF9.3 have revealed a restriction fragment length polymorphism. Variations in size of the alleles detected by pDF9.3 were also identified in Pst I, but not in HindIII, DNA digests. Furthermore, hybridization of 32 P-labeled pDF9.3 with total cellular RNA from each of these cell lines demonstrated either one or two transcripts that varied from 4.1 to 7.1 kilobases in size. The presence of differently sized transcripts detected by pDF9.3 was also found to correspond with the polymorphic expression of DF3 glycoproteins. Nucleotide sequence analysis of pDF9.3 has revealed a highly conserved (G + C)-rich 60-base-pair tandem repeat. These findings suggest that the variation in size of alleles coding for the polymorphic DF3 glycoprotein may represent different numbers of repeats

  18. Characterization of cDNA for human tripeptidyl peptidase II: The N-terminal part of the enzyme is similar to subtilisin

    International Nuclear Information System (INIS)

    Tomkinson, B.; Jonsson, A-K

    1991-01-01

    Tripeptidyl peptidase II is a high molecular weight serine exopeptidase, which has been purified from rat liver and human erythrocytes. Four clones, representing 4453 bp, or 90% of the mRNA of the human enzyme, have been isolated from two different cDNA libraries. One clone, designated A2, was obtained after screening a human B-lymphocyte cDNA library with a degenerated oligonucleotide mixture. The B-lymphocyte cDNA library, obtained from human fibroblasts, were rescreened with a 147 bp fragment from the 5' part of the A2 clone, whereby three different overlapping cDNA clones could be isolated. The deduced amino acid sequence, 1196 amino acid residues, corresponding to the longest open rading frame of the assembled nucleotide sequence, was compared to sequences of current databases. This revealed a 56% similarity between the bacterial enzyme subtilisin and the N-terminal part of tripeptidyl peptidase II. The enzyme was found to be represented by two different mRNAs of 4.2 and 5.0 kilobases, respectively, which probably result from the utilziation of two different polyadenylation sites. Futhermore, cDNA corresponding to both the N-terminal and C-terminal part of tripeptidyl peptidase II hybridized with genomic DNA from mouse, horse, calf, and hen, even under fairly high stringency conditions, indicating that tripeptidyl peptidase II is highly conserved

  19. Construction of cDNA libraries from Pseudocercospora fijiensis Morelet infected leaves of the cultivars Calcutta 4 and Niyarma Yik

    Directory of Open Access Journals (Sweden)

    Milady Mendoza-Rodríguez

    2004-01-01

    Full Text Available Molecular studies of plant-pathogen interaction are very important for the identification of gene (s related with the pathogenic process, as well as with the plant resistance. These gene (s could be use for the genetic improvement programs in order to obtain resistant cultivars. The aim of this work was to construct complementary DNA (cDNA libraries from infected leaves with Pseudocercospora fijiensis CCIBP-Pf1 isolated of two banana cultivars (a resistant one Calcutta4 and another one susceptible Niyarma Yik. First-strand cDNA synthesis, was made beginning with one microgram of total RNA by using oligo dT primer and cDNA quality was checked by Polimerase chain reaction (PCR with cytochrome b specific primers. Second-strand cDNA synthesis was performed by using the homopolymeric tailing with dC-BamH I + dT-Not I primer combination. Four cDNA libraries of infected plants at different times of infection with the pathogen were obtained. Forty one clones of one of the libraries of Niyarma Yik were sequenced and the obtained sequences correspond with genes related to fungi. Key words: Banana-Mycosphaerella fijiensis interaction,Black Sigatoka, Musa spp.

  20. CPSS: a computational platform for the analysis of small RNA deep sequencing data.

    Science.gov (United States)

    Zhang, Yuanwei; Xu, Bo; Yang, Yifan; Ban, Rongjun; Zhang, Huan; Jiang, Xiaohua; Cooke, Howard J; Xue, Yu; Shi, Qinghua

    2012-07-15

    Next generation sequencing (NGS) techniques have been widely used to document the small ribonucleic acids (RNAs) implicated in a variety of biological, physiological and pathological processes. An integrated computational tool is needed for handling and analysing the enormous datasets from small RNA deep sequencing approach. Herein, we present a novel web server, CPSS (a computational platform for the analysis of small RNA deep sequencing data), designed to completely annotate and functionally analyse microRNAs (miRNAs) from NGS data on one platform with a single data submission. Small RNA NGS data can be submitted to this server with analysis results being returned in two parts: (i) annotation analysis, which provides the most comprehensive analysis for small RNA transcriptome, including length distribution and genome mapping of sequencing reads, small RNA quantification, prediction of novel miRNAs, identification of differentially expressed miRNAs, piwi-interacting RNAs and other non-coding small RNAs between paired samples and detection of miRNA editing and modifications and (ii) functional analysis, including prediction of miRNA targeted genes by multiple tools, enrichment of gene ontology terms, signalling pathway involvement and protein-protein interaction analysis for the predicted genes. CPSS, a ready-to-use web server that integrates most functions of currently available bioinformatics tools, provides all the information wanted by the majority of users from small RNA deep sequencing datasets. CPSS is implemented in PHP/PERL+MySQL+R and can be freely accessed at http://mcg.ustc.edu.cn/db/cpss/index.html or http://mcg.ustc.edu.cn/sdap1/cpss/index.html.

  1. Automation of cDNA Synthesis and Labelling Improves Reproducibility

    Directory of Open Access Journals (Sweden)

    Daniel Klevebring

    2009-01-01

    Full Text Available Background. Several technologies, such as in-depth sequencing and microarrays, enable large-scale interrogation of genomes and transcriptomes. In this study, we asses reproducibility and throughput by moving all laboratory procedures to a robotic workstation, capable of handling superparamagnetic beads. Here, we describe a fully automated procedure for cDNA synthesis and labelling for microarrays, where the purification steps prior to and after labelling are based on precipitation of DNA on carboxylic acid-coated paramagnetic beads. Results. The fully automated procedure allows for samples arrayed on a microtiter plate to be processed in parallel without manual intervention and ensuring high reproducibility. We compare our results to a manual sample preparation procedure and, in addition, use a comprehensive reference dataset to show that the protocol described performs better than similar manual procedures. Conclusions. We demonstrate, in an automated gene expression microarray experiment, a reduced variance between replicates, resulting in an increase in the statistical power to detect differentially expressed genes, thus allowing smaller differences between samples to be identified. This protocol can with minor modifications be used to create cDNA libraries for other applications such as in-depth analysis using next-generation sequencing technologies.

  2. Isolation and Cloning of cDNA Fragment of Gene Encoding for Multidrug Resistance Associated Protein from M. affine.

    Directory of Open Access Journals (Sweden)

    Utut Widyastuti Suharsono

    2008-11-01

    Full Text Available Isolation and Cloning of cDNA Fragment of Gene Encoding for Multidrug Resistance Associated Protein from M. affine. M. affine can grow well in acid soil with high level of soluble aluminum. One of the important proteins in the detoxifying xenobiotic stress including acid and Al stresses is a multidrug resistance associated protein (MRP encoded by mrp gene. The objective of this research is to isolate and clone the cDNA fragment of MaMrp encoding MRP from M. affine. By reverse transcription, total cDNA had been synthesized from the total RNA as template. The fragment of cDNA MaMrp had been successfully isolated by PCR by using total cDNA as template and mrp primer designed from A. thaliana, yeast, and human. This fragment was successfully inserted into pGEM-T Easy and the recombinant plasmid was successfully introduced into E. coli DH5α. Nucleotide sequence analysis showed that the lenght of MaMrp fragment is 633 bp encoding 208 amino acids. Local alignment analysis based on nucleotide of mRNA showed that MaMrp fragment is 69% identical to AtMrp1 and 63% to AtMrp from A. thaliana. Based on deduced amino acid sequence, MaMRP is 84% identical to part of AtMRP13, 77% to AtMRP12, and 73% to AtMRP1 from A. thaliana respectively. Alignment analysis with AtMRP1 showed that MaMRP fragment is located in TM1 and NBF1 domains and has a specific amino acid sequence QCKAQLQNMEEE.

  3. Cloning and Sequencing of Protein Kinase cDNA from Harbor Seal (Phoca vitulina Lymphocytes

    Directory of Open Access Journals (Sweden)

    Jennifer C. C. Neale

    2004-01-01

    Full Text Available Protein kinases (PKs play critical roles in signal transduction and activation of lymphocytes. The identification of PK genes provides a tool for understanding mechanisms of immunotoxic xenobiotics. As part of a larger study investigating persistent organic pollutants in the harbor seal and their possible immunomodulatory actions, we sequenced harbor seal cDNA fragments encoding PKs. The procedure, using degenerate primers based on conserved motifs of human protein tyrosine kinases (PTKs, successfully amplified nine phocid PK gene fragments with high homology to human and rodent orthologs. We identified eight PTKs and one dual (serine/threonine and tyrosine kinase. Among these were several PKs important in early signaling events through the B- and T-cell receptors (FYN, LYN, ITK and SYK and a MAP kinase involved in downstream signal transduction. V-FGR, RET and DDR2 were also expressed. Sequential activation of protein kinases ultimately induces gene transcription leading to the proliferation and differentiation of lymphocytes critical to adaptive immunity. PKs are potential targets of bioactive xenobiotics, including persistent organic pollutants of the marine environment; characterization of these molecules in the harbor seal provides a foundation for further research illuminating mechanisms of action of contaminants speculated to contribute to large-scale die-offs of marine mammals via immunosuppression.

  4. Deep RNA Sequencing of the Skeletal Muscle Transcriptome in Swimming Fish

    NARCIS (Netherlands)

    Palstra, A.P.; Beltran, S.; Burgerhout, E.; Brittijn, S.A.; Magnoni, L.J.; Henkel, C.V.; Jansen, A.; Thillart, G.E.E.J.M.; Spaink, H.P.; Planas, J.V.

    2013-01-01

    Deep RNA sequencing (RNA-seq) was performed to provide an in-depth view of the transcriptome of red and white skeletal muscle of exercised and non-exercised rainbow trout (Oncorhynchus mykiss) with the specific objective to identify expressed genes and quantify the transcriptomic effects of

  5. Ultra-deep sequencing of intra-host rabies virus populations during cross-species transmission.

    Directory of Open Access Journals (Sweden)

    Monica K Borucki

    2013-11-01

    Full Text Available One of the hurdles to understanding the role of viral quasispecies in RNA virus cross-species transmission (CST events is the need to analyze a densely sampled outbreak using deep sequencing in order to measure the amount of mutation occurring on a small time scale. In 2009, the California Department of Public Health reported a dramatic increase (350 in the number of gray foxes infected with a rabies virus variant for which striped skunks serve as a reservoir host in Humboldt County. To better understand the evolution of rabies, deep-sequencing was applied to 40 unpassaged rabies virus samples from the Humboldt outbreak. For each sample, approximately 11 kb of the 12 kb genome was amplified and sequenced using the Illumina platform. Average coverage was 17,448 and this allowed characterization of the rabies virus population present in each sample at unprecedented depths. Phylogenetic analysis of the consensus sequence data demonstrated that samples clustered according to date (1995 vs. 2009 and geographic location (northern vs. southern. A single amino acid change in the G protein distinguished a subset of northern foxes from a haplotype present in both foxes and skunks, suggesting this mutation may have played a role in the observed increased transmission among foxes in this region. Deep-sequencing data indicated that many genetic changes associated with the CST event occurred prior to 2009 since several nonsynonymous mutations that were present in the consensus sequences of skunk and fox rabies samples obtained from 20032010 were present at the sub-consensus level (as rare variants in the viral population in skunk and fox samples from 1995. These results suggest that analysis of rare variants within a viral population may yield clues to ancestral genomes and identify rare variants that have the potential to be selected for if environment conditions change.

  6. Isolation and characterization of full-length cDNA clones coding for cholinesterase from fetal human tissues

    International Nuclear Information System (INIS)

    Prody, C.A.; Zevin-Sonkin, D.; Gnatt, A.; Goldberg, O.; Soreq, H.

    1987-01-01

    To study the primary structure and regulation of human cholinesterases, oligodeoxynucleotide probes were prepared according to a consensus peptide sequence present in the active site of both human serum pseudocholinesterase and Torpedo electric organ true acetylcholinesterase. Using these probes, the authors isolated several cDNA clones from λgt10 libraries of fetal brain and liver origins. These include 2.4-kilobase cDNA clones that code for a polypeptide containing a putative signal peptide and the N-terminal, active site, and C-terminal peptides of human BtChoEase, suggesting that they code either for BtChoEase itself or for a very similar but distinct fetal form of cholinesterase. In RNA blots of poly(A) + RNA from the cholinesterase-producing fetal brain and liver, these cDNAs hybridized with a single 2.5-kilobase band. Blot hybridization to human genomic DNA revealed that these fetal BtChoEase cDNA clones hybridize with DNA fragments of the total length of 17.5 kilobases, and signal intensities indicated that these sequences are not present in many copies. Both the cDNA-encoded protein and its nucleotide sequence display striking homology to parallel sequences published for Torpedo AcChoEase. These finding demonstrate extensive homologies between the fetal BtChoEase encoded by these clones and other cholinesterases of various forms and species

  7. Cloning and chromosomal assignment of a human cDNA encoding a T cell- and natural killer cell-specific trypsin-like serine protease

    International Nuclear Information System (INIS)

    Gershenfeld, H.K.; Hershberger, R.J.; Shows, T.B.; Weissman, I.L.

    1988-01-01

    A cDNA clone encoding a human T cell- and natural killer cell-specific serine protease was obtained by screening a phage λgt10 cDNA library from phytohemagglutinin-stimulated human peripheral blood lymphocytes with the mouse Hanukah factor cDNA clone. In an RNA blot-hybridization analysis, this human Hanukah factor cDNA hybridized with a 1.3-kilobase band in allogeneic-stimulated cytotoxic T cells and the Jurkat cell line, but this transcript was not detectable in normal muscle, liver, tonsil, or thymus. By dot-blot hybridization, this cDNA hybridized with RNA from three cytolytic T-cell clones and three noncytolytic T-cell clones grown in vitro as well as with purified CD16 + natural killer cells and CD3 + , CD16 - T-cell large granular lymphocytes from peripheral blood lymphocytes (CD = cluster designation). The nucleotide sequence of this cDNA clone encodes a predicted serine protease of 262 amino acids. The active enzyme is 71% and 77% similar to the mouse sequence at the amino acid and DNA level, respectively. The human and mouse sequences conserve the active site residues of serine proteases--the trypsin-specific Asp-189 and all 10 cysteine residues. The gene for the human Hanukah factor serine protease is located on human chromosome 5. The authors propose that this trypsin-like serine protease may function as a common component necessary for lysis of target cells by cytotoxic T lymphocytes and natural killer cells

  8. Growth hormone and prolactin in Andrias davidianus: cDNA cloning, tissue distribution and phylogenetic analysis.

    Science.gov (United States)

    Yang, Liping; Meng, Zining; Liu, Yun; Zhang, Yong; Liu, Xiaochun; Lu, Danqi; Huang, Junhai; Lin, Haoran

    2010-01-15

    The Chinese giant salamander (Andrias davidianus) is one of the largest and 'living fossil' species of amphibian. To obtain genetic information for this species, the cDNAs encoding growth hormone (adGH) and prolactin (adPRL) were cloned from a pituitary cDNA library. The isolated adGH cDNA consisted of 864 bp and encoded a propeptide of 215 amino acids, while the cDNA of adPRL was 1106 bp in length and encoded a putative peptide of 229 amino acids. Expression of the GH and PRL mRNA was only detected in the pituitary. Phylogenetic analyses were performed based on the isolated pituitary hormone sequences using maximum parsimony and neighbor-joining algorithms. The clustering results are similar to that based on the morphological characteristics or the rRNA genes, which indicate that the two orders (Anura and Caudata) of amphibian were monophyletic, and that A. davidianus was diverged early in the Caudate clade. These results indicated that both the GH and PRL sequence might be useful to study the phylogenies of relatively moderate evolved groups.

  9. Determining mutant spectra of three RNA viral samples using ultra-deep sequencing

    Energy Technology Data Exchange (ETDEWEB)

    Chen, H

    2012-06-06

    RNA viruses have extremely high mutation rates that enable the virus to adapt to new host environments and even jump from one species to another. As part of a viral transmission study, three viral samples collected from naturally infected animals were sequenced using Illumina paired-end technology at ultra-deep coverage. In order to determine the mutant spectra within the viral quasispecies, it is critical to understand the sequencing error rates and control for false positive calls of viral variants (point mutantations). I will estimate the sequencing error rate from two control sequences and characterize the mutant spectra in the natural samples with this error rate.

  10. Deep sequencing reveals double mutations in cis of MPL exon 10 in myeloproliferative neoplasms.

    Science.gov (United States)

    Pietra, Daniela; Brisci, Angela; Rumi, Elisa; Boggi, Sabrina; Elena, Chiara; Pietrelli, Alessandro; Bordoni, Roberta; Ferrari, Maurizio; Passamonti, Francesco; De Bellis, Gianluca; Cremonesi, Laura; Cazzola, Mario

    2011-04-01

    Somatic mutations of MPL exon 10, mainly involving a W515 substitution, have been described in JAK2 (V617F)-negative patients with essential thrombocythemia and primary myelofibrosis. We used direct sequencing and high-resolution melt analysis to identify mutations of MPL exon 10 in 570 patients with myeloproliferative neoplasms, and allele specific PCR and deep sequencing to further characterize a subset of mutated patients. Somatic mutations were detected in 33 of 221 patients (15%) with JAK2 (V617F)-negative essential thrombocythemia or primary myelofibrosis. Only one patient with essential thrombocythemia carried both JAK2 (V617F) and MPL (W515L). High-resolution melt analysis identified abnormal patterns in all the MPL mutated cases, while direct sequencing did not detect the mutant MPL in one fifth of them. In 3 cases carrying double MPL mutations, deep sequencing analysis showed identical load and location in cis of the paired lesions, indicating their simultaneous occurrence on the same chromosome.

  11. Amino acid sequence of bovine muzzle epithelial desmocollin derived from cloned cDNA: a novel subtype of desmosomal cadherins.

    Science.gov (United States)

    Koch, P J; Goldschmidt, M D; Walsh, M J; Zimbelmann, R; Schmelz, M; Franke, W W

    1991-05-01

    Desmosomes are cell-type-specific intercellular junctions found in epithelium, myocardium and certain other tissues. They consist of assemblies of molecules involved in the adhesion of specific cell types and in the anchorage of cell-type-specific cytoskeletal elements, the intermediate-size filaments, to the plasma membrane. To explore the individual desmosomal components and their functions we have isolated DNA clones encoding the desmosomal glycoprotein, desmocollin, using antibodies and a cDNA expression library from bovine muzzle epithelium. The cDNA-deduced amino-acid sequence of desmocollin (presently we cannot decide to which of the two desmocollins, DC I or DC II, this clone relates) defines a polypeptide with a calculated molecular weight of 85,000, with a single candidate sequence of 24 amino acids sufficiently long for a transmembrane arrangement, and an extracellular aminoterminal portion of 561 amino acid residues, compared to a cytoplasmic part of only 176 amino acids. Amino acid sequence comparisons have revealed that desmocollin is highly homologous to members of the cadherin family of cell adhesion molecules, including the previously sequenced desmoglein, another desmosome-specific cadherin. Using riboprobes derived from cDNAs for Northern-blot analyses, we have identified an mRNA of approximately 6 kb in stratified epithelia such as muzzle epithelium and tongue mucosa but not in two epithelial cell culture lines containing desmosomes and desmoplakins. The difference may indicate drastic differences in mRNA concentration or the existence of cell-type-specific desmocollin subforms. The molecular topology of desmocollin(s) is discussed in relation to possible functions of the individual molecular domains.

  12. Consistent errors in first strand cDNA due to random hexamer mispriming.

    Directory of Open Access Journals (Sweden)

    Thomas P van Gurp

    Full Text Available Priming of random hexamers in cDNA synthesis is known to show sequence bias, but in addition it has been suggested recently that mismatches in random hexamer priming could be a cause of mismatches between the original RNA fragment and observed sequence reads. To explore random hexamer mispriming as a potential source of these errors, we analyzed two independently generated RNA-seq datasets of synthetic ERCC spikes for which the reference is known. First strand cDNA synthesized by random hexamer priming on RNA showed consistent position and nucleotide-specific mismatch errors in the first seven nucleotides. The mismatch errors found in both datasets are consistent in distribution and thermodynamically stable mismatches are more common. This strongly indicates that RNA-DNA mispriming of specific random hexamers causes these errors. Due to their consistency and specificity, mispriming errors can have profound implications for downstream applications if not dealt with properly.

  13. An alternative method for cDNA cloning from surrogate eukaryotic cells transfected with the corresponding genomic DNA.

    Science.gov (United States)

    Hu, Lin-Yong; Cui, Chen-Chen; Song, Yu-Jie; Wang, Xiang-Guo; Jin, Ya-Ping; Wang, Ai-Hua; Zhang, Yong

    2012-07-01

    cDNA is widely used in gene function elucidation and/or transgenics research but often suitable tissues or cells from which to isolate mRNA for reverse transcription are unavailable. Here, an alternative method for cDNA cloning is described and tested by cloning the cDNA of human LALBA (human alpha-lactalbumin) from genomic DNA. First, genomic DNA containing all of the coding exons was cloned from human peripheral blood and inserted into a eukaryotic expression vector. Next, by delivering the plasmids into either 293T or fibroblast cells, surrogate cells were constructed. Finally, the total RNA was extracted from the surrogate cells and cDNA was obtained by RT-PCR. The human LALBA cDNA that was obtained was compared with the corresponding mRNA published in GenBank. The comparison showed that the two sequences were identical. The novel method for cDNA cloning from surrogate eukaryotic cells described here uses well-established techniques that are feasible and simple to use. We anticipate that this alternative method will have widespread applications.

  14. Cloning and sequence analysis of serine proteinase of Gloydius ussuriensis venom gland

    International Nuclear Information System (INIS)

    Sun Dejun; Liu Shanshan; Yang Chunwei; Zhao Yizhuo; Chang Shufang; Yan Weiqun

    2005-01-01

    Objective: To construct a cDNA library by using mRNA from Gloydius ussuriensis (G. Ussuriensis) venom gland, to clone and analyze serine proteinase gene from the cDNA library. Methods: Total RNA was isolated from venom gland of G. ussuriensis, mRNA was purified by using mRNA isolation Kit. The whole length cDNA was synthesized by means of smart cDNA synthesis strategy, and amplified by long distance PCR procedure, lately cDAN was cloned into vector pBluescrip-sk. The recombinant cDNA was transformed into E. coli DH5α. The cDNA of serine proteinase gene in the venom gland of G. ussuriensis was detected and amplified using the in situ hybridization. The cDNA fragment was inserted into pGEMT vector, cloned and its nucleotide sequence was determined. Results: The capacity of cDNA library of venom gland was above 2.3 x 10 6 . Its open reading frame was composed of 702 nucleotides and coded a protein pre-zymogen of 234 amino acids. It contained 12 cysteine residues. The sequence analysis indicated that the deduced amino acid sequence of the cDNA fragment shared high identity with the thrombin-like enzyme genes of other snakes in the GenBank. the query sequence exhibited strong amino acid sequence homology of 85% to the serine proteas of T. gramineus, thrombin-like serine proteinase I of D. acutus and serine protease catroxase II of C. atrox respectively. Based on the amino acid sequences of other thrombin-like enzymes, the catalytic residues and disulfide bridges of this thrombin-like enzyme were deduced as follows: catalytic residues, His 41 , Asp 86 , Ser 180 ; and six disulfide bridges Cys 7 -Cys 139 , Cys 26 -Cys 42 , Cys 74 -Cys 232 , Cys 118 -Cys 186 , Cys 150 -Cys 165 , Cys 176 -Cys 201 . Conclusion: The capacity of cDNA library of venom gland is above 2.3 x 10 6 , overtop the level of 10 5 capicity. The constructed cDNA library of G. ussuriensis venom gland would be helpful platform to detect new target genes and further gene manipulate. The cloned serine

  15. cDNA cloning and expression of a human platelet-derived growth factor (PDGF) receptor specific for B-chain-containing PDGF molecules

    International Nuclear Information System (INIS)

    Claesson-Welsh, L.; Eriksson, A.; Moren, A.; Severinsson, L.; Ek, B.; Ostman, A.; Betsholtz, C.; Heldin, C.H.

    1988-01-01

    The structure of the human receptor for platelet-derived growth factor (PDGF) has been deduced through cDNA cloning. A 5.45-kilobase-pair cDNA clone predicts a 1,106-amino-acid polypeptide, including the cleavable signal sequence. The overall amino acid sequence similarity with the murine PDGFR receptor is 85%. After transcription of the cDNA and translation in vitro, a PDGR receptor antiserum was used to immunoprecipitate a product of predicted size, which also could be phosphorylated in vitro. Stable introduction of the cDNA into Chinese hamster ovary (CHO) cells led to the expression of a 190-kilodalton component, which was immunoprecipitated by the PDGF receptor antiserum; this most probably represents the mature PDGF receptor. Binding assays with different /sup 125/I-labeled dimeric forms of PDGF A and B chains showed that the PDGFR receptor expressed in CHO cells bound PDGF-BB and, to a lesser extent, PDGF-AB, but not PDGF-AA

  16. Chiron: translating nanopore raw signal directly into nucleotide sequence using deep learning

    KAUST Repository

    Teng, Haotian; Cao, Minh Duc; Hall, Michael B; Duarte, Tania; Wang, Sheng; Coin, Lachlan J M

    2018-01-01

    Sequencing by translocating DNA fragments through an array of nanopores is a rapidly maturing technology that offers faster and cheaper sequencing than other approaches. However, accurately deciphering the DNA sequence from the noisy and complex electrical signal is challenging. Here, we report Chiron, the first deep learning model to achieve end-to-end basecalling and directly translate the raw signal to DNA sequence without the error-prone segmentation step. Trained with only a small set of 4,000 reads, we show that our model provides state-of-the-art basecalling accuracy, even on previously unseen species. Chiron achieves basecalling speeds of more than 2,000 bases per second using desktop computer graphics processing units.

  17. Chiron: translating nanopore raw signal directly into nucleotide sequence using deep learning

    KAUST Repository

    Teng, Haotian

    2018-04-10

    Sequencing by translocating DNA fragments through an array of nanopores is a rapidly maturing technology that offers faster and cheaper sequencing than other approaches. However, accurately deciphering the DNA sequence from the noisy and complex electrical signal is challenging. Here, we report Chiron, the first deep learning model to achieve end-to-end basecalling and directly translate the raw signal to DNA sequence without the error-prone segmentation step. Trained with only a small set of 4,000 reads, we show that our model provides state-of-the-art basecalling accuracy, even on previously unseen species. Chiron achieves basecalling speeds of more than 2,000 bases per second using desktop computer graphics processing units.

  18. De Novo Deep Transcriptome Analysis of Medicinal Plants for Gene Discovery in Biosynthesis of Plant Natural Products.

    Science.gov (United States)

    Han, R; Rai, A; Nakamura, M; Suzuki, H; Takahashi, H; Yamazaki, M; Saito, K

    2016-01-01

    Study on transcriptome, the entire pool of transcripts in an organism or single cells at certain physiological or pathological stage, is indispensable in unraveling the connection and regulation between DNA and protein. Before the advent of deep sequencing, microarray was the main approach to handle transcripts. Despite obvious shortcomings, including limited dynamic range and difficulties to compare the results from distinct experiments, microarray was widely applied. During the past decade, next-generation sequencing (NGS) has revolutionized our understanding of genomics in a fast, high-throughput, cost-effective, and tractable manner. By adopting NGS, efficiency and fruitful outcomes concerning the efforts to elucidate genes responsible for producing active compounds in medicinal plants were profoundly enhanced. The whole process involves steps, from the plant material sampling, to cDNA library preparation, to deep sequencing, and then bioinformatics takes over to assemble enormous-yet fragmentary-data from which to comb and extract information. The unprecedentedly rapid development of such technologies provides so many choices to facilitate the task, which can cause confusion when choosing the suitable methodology for specific purposes. Here, we review the general approaches for deep transcriptome analysis and then focus on their application in discovering biosynthetic pathways of medicinal plants that produce important secondary metabolites. © 2016 Elsevier Inc. All rights reserved.

  19. Deep sequencing of the Camellia sinensis transcriptome revealed candidate genes for major metabolic pathways of tea-specific compounds

    Energy Technology Data Exchange (ETDEWEB)

    Shi, CY; Yang, H; Wei, CL; Yu, O; Zhang, ZZ; Sun, J; Wan, XC

    2011-01-01

    Tea is one of the most popular non-alcoholic beverages worldwide. However, the tea plant, Camellia sinensis, is difficult to culture in vitro, to transform, and has a large genome, rendering little genomic information available. Recent advances in large-scale RNA sequencing (RNA-seq) provide a fast, cost-effective, and reliable approach to generate large expression datasets for functional genomic analysis, which is especially suitable for non-model species with un-sequenced genomes. Using high-throughput Illumina RNA-seq, the transcriptome from poly (A){sup +} RNA of C. sinensis was analyzed at an unprecedented depth (2.59 gigabase pairs). Approximate 34.5 million reads were obtained, trimmed, and assembled into 127,094 unigenes, with an average length of 355 bp and an N50 of 506 bp, which consisted of 788 contig clusters and 126,306 singletons. This number of unigenes was 10-fold higher than existing C. sinensis sequences deposited in GenBank (as of August 2010). Sequence similarity analyses against six public databases (Uniprot, NR and COGs at NCBI, Pfam, InterPro and KEGG) found 55,088 unigenes that could be annotated with gene descriptions, conserved protein domains, or gene ontology terms. Some of the unigenes were assigned to putative metabolic pathways. Targeted searches using these annotations identified the majority of genes associated with several primary metabolic pathways and natural product pathways that are important to tea quality, such as flavonoid, theanine and caffeine biosynthesis pathways. Novel candidate genes of these secondary pathways were discovered. Comparisons with four previously prepared cDNA libraries revealed that this transcriptome dataset has both a high degree of consistency with previous EST data and an approximate 20 times increase in coverage. Thirteen unigenes related to theanine and flavonoid synthesis were validated. Their expression patterns in different organs of the tea plant were analyzed by RT-PCR and quantitative real

  20. Deep sequencing of the Camellia sinensis transcriptome revealed candidate genes for major metabolic pathways of tea-specific compounds

    Directory of Open Access Journals (Sweden)

    Chen Qi

    2011-02-01

    Full Text Available Abstract Background Tea is one of the most popular non-alcoholic beverages worldwide. However, the tea plant, Camellia sinensis, is difficult to culture in vitro, to transform, and has a large genome, rendering little genomic information available. Recent advances in large-scale RNA sequencing (RNA-seq provide a fast, cost-effective, and reliable approach to generate large expression datasets for functional genomic analysis, which is especially suitable for non-model species with un-sequenced genomes. Results Using high-throughput Illumina RNA-seq, the transcriptome from poly (A+ RNA of C. sinensis was analyzed at an unprecedented depth (2.59 gigabase pairs. Approximate 34.5 million reads were obtained, trimmed, and assembled into 127,094 unigenes, with an average length of 355 bp and an N50 of 506 bp, which consisted of 788 contig clusters and 126,306 singletons. This number of unigenes was 10-fold higher than existing C. sinensis sequences deposited in GenBank (as of August 2010. Sequence similarity analyses against six public databases (Uniprot, NR and COGs at NCBI, Pfam, InterPro and KEGG found 55,088 unigenes that could be annotated with gene descriptions, conserved protein domains, or gene ontology terms. Some of the unigenes were assigned to putative metabolic pathways. Targeted searches using these annotations identified the majority of genes associated with several primary metabolic pathways and natural product pathways that are important to tea quality, such as flavonoid, theanine and caffeine biosynthesis pathways. Novel candidate genes of these secondary pathways were discovered. Comparisons with four previously prepared cDNA libraries revealed that this transcriptome dataset has both a high degree of consistency with previous EST data and an approximate 20 times increase in coverage. Thirteen unigenes related to theanine and flavonoid synthesis were validated. Their expression patterns in different organs of the tea plant were

  1. Horse cDNA clones encoding two MHC class I genes

    Energy Technology Data Exchange (ETDEWEB)

    Barbis, D.P.; Maher, J.K.; Stanek, J.; Klaunberg, B.A.; Antczak, D.F.

    1994-12-31

    Two full-length clones encoding MHC class I genes were isolated by screening a horse cDNA library, using a probe encoding in human HLA-A2.2Y allele. The library was made in the pcDNA1 vector (Invitrogen, San Diego, CA), using mRNA from peripheral blood lymphocytes obtained from a Thoroughbred stallion (No. 0834) homozygous for a common horse MHC haplotype (ELA-A2, -B2, -D2; Antczak et al. 1984; Donaldson et al. 1988). The clones were sequenced, using SP6 and T7 universal primers and horse-specific oligonucleotides designed to extend previously determined sequences.

  2. Purification of MUC1 from Bovine Milk-Fat Globules and Characterization of a Corresponding Full-Length cDNA Clone

    DEFF Research Database (Denmark)

    Pallesen, Lone Tjener; Andersen, Mikkel Holmen; Nielsen, Rune

    2001-01-01

    acid sequences obtained by peptide mapping. The complete amino acid sequence of MUC1 was determined by cloning and sequencing the corresponding bovine mammary gland cDNA, which was shown to encode a protein of 580 amino acid residues comprising a cleavable signal peptide of 22 residues. The deduced...

  3. Cloning of the cDNA for U1 small nuclear ribonucleoprotein particle 70K protein from Arabidopsis thaliana

    Science.gov (United States)

    Reddy, A. S.; Czernik, A. J.; An, G.; Poovaiah, B. W.

    1992-01-01

    We cloned and sequenced a plant cDNA that encodes U1 small nuclear ribonucleoprotein (snRNP) 70K protein. The plant U1 snRNP 70K protein cDNA is not full length and lacks the coding region for 68 amino acids in the amino-terminal region as compared to human U1 snRNP 70K protein. Comparison of the deduced amino acid sequence of the plant U1 snRNP 70K protein with the amino acid sequence of animal and yeast U1 snRNP 70K protein showed a high degree of homology. The plant U1 snRNP 70K protein is more closely related to the human counter part than to the yeast 70K protein. The carboxy-terminal half is less well conserved but, like the vertebrate 70K proteins, is rich in charged amino acids. Northern analysis with the RNA isolated from different parts of the plant indicates that the snRNP 70K gene is expressed in all of the parts tested. Southern blotting of genomic DNA using the cDNA indicates that the U1 snRNP 70K protein is coded by a single gene.

  4. Ferritin from the Pacific abalone Haliotis discus hannai: Analysis of cDNA sequence, expression, and activity.

    Science.gov (United States)

    Qiu, Reng; Kan, Yunchao; Li, Dandan

    2016-02-01

    Ferritin plays an important role in iron homeostasis due to its ability to bind and sequester large amounts of iron. In this study, the gene encoding a ferritin (HdhFer2) was cloned from Pacific abalone (Haliotis discus hannai). The full-length cDNA of HdhFer2 contains a 5'-UTR of 121 bp, an ORF of 516 bp, and a 3'-UTR of 252 bp with a polyadenylation signal sequence of AATAAA and a poly(A) tail. It also contains a 31 bp iron-responsive element (IRE) in the 5'-UTR position, which is conserved in many ferritins. HdhFer2 consists of 171 amino acid residues with a predicted molecular weight (MW) ∼19.8 kDa and a theoretical isoelectric point (PI) of 4.84. The deduced amino acid sequence of HdhFer2 contains two ferritin iron-binding region signatures (IBRSs). HdhFer2 mRNA was detected in a wide range of tissues and was dominantly expressed in the gill. Infection with the bacterial pathogen Vibrio anguillarum significantly upregulated HdhFer2 expression in a time-dependent manner. Recombinant HdhFer2 (rHdhFer2) purified from Escherichia coli was able to bind ferrous iron in a concentration-dependent manner. In summary, these results suggest that HdhFer2 is a crucial protein in the iron-withholding defense system, and plays an important role in the innate immune response of abalone. Copyright © 2016 Elsevier Ltd. All rights reserved.

  5. Characterization of cDNA for PMT: a Partial Nicotine Biosynthesis-Related Gene Isolated from Indonesian Local Tobacco (Nicotiana tabacum cv. Sindoro1

    Directory of Open Access Journals (Sweden)

    SESANTI BASUKI

    2013-12-01

    Full Text Available Nicotine is the major alkaloid compound in cultivated tobacco (Nicotiana tabacum that could potentially be converted into carcinogenic compound (nor-nicotine. The PMT gene encoding putrescine N-methyltransferase (PMT is one of the two key genes that play a prominent role in nicotine biosynthesis. The aimed of this study was to isolate and characterize the cDNA sequence originated from Indonesian local tobacco cv. Sindoro1 (Ntpmt_Sindoro1. The results showed that the Ntpmt_Sindoro1 was 1124 bp in length. This cDNA fragment encodes for 374 amino acid residues. The predicted polypeptide from the cDNA is a hidrophilic protein, and has a predicted molecular weight of 40.95 kDa. The predicted amino acids sequence also showed high similarity to the PMT gene product Nicotiana sp. available in the GenBank data base. The amino acid sequences also exert conserved residues specifically exhibited only by PMT gene originated from N. tabacum. Clustering analysis revealed that Ntpmt_Sindoro1 belongs to the same clade as the PMT3 gene, a member of the N. tabacum PMT gene family. The Ntpmt_Sindoro1 cDNA sequence covering exon1-exon8 of the PMT gene fragment has been registered in the GenBank data base, under the accession number JX978277.

  6. Microaspiration of esophageal gland cells and cDNA library construction for identifying parasitism genes of plant-parasitic nematodes.

    Science.gov (United States)

    Hussey, Richard S; Huang, Guozhong; Allen, Rex

    2011-01-01

    Identifying parasitism genes encoding proteins secreted from a plant-parasitic nematode's esophageal gland cells and injected through its stylet into plant tissue is the key to understanding the molecular basis of nematode parasitism of plants. Parasitism genes have been cloned by directly microaspirating the cytoplasm from the esophageal gland cells of different parasitic stages of cyst or root-knot nematodes to provide mRNA to create a gland cell-specific cDNA library by long-distance reverse-transcriptase polymerase chain reaction. cDNA clones are sequenced and deduced protein sequences with a signal peptide for secretion are identified for high-throughput in situ hybridization to confirm gland-specific expression.

  7. Cloning and characterization of a cDNA encoding topoisomerase II in pea and analysis of its expression in relation to cell proliferation.

    Science.gov (United States)

    Reddy, M K; Nair, S; Tewari, K K; Mudgil, Y; Yadav, B S; Sopory, S K

    1999-09-01

    We have isolated and sequenced four overlapping cDNA clones to identify the full-length cDNA for topoisomerase II (PsTopII) from pea. Using degenerate primers, based on the conserved amino acid sequences of other eukaryotic type II topoisomerases, a 680 bp fragment was PCR-amplified with pea cDNA as template. This fragment was used as a probe to screen an oligo-dT-primed pea cDNA library. A partial cDNA clone was isolated that was truncated at the 3' end. RACE-PCR was employed to isolate the remaining portion of the gene. The total size of PsTopII is 4639 bp with an open reading frame of 4392 bp. The deduced amino acid sequence shows a strong homology to other eukaryotic topoisomerase II (topo II) at the N-terminus end. The topo II transcript was abundant in proliferative tissues. We also show that the level of topo II transcripts could be stimulated by exogenous application of growth factors that induced proliferation in vitro cultures. Light irradiation to etiolated tissue strongly stimulated the expression of topo II. These results suggest that topo II gene expression is up-regulated in response to light and hormones and correlates with cell proliferation. Besides, we have also isolated and analysed the 5'-flanking region of the pea TopII gene. This is first report on the isolation of a putative promoter for topoisomerase II from plants.

  8. cDNA sequence and tissue distribution of the mRNA for bovine and murine p11, the S100-related light chain of the protein-tyrosine kinase substrate p36 (calpactin I)

    DEFF Research Database (Denmark)

    Saris, Chris J M; Kristensen, Torsten; D’Eustachio, Peter

    1987-01-01

    We have isolated and sequenced cDNA clones of bovine nd murine pl 1 mRNAs. The nonpolyadenylated mRNAs are predicted to be 614 and 600 nucleotides, respectively. The p l l mRNAs both contain a 291 nucleotide open reading frame, preceded by a 5”untranslated region of 73 nucleotides in bovine p l l m...

  9. Molecular characterization of a Leishmania donovani cDNA clone with similarity to human 20S proteasome a-type subunit

    DEFF Research Database (Denmark)

    Christensen, C B; Jørgensen, L; Jensen, A T

    2000-01-01

    Using plasma from patients infected or previously infected with Leishmania donovanii, we isolated a L. donovanii cDNA clone with similarity to the proteasome a-type subunit from humans and other eukaryotes. The cDNA clone, designated LePa, was DNA sequenced and Northern blot analysis of L....... donovanii poly(A(+))mRNA indicated the isolation of a full length cDNA clone with a transcript size of 1.9 kb. The expressed recombinant LePa fusion protein induced proliferation of peripheral blood mononuclear cells in one out of seven patients who had suffered from visceral leishmaniasis. Plasma from 16...

  10. Isolation, cDNA cloning and gene expression of an antibacterial protein from larvae of the coconut rhinoceros beetle, Oryctes rhinoceros.

    Science.gov (United States)

    Yang, J; Yamamoto, M; Ishibashi, J; Taniai, K; Yamakawa, M

    1998-08-01

    An antibacterial protein, designated rhinocerosin, was purified to homogeneity from larvae of the coconut rhinoceros beetle, Oryctes rhinoceros immunized with Escherichia coli. Based on the amino acid sequence of the N-terminal region, a degenerate primer was synthesized and reverse-transcriptase PCR was performed to clone rhinocerosin cDNA. As a result, a 279-bp fragment was obtained. The complete nucleotide sequence was determined by sequencing the extended rhinocerosin cDNA clone by 5' rapid amplification of cDNA ends. The deduced amino acid sequence of the mature portion of rhinocerosin was composed of 72 amino acids without cystein residues and was shown to be rich in glycine (11.1%) and proline (11.1%) residues. Comparison of the deduced amino acid sequence of rhinocerosin with those of other antibacterial proteins indicated that it has 77.8% and 44.6% identity with holotricin 2 and coleoptrecin, respectively. Rhinocerosin had strong antibacterial activity against E. coli, Streptococcus pyogenes, Staphylococcus aureus but not against Pseudomonas aeruginosa. Results of reverse-transcriptase PCR analysis of gene expression in different tissues indicated that the rhinocerosin gene is strongly expressed in the fat body and the Malpighian tubule, and weakly expressed in hemocytes and midgut. In addition, gene expression was inducible by bacteria in the fat body, the Malpighian tubule and hemocyte but constitutive expression was observed in the midgut.

  11. Workup of Human Blood Samples for Deep Sequencing of HIV-1 Genomes

    NARCIS (Netherlands)

    Cornelissen, Marion; Gall, Astrid; van der Kuyl, Antoinette; Wymant, Chris; Blanquart, François; Fraser, Christophe; Berkhout, Ben

    2018-01-01

    We describe a detailed protocol for the manual workup of blood (plasma/serum) samples from individuals infected with the human immunodeficiency virus type 1 (HIV-1) for deep sequence analysis of the viral genome. The study optimizing the assay was performed in the context of the BEEHIVE (Bridging

  12. Isolation and characterization of cDNA clones for carrot extensin and a proline-rich 33-kDa protein

    International Nuclear Information System (INIS)

    Chen, J.; Varner, J.E.

    1985-01-01

    Extensins are hydroxyproline-rich glycoproteins associated with most dicotyledonous plant cell walls. To isolate cDNA clones encoding extensin, the authors started by isolating poly(A) + RNA from carrot root tissue, and then translating the RNA in vitro, in the presence of tritiated leucine or proline. A 33-kDa peptide was identified in the translation products as a putative extensin precursor. From a cDNA library constructed with poly(A) + RNA from wounded carrots, one cDNA clone (pDC5) was identified that specifically hybridized to poly(A) + RNA encoding this 33-kDa peptide. They isolated three cDNA clones (pDC11, pDC12, and pDC16) from another cDNA library using pCD5 as a probe. DNA sequence data, RNA hybridization analysis, and hybrid released in vitro translation indicate that the cDNA clones pDC11 encodes extensin and that cDNA clones pDC12 and pDC16 encode the 33-kDa peptide, which as yet has an unknown identity and function. The assumption that the 33-kDa peptide was an extensin precursor was invalid. RNA hybridization analysis showed that RNA encoded by both clone types is accumulated upon wounding

  13. A Bioinformatic Pipeline for Monitoring of the Mutational Stability of Viral Drug Targets with Deep-Sequencing Technology.

    Science.gov (United States)

    Kravatsky, Yuri; Chechetkin, Vladimir; Fedoseeva, Daria; Gorbacheva, Maria; Kravatskaya, Galina; Kretova, Olga; Tchurikov, Nickolai

    2017-11-23

    The efficient development of antiviral drugs, including efficient antiviral small interfering RNAs (siRNAs), requires continuous monitoring of the strict correspondence between a drug and the related highly variable viral DNA/RNA target(s). Deep sequencing is able to provide an assessment of both the general target conservation and the frequency of particular mutations in the different target sites. The aim of this study was to develop a reliable bioinformatic pipeline for the analysis of millions of short, deep sequencing reads corresponding to selected highly variable viral sequences that are drug target(s). The suggested bioinformatic pipeline combines the available programs and the ad hoc scripts based on an original algorithm of the search for the conserved targets in the deep sequencing data. We also present the statistical criteria for the threshold of reliable mutation detection and for the assessment of variations between corresponding data sets. These criteria are robust against the possible sequencing errors in the reads. As an example, the bioinformatic pipeline is applied to the study of the conservation of RNA interference (RNAi) targets in human immunodeficiency virus 1 (HIV-1) subtype A. The developed pipeline is freely available to download at the website http://virmut.eimb.ru/. Brief comments and comparisons between VirMut and other pipelines are also presented.

  14. Efficient forward propagation of time-sequences in convolutional neural networks using Deep Shifting

    NARCIS (Netherlands)

    K.L. Groenland (Koen); S.M. Bohte (Sander)

    2016-01-01

    textabstractWhen a Convolutional Neural Network is used for on-the-fly evaluation of continuously updating time-sequences, many redundant convolution operations are performed. We propose the method of Deep Shifting, which remembers previously calculated results of convolution operations in order

  15. Construction and characterization of a full-length cDNA library for the wheat stripe rust pathogen (Puccinia striiformis f. sp. tritici

    Directory of Open Access Journals (Sweden)

    Chen Xianming

    2007-06-01

    Full Text Available Abstract Background Puccinia striiformis is a plant pathogenic fungus causing stripe rust, one of the most important diseases on cereal crops and grasses worldwide. However, little is know about its genome and genes involved in the biology and pathogenicity of the pathogen. We initiated the functional genomic research of the fungus by constructing a full-length cDNA and determined functions of the first group of genes by sequence comparison of cDNA clones to genes reported in other fungi. Results A full-length cDNA library, consisting of 42,240 clones with an average cDNA insert of 1.9 kb, was constructed using urediniospores of race PST-78 of P. striiformis f. sp. tritici. From 196 sequenced cDNA clones, we determined functions of 73 clones (37.2%. In addition, 36 clones (18.4% had significant homology to hypothetical proteins, 37 clones (18.9% had some homology to genes in other fungi, and the remaining 50 clones (25.5% did not produce any hits. From the 73 clones with functions, we identified 51 different genes encoding protein products that are involved in amino acid metabolism, cell defense, cell cycle, cell signaling, cell structure and growth, energy cycle, lipid and nucleotide metabolism, protein modification, ribosomal protein complex, sugar metabolism, transcription factor, transport metabolism, and virulence/infection. Conclusion The full-length cDNA library is useful in identifying functional genes of P. striiformis.

  16. Cloning and expression of a cDNA coding for the human platelet-derived growth factor receptor: Evidence for more than one receptor class

    International Nuclear Information System (INIS)

    Gronwald, R.G.K.; Grant, F.J.; Haldeman, B.A.; Hart, C.E.; O'Hara, P.J.; Hagen, F.S.; Ross, R.; Bowen-Pope, D.F.; Murray, M.J.

    1988-01-01

    The complete nucleotide sequence of a cDNA encoding the human platelet-derived growth factor (PDGF) receptor is presented. The cDNA contains an open reading frame that codes for a protein of 1106 amino acids. Comparison to the mouse PDGF receptor reveals an overall amino acid sequence identity of 86%. This sequence identity rises to 98% in the cytoplasmic split tyrosine kinase domain. RNA blot hybridization analysis of poly(A) + RNA from human dermal fibroblasts detects a major and a minor transcript using the cDNA as a probe. Baby hamster kidney cells, transfected with an expression vector containing the receptor cDNA, express an ∼ 190-kDa cell surface protein that is recognized by an anti-human PDGF receptor antibody. The recombinant PDGF receptor is functional in the transfected baby hamster kidney cells as demonstrated by ligand-induced phosphorylation of the receptor. Binding properties of the recombinant PDGF receptor were also assessed with pure preparations of BB and AB isoforms of PDGF. Unlike human dermal fibroblasts, which bind both isoforms with high affinity, the transfected baby hamster kidney cells bind only the BB isoform of PDGF with high affinity. This observation is consistent with the existence of more than one PDGF receptor class

  17. Prognostic value of deep sequencing method for minimal residual disease detection in multiple myeloma

    Science.gov (United States)

    Lahuerta, Juan J.; Pepin, François; González, Marcos; Barrio, Santiago; Ayala, Rosa; Puig, Noemí; Montalban, María A.; Paiva, Bruno; Weng, Li; Jiménez, Cristina; Sopena, María; Moorhead, Martin; Cedena, Teresa; Rapado, Immaculada; Mateos, María Victoria; Rosiñol, Laura; Oriol, Albert; Blanchard, María J.; Martínez, Rafael; Bladé, Joan; San Miguel, Jesús; Faham, Malek; García-Sanz, Ramón

    2014-01-01

    We assessed the prognostic value of minimal residual disease (MRD) detection in multiple myeloma (MM) patients using a sequencing-based platform in bone marrow samples from 133 MM patients in at least very good partial response (VGPR) after front-line therapy. Deep sequencing was carried out in patients in whom a high-frequency myeloma clone was identified and MRD was assessed using the IGH-VDJH, IGH-DJH, and IGK assays. The results were contrasted with those of multiparametric flow cytometry (MFC) and allele-specific oligonucleotide polymerase chain reaction (ASO-PCR). The applicability of deep sequencing was 91%. Concordance between sequencing and MFC and ASO-PCR was 83% and 85%, respectively. Patients who were MRD– by sequencing had a significantly longer time to tumor progression (TTP) (median 80 vs 31 months; P < .0001) and overall survival (median not reached vs 81 months; P = .02), compared with patients who were MRD+. When stratifying patients by different levels of MRD, the respective TTP medians were: MRD ≥10−3 27 months, MRD 10−3 to 10−5 48 months, and MRD <10−5 80 months (P = .003 to .0001). Ninety-two percent of VGPR patients were MRD+. In complete response patients, the TTP remained significantly longer for MRD– compared with MRD+ patients (131 vs 35 months; P = .0009). PMID:24646471

  18. Analysis of xylem formation in pine by cDNA sequencing

    Science.gov (United States)

    Allona, I.; Quinn, M.; Shoop, E.; Swope, K.; St Cyr, S.; Carlis, J.; Riedl, J.; Retzel, E.; Campbell, M. M.; Sederoff, R.; hide

    1998-01-01

    Secondary xylem (wood) formation is likely to involve some genes expressed rarely or not at all in herbaceous plants. Moreover, environmental and developmental stimuli influence secondary xylem differentiation, producing morphological and chemical changes in wood. To increase our understanding of xylem formation, and to provide material for comparative analysis of gymnosperm and angiosperm sequences, ESTs were obtained from immature xylem of loblolly pine (Pinus taeda L.). A total of 1,097 single-pass sequences were obtained from 5' ends of cDNAs made from gravistimulated tissue from bent trees. Cluster analysis detected 107 groups of similar sequences, ranging in size from 2 to 20 sequences. A total of 361 sequences fell into these groups, whereas 736 sequences were unique. About 55% of the pine EST sequences show similarity to previously described sequences in public databases. About 10% of the recognized genes encode factors involved in cell wall formation. Sequences similar to cell wall proteins, most known lignin biosynthetic enzymes, and several enzymes of carbohydrate metabolism were found. A number of putative regulatory proteins also are represented. Expression patterns of several of these genes were studied in various tissues and organs of pine. Sequencing novel genes expressed during xylem formation will provide a powerful means of identifying mechanisms controlling this important differentiation pathway.

  19. Construction of a cDNA library and preliminary analysis of expressed sequence tags in Piper hainanense.

    Science.gov (United States)

    Fan, R; Ling, P; Hao, C Y; Li, F P; Huang, L F; Wu, B D; Wu, H S

    2015-10-19

    Black pepper is a perennial climbing vine. It is widely cultivated because its berries can be utilized not only as a spice in food but also for medicinal use. This study aimed to construct a standardized, high-quality cDNA library to facilitated identification of new Piper hainanense transcripts. For this, 262 unigenes were used to generate raw reads. The average length of these 262 unigenes was 774.8 bp. Of these, 94 genes (35.9%) were newly identified, according to the NCBI protein database. Thus, identification of new genes may broaden the molecular knowledge of P. hainanense on the basis of Clusters of Orthologous Groups and Gene Ontology categories. In addition, certain basic genes linked to physiological processes, which can contribute to disease resistance and thereby to the breeding of black pepper. A total of 26 unigenes were found to be SSR markers. Dinucleotide SSR was the main repeat motif, accounting for 61.54%, followed by trinucleotide SSR (23.07%). Eight primer pairs successfully amplified DNA fragments and detected significant amounts of polymorphism among twenty-one piper germplasm. These results present a novel sequence information of P. hainanense, which can serve as the foundation for further genetic research on this species.

  20. A Full-Length Infectious cDNA Clone of Zika Virus from the 2015 Epidemic in Brazil as a Genetic Platform for Studies of Virus-Host Interactions and Vaccine Development

    Directory of Open Access Journals (Sweden)

    Konstantin A. Tsetsarkin

    2016-08-01

    Full Text Available An arthropod-borne virus, Zika virus (ZIKV, has recently emerged as a major human pathogen. Associated with complications during perinatal development and Guillain-Barré syndrome in adults, ZIKV raises new challenges for understanding the molecular determinants of flavivirus pathogenesis. This underscores the necessity for the development of a reverse genetic system based on an epidemic ZIKV strain. Here, we describe the generation and characterization in cell cultures of an infectious cDNA clone of ZIKV isolated from the 2015 epidemic in Brazil. The cDNA-derived ZIKV replicated efficiently in a variety of cell lines, including those of both neuronal and placental origin. We observed that the growth of cDNA-derived virus was attenuated compared to the growth of the parental isolate in most cell lines, which correlates with substantial differences in sequence heterogeneity between these viruses that were determined by deep-sequencing analysis. Our findings support the role of genetic diversity in maintaining the replicative fitness of viral populations under changing conditions. Moreover, these results indicate that caution should be exercised when interpreting the results of reverse-genetics experiments in attempts to accurately predict the biology of natural viruses. Finally, a Vero cell-adapted cDNA clone of ZIKV was generated that can be used as a convenient platform for studies aimed at the development of ZIKV vaccines and therapeutics.

  1. Isolation and characterisation of the cDNA encoding a glycosylated accessory protein of pea chloroplast DNA polymerase.

    OpenAIRE

    Gaikwad, A; Tewari, K K; Kumar, D; Chen, W; Mukherjee, S K

    1999-01-01

    The cDNA encoding p43, a DNA binding protein from pea chloroplasts (ct) that binds to cognate DNA polymerase and stimulates the polymerase activity, has been cloned and characterised. The characteristic sequence motifs of hydroxyproline-rich glyco-proteins (HRGP) are present in the cDNA corres-ponding to the N-terminal domain of the mature p43. The protein was found to be highly O-arabinosylated. Chemically deglycosylated p43 (i.e. p29) retains its binding to both DNA and pea ct-DNA polymeras...

  2. A Bioinformatic Pipeline for Monitoring of the Mutational Stability of Viral Drug Targets with Deep-Sequencing Technology

    Directory of Open Access Journals (Sweden)

    Yuri Kravatsky

    2017-11-01

    Full Text Available The efficient development of antiviral drugs, including efficient antiviral small interfering RNAs (siRNAs, requires continuous monitoring of the strict correspondence between a drug and the related highly variable viral DNA/RNA target(s. Deep sequencing is able to provide an assessment of both the general target conservation and the frequency of particular mutations in the different target sites. The aim of this study was to develop a reliable bioinformatic pipeline for the analysis of millions of short, deep sequencing reads corresponding to selected highly variable viral sequences that are drug target(s. The suggested bioinformatic pipeline combines the available programs and the ad hoc scripts based on an original algorithm of the search for the conserved targets in the deep sequencing data. We also present the statistical criteria for the threshold of reliable mutation detection and for the assessment of variations between corresponding data sets. These criteria are robust against the possible sequencing errors in the reads. As an example, the bioinformatic pipeline is applied to the study of the conservation of RNA interference (RNAi targets in human immunodeficiency virus 1 (HIV-1 subtype A. The developed pipeline is freely available to download at the website http://virmut.eimb.ru/. Brief comments and comparisons between VirMut and other pipelines are also presented.

  3. Sequence recombination and conservation of Varroa destructor virus-1 and deformed wing virus in field collected honey bees (Apis mellifera.

    Directory of Open Access Journals (Sweden)

    Hui Wang

    Full Text Available We sequenced small (s RNAs from field collected honeybees (Apis mellifera and bumblebees (Bombuspascuorum using the Illumina technology. The sRNA reads were assembled and resulting contigs were used to search for virus homologues in GenBank. Matches with Varroadestructor virus-1 (VDV1 and Deformed wing virus (DWV genomic sequences were obtained for A. mellifera but not B. pascuorum. Further analyses suggested that the prevalent virus population was composed of VDV-1 and a chimera of 5'-DWV-VDV1-DWV-3'. The recombination junctions in the chimera genomes were confirmed by using RT-PCR, cDNA cloning and Sanger sequencing. We then focused on conserved short fragments (CSF, size > 25 nt in the virus genomes by using GenBank sequences and the deep sequencing data obtained in this study. The majority of CSF sites confirmed conservation at both between-species (GenBank sequences and within-population (dataset of this study levels. However, conserved nucleotide positions in the GenBank sequences might be variable at the within-population level. High mutation rates (Pi>10% were observed at a number of sites using the deep sequencing data, suggesting that sequence conservation might not always be maintained at the population level. Virus-host interactions and strategies for developing RNAi treatments against VDV1/DWV infections are discussed.

  4. Identification of a cDNA encoding a parathyroid hormone-like peptide from a human tumor associated with humoral hypercalcemia of malignancy

    International Nuclear Information System (INIS)

    Mangin, M.; Webb, A.C.; Dreyer, B.E.

    1988-01-01

    Humoral hypercalcemia of malignancy is a common paraneoplastic syndrome that appears to be mediated in many instances by a parathyroid hormone-like peptide. Poly(A) + RNA from a human renal carcinoma associated with this syndrome was enriched by preparative electrophoresis and used to construct an enriched cDNA library in phage λgt10. The library was screened with a codon-preference oligonucleotide synthesized on the basis of a partial N-terminal amino acid sequence from a human tumor-derived peptide, and a 2.0 kilo-base cDNA was identified. The cDNA encodes a 177 amino acid protein consisting of a 36 amino acid leader sequence and a 141 amino acid mature peptide. The first 13 amino acids of the deduced sequence of the mature peptide display strong homology to human PTH, with complete divergence thereafter. RNA blot-hybridization analysis revealed multiple transcripts in mRNA from tumors associated with the humor syndrome and also in mRNA from normal human keratinocytes. Southern blot analysis of genomic DNA from humans and rodents revealed a simple pattern compatible with a single-copy gene. The gene has been mapped to chromosome 12

  5. Radioactive cDNA microarray in neurospsychiatry

    International Nuclear Information System (INIS)

    Choe, Jae Gol; Shin, Kyung Ho; Lee, Min Soo; Kim, Meyoung Kon

    2003-01-01

    Microarray technology allows the simultaneous analysis of gene expression patterns of thousands of genes, in a systematic fashion, under a similar set of experimental conditions, thus making the data highly comparable. In some cases arrays are used simply as a primary screen leading to downstream molecular characterization of individual gene candidates. In other cases, the goal of expression profiling is to begin to identify complex regulatory networks underlying developmental processes and disease states. Microarrays were originally used with cell lines or other simple model systems. More recently, microarrays have been used in the analysis of more complex biological tissues including neural systems and the brain. The application of cDNA arrays in neuropsychiatry has lagged behind other fields for a number of reasons. These include a requirement for a large amount of input probe RNA in fluorescent-glass based array systems and the cellular complexity introduced by multicellular brain and neural tissues. An additional factor that impacts the general use of microarrays in neuropsychiatry is the lack of availability of sequenced clone sets from model systems. While human cDNA clones have been widely available, high quality rat, mouse, and drosophilae, among others are just becoming widely available. A final factor in the application of cDNA microarrays in neuropsychiatry is cost of commercial arrays. As academic microarray facilitates become more commonplace custom made arrays will become more widely available at a lower cost allowing more widespread applications. In summary, microarray technology is rapidly having an impact on many areas of biomedical research. Radioisotope-nylon based microarrays offer alternatives that may in some cases be more sensitive, flexible, inexpensive, and universal as compared to other array formats, such as fluorescent-glass arrays. In some situations of limited RNA or exotic species, radioactive membrane microarrays may be the most

  6. Radioactive cDNA microarray in neurospsychiatry

    Energy Technology Data Exchange (ETDEWEB)

    Choe, Jae Gol; Shin, Kyung Ho; Lee, Min Soo; Kim, Meyoung Kon [Korea University Medical School, Seoul (Korea, Republic of)

    2003-02-01

    Microarray technology allows the simultaneous analysis of gene expression patterns of thousands of genes, in a systematic fashion, under a similar set of experimental conditions, thus making the data highly comparable. In some cases arrays are used simply as a primary screen leading to downstream molecular characterization of individual gene candidates. In other cases, the goal of expression profiling is to begin to identify complex regulatory networks underlying developmental processes and disease states. Microarrays were originally used with cell lines or other simple model systems. More recently, microarrays have been used in the analysis of more complex biological tissues including neural systems and the brain. The application of cDNA arrays in neuropsychiatry has lagged behind other fields for a number of reasons. These include a requirement for a large amount of input probe RNA in fluorescent-glass based array systems and the cellular complexity introduced by multicellular brain and neural tissues. An additional factor that impacts the general use of microarrays in neuropsychiatry is the lack of availability of sequenced clone sets from model systems. While human cDNA clones have been widely available, high quality rat, mouse, and drosophilae, among others are just becoming widely available. A final factor in the application of cDNA microarrays in neuropsychiatry is cost of commercial arrays. As academic microarray facilitates become more commonplace custom made arrays will become more widely available at a lower cost allowing more widespread applications. In summary, microarray technology is rapidly having an impact on many areas of biomedical research. Radioisotope-nylon based microarrays offer alternatives that may in some cases be more sensitive, flexible, inexpensive, and universal as compared to other array formats, such as fluorescent-glass arrays. In some situations of limited RNA or exotic species, radioactive membrane microarrays may be the most

  7. A new approach for cloning hLIF cDNA from genomic DNA isolated from the oral mucous membrane.

    Science.gov (United States)

    Cui, Y H; Zhu, G Q; Chen, Q J; Wang, Y F; Yang, M M; Song, Y X; Wang, J G; Cao, B Y

    2011-11-25

    Complementary DNA (cDNA) is valuable for investigating protein structure and function in the study of life science, but it is difficult to obtain by traditional reverse transcription. We employed a novel strategy to clone human leukemia inhibitory factor (hLIF) gene cDNA from genomic DNA, which was directly isolated from the mucous membrane of mouth. The hLIF sequence, which is 609 bp long and is composed of three exons, can be acquired within a few hours by amplifying each exon and splicing all of them using overlap-PCR. This new approach developed is simple, time- and cost-effective, without RNA preparation or cDNA synthesis, and is not limited to the specific tissues for a particular gene and the expression level of the gene.

  8. Technology development for gene discovery and full-length sequencing

    Energy Technology Data Exchange (ETDEWEB)

    Marcelo Bento Soares

    2004-07-19

    In previous years, with support from the U.S. Department of Energy, we developed methods for construction of normalized and subtracted cDNA libraries, and constructed hundreds of high-quality libraries for production of Expressed Sequence Tags (ESTs). Our clones were made widely available to the scientific community through the IMAGE Consortium, and millions of ESTs were produced from our libraries either by collaborators or by our own sequencing laboratory at the University of Iowa. During this grant period, we focused on (1) the development of a method for preferential cloning of tissue-specific and/or rare transcripts, (2) its utilization to expedite EST-based gene discovery for the NIH Mouse Brain Molecular Anatomy Project, (3) further development and optimization of a method for construction of full-length-enriched cDNA libraries, and (4) modification of a plasmid vector to maximize efficiency of full-length cDNA sequencing by the transposon-mediated approach. It is noteworthy that the technology developed for preferential cloning of rare mRNAs enabled identification of over 2,000 mouse transcripts differentially expressed in the hippocampus. In addition, the method that we optimized for construction of full-length-enriched cDNA libraries was successfully utilized for the production of approximately fifty libraries from the developing mouse nervous system, from which over 2,500 full-ORF-containing cDNAs have been identified and accurately sequenced in their entirety either by our group or by the NIH-Mammalian Gene Collection Program Sequencing Team.

  9. Deep sequencing analysis of the developing mouse brain reveals a novel microRNA

    Directory of Open Access Journals (Sweden)

    Piltz Sandra

    2011-04-01

    Full Text Available Abstract Background MicroRNAs (miRNAs are small non-coding RNAs that can exert multilevel inhibition/repression at a post-transcriptional or protein synthesis level during disease or development. Characterisation of miRNAs in adult mammalian brains by deep sequencing has been reported previously. However, to date, no small RNA profiling of the developing brain has been undertaken using this method. We have performed deep sequencing and small RNA analysis of a developing (E15.5 mouse brain. Results We identified the expression of 294 known miRNAs in the E15.5 developing mouse brain, which were mostly represented by let-7 family and other brain-specific miRNAs such as miR-9 and miR-124. We also discovered 4 putative 22-23 nt miRNAs: mm_br_e15_1181, mm_br_e15_279920, mm_br_e15_96719 and mm_br_e15_294354 each with a 70-76 nt predicted pre-miRNA. We validated the 4 putative miRNAs and further characterised one of them, mm_br_e15_1181, throughout embryogenesis. Mm_br_e15_1181 biogenesis was Dicer1-dependent and was expressed in E3.5 blastocysts and E7 whole embryos. Embryo-wide expression patterns were observed at E9.5 and E11.5 followed by a near complete loss of expression by E13.5, with expression restricted to a specialised layer of cells within the developing and early postnatal brain. Mm_br_e15_1181 was upregulated during neurodifferentiation of P19 teratocarcinoma cells. This novel miRNA has been identified as miR-3099. Conclusions We have generated and analysed the first deep sequencing dataset of small RNA sequences of the developing mouse brain. The analysis revealed a novel miRNA, miR-3099, with potential regulatory effects on early embryogenesis, and involvement in neuronal cell differentiation/function in the brain during late embryonic and early neonatal development.

  10. A novel wavelet sequence based on deep bidirectional LSTM network model for ECG signal classification.

    Science.gov (United States)

    Yildirim, Özal

    2018-05-01

    Long-short term memory networks (LSTMs), which have recently emerged in sequential data analysis, are the most widely used type of recurrent neural networks (RNNs) architecture. Progress on the topic of deep learning includes successful adaptations of deep versions of these architectures. In this study, a new model for deep bidirectional LSTM network-based wavelet sequences called DBLSTM-WS was proposed for classifying electrocardiogram (ECG) signals. For this purpose, a new wavelet-based layer is implemented to generate ECG signal sequences. The ECG signals were decomposed into frequency sub-bands at different scales in this layer. These sub-bands are used as sequences for the input of LSTM networks. New network models that include unidirectional (ULSTM) and bidirectional (BLSTM) structures are designed for performance comparisons. Experimental studies have been performed for five different types of heartbeats obtained from the MIT-BIH arrhythmia database. These five types are Normal Sinus Rhythm (NSR), Ventricular Premature Contraction (VPC), Paced Beat (PB), Left Bundle Branch Block (LBBB), and Right Bundle Branch Block (RBBB). The results show that the DBLSTM-WS model gives a high recognition performance of 99.39%. It has been observed that the wavelet-based layer proposed in the study significantly improves the recognition performance of conventional networks. This proposed network structure is an important approach that can be applied to similar signal processing problems. Copyright © 2018 Elsevier Ltd. All rights reserved.

  11. Analysis of Plasmodium falciparum diversity in natural infections by deep sequencing

    OpenAIRE

    Manske, Magnus; Miotto, Olivo; Campino, Susana; Auburn, Sarah; Almagro-Garcia, Jacob; Maslen, Gareth; O?Brien, Jack; Djimde, Abdoulaye; Doumbo, Ogobara; Zongo, Issaka; Ouedraogo, Jean-Bosco; Michon, Pascal; Mueller, Ivo; Siba, Peter; Nzila, Alexis

    2012-01-01

    : Malaria elimination strategies require surveillance of the parasite population for genetic changes that demand a public health response, such as new forms of drug resistance. Here we describe methods for the large-scale analysis of genetic variation in Plasmodium falciparum by deep sequencing of parasite DNA obtained from the blood of patients with malaria, either directly or after short-term culture. Analysis of 86,158 exonic single nucleotide polymorphisms that passed genotyping quality c...

  12. Cloning of the cDNA and gene for a human D2 dopamine receptor

    International Nuclear Information System (INIS)

    Grady, D.K.; Makam, H.; Stofko, R.E.; Bunzow, J.R.; Civelli, O.; Marchionni, M.A.; Alfano, M.; Frothingham, L.; Fischer, J.B.; Burke-Howie, K.J.; Server, A.C.

    1989-01-01

    A clone encoding a human D 2 dopamine receptor was isolated from a pituitary cDNA library and sequenced. The deduced protein sequence is 96% identical with that of the cloned rat receptor with one major difference: the human receptor contains an additional 29 amino acids in its putative third cytoplasmic loop. Southern blotting demonstrated the presence of only one human D 2 receptor gene. Two overlapping phage containing the gene were isolated and characterized. DNA sequence analysis of these clones showed that the coding sequence is interrupted by six introns and that the additional amino acids present in the human pituitary receptor are encoded by a single exon of 87 base pairs. The involvement of this sequence in alternative splicing and its biological significance are discussed

  13. Sequence of interleukin-2 isolated from human placental poly A+ RNA: possible role in maintenance of fetal allograft.

    Science.gov (United States)

    Chernicky, C L; Tan, H; Burfeind, P; Ilan, J; Ilan, J

    1996-02-01

    There are several cell types within the placenta that produce cytokines which can contribute to the regulatory mechanisms that ensure normal pregnancy. The immunological milieu at the maternofetal interface is considered to be crucial for survival of the fetus. Interleukin-2 (IL-2) is expressed by the syncytiotrophoblast, the cell layer between the mother and the fetus. IL-2 appears to be a key factor in maintenance of pregnancy. Therefore, it was important to determine the sequence of human placental interleukin-2. Direct sequencing of human placental IL-2 cDNA was determined for the coding region. Subclone sequencing was carried out for the 5'- and 3'-untranslated regions (5'-UTR and 3'-UTR). The 5'-UTR for human placental IL-2 cDNA is 294 bp, which is 247 nucleotides longer than that reported for cDNA IL-2 derived from T cells. The sequence of the coding region is identical to that reported for T cell IL-2, while sequence analysis of the polymerase chain reaction (PCR) product showed that the cDNA from the 3' end was the same as that reported for cDNA from T cells. Human placental IL-2 cDNA is 1,028 base pairs (excluding the poly A tail), which is 247 bp longer at the 5' end than that reported for IL-2 T cell cDNA. Therefore, the extended 5'-UTR of the placental IL-2 cDNA may be a consequence of alternative promoter utilization in the placenta.

  14. Exploring the Mechanisms of Gastrointestinal Cancer Development Using Deep Sequencing Analysis

    International Nuclear Information System (INIS)

    Matsumoto, Tomonori; Shimizu, Takahiro; Takai, Atsushi; Marusawa, Hiroyuki

    2015-01-01

    Next-generation sequencing (NGS) technologies have revolutionized cancer genomics due to their high throughput sequencing capacity. Reports of the gene mutation profiles of various cancers by many researchers, including international cancer genome research consortia, have increased over recent years. In addition to detecting somatic mutations in tumor cells, NGS technologies enable us to approach the subject of carcinogenic mechanisms from new perspectives. Deep sequencing, a method of optimizing the high throughput capacity of NGS technologies, allows for the detection of genetic aberrations in small subsets of premalignant and/or tumor cells in noncancerous chronically inflamed tissues. Genome-wide NGS data also make it possible to clarify the mutational signatures of each cancer tissue by identifying the precise pattern of nucleotide alterations in the cancer genome, providing new information regarding the mechanisms of tumorigenesis. In this review, we highlight these new methods taking advantage of NGS technologies, and discuss our current understanding of carcinogenic mechanisms elucidated from such approaches

  15. Exploring the Mechanisms of Gastrointestinal Cancer Development Using Deep Sequencing Analysis

    Energy Technology Data Exchange (ETDEWEB)

    Matsumoto, Tomonori; Shimizu, Takahiro; Takai, Atsushi; Marusawa, Hiroyuki, E-mail: maru@kuhp.kyoto-u.ac.jp [Department of Gastroenterology and Hepatology, Graduate School of Medicine, Kyoto University, 54 Shogoin-Kawahara-cho, Sakyo-ku, Kyoto 606-8507 (Japan)

    2015-06-15

    Next-generation sequencing (NGS) technologies have revolutionized cancer genomics due to their high throughput sequencing capacity. Reports of the gene mutation profiles of various cancers by many researchers, including international cancer genome research consortia, have increased over recent years. In addition to detecting somatic mutations in tumor cells, NGS technologies enable us to approach the subject of carcinogenic mechanisms from new perspectives. Deep sequencing, a method of optimizing the high throughput capacity of NGS technologies, allows for the detection of genetic aberrations in small subsets of premalignant and/or tumor cells in noncancerous chronically inflamed tissues. Genome-wide NGS data also make it possible to clarify the mutational signatures of each cancer tissue by identifying the precise pattern of nucleotide alterations in the cancer genome, providing new information regarding the mechanisms of tumorigenesis. In this review, we highlight these new methods taking advantage of NGS technologies, and discuss our current understanding of carcinogenic mechanisms elucidated from such approaches.

  16. Cloning of the human androgen receptor cDNA

    International Nuclear Information System (INIS)

    Govindan, M.V.; Burelle, M.; Cantin, C.; Kabrie, C.; Labrie, F.; Lachance, Y.; Leblanc, G.; Lefebvre, C.; Patel, P.; Simard, J.

    1988-01-01

    The authors discuss how in order to define the functional domains of the human androgen receptor, complementary DNA (cDNA) clones encoding the human androgen receptor (hAR) have been isolated from a human testis λgtll cDNA library using synthetic oligonnucleotide probes, homologous to segments of the human glucocorticoid, estradiol and progesterone receptors. The cDNA clones corresponding to the human glucocorticoid, estradiol and progesterone receptors were eliminated after cross-hybridization with their respective cDNA probes and/or after restriction mapping of the cDNA clones. The remaining cDNA clones were classified into different groups after analysis by restriction digestion and cross-hybridization. Two of the largest cDNA clones from each group were inserted into an expression vector in both orientations. The linearized plasmids were used as templates in in vitro transcription with T7 RNA polymerase. Subsequent in vitro translation of the purified transcripts in rabbit reticulocyte lysate followed by sodium dodecylsulfate polyacrylamide gel electrophoresis (SDS-PAGE) permitted the characterization of the encoded polyeptides. The expressed proteins larger than 30,000 Da were analyzed for their ability to bind tritium-labelled dihydrotestosterone ([ 3 H] DHT) with high affinity and specificity

  17. PCR amplification and sequences of cDNA clones for the small and large subunits of ADP-glucose pyrophosphorylase from barley tissues.

    Science.gov (United States)

    Villand, P; Aalen, R; Olsen, O A; Lüthi, E; Lönneborg, A; Kleczkowski, L A

    1992-06-01

    Several cDNAs encoding the small and large subunit of ADP-glucose pyrophosphorylase (AGP) were isolated from total RNA of the starchy endosperm, roots and leaves of barley by polymerase chain reaction (PCR). Sets of degenerate oligonucleotide primers, based on previously published conserved amino acid sequences of plant AGP, were used for synthesis and amplification of the cDNAs. For either the endosperm, roots and leaves, the restriction analysis of PCR products (ca. 550 nucleotides each) has revealed heterogeneity, suggesting presence of three transcripts for AGP in the endosperm and roots, and up to two AGP transcripts in the leaf tissue. Based on the derived amino acid sequences, two clones from the endosperm, beps and bepl, were identified as coding for the small and large subunit of AGP, respectively, while a leaf transcript (blpl) encoded the putative large subunit of AGP. There was about 50% identity between the endosperm clones, and both of them were about 60% identical to the leaf cDNA. Northern blot analysis has indicated that beps and bepl are expressed in both the endosperm and roots, while blpl is detectable only in leaves. Application of the PCR technique in studies on gene structure and gene expression of plant AGP is discussed.

  18. Cloning of cDNA sequences of a progestin-regulated mRNA from MCF7 human breast cancer cells

    Energy Technology Data Exchange (ETDEWEB)

    Chalbos, D; Westley, B; Alibert, C; Rochefort, H

    1986-01-24

    A cDNA clone corresponding to an mRNA regulated by the progestin R5020, has been isolated by differential screening of a cDNA library from the MCF7 breast cancer cell line, which contains estrogen and progesterone receptors. This probe hybridized with a single species of poly A + RNA of 8-kb molecular weight as shown by Northern blot analysis and could also be used to total RNA preparation. This recombinant cone hybridized specifically to an mRNA coding for a 250,000 daltons protein when translated in vitro. This protein was identical to the 250 kDa progestin-regulated protein that the authors previously described as shown by immunoprecipitation with specific rabbit polyclonal antibodies. Dose-response curve and specificity studies show that the accumulation of the Pg8 mRNA and that of the 250-kDa protein was increased by 5 to 30-fold following progestin treatment and that this effect was mediated by the progesterone receptor. Time course of induction indicated that the accumulation of mRNA was rapid and preceded that of the protein. This is the first report on a cloned cDNA probe of progestin-regulated mRNA in human cell lines.

  19. Molecular cloning and characterization of a cDNA encoding the gibberellin biosynthetic enzyme ent-kaurene synthase B from pumpkin (Cucurbita maxima L.).

    Science.gov (United States)

    Yamaguchi, S; Saito, T; Abe, H; Yamane, H; Murofushi, N; Kamiya, Y

    1996-08-01

    The first committed step in the formation of diterpenoids leading to gibberellin (GA) biosynthesis is the conversion of geranylgeranyl diphosphate (GGDP) to ent-kaurene. ent-Kaurene synthase A (KSA) catalyzes the conversion of GGDP to copalyl diphosphate (CDP), which is subsequently converted to ent-kaurene by ent-kaurene synthase B (KSB). A full-length KSB cDNA was isolated from developing cotyledons in immature seeds of pumpkin (Cucurbita maxima L.). Degenerate oligonucleotide primers were designed from the amino acid sequences obtained from the purified protein to amplify a cDNA fragment, which was used for library screening. The isolated full-length cDNA was expressed in Escherichia coli as a fusion protein, which demonstrated the KSB activity to cyclize [3H]CDP to [3H]ent-kaurene. The KSB transcript was most abundant in growing tissues, but was detected in every organ in pumpkin seedlings. The deduced amino acid sequence shares significant homology with other terpene cyclases, including the conserved DDXXD motif, a putative divalent metal ion-diphosphate complex binding site. A putative transit peptide sequence that may target the translated product into the plastids is present in the N-terminal region.

  20. Cloning, molecular characterization and expression of a cDNA encoding a functional NADH-cytochrome b5 reductase from Mucor racemosus PTCC 5305 in E. coli

    Directory of Open Access Journals (Sweden)

    NED A SETAYESH

    2009-01-01

    Full Text Available The present work aims to study a new NADH-cytochrome b5 reductase (cb5r from Mucor racemosus PTCC 5305. A cDNA coding for cb s r was isolated from a Mucor racemosus PTCC 5305 cDNA library. The nucleotide sequence of the cDNA including coding and sequences flanking regions was determined. The open reading frame starting from ATG and ending with TAG stop codon encoded 228 amino acids and displayed the closest similarity (73% with Mortierella alpina cb s r. Lack of hydrophobic residues in the N-terminal sequence was apparent, suggesting that the enzyme is a soluble isoform. The coding sequence was then cloned in the pET16b transcription vector carrying an N-terminal-linked His-Tag® sequence and expressed in Escherichia coli BL21 (DE3. The enzyme was then homogeneously purified by a metal affinity column. The recombinant Mucor enzyme was shown to have its optimal activity at pH and temperature of about 7.5 and 40 °C, respectively. The apparent Km value was calculated to be 13 μM for ferricyanide. To our knowledge, this is the first report on cloning and expression of a native fungal soluble isoform of NADH-cytochrome b5 reductase in E. coli.

  1. Deep sequencing discovery of novel and conserved microRNAs in trifoliate orange (Citrus trifoliata

    Directory of Open Access Journals (Sweden)

    Yu Huaping

    2010-07-01

    Full Text Available Abstract Background MicroRNAs (miRNAs play a critical role in post-transcriptional gene regulation and have been shown to control many genes involved in various biological and metabolic processes. There have been extensive studies to discover miRNAs and analyze their functions in model plant species, such as Arabidopsis and rice. Deep sequencing technologies have facilitated identification of species-specific or lowly expressed as well as conserved or highly expressed miRNAs in plants. Results In this research, we used Solexa sequencing to discover new microRNAs in trifoliate orange (Citrus trifoliata which is an important rootstock of citrus. A total of 13,106,753 reads representing 4,876,395 distinct sequences were obtained from a short RNA library generated from small RNA extracted from C. trifoliata flower and fruit tissues. Based on sequence similarity and hairpin structure prediction, we found that 156,639 reads representing 63 sequences from 42 highly conserved miRNA families, have perfect matches to known miRNAs. We also identified 10 novel miRNA candidates whose precursors were all potentially generated from citrus ESTs. In addition, five miRNA* sequences were also sequenced. These sequences had not been earlier described in other plant species and accumulation of the 10 novel miRNAs were confirmed by qRT-PCR analysis. Potential target genes were predicted for most conserved and novel miRNAs. Moreover, four target genes including one encoding IRX12 copper ion binding/oxidoreductase and three genes encoding NB-LRR disease resistance protein have been experimentally verified by detection of the miRNA-mediated mRNA cleavage in C. trifoliata. Conclusion Deep sequencing of short RNAs from C. trifoliata flowers and fruits identified 10 new potential miRNAs and 42 highly conserved miRNA families, indicating that specific miRNAs exist in C. trifoliata. These results show that regulatory miRNAs exist in agronomically important trifoliate orange

  2. Molecular cloning and characterization of an acetylcholinesterase cDNA in the brown planthopper, Nilaparvata lugens.

    Science.gov (United States)

    Yang, Zhifan; Chen, Jun; Chen, Yongqin; Jiang, Sijing

    2010-01-01

    A full cDNA encoding an acetylcholinesterase (AChE, EC 3.1.1.7) was cloned and characterized from the brown planthopper, Nilaparvata lugens Stål (Hemiptera: Delphacidae). The complete cDNA (2467 bp) contains a 1938-bp open reading frame encoding 646 amino acid residues. The amino acid sequence of the AChE deduced from the cDNA consists of 30 residues for a putative signal peptide and 616 residues for the mature protein with a predicted molecular weight of 69,418. The three residues (Ser242, Glu371, and His485) that putatively form the catalytic triad and the six Cys that form intra-subunit disulfide bonds are completely conserved, and 10 out of the 14 aromatic residues lining the active site gorge of the AChE are also conserved. Northern blot analysis of poly(A)+ RNA showed an approximately 2.6-kb transcript, and Southern blot analysis revealed there likely was just a single copy of this gene in N. lugens. The deduced protein sequence is most similar to AChE of Nephotettix cincticeps with 83% amino acid identity. Phylogenetic analysis constructed with 45 AChEs from 30 species showed that the deduced N. lugens AChE formed a cluster with the other 8 insect AChE2s. Additionally, the hypervariable region and amino acids specific to insect AChE2 also existed in the AChE of N. lugens. The results revealed that the AChE cDNA cloned in this work belongs to insect AChE2 subgroup, which is orthologous to Drosophila AChE. Comparison of the AChEs between the susceptible and resistant strains revealed a point mutation, Gly185Ser, is likely responsible for the insensitivity of the AChE to methamidopho in the resistant strain.

  3. Methods to determine the transcriptomes of trypanosomes in mixtures with mammalian cells: the effects of parasite purification and selective cDNA amplification.

    Directory of Open Access Journals (Sweden)

    Julius Mulindwa

    2014-04-01

    Full Text Available Patterns of gene expression in cultured Trypanosoma brucei bloodstream and procyclic forms have been extensively characterized, and some comparisons have been made with trypanosomes grown to high parasitaemias in laboratory rodents. We do not know, however, to what extent these transcriptomes resemble those in infected Tsetse flies - or in humans or cattle, where parasitaemias are substantially lower. For clinical and field samples it is difficult to characterize parasite gene expression because of the large excess of host cell RNA. We have here examined two potential solutions to this problem for bloodstream form trypanosomes, assaying transcriptomes by high throughput cDNA sequencing (RNASeq. We first purified the parasites from blood of infected rats. We found that a red blood cell lysis procedure affected the transcriptome substantially more than purification using a DEAE cellulose column, but that too introduced significant distortions and variability. As an alternative, we specifically amplified parasite sequences from a mixture containing a 1000-fold excess of human RNA. We first purified polyadenylated RNA, then made trypanosome-specific cDNA by priming with a spliced leader primer. Finally, the cDNA was amplified using nested primers. The amplification procedure was able to produce samples in which 20% of sequence reads mapped to the trypanosome genome. Synthesis of the second cDNA strand with a spliced leader primer, followed by amplification, is sufficiently reproducible to allow comparison of different samples so long as they are all treated in the same way. However, SL priming distorted the abundances of the cDNA products and definitely cannot be used, by itself, to measure absolute mRNA levels. The amplification method might be suitable for clinical samples with low parasitaemias, and could also be adapted for other Kinetoplastids and to samples from infected vectors.

  4. Evaluation of Nine Somatic Variant Callers for Detection of Somatic Mutations in Exome and Targeted Deep Sequencing Data

    DEFF Research Database (Denmark)

    Krøigård, Anne Bruun; Thomassen, Mads; Lænkholm, Anne Vibeke

    2016-01-01

    a comprehensive evaluation using exome sequencing and targeted deep sequencing data of paired tumor-normal samples from five breast cancer patients to evaluate the performance of nine publicly available somatic variant callers: EBCall, Mutect, Seurat, Shimmer, Indelocator, Somatic Sniper, Strelka, VarScan 2...

  5. Signal sequence and keyword trap in silico for selection of full-length human cDNAs encoding secretion or membrane proteins from oligo-capped cDNA libraries.

    Science.gov (United States)

    Otsuki, Tetsuji; Ota, Toshio; Nishikawa, Tetsuo; Hayashi, Koji; Suzuki, Yutaka; Yamamoto, Jun-ichi; Wakamatsu, Ai; Kimura, Kouichi; Sakamoto, Katsuhiko; Hatano, Naoto; Kawai, Yuri; Ishii, Shizuko; Saito, Kaoru; Kojima, Shin-ichi; Sugiyama, Tomoyasu; Ono, Tetsuyoshi; Okano, Kazunori; Yoshikawa, Yoko; Aotsuka, Satoshi; Sasaki, Naokazu; Hattori, Atsushi; Okumura, Koji; Nagai, Keiichi; Sugano, Sumio; Isogai, Takao

    2005-01-01

    We have developed an in silico method of selection of human full-length cDNAs encoding secretion or membrane proteins from oligo-capped cDNA libraries. Fullness rates were increased to about 80% by combination of the oligo-capping method and ATGpr, software for prediction of translation start point and the coding potential. Then, using 5'-end single-pass sequences, cDNAs having the signal sequence were selected by PSORT ('signal sequence trap'). We also applied 'secretion or membrane protein-related keyword trap' based on the result of BLAST search against the SWISS-PROT database for the cDNAs which could not be selected by PSORT. Using the above procedures, 789 cDNAs were primarily selected and subjected to full-length sequencing, and 334 of these cDNAs were finally selected as novel. Most of the cDNAs (295 cDNAs: 88.3%) were predicted to encode secretion or membrane proteins. In particular, 165(80.5%) of the 205 cDNAs selected by PSORT were predicted to have signal sequences, while 70 (54.2%) of the 129 cDNAs selected by 'keyword trap' preserved the secretion or membrane protein-related keywords. Many important cDNAs were obtained, including transporters, receptors, and ligands, involved in significant cellular functions. Thus, an efficient method of selecting secretion or membrane protein-encoding cDNAs was developed by combining the above four procedures.

  6. scsB, a cDNA encoding the hydrogenosomal beta subunit of succinyl-CoA synthetase from the anaerobic fungus Neocallimastix frontalis

    NARCIS (Netherlands)

    Brondijk, THC; Durand, R; vanderGiezen, M; Gottschal, JC; Prins, RA; Fevre, M

    1996-01-01

    A clone containing a Neocallimastix frontalis cDNA assumed to encode the beta subunit of succinyl-CoA synthetase (SCSB) was identified by sequence homology with prokaryotic and eukaryotic counterparts. An open reading frame of 1311 bp was found. The deduced 437 amino acid sequence showed a high

  7. Rhodopsin in the Dark Hot Sea: Molecular Analysis of Rhodopsin in a Snailfish, Careproctus rhodomelas, Living near the Deep-Sea Hydrothermal Vent.

    Directory of Open Access Journals (Sweden)

    Rie Sakata

    Full Text Available Visual systems in deep-sea fishes have been previously studied from a photobiological aspect; however, those of deep-sea fish inhabiting the hydrothermal vents are far less understood due to sampling difficulties. In this study, we analyzed the visual pigment of a deep-sea snailfish, Careproctus rhodomelas, discovered and collected only near the hydrothermal vents of oceans around Japan. Proteins were solubilized from the C. rhodomelas eyeball and subjected to spectroscopic analysis, which revealed the presence of a pigment characterized by an absorption maximum (λmax at 480 nm. Immunoblot analysis of the ocular protein showed a rhodopsin-like immunoreactivity. We also isolated a retinal cDNA encoding the entire coding sequence of putative C. rhodomelas rhodopsin (CrRh. HEK293EBNA cells were transfected with the CrRh cDNA and the proteins extracted from the cells were subjected to spectroscopic analysis. The recombinant CrRh showed the absorption maximum at 480 nm in the presence of 11-cis retinal. Comparison of the results from the eyeball extract and the recombinant CrRh strongly suggests that CrRh has an A1-based 11-cis-retinal chromophore and works as a photoreceptor in the C. rhodomelas retina, and hence that C. rhodomelas responds to dim blue light much the same as other deep-sea fishes. Because hydrothermal vent is a huge supply of viable food, C. rhodomelas likely do not need to participate diel vertical migration and may recognize the bioluminescence produced by aquatic animals living near the hydrothermal vents.

  8. Evaluation of Nine Somatic Variant Callers for Detection of Somatic Mutations in Exome and Targeted Deep Sequencing Data.

    Science.gov (United States)

    Krøigård, Anne Bruun; Thomassen, Mads; Lænkholm, Anne-Vibeke; Kruse, Torben A; Larsen, Martin Jakob

    2016-01-01

    Next generation sequencing is extensively applied to catalogue somatic mutations in cancer, in research settings and increasingly in clinical settings for molecular diagnostics, guiding therapy decisions. Somatic variant callers perform paired comparisons of sequencing data from cancer tissue and matched normal tissue in order to detect somatic mutations. The advent of many new somatic variant callers creates a need for comparison and validation of the tools, as no de facto standard for detection of somatic mutations exists and only limited comparisons have been reported. We have performed a comprehensive evaluation using exome sequencing and targeted deep sequencing data of paired tumor-normal samples from five breast cancer patients to evaluate the performance of nine publicly available somatic variant callers: EBCall, Mutect, Seurat, Shimmer, Indelocator, Somatic Sniper, Strelka, VarScan 2 and Virmid for the detection of single nucleotide mutations and small deletions and insertions. We report a large variation in the number of calls from the nine somatic variant callers on the same sequencing data and highly variable agreement. Sequencing depth had markedly diverse impact on individual callers, as for some callers, increased sequencing depth highly improved sensitivity. For SNV calling, we report EBCall, Mutect, Virmid and Strelka to be the most reliable somatic variant callers for both exome sequencing and targeted deep sequencing. For indel calling, EBCall is superior due to high sensitivity and robustness to changes in sequencing depths.

  9. (+)-(10R)-Germacrene A synthase from goldenrod, Solidago canadensis; cDNA isolation, bacterial expression and functional analysis.

    Science.gov (United States)

    Prosser, Ian; Phillips, Andy L; Gittings, Simon; Lewis, Mervyn J; Hooper, Antony M; Pickett, John A; Beale, Michael H

    2002-08-01

    Profiling of sesquiterpene hydrocarbons in extracts of goldenrod, Solidago canadensis, by GC-MS revealed the presence of both enantiomers of germacrene D and lesser amounts of germacrene A, alpha-humulene, and beta-caryophyllene. A similarity-based cloning strategy using degenerate oligonucleotide primers, based on conserved amino acid sequences in known plant sesquiterpene synthases and RT-PCR, resulted in the isolation of a full length sesquiterpene synthase cDNA. Functional expression of the cDNA in E. coli, as an N-terminal thioredoxin fusion protein using the pET32b vector yielded an enzyme that was readily purified by nickel-chelate affinity chromatography. Chiral GC-MS analysis of products from of (3)H- and (2)H-labelled farnesyl diphosphate identified the enzyme as (+)-(10R)-germacrene A synthase. Sequence analysis and molecular modelling was used to compare this enzyme with the mechanistically related epi-aristolochene synthase from tobacco.

  10. Construction and Cloning of Reporter-Tagged Replicon cDNA for an In Vitro Replication Study of Murine Norovirus-1 (MNV-1).

    Science.gov (United States)

    Ahmad, Muhammad Khairi; Tabana, Yasser M; Ahmed, Mowaffaq Adam; Sandai, Doblin Anak; Mohamed, Rafeezul; Ismail, Ida Shazrina; Zulkiflie, Nurulisa; Yunus, Muhammad Amir

    2017-12-01

    A norovirus maintains its viability, infectivity and virulence by its ability to replicate. However, the biological mechanisms of the process remain to be explored. In this work, the NanoLuc™ Luciferase gene was used to develop a reporter-tagged replicon system to study norovirus replication. The NanoLuc™ Luciferase reporter protein was engineered to be expressed as a fusion protein for MNV-1 minor capsid protein, VP2. The foot-and-mouth disease virus 2A (FMDV2A) sequence was inserted between the 3'end of the reporter gene and the VP2 start sequence to allow co-translational 'cleavage' of fusion proteins during intracellular transcript expression. Amplification of the fusion gene was performed using a series of standard and overlapping polymerase chain reactions. The resulting amplicon was then cloned into three readily available backbones of MNV-1 cDNA clones. Restriction enzyme analysis indicated that the NanoLucTM Luciferase gene was successfully inserted into the parental MNV-1 cDNA clone. The insertion was further confirmed by using DNA sequencing. NanoLuc™ Luciferase-tagged MNV-1 cDNA clones were successfully engineered. Such clones can be exploited to develop robust experimental assays for in vitro assessments of viral RNA replication.

  11. Second-strand cDNA synthesis: classical method

    International Nuclear Information System (INIS)

    Gubler, U.

    1987-01-01

    The classical scheme for the synthesis of double-stranded cDNA as it was reported in 1976 is described. Reverse transcription of mRNA with oligo(dT) as the primer generates first strands with a small loop at the 3' end of the cDNA (the end that corresponds to the 5' end of the mRNA). Subsequent removal of the mRNA by alkaline hydrolysis leaves single-stranded cDNA molecules again with a small 3' loop. This loop can be used by either reverse transcriptase or Klenow fragment of DNA polymerase I as a primer for second-strand synthesis. The resulting products are double-stranded cDNA molecules that are covalently closed at the end corresponding to the 5' end of the original mRNA. Subsequent cleavage of the short piece of single-stranded cDNA within the loop with the single-strand-specific S 1 nuclease generate open double-stranded molecules that can be used for molecular cloning in plasmids or in phage. Useful variations of this scheme have been described

  12. cDNA and deduced primary structure of basic phospholipase A2 with neurotoxic activity from the venom secretion of the Crotalus durissus collilineatus rattlesnake

    Directory of Open Access Journals (Sweden)

    F.H.R. Fagundes

    2010-03-01

    Full Text Available To illustrate the construction of precursor complementary DNAs, we isolated mRNAs from whole venom samples. After reverse transcription polymerase chain reaction (RT-PCR, we amplified the cDNA coding for a neurotoxic protein, phospholipase A2 D49 (PLA2 D49, from the venom of Crotalus durissus collilineatus (Cdc PLA2. The cDNA encoding Cdc PLA2 from whole venom was sequenced. The deduced amino acid sequence of this cDNA has high overall sequence identity with the group II PLA2 protein family. Cdc PLA2 has 14 cysteine residues capable of forming seven disulfide bonds that characterize this group of PLA2 enzymes. Cdc PLA2 was isolated using conventional Sephadex G75 column chromatography and reverse-phase high performance liquid chromatography (RP-HPLC. The molecular mass was estimated using matrix-assisted laser desorption ionization-time-of-flight (MALDI-TOF mass spectrometry. We tested the neuromuscular blocking activities on chick biventer cervicis neuromuscular tissue. Phylogenetic analysis of Cdc PLA2 showed the existence of two lines of N6-PLA2, denominated F24 and S24. Apparently, the sequences of the New World’s N6-F24-PLA2 are similar to those of the agkistrodotoxin from the Asian genus Gloydius. The sequences of N6-S24-PLA2 are similar to the sequence of trimucrotoxin from the genus Protobothrops, found in the Old World.

  13. Porcine transcriptome analysis based on 97 non-normalized cDNA libraries and assembly of 1,021,891 expressed sequence tags

    DEFF Research Database (Denmark)

    Gorodkin, Jan; Cirera, Susanna; Hedegaard, Jacob

    2007-01-01

    public databases. The Sino-Danish ESTs were generated from one normalized and 97 non-normalized cDNA libraries representing 35 different tissues and three developmental stages. RESULTS: Using the Distiller package, the ESTs were assembled to roughly 48,000 contigs and 73,000 singletons, of which...... with the greatest number of different expressed genes, whereas tissues with more specialized function, such as developing liver, have fewer expressed genes. There are at least 65 high confidence housekeeping gene candidates and 876 cDNA library-specific gene candidates. We identified differential expression...

  14. Preparation of a differentially expressed, full-length cDNA expression library by RecA-mediated triple-strand formation with subtractively enriched cDNA fragments

    NARCIS (Netherlands)

    Hakvoort, T. B.; Spijkers, J. A.; Vermeulen, J. L.; Lamers, W. H.

    1996-01-01

    We have developed a fast and general method to obtain an enriched, full-length cDNA expression library with subtractively enriched cDNA fragments. The procedure relies on RecA-mediated triple-helix formation of single-stranded cDNA fragments with a double-stranded cDNA plasmid library. The complexes

  15. Monoterpene biosynthesis in lemon (Citrus limon) cDNA isolation and functional analysis of four monoterpene synthases

    NARCIS (Netherlands)

    Lücker, J.; Tamer, El M.K.; Schwab, W.; Verstappen, F.W.A.; Plas, van der L.H.W.; Bouwmeester, H.J.; Verhoeven, H.A.

    2002-01-01

    Citrus limon possesses a high content and large variety of monoterpenoids, especially in the glands of the fruit flavedo. The genes responsible for the production of these monoterpenes have never been isolated. By applying a random sequencing approach to a cDNA library from mRNA isolated from the

  16. Cloning and expression of a human kidney cDNA for an α2-adrenergic receptor subtype

    International Nuclear Information System (INIS)

    Regan, J.W.; Kobilka, T.S.; Yang-Feng, T.L.; Caron, M.G.; Lefkowitz, R.J.; Kobilka, B.K.

    1988-01-01

    An α 2 -adrenergic receptor subtype has been cloned from a human kidney cDNA library using the gene for the human platelet α 2 -adrenergic receptor as a probe. The deduced amino acid sequence resembles the human platelet α 2 -adrenergic receptor and is consistent with the structure of other members of he family of guanine nucleotide-binding protein-coupled receptors. The cDNA was expressed in a mammalian cell line (COS-7), and the α 2 -adrenergic ligand [ 3 H]rauwolscine was bound. Competition curve analysis with a variety of adrenergic ligands suggests that this cDNA clone represents the α 2 B-adrenergic receptor. The gene for this receptor is on human chromosome 4, whereas the gene for the human platelet α 2 -adrenergic receptor (α 2 A) lies on chromosome 10. This ability to express the receptor in mammalian cells, free of other adrenergic receptor subtypes, should help in developing more selective α-adrenergic ligands

  17. A new set of ESTs and cDNA clones from full-length and normalized libraries for gene discovery and functional characterization in citrus

    Directory of Open Access Journals (Sweden)

    Alamar Santiago

    2009-09-01

    Full Text Available Abstract Background Interpretation of ever-increasing raw sequence information generated by modern genome sequencing technologies faces multiple challenges, such as gene function analysis and genome annotation. Indeed, nearly 40% of genes in plants encode proteins of unknown function. Functional characterization of these genes is one of the main challenges in modern biology. In this regard, the availability of full-length cDNA clones may fill in the gap created between sequence information and biological knowledge. Full-length cDNA clones facilitate functional analysis of the corresponding genes enabling manipulation of their expression in heterologous systems and the generation of a variety of tagged versions of the native protein. In addition, the development of full-length cDNA sequences has the power to improve the quality of genome annotation. Results We developed an integrated method to generate a new normalized EST collection enriched in full-length and rare transcripts of different citrus species from multiple tissues and developmental stages. We constructed a total of 15 cDNA libraries, from which we isolated 10,898 high-quality ESTs representing 6142 different genes. Percentages of redundancy and proportion of full-length clones range from 8 to 33, and 67 to 85, respectively, indicating good efficiency of the approach employed. The new EST collection adds 2113 new citrus ESTs, representing 1831 unigenes, to the collection of citrus genes available in the public databases. To facilitate functional analysis, cDNAs were introduced in a Gateway-based cloning vector for high-throughput functional analysis of genes in planta. Herein, we describe the technical methods used in the library construction, sequence analysis of clones and the overexpression of CitrSEP, a citrus homolog to the Arabidopsis SEP3 gene, in Arabidopsis as an example of a practical application of the engineered Gateway vector for functional analysis. Conclusion The new

  18. A new set of ESTs and cDNA clones from full-length and normalized libraries for gene discovery and functional characterization in citrus

    Science.gov (United States)

    Marques, M Carmen; Alonso-Cantabrana, Hugo; Forment, Javier; Arribas, Raquel; Alamar, Santiago; Conejero, Vicente; Perez-Amador, Miguel A

    2009-01-01

    Background Interpretation of ever-increasing raw sequence information generated by modern genome sequencing technologies faces multiple challenges, such as gene function analysis and genome annotation. Indeed, nearly 40% of genes in plants encode proteins of unknown function. Functional characterization of these genes is one of the main challenges in modern biology. In this regard, the availability of full-length cDNA clones may fill in the gap created between sequence information and biological knowledge. Full-length cDNA clones facilitate functional analysis of the corresponding genes enabling manipulation of their expression in heterologous systems and the generation of a variety of tagged versions of the native protein. In addition, the development of full-length cDNA sequences has the power to improve the quality of genome annotation. Results We developed an integrated method to generate a new normalized EST collection enriched in full-length and rare transcripts of different citrus species from multiple tissues and developmental stages. We constructed a total of 15 cDNA libraries, from which we isolated 10,898 high-quality ESTs representing 6142 different genes. Percentages of redundancy and proportion of full-length clones range from 8 to 33, and 67 to 85, respectively, indicating good efficiency of the approach employed. The new EST collection adds 2113 new citrus ESTs, representing 1831 unigenes, to the collection of citrus genes available in the public databases. To facilitate functional analysis, cDNAs were introduced in a Gateway-based cloning vector for high-throughput functional analysis of genes in planta. Herein, we describe the technical methods used in the library construction, sequence analysis of clones and the overexpression of CitrSEP, a citrus homolog to the Arabidopsis SEP3 gene, in Arabidopsis as an example of a practical application of the engineered Gateway vector for functional analysis. Conclusion The new EST collection denotes an

  19. Isolation and characterization of two cDNA clones encoding for glutamate dehydrogenase in Nicotiana plumbaginifolia.

    Science.gov (United States)

    Ficarelli, A; Tassi, F; Restivo, F M

    1999-03-01

    We have isolated two full length cDNA clones encoding Nicotiana plumbaginifolia NADH-glutamate dehydrogenase. Both clones share amino acid boxes of homology corresponding to conserved GDH catalytic domains and putative mitochondrial targeting sequence. One clone shows a putative EF-hand loop. The level of the two transcripts is affected differently by carbon source.

  20. Single-Cell RNA Sequencing of Glioblastoma Cells.

    Science.gov (United States)

    Sen, Rajeev; Dolgalev, Igor; Bayin, N Sumru; Heguy, Adriana; Tsirigos, Aris; Placantonakis, Dimitris G

    2018-01-01

    Single-cell RNA sequencing (sc-RNASeq) is a recently developed technique used to evaluate the transcriptome of individual cells. As opposed to conventional RNASeq in which entire populations are sequenced in bulk, sc-RNASeq can be beneficial when trying to better understand gene expression patterns in markedly heterogeneous populations of cells or when trying to identify transcriptional signatures of rare cells that may be underrepresented when using conventional bulk RNASeq. In this method, we describe the generation and analysis of cDNA libraries from single patient-derived glioblastoma cells using the C1 Fluidigm system. The protocol details the use of the C1 integrated fluidics circuit (IFC) for capturing, imaging and lysing cells; performing reverse transcription; and generating cDNA libraries that are ready for sequencing and analysis.

  1. Deep Ion Torrent sequencing identifies soil fungal community shifts after frequent prescribed fires in a southeastern US forest ecosystem.

    Science.gov (United States)

    Brown, Shawn P; Callaham, Mac A; Oliver, Alena K; Jumpponen, Ari

    2013-12-01

    Prescribed burning is a common management tool to control fuel loads, ground vegetation, and facilitate desirable game species. We evaluated soil fungal community responses to long-term prescribed fire treatments in a loblolly pine forest on the Piedmont of Georgia and utilized deep Internal Transcribed Spacer Region 1 (ITS1) amplicon sequencing afforded by the recent Ion Torrent Personal Genome Machine (PGM). These deep sequence data (19,000 + reads per sample after subsampling) indicate that frequent fires (3-year fire interval) shift soil fungus communities, whereas infrequent fires (6-year fire interval) permit system resetting to a state similar to that without prescribed fire. Furthermore, in nonmetric multidimensional scaling analyses, primarily ectomycorrhizal taxa were correlated with axes associated with long fire intervals, whereas soil saprobes tended to be correlated with the frequent fire recurrence. We conclude that (1) multiplexed Ion Torrent PGM analyses allow deep cost effective sequencing of fungal communities but may suffer from short read lengths and inconsistent sequence quality adjacent to the sequencing adaptor; (2) frequent prescribed fires elicit a shift in soil fungal communities; and (3) such shifts do not occur when fire intervals are longer. Our results emphasize the general responsiveness of these forests to management, and the importance of fire return intervals in meeting management objectives. © 2013 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.

  2. Cloning and sequencing of the casein kinase 2 alpha subunit from Zea mays

    DEFF Research Database (Denmark)

    Dobrowolska, G; Boldyreff, B; Issinger, O G

    1991-01-01

    The nucleotide sequence of the cDNA coding for the alpha subunit of casein kinase 2 of Zea mays has been determined. The cDNA clone contains an open reading frame of 996 nucleotides encoding a polypeptide comprising 332 amino acids. The primary amino acid sequence exhibits 75% identity to the alpha...... subunit and 71% identity to the alpha' subunit of human casein kinase 2....

  3. Complementation of radiation-sensitive Ataxia telangiectasia cells after transfection of cDNA expression libraries and cosmid clones from wildtype cells

    International Nuclear Information System (INIS)

    Fritz, E.

    1994-06-01

    In this Ph.D.-thesis, phenotypic complementation of AT-cells (AT5BIVA) by transfection of cDNA-expression-libraries was adressed: After stable transfection of cDNA-expression-libraries G418 resistant clones were selected for enhanced radioresistance by a fractionated X-ray selection. One surviving transfectant clone (clone 514) exhibited enhanced radiation resistance in dose-response experiments and further X-ray selections. Cell cycle analysis revealed complementation of untreated and irradiated 514-cells in cell cycle progression. The rate of DNA synthesis, however, is not diminished after irradiation but shows the reverse effect. A transfected cDNA-fragment (AT500-cDNA) was isolated from the genomic DNA of 514-cells and proved to be an unknown DNA sequence. A homologous sequence could be detected in genomic DNA from human cell lines, but not in DNA from other species. The cDNA-sequence could be localized to human chromosome 11. In human cells the cDNA sequence is part of two large mRNAs. 4 different cosmid clones containing high molecular genomic DNA from normal human cells could be isolated from a library, each hybridizing to the AT500-cDNA. After stable transfection into AT-cells, one cosmid-clone was able to confer enhanced radiation resistance both in X-ray selections and dose-response experiments. The results indicate that the cloned cDNA-fragment is based on an unknown gene from human chromosome 11 which partially complements the radiosensitivity and the defective cell cycle progression in AT5BIVA cells. (orig.) [de

  4. Evaluation of Nine Somatic Variant Callers for Detection of Somatic Mutations in Exome and Targeted Deep Sequencing Data.

    Directory of Open Access Journals (Sweden)

    Anne Bruun Krøigård

    Full Text Available Next generation sequencing is extensively applied to catalogue somatic mutations in cancer, in research settings and increasingly in clinical settings for molecular diagnostics, guiding therapy decisions. Somatic variant callers perform paired comparisons of sequencing data from cancer tissue and matched normal tissue in order to detect somatic mutations. The advent of many new somatic variant callers creates a need for comparison and validation of the tools, as no de facto standard for detection of somatic mutations exists and only limited comparisons have been reported. We have performed a comprehensive evaluation using exome sequencing and targeted deep sequencing data of paired tumor-normal samples from five breast cancer patients to evaluate the performance of nine publicly available somatic variant callers: EBCall, Mutect, Seurat, Shimmer, Indelocator, Somatic Sniper, Strelka, VarScan 2 and Virmid for the detection of single nucleotide mutations and small deletions and insertions. We report a large variation in the number of calls from the nine somatic variant callers on the same sequencing data and highly variable agreement. Sequencing depth had markedly diverse impact on individual callers, as for some callers, increased sequencing depth highly improved sensitivity. For SNV calling, we report EBCall, Mutect, Virmid and Strelka to be the most reliable somatic variant callers for both exome sequencing and targeted deep sequencing. For indel calling, EBCall is superior due to high sensitivity and robustness to changes in sequencing depths.

  5. Purification, reactivity with IgE and cDNA cloning of parvalbumin as the major allergen of mackerels.

    Science.gov (United States)

    Hamada, Y; Tanaka, H; Ishizaki, S; Ishida, M; Nagashima, Y; Shiomi, K

    2003-08-01

    Three species of mackerels (Scomber japonicus, S. australasicus and S. scombrus) are widely consumed and considered to be most frequently involved in incidents of IgE-mediated fish allergy in Japan. In this study, parvalbumin, a possible candidate for the major allergen, was purified from the white muscle of three species of mackerels by gel filtration on Sephadex G-75 and reverse-phase HPLC on TSKgel ODS-120T. All the purified preparations from three species gave a single band of about 11 kDa and were clearly identified as parvalbumins by analyses of their partial amino acid sequences. In ELISA experiments, four of five sera from fish-allergic patients reacted to all the purified parvalbumins, demonstrating that parvalbumin is the major allergen in common with the mackerels. Antigenic cross-reactivity among the mackerel parvalbumins was also established by ELISA inhibition experiments. A cDNA library was constructed from the white muscle of S. japonicus and the cDNA encoding parvalbumin was cloned. The amino acid sequence translated from the nucleotide sequence revealed that the S. japonicus parvalbumin is composed of 108 residues, being a member of beta-type parvalbumins.

  6. High-Quality Draft Single-Cell Genome Sequence Belonging to the Archaeal Candidate Division SA1, Isolated from Nereus Deep in the Red Sea

    KAUST Repository

    Ngugi, David; Stingl, Ulrich

    2018-01-01

    Candidate division SA1 encompasses a phylogenetically coherent archaeal group ubiquitous in deep hypersaline anoxic brines around the globe. Recently, the genome sequences of two cultivated representatives from hypersaline soda lake sediments were published. Here, we present a single-cell genome sequence from Nereus Deep in the Red Sea that represents a putatively novel family within SA1.

  7. High-Quality Draft Single-Cell Genome Sequence Belonging to the Archaeal Candidate Division SA1, Isolated from Nereus Deep in the Red Sea

    KAUST Repository

    Ngugi, David

    2018-05-09

    Candidate division SA1 encompasses a phylogenetically coherent archaeal group ubiquitous in deep hypersaline anoxic brines around the globe. Recently, the genome sequences of two cultivated representatives from hypersaline soda lake sediments were published. Here, we present a single-cell genome sequence from Nereus Deep in the Red Sea that represents a putatively novel family within SA1.

  8. Whitefly (Bemisia tabaci genome project: analysis of sequenced clones from egg, instar, and adult (viruliferous and non-viruliferous cDNA libraries

    Directory of Open Access Journals (Sweden)

    Czosnek Henryk

    2006-04-01

    Full Text Available Abstract Background The past three decades have witnessed a dramatic increase in interest in the whitefly Bemisia tabaci, owing to its nature as a taxonomically cryptic species, the damage it causes to a large number of herbaceous plants because of its specialized feeding in the phloem, and to its ability to serve as a vector of plant viruses. Among the most important plant viruses to be transmitted by B. tabaci are those in the genus Begomovirus (family, Geminiviridae. Surprisingly, little is known about the genome of this whitefly. The haploid genome size for male B. tabaci has been estimated to be approximately one billion bp by flow cytometry analysis, about five times the size of the fruitfly Drosophila melanogaster. The genes involved in whitefly development, in host range plasticity, and in begomovirus vector specificity and competency, are unknown. Results To address this general shortage of genomic sequence information, we have constructed three cDNA libraries from non-viruliferous whiteflies (eggs, immature instars, and adults and two from adult insects that fed on tomato plants infected by two geminiviruses: Tomato yellow leaf curl virus (TYLCV and Tomato mottle virus (ToMoV. In total, the sequence of 18,976 clones was determined. After quality control, and removal of 5,542 clones of mitochondrial origin 9,110 sequences remained which included 3,843 singletons and 1,017 contigs. Comparisons with public databases indicated that the libraries contained genes involved in cellular and developmental processes. In addition, approximately 1,000 bases aligned with the genome of the B. tabaci endosymbiotic bacterium Candidatus Portiera aleyrodidarum, originating primarily from the egg and instar libraries. Apart from the mitochondrial sequences, the longest and most abundant sequence encodes vitellogenin, which originated from whitefly adult libraries, indicating that much of the gene expression in this insect is directed toward the production

  9. An introduction to deep learning on biological sequence data: examples and solutions.

    Science.gov (United States)

    Jurtz, Vanessa Isabell; Johansen, Alexander Rosenberg; Nielsen, Morten; Almagro Armenteros, Jose Juan; Nielsen, Henrik; Sønderby, Casper Kaae; Winther, Ole; Sønderby, Søren Kaae

    2017-11-15

    Deep neural network architectures such as convolutional and long short-term memory networks have become increasingly popular as machine learning tools during the recent years. The availability of greater computational resources, more data, new algorithms for training deep models and easy to use libraries for implementation and training of neural networks are the drivers of this development. The use of deep learning has been especially successful in image recognition; and the development of tools, applications and code examples are in most cases centered within this field rather than within biology. Here, we aim to further the development of deep learning methods within biology by providing application examples and ready to apply and adapt code templates. Given such examples, we illustrate how architectures consisting of convolutional and long short-term memory neural networks can relatively easily be designed and trained to state-of-the-art performance on three biological sequence problems: prediction of subcellular localization, protein secondary structure and the binding of peptides to MHC Class II molecules. All implementations and datasets are available online to the scientific community at https://github.com/vanessajurtz/lasagne4bio. skaaesonderby@gmail.com. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  10. Production of a full-length infectious GFP-tagged cDNA clone of Beet mild yellowing virus for the study of plant-polerovirus interactions.

    Science.gov (United States)

    Stevens, Mark; Viganó, Felicita

    2007-04-01

    The full-length cDNA of Beet mild yellowing virus (Broom's Barn isolate) was sequenced and cloned into the vector pLitmus 29 (pBMYV-BBfl). The sequence of BMYV-BBfl (5721 bases) shared 96% and 98% nucleotide identity with the other complete sequences of BMYV (BMYV-2ITB, France and BMYV-IPP, Germany respectively). Full-length capped RNA transcripts of pBMYV-BBfl were synthesised and found to be biologically active in Arabidopsis thaliana protoplasts following electroporation or PEG inoculation when the protoplasts were subsequently analysed using serological and molecular methods. The BMYV sequence was modified by inserting DNA that encoded the jellyfish green fluorescent protein (GFP) into the P5 gene close to its 3' end. A. thaliana protoplasts electroporated with these RNA transcripts were biologically active and up to 2% of transfected protoplasts showed GFP-specific fluorescence. The exploitation of these cDNA clones for the study of the biology of beet poleroviruses is discussed.

  11. Construction of a cDNA library from female adult of Toxocara canis, and analysis of EST and immune-related genes expressions.

    Science.gov (United States)

    Zhou, Rongqiong; Xia, Qingyou; Huang, Hancheng; Lai, Min; Wang, Zhenxin

    2011-10-01

    Toxocara canis is a widespread intestinal nematode parasite of dogs, which can also cause disease in humans. We employed an expressed sequence tag (EST) strategy in order to study gene-expression including development, digestion and reproduction of T. canis. ESTs provided a rapid way to identify genes, particularly in organisms for which we have very little molecular information. In this study, a cDNA library was constructed from a female adult of T. canis and 215 high-quality ESTs from 5'-ends of the cDNA clones representing 79 unigenes were obtained. The titer of the primary cDNA library was 1.83×10(6)pfu/mL with a recombination rate of 99.33%. Most of the sequences ranged from 300 to 900bp with an average length of 656bp. Cluster analysis of these ESTs allowed identification of 79 unique sequences containing 28 contigs and 51 singletons. BLASTX searches revealed that 18 unigenes (22.78% of the total) or 70 ESTs (32.56% of the total) were novel genes that had no significant matches to any protein sequences in the public databases. The rest of the 61 unigenes (77.22% of the total) or 145 ESTs (67.44% of the total) were closely matched to the known genes or sequences deposited in the public databases. These genes were classified into seven groups based on their known or putative biological functions. We also confirmed the gene expression patterns of several immune-related genes using RT-PCR examination. This work will provide a valuable resource for the further investigations in the stage-, sex- and tissue-specific gene transcription or expression. Copyright © 2011. Published by Elsevier Inc.

  12. Virus pathotype and deep sequencing of the HA gene of a low pathogenicity H7N1 avian influenza virus causing mortality in Turkeys.

    Directory of Open Access Journals (Sweden)

    Munir Iqbal

    Full Text Available Low pathogenicity avian influenza (LPAI viruses of the H7 subtype generally cause mild disease in poultry. However the evolution of a LPAI virus into highly pathogenic avian influenza (HPAI virus results in the generation of a virus that can cause severe disease and death. The classification of these two pathotypes is based, in part, on disease signs and death in chickens, as assessed in an intravenous pathogenicity test, but the effect of LPAI viruses in turkeys is less well understood. During an investigation of LPAI virus infection of turkeys, groups of three-week-old birds inoculated with A/chicken/Italy/1279/99 (H7N1 showed severe disease signs and died or were euthanised within seven days of infection. Virus was detected in many internal tissues and organs from culled birds. To examine the possible evolution of the infecting virus to a highly pathogenic form in these turkeys, sequence analysis of the haemagglutinin (HA gene cleavage site was carried out by analysing multiple cDNA amplicons made from swabs and tissue sample extracts employing Sanger and Next Generation Sequencing. In addition, a RT-PCR assay to detect HPAI virus was developed. There was no evidence of the presence of HPAI virus in either the virus used as inoculum or from swabs taken from infected birds. However, a small proportion (<0.5% of virus carried in individual tracheal or liver samples did contain a molecular signature typical of a HPAI virus at the HA cleavage site. All the signature sequences were identical and were similar to HPAI viruses collected during the Italian epizootic in 1999/2000. We assume that the detection of HPAI virus in tissue samples following infection with A/chicken/Italy/1279/99 reflected amplification of a virus present at very low levels within the mixed inoculum but, strikingly, we observed no new HPAI virus signatures in the amplified DNA analysed by deep-sequencing.

  13. Characterization and sequence analysis of cysteine and glycine-rich ...

    African Journals Online (AJOL)

    Primers specific for CSRP3 were designed using known cDNA sequences of Bos taurus published in database with different accession numbers. Polymerase chain reaction (PCR) was performed and products were purified and sequenced. Sequence analysis and alignment were carried out using CLUSTAL W (1.83).

  14. Cloning of the human carnitine-acylcarnitine carrier cDNA and identification of the molecular defect in a patient

    NARCIS (Netherlands)

    Huizing, M.; Iacobazzi, V.; IJlst, L.; Savelkoul, P.; Ruitenbeek, W.; van den Heuvel, L.; Indiveri, C.; Smeitink, J.; Trijbels, F.; Wanders, R.; Palmieri, F.

    1997-01-01

    The carnitine-acylcarnitine carrier (CAC) catalyzes the translocation of long-chain fatty acids across the inner mitochondrial membrane. We cloned and sequenced the human CAC cDNA, which has an open reading frame of 903 nucleotides. Northern blot studies revealed different expression levels of CAC

  15. cDNA - ASTRA | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available ontents List of cDNA in locus Data file File name: astra_cdna.zip File URL: ftp://ftp.biosciencedbc.jp/archive/astra/LATEST/astra_cdn...a.zip File size: 3.3 MB Simple search URL http://togodb.biosciencedbc.jp/togodb/view/astra_cdna...n, Department of Molecular Genetics, National Institute of Agrobiological Sciences (Kikuchi et al., 2003; ftp://cdna

  16. Efficient generation of recombinant RNA viruses using targeted recombination-mediated mutagenesis of bacterial artificial chromosomes containing full-length cDNA

    DEFF Research Database (Denmark)

    Rasmussen, Thomas Bruun; Risager, Peter Christian; Fahnøe, Ulrik

    2013-01-01

    Background Infectious cDNA clones are a prerequisite for directed genetic manipulation of RNA viruses. Here, a strategy to facilitate manipulation and rescue of classical swine fever viruses (CSFVs) from full-length cDNAs present within bacterial artificial chromosomes (BACs) is described....... This strategy allows manipulation of viral cDNA by targeted recombination-mediated mutagenesis within bacteria. Results A new CSFV-BAC (pBeloR26) derived from the Riems vaccine strain has been constructed and subsequently modified in the E2 coding sequence, using the targeted recombination strategy to enable...

  17. Poly(A)-tag deep sequencing data processing to extract poly(A) sites.

    Science.gov (United States)

    Wu, Xiaohui; Ji, Guoli; Li, Qingshun Quinn

    2015-01-01

    Polyadenylation [poly(A)] is an essential posttranscriptional processing step in the maturation of eukaryotic mRNA. The advent of next-generation sequencing (NGS) technology has offered feasible means to generate large-scale data and new opportunities for intensive study of polyadenylation, particularly deep sequencing of the transcriptome targeting the junction of 3'-UTR and the poly(A) tail of the transcript. To take advantage of this unprecedented amount of data, we present an automated workflow to identify polyadenylation sites by integrating NGS data cleaning, processing, mapping, normalizing, and clustering. In this pipeline, a series of Perl scripts are seamlessly integrated to iteratively map the single- or paired-end sequences to the reference genome. After mapping, the poly(A) tags (PATs) at the same genome coordinate are grouped into one cleavage site, and the internal priming artifacts removed. Then the ambiguous region is introduced to parse the genome annotation for cleavage site clustering. Finally, cleavage sites within a close range of 24 nucleotides and from different samples can be clustered into poly(A) clusters. This procedure could be used to identify thousands of reliable poly(A) clusters from millions of NGS sequences in different tissues or treatments.

  18. Next generation sequencing (NGS)technologies and applications

    Energy Technology Data Exchange (ETDEWEB)

    Vuyisich, Momchilo [Los Alamos National Laboratory

    2012-09-11

    NGS technology overview: (1) NGS library preparation - Nucleic acids extraction, Sample quality control, RNA conversion to cDNA, Addition of sequencing adapters, Quality control of library; (2) Sequencing - Clonal amplification of library fragments, (except PacBio), Sequencing by synthesis, Data output (reads and quality); and (3) Data analysis - Read mapping, Genome assembly, Gene expression, Operon structure, sRNA discovery, and Epigenetic analyses.

  19. Identification and Removal of Contaminant Sequences From Ribosomal Gene Databases: Lessons From the Census of Deep Life.

    Science.gov (United States)

    Sheik, Cody S; Reese, Brandi Kiel; Twing, Katrina I; Sylvan, Jason B; Grim, Sharon L; Schrenk, Matthew O; Sogin, Mitchell L; Colwell, Frederick S

    2018-01-01

    Earth's subsurface environment is one of the largest, yet least studied, biomes on Earth, and many questions remain regarding what microorganisms are indigenous to the subsurface. Through the activity of the Census of Deep Life (CoDL) and the Deep Carbon Observatory, an open access 16S ribosomal RNA gene sequence database from diverse subsurface environments has been compiled. However, due to low quantities of biomass in the deep subsurface, the potential for incorporation of contaminants from reagents used during sample collection, processing, and/or sequencing is high. Thus, to understand the ecology of subsurface microorganisms (i.e., the distribution, richness, or survival), it is necessary to minimize, identify, and remove contaminant sequences that will skew the relative abundances of all taxa in the sample. In this meta-analysis, we identify putative contaminants associated with the CoDL dataset, recommend best practices for removing contaminants from samples, and propose a series of best practices for subsurface microbiology sampling. The most abundant putative contaminant genera observed, independent of evenness across samples, were Propionibacterium , Aquabacterium , Ralstonia , and Acinetobacter . While the top five most frequently observed genera were Pseudomonas , Propionibacterium , Acinetobacter , Ralstonia , and Sphingomonas . The majority of the most frequently observed genera (high evenness) were associated with reagent or potential human contamination. Additionally, in DNA extraction blanks, we observed potential archaeal contaminants, including methanogens, which have not been discussed in previous contamination studies. Such contaminants would directly affect the interpretation of subsurface molecular studies, as methanogenesis is an important subsurface biogeochemical process. Utilizing previously identified contaminant genera, we found that ∼27% of the total dataset were identified as contaminant sequences that likely originate from DNA

  20. Deep Sequencing Insights in Therapeutic shRNA Processing and siRNA Target Cleavage Precision.

    Science.gov (United States)

    Denise, Hubert; Moschos, Sterghios A; Sidders, Benjamin; Burden, Frances; Perkins, Hannah; Carter, Nikki; Stroud, Tim; Kennedy, Michael; Fancy, Sally-Ann; Lapthorn, Cris; Lavender, Helen; Kinloch, Ross; Suhy, David; Corbau, Romu

    2014-02-04

    TT-034 (PF-05095808) is a recombinant adeno-associated virus serotype 8 (AAV8) agent expressing three short hairpin RNA (shRNA) pro-drugs that target the hepatitis C virus (HCV) RNA genome. The cytosolic enzyme Dicer cleaves each shRNA into multiple, potentially active small interfering RNA (siRNA) drugs. Using next-generation sequencing (NGS) to identify and characterize active shRNAs maturation products, we observed that each TT-034-encoded shRNA could be processed into as many as 95 separate siRNA strands. Few of these appeared active as determined by Sanger 5' RNA Ligase-Mediated Rapid Amplification of cDNA Ends (5-RACE) and through synthetic shRNA and siRNA analogue studies. Moreover, NGS scrutiny applied on 5-RACE products (RACE-seq) suggested that synthetic siRNAs could direct cleavage in not one, but up to five separate positions on targeted RNA, in a sequence-dependent manner. These data support an on-target mechanism of action for TT-034 without cytotoxicity and question the accepted precision of substrate processing by the key RNA interference (RNAi) enzymes Dicer and siRNA-induced silencing complex (siRISC).Molecular Therapy-Nucleic Acids (2014) 3, e145; doi:10.1038/mtna.2013.73; published online 4 February 2014.

  1. cDNA, genomic sequence cloning, and overexpression of EIF1 from the giant panda (Ailuropoda Melanoleuca) and the black bear (Ursus Thibetanus Mupinensis).

    Science.gov (United States)

    Hou, Wan-ru; Tang, Yun; Hou, Yi-ling; Song, Yan; Zhang, Tian; Wu, Guang-fu

    2010-07-01

    Eukaryotic initiation factor (eIF) EIF1 is a universally conserved translation factor that is involved in translation initiation site selection. The cDNA and the genomic sequences of EIF1 were cloned successfully from the giant panda (Ailuropoda melanoleuca) and the black bear (Ursus thibetanus mupinensis) using reverse transcription polymerase chain reaction (RT-PCR) technology and touchdown-polymerase chain reaction, respectively. The cDNAs of the EIF1 cloned from the giant panda and the black bear are 418 bp in size, containing an open reading frame (ORF) of 342 bp encoding 113 amino acids. The length of the genomic sequence of the giant panda is 1909 bp, which contains four exons and three introns. The length of the genomic sequence of the black bear is 1897 bp, which also contains four exons and three introns. Sequence alignment indicates a high degree of homology to those of Homo sapiens, Mus musculus, Rattus norvegicus, and Bos Taurus at both amino acid and DNA levels. Topology prediction shows there are one N-glycosylation site, two Casein kinase II phosphorylation sites, and a Amidation site in the EIF1 protein of the giant panda and black bear. In addition, there is a protein kinase C phosphorylation site in EIF1 of the giant panda. The giant panda and the black bear EIF1 genes were overexpressed in E. coli BL21. The results indicated that the both EIF1 fusion proteins with the N-terminally His-tagged form gave rise to the accumulation of two expected 19 kDa polypeptide. The expression products obtained could be used to purify the proteins and study their function further.

  2. Generation and analysis of a large-scale expressed sequence Tag database from a full-length enriched cDNA library of developing leaves of Gossypium hirsutum L.

    Directory of Open Access Journals (Sweden)

    Min Lin

    Full Text Available BACKGROUND: Cotton (Gossypium hirsutum L. is one of the world's most economically-important crops. However, its entire genome has not been sequenced, and limited resources are available in GenBank for understanding the molecular mechanisms underlying leaf development and senescence. METHODOLOGY/PRINCIPAL FINDINGS: In this study, 9,874 high-quality ESTs were generated from a normalized, full-length cDNA library derived from pooled RNA isolated from throughout leaf development during the plant blooming stage. After clustering and assembly of these ESTs, 5,191 unique sequences, representative 1,652 contigs and 3,539 singletons, were obtained. The average unique sequence length was 682 bp. Annotation of these unique sequences revealed that 84.4% showed significant homology to sequences in the NCBI non-redundant protein database, and 57.3% had significant hits to known proteins in the Swiss-Prot database. Comparative analysis indicated that our library added 2,400 ESTs and 991 unique sequences to those known for cotton. The unigenes were functionally characterized by gene ontology annotation. We identified 1,339 and 200 unigenes as potential leaf senescence-related genes and transcription factors, respectively. Moreover, nine genes related to leaf senescence and eleven MYB transcription factors were randomly selected for quantitative real-time PCR (qRT-PCR, which revealed that these genes were regulated differentially during senescence. The qRT-PCR for three GhYLSs revealed that these genes express express preferentially in senescent leaves. CONCLUSIONS/SIGNIFICANCE: These EST resources will provide valuable sequence information for gene expression profiling analyses and functional genomics studies to elucidate their roles, as well as for studying the mechanisms of leaf development and senescence in cotton and discovering candidate genes related to important agronomic traits of cotton. These data will also facilitate future whole-genome sequence

  3. Protein model discrimination using mutational sensitivity derived from deep sequencing.

    Science.gov (United States)

    Adkar, Bharat V; Tripathi, Arti; Sahoo, Anusmita; Bajaj, Kanika; Goswami, Devrishi; Chakrabarti, Purbani; Swarnkar, Mohit K; Gokhale, Rajesh S; Varadarajan, Raghavan

    2012-02-08

    A major bottleneck in protein structure prediction is the selection of correct models from a pool of decoys. Relative activities of ∼1,200 individual single-site mutants in a saturation library of the bacterial toxin CcdB were estimated by determining their relative populations using deep sequencing. This phenotypic information was used to define an empirical score for each residue (RankScore), which correlated with the residue depth, and identify active-site residues. Using these correlations, ∼98% of correct models of CcdB (RMSD ≤ 4Å) were identified from a large set of decoys. The model-discrimination methodology was further validated on eleven different monomeric proteins using simulated RankScore values. The methodology is also a rapid, accurate way to obtain relative activities of each mutant in a large pool and derive sequence-structure-function relationships without protein isolation or characterization. It can be applied to any system in which mutational effects can be monitored by a phenotypic readout. Copyright © 2012 Elsevier Ltd. All rights reserved.

  4. Deep-Sea, Deep-Sequencing: Metabarcoding Extracellular DNA from Sediments of Marine Canyons.

    Directory of Open Access Journals (Sweden)

    Magdalena Guardiola

    Full Text Available Marine sediments are home to one of the richest species pools on Earth, but logistics and a dearth of taxonomic work-force hinders the knowledge of their biodiversity. We characterized α- and β-diversity of deep-sea assemblages from submarine canyons in the western Mediterranean using an environmental DNA metabarcoding. We used a new primer set targeting a short eukaryotic 18S sequence (ca. 110 bp. We applied a protocol designed to obtain extractions enriched in extracellular DNA from replicated sediment corers. With this strategy we captured information from DNA (local or deposited from the water column that persists adsorbed to inorganic particles and buffered short-term spatial and temporal heterogeneity. We analysed replicated samples from 20 localities including 2 deep-sea canyons, 1 shallower canal, and two open slopes (depth range 100-2,250 m. We identified 1,629 MOTUs, among which the dominant groups were Metazoa (with representatives of 19 phyla, Alveolata, Stramenopiles, and Rhizaria. There was a marked small-scale heterogeneity as shown by differences in replicates within corers and within localities. The spatial variability between canyons was significant, as was the depth component in one of the canyons where it was tested. Likewise, the composition of the first layer (1 cm of sediment was significantly different from deeper layers. We found that qualitative (presence-absence and quantitative (relative number of reads data showed consistent trends of differentiation between samples and geographic areas. The subset of exclusively benthic MOTUs showed similar patterns of β-diversity and community structure as the whole dataset. Separate analyses of the main metazoan phyla (in number of MOTUs showed some differences in distribution attributable to different lifestyles. Our results highlight the differentiation that can be found even between geographically close assemblages, and sets the ground for future monitoring and conservation

  5. Hybridization-based reconstruction of small non-coding RNA transcripts from deep sequencing data.

    Science.gov (United States)

    Ragan, Chikako; Mowry, Bryan J; Bauer, Denis C

    2012-09-01

    Recent advances in RNA sequencing technology (RNA-Seq) enables comprehensive profiling of RNAs by producing millions of short sequence reads from size-fractionated RNA libraries. Although conventional tools for detecting and distinguishing non-coding RNAs (ncRNAs) from reference-genome data can be applied to sequence data, ncRNA detection can be improved by harnessing the full information content provided by this new technology. Here we present NorahDesk, the first unbiased and universally applicable method for small ncRNAs detection from RNA-Seq data. NorahDesk utilizes the coverage-distribution of small RNA sequence data as well as thermodynamic assessments of secondary structure to reliably predict and annotate ncRNA classes. Using publicly available mouse sequence data from brain, skeletal muscle, testis and ovary, we evaluated our method with an emphasis on the performance for microRNAs (miRNAs) and piwi-interacting small RNA (piRNA). We compared our method with Dario and mirDeep2 and found that NorahDesk produces longer transcripts with higher read coverage. This feature makes it the first method particularly suitable for the prediction of both known and novel piRNAs.

  6. Non PCR-amplified Transcripts and AFLP fragments as reduced representations of the quail genome for 454 Titanium sequencing

    Directory of Open Access Journals (Sweden)

    Leterrier Christine

    2010-07-01

    Full Text Available Abstract Background SNP (Single Nucleotide Polymorphism discovery is now routinely performed using high-throughput sequencing of reduced representation libraries. Our objective was to adapt 454 GS FLX based sequencing methodologies in order to obtain the largest possible dataset from two reduced representations libraries, produced by AFLP (Amplified Fragment Length Polymorphism for genomic DNA, and EST (Expressed Sequence Tag for the transcribed fraction of the genome. Findings The expressed fraction was obtained by preparing cDNA libraries without PCR amplification from quail embryo and brain. To optimize the information content for SNP analyses, libraries were prepared from individuals selected in three quail lines and each individual in the AFLP library was tagged. Sequencing runs produced 399,189 sequence reads from cDNA and 373,484 from genomic fragments, covering close to 250 Mb of sequence in total. Conclusions Both methods used to obtain reduced representations for high-throughput sequencing were successful after several improvements. The protocols may be used for several sequencing applications, such as de novo sequencing, tagged PCR fragments or long fragment sequencing of cDNA.

  7. Draft Genome Sequence of Deep-Sea Alteromonas sp. Strain V450 Isolated from the Marine Sponge Leiodermatium sp.

    Science.gov (United States)

    Wang, Guojun; Barrett, Nolan H; McCarthy, Peter J

    2017-02-02

    The proteobacterium Alteromonas sp. strain V450 was isolated from the Atlantic deep-sea sponge Leiodermatium sp. Here, we report the draft genome sequence of this strain, with a genome size of approx. 4.39 Mb and a G+C content of 44.01%. The results will aid deep-sea microbial ecology, evolution, and sponge-microbe association studies. Copyright © 2017 Wang et al.

  8. Subtractive cloning of cDNA from Aspergillus oryzae differentially regulated between solid-state culture and liquid (submerged) culture.

    Science.gov (United States)

    Akao, Takeshi; Gomi, Katsuya; Goto, Kuniyasu; Okazaki, Naoto; Akita, Osamu

    2002-07-01

    In solid-state cultures (SC), Aspergillus oryzae shows characteristics such as high-level production and secretion of enzymes and hyphal differentiation with asexual development which are absent in liquid (submerged) culture (LC). It was predicted that many of the genes involved in the characteristics of A. oryzae in SC are differentially expressed between SC and LC. We generated two subtracted cDNA libraries with bi-directional cDNA subtractive hybridizations to isolate and identify such genes. Among them, we identified genes upregulated in or specific to SC, such as the AOS ( A. oryzae SC-specific gene) series, and those downregulated or not expressed in SC, such as the AOL ( A. oryzae LC-specific) series. Sequencing analyses revealed that the AOS series and the AOL series contain genes encoding extra- and intracellular enzymes and transport proteins. However, half were functionally unclassified by nucleotide sequences. Also, by expression profile, the AOS series comprised two groups. These gene products' molecular functions and physiological roles in SC await further investigation.

  9. Procedure for normalization of cDNA libraries

    Science.gov (United States)

    Bonaldo, Maria DeFatima; Soares, Marcelo Bento

    1997-01-01

    This invention provides a method to normalize a cDNA library constructed in a vector capable of being converted to single-stranded circles and capable of producing complementary nucleic acid molecules to the single-stranded circles comprising: (a) converting the cDNA library in single-stranded circles; (b) generating complementary nucleic acid molecules to the single-stranded circles; (c) hybridizing the single-stranded circles converted in step (a) with complementary nucleic acid molecules of step (b) to produce partial duplexes to an appropriate Cot; (e) separating the unhybridized single-stranded circles from the hybridized single-stranded circles, thereby generating a normalized cDNA library.

  10. Hibiscus latent Fort Pierce virus in Brazil and synthesis of its biologically active full-length cDNA clone.

    Science.gov (United States)

    Gao, Ruimin; Niu, Shengniao; Dai, Weifang; Kitajima, Elliot; Wong, Sek-Man

    2016-10-01

    A Brazilian isolate of Hibiscus latent Fort Pierce virus (HLFPV-BR) was firstly found in a hibiscus plant in Limeira, SP, Brazil. RACE PCR was carried out to obtain the full-length sequences of HLFPV-BR which is 6453 nucleotides and has more than 99.15 % of complete genomic RNA nucleotide sequence identity with that of HLFPV Japanese isolate. The genomic structure of HLFPV-BR is similar to other tobamoviruses. It includes a 5' untranslated region (UTR), followed by open reading frames encoding for a 128-kDa protein and a 188-kDa readthrough protein, a 38-kDa movement protein, 18-kDa coat protein, and a 3' UTR. Interestingly, the unique feature of poly(A) tract is also found within its 3'-UTR. Furthermore, from the total RNA extracted from the local lesions of HLFPV-BR-infected Chenopodium quinoa leaves, a biologically active, full-length cDNA clone encompassing the genome of HLFPV-BR was amplified and placed adjacent to a T7 RNA polymerase promoter. The capped in vitro transcripts from the cloned cDNA were infectious when mechanically inoculated into C. quinoa and Nicotiana benthamiana plants. This is the first report of the presence of an isolate of HLFPV in Brazil and the successful synthesis of a biologically active HLFPV-BR full-length cDNA clone.

  11. Deep sequence characterisation of a divergent HPIV-4a from an adult with prolonged influenza-like illness

    Directory of Open Access Journals (Sweden)

    Katherine E. Arden

    2015-12-01

    Deep sequencing allowed identification and genomic characterisation of a possible pathogen from an ILI as well as being an important tool to aid future understanding of the linkages between viral genetic variation, transmission and disease prognosis.

  12. Lectin cDNA and transgenic plants derived therefrom

    Science.gov (United States)

    Raikhel, Natasha V.

    2000-10-03

    Transgenic plants containing cDNA encoding Gramineae lectin are described. The plants preferably contain cDNA coding for barley lectin and store the lectin in the leaves. The transgenic plants, particularly the leaves exhibit insecticidal and fungicidal properties.

  13. cDNA table - RPD | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available of data contents Results of homology search to cDNA clones in the KOME. Data file File name: rpd_cdna.zip F...ile URL: ftp://ftp.biosciencedbc.jp/archive/rpd/LATEST/rpd_cdna.zip File size: 15 KB Simple search URL http:...//togodb.biosciencedbc.jp/togodb/view/rpd_cdna#en Data acquisition method - Data

  14. Enhanced arbovirus surveillance with deep sequencing: Identification of novel rhabdoviruses and bunyaviruses in Australian mosquitoes.

    Science.gov (United States)

    Coffey, Lark L; Page, Brady L; Greninger, Alexander L; Herring, Belinda L; Russell, Richard C; Doggett, Stephen L; Haniotis, John; Wang, Chunlin; Deng, Xutao; Delwart, Eric L

    2014-01-05

    Viral metagenomics characterizes known and identifies unknown viruses based on sequence similarities to any previously sequenced viral genomes. A metagenomics approach was used to identify virus sequences in Australian mosquitoes causing cytopathic effects in inoculated mammalian cell cultures. Sequence comparisons revealed strains of Liao Ning virus (Reovirus, Seadornavirus), previously detected only in China, livestock-infecting Stretch Lagoon virus (Reovirus, Orbivirus), two novel dimarhabdoviruses, named Beaumont and North Creek viruses, and two novel orthobunyaviruses, named Murrumbidgee and Salt Ash viruses. The novel virus proteomes diverged by ≥ 50% relative to their closest previously genetically characterized viral relatives. Deep sequencing also generated genomes of Warrego and Wallal viruses, orbiviruses linked to kangaroo blindness, whose genomes had not been fully characterized. This study highlights viral metagenomics in concert with traditional arbovirus surveillance to characterize known and new arboviruses in field-collected mosquitoes. Follow-up epidemiological studies are required to determine whether the novel viruses infect humans. © 2013 Elsevier Inc. All rights reserved.

  15. 3' terminal diversity of MRP RNA and other human noncoding RNAs revealed by deep sequencing.

    Science.gov (United States)

    Goldfarb, Katherine C; Cech, Thomas R

    2013-09-21

    Post-transcriptional 3' end processing is a key component of RNA regulation. The abundant and essential RNA subunit of RNase MRP has been proposed to function in three distinct cellular compartments and therefore may utilize this mode of regulation. Here we employ 3' RACE coupled with high-throughput sequencing to characterize the 3' terminal sequences of human MRP RNA and other noncoding RNAs that form RNP complexes. The 3' terminal sequence of MRP RNA from HEK293T cells has a distinctive distribution of genomically encoded termini (including an assortment of U residues) with a portion of these selectively tagged by oligo(A) tails. This profile contrasts with the relatively homogenous 3' terminus of an in vitro transcribed MRP RNA control and the differing 3' terminal profiles of U3 snoRNA, RNase P RNA, and telomerase RNA (hTR). 3' RACE coupled with deep sequencing provides a valuable framework for the functional characterization of 3' terminal sequences of noncoding RNAs.

  16. Human α2-HS-glycoprotein: the A and B chains with a connecting sequence are encoded by a single mRNA transcript

    International Nuclear Information System (INIS)

    Lee, C.C.; Bowman, B.H.; Yang, F.

    1987-01-01

    The α 2 -HS-glycoprotein (AHSG) is a plasma protein reported to play roles in bone mineralization and in the immune response. It is composed of two subunits, the A and B chains. Recombinant plasmids containing human cDNA AHSG have been isolated by screening an adult human liver library with a mixed oligonucleotide probe. The cDNA clones containing AHSG inserts span approximately 1.5 kilobase pairs and include the entire AHSG coding sequence, demonstrating that the A and B chains are encoded by a single mRNA transcript. The cDNA sequence predicts an 18-amino-acid signal peptide, followed by the A-chain sequence of AHSG. A heretofore unseen connecting sequence of 40 amino acids was deduced between the A- and B-chain sequences. The connecting sequence demonstrates the unique amino acid doublets and collagen triplets found in the A and B chains; it is not homologous with other reported amino acid sequences. The connecting sequence may be cleaved in a posttranslational step by limited proteolysis before mature AHSG is released into the circulation or may vary in its presence because of alternative processing. The AHSG cDNA was utilized for mapping the AHSG gene to the 3q21→qter region of human chromosome 3. The availability of the AHSG cDNA clone will facilitate the analysis of its genetic control and gene expression during development and bone formation

  17. cDNA cloning of porcine brain prolyl endopeptidase and identification of the active-site seryl residue

    Energy Technology Data Exchange (ETDEWEB)

    Rennex, D.; Hemmings, B.A.; Hofsteenge, J.; Stone, S.R. (Friedrich Miescher-Institut, Basel (Switzerland))

    1991-02-26

    Prolyl endopeptidase is a cytoplasmic serine protease. The enzyme was purified from porcine kidney, and oligonucleotides based on peptide sequences from this protein were used to isolate a cDNA clone from a porcine brain library. This clone contained the complete coding sequence of prolyl endopeptidase and encoded a polypeptide with a molecular mass of 80751 Da. The deduced amino acid sequence of prolyl endopeptidase showed no sequence homology with other known serine proteases. ({sup 3}H)Diisopropyl fluorophosphate was used to identify the active-site serine of prolyl endopeptidase. One labeled peptide was isolated and sequenced. The sequence surrounding the active-site serine was Asn-Gly-Gly-Ser-Asn-Gly-Gly. This sequence is different from the active-site sequences of other known serine proteases. This difference and the lack of overall homology with the known families of serine proteases suggest that prolyl endopeptidase represents a new type of serine protease.

  18. Paramyosin from the parasitic mite Sarcoptes scabiei: cDNA cloning and heterologous expression.

    Science.gov (United States)

    Mattsson, J G; Ljunggren, E L; Bergström, K

    2001-05-01

    The burrowing mite Sarcoptes scabiei is the causative agent of the highly contagious disease sarcoptic mange or scabies. So far, there is no in vitro propagation system for S. scabiei available, and mites used for various purposes must be isolated from infected hosts. Lack of parasite-derived material has limited the possibilities to study several aspects of scabies, including pathogenesis and immunity. It has also hampered the development of high performance serological assays. We have now constructed an S. scabiei cDNA expression library with mRNA purified from mites isolated from red foxes. Immunoscreening of the library enabled us to clone a full-length cDNA coding for a 102.5 kDa protein. Sequence similarity searches identified the protein as a paramyosin. Recombinant S. scabiei paramyosin expressed in Escherichia coli was recognized by sera from dogs and swine infected with S. scabiei. We also designed a small paramyosin construct of about 17 kDa that included the N-terminal part, an evolutionary variable part of the helical core, and the C-terminal part of the molecule. The miniaturized protein was efficiently expressed in E. coli and was recognized by sera from immunized rabbits. These data demonstrate that the cDNA library can assist in the isolation of important S. scabiei antigens and that recombinant proteins can be useful for the study of scabies.

  19. Preparation of fluorescent-dye-labeled cDNA from RNA for microarray hybridization.

    Science.gov (United States)

    Ares, Manuel

    2014-01-01

    This protocol describes how to prepare fluorescently labeled cDNA for hybridization to microarrays. It consists of two steps: first, a mixture of anchored oligo(dT) and random hexamers is used to prime amine-modified cDNA synthesis by reverse transcriptase using a modified deoxynucleotide with a reactive amine group (aminoallyl-dUTP) and an RNA sample as a template. Second, the cDNA is purified and exchanged into bicarbonate buffer so that the amine groups in the cDNA react with the dye N-hydroxysuccinimide (NHS) esters, covalently joining the dye to the cDNA. The dye-coupled cDNA is purified again, and the amount of dye incorporated per microgram of cDNA is determined.

  20. Complete coding sequence of the human raf oncogene and the corresponding structure of the c-raf-1 gene

    Energy Technology Data Exchange (ETDEWEB)

    Bonner, T I; Oppermann, H; Seeburg, P; Kerby, S B; Gunnell, M A; Young, A C; Rapp, U R

    1986-01-24

    The complete 648 amino acid sequence of the human raf oncogene was deduced from the 2977 nucleotide sequence of a fetal liver cDNA. The cDNA has been used to obtain clones which extend the human c-raf-1 locus by an additional 18.9 kb at the 5' end and contain all the remaining coding exons.

  1. Molecular cloning of a cDNA encoding the precursor of adenoregulin from frog skin. Relationships with the vertebrate defensive peptides, dermaseptins.

    Science.gov (United States)

    Amiche, M; Ducancel, F; Lajeunesse, E; Boulain, J C; Ménez, A; Nicolas, P

    1993-03-31

    Adenoregulin has recently been isolated from Phyllomedusa skin as a 33 amino acid residues peptide which enhanced binding of agonists to the A1 adenosine receptor. In order to study the structure of the precursor of adenoregulin we constructed a cDNA library from mRNAs extracted from the skin of Phyllomedusa bicolor. We detected the complete nucleotide sequence of a cDNA encoding the adenoregulin biosynthetic precursor. The deduced sequence of the precursor is 81 amino acids long, exhibits a putative signal sequence at the NH2 terminus and contains a single copy of the biologically active peptide at the COOH terminus. Structural and conformational homologies that are observed between adenoregulin and the dermaseptins, antimicrobial peptides exhibiting strong membranolytic activities against various pathogenic agents, suggest that adenoregulin is an additional member of the growing family of cytotropic antimicrobial peptides that allow vertebrate animals to defend themselves against microorganisms. As such, the adenosine receptor regulating activity of adenoregulin could be due to its ability to interact with and disrupt membranes lipid bilayers.

  2. Functional cloning using pFB retroviral cDNA expression libraries.

    Science.gov (United States)

    Felts, Katherine A; Chen, Keith; Zaharee, Kim; Sundar, Latha; Limjoco, Jamie; Miller, Anna; Vaillancourt, Peter

    2002-09-01

    Retroviral cDNA expression libraries allow the efficient introduction of complex cDNA libraries into virtually any mitotic cell type for screening based on gene function. The cDNA copy number per cell can be easily controlled by adjusting the multiplicity of infection, thus cell populations may be generated in which >90% of infected cells contain one to three cDNAs. We describe the isolation of two known oncogenes and one cell-surface receptor from a human Burkitt's lymphoma (Daudi) cDNA library inserted into the high-titer retroviral vector pFB.

  3. Cloning and analysis of the mouse Fanconi anemia group a cDNA and an overlapping penta zinc finger cDNA

    NARCIS (Netherlands)

    Wong, JCY; Alon, N; Norga, K; Kruyt, FAE; Youssoufian, H; Buchwald, M

    2000-01-01

    Despite the cloning of four disease-associated genes for Fanconi anemia (FA), the molecular pathogenesis of FA remains largely unknown. To study FA complementation group A using the mouse as a mode I system, we cloned and characterized the mouse homolog of the human FANCA cDNA, The mouse cDNA

  4. Cloning and sequencing of growth hormone gene of Iranian Lori Bakhtiari sheep

    Directory of Open Access Journals (Sweden)

    M Dayani-Nia

    2010-05-01

    Full Text Available Growth hormone (GH is a peptide hormone that stimulates growth and cell reproduction in humans and animals. It is a 191-amino acid, single chain polypeptide hormone which is synthesized, stored, and secreted by the somatotroph cells within the lateral wings of the anterior pituitary gland. The goal of this research was to clone and sequence sheep growth hormone of Lori Bakhtiary breed in Iran. For this purpose, RNA was extracted from the pituitary gland of freshly slaughtered sheep and cDNA of growth hormone produced. The T/A cloning technique was used to clone the cDNA of growth hormone and then the synthesized construct was transferred into E. coli as the host. Once the correct recombinants were further confirmed by colony PCR or restriction enzyme digestion, sequencing was done. The sequencing results showed that, the length of sheep growth hormone cDNA was 690 bp fragments. Comparison of sequence of growth hormone inside the synthesized construct with those recorded in Genebank (NCBI, Blast indicated high degrees of similarity between Iranian native sheep and other sheep breeds of the world.

  5. Deep sequencing analysis of HBV genotype shift and correlation with antiviral efficiency during adefovir dipivoxil therapy.

    Directory of Open Access Journals (Sweden)

    Yuwei Wang

    Full Text Available Viral genotype shift in chronic hepatitis B (CHB patients during antiviral therapy has been reported, but the underlying mechanism remains elusive.38 CHB patients treated with ADV for one year were selected for studying genotype shift by both deep sequencing and Sanger sequencing method.Sanger sequencing method found that 7.9% patients showed mixed genotype before ADV therapy. In contrast, all 38 patients showed mixed genotype before ADV treatment by deep sequencing. 95.5% mixed genotype rate was also obtained from additional 200 treatment-naïve CHB patients. Of the 13 patients with genotype shift, the fraction of the minor genotype in 5 patients (38% increased gradually during the course of ADV treatment. Furthermore, responses to ADV and HBeAg seroconversion were associated with the high rate of genotype shift, suggesting drug and immune pressure may be key factors to induce genotype shift. Interestingly, patients with genotype C had a significantly higher rate of genotype shift than genotype B. In genotype shift group, ADV treatment induced a marked enhancement of genotype B ratio accompanied by a reduction of genotype C ratio, suggesting genotype C may be more sensitive to ADV than genotype B. Moreover, patients with dominant genotype C may have a better therapeutic effect. Finally, genotype shifts was correlated with clinical improvement in terms of ALT.Our findings provided a rational explanation for genotype shift among ADV-treated CHB patients. The genotype and genotype shift might be associated with antiviral efficiency.

  6. Lactation transcriptomics in the Australian marsupial, Macropus eugenii: transcript sequencing and quantification

    Directory of Open Access Journals (Sweden)

    Whitley Jane C

    2007-11-01

    Full Text Available Abstract Background Lactation is an important aspect of mammalian biology and, amongst mammals, marsupials show one of the most complex lactation cycles. Marsupials, such as the tammar wallaby (Macropus eugenii give birth to a relatively immature newborn and progressive changes in milk composition and milk production regulate early stage development of the young. Results In order to investigate gene expression in the marsupial mammary gland during lactation, a comprehensive set of cDNA libraries was derived from lactating tissues throughout the lactation cycle of the tammar wallaby. A total of 14,837 express sequence tags were produced by cDNA sequencing. Sequence analysis and sequence assembly were used to construct a comprehensive catalogue of mammary transcripts. Sequence data from pregnant and early or late lactating specific cDNA libraries and, data from early or late lactation massively parallel sequencing strategies were combined to analyse the variation of milk protein gene expression during the lactation cycle. Conclusion Results show a steady increase in expression of genes coding for secreted protein during the lactation cycle that is associated with high proportion of transcripts coding for milk proteins. In addition, genes involved in immune function, translation and energy or anabolic metabolism are expressed across the lactation cycle. A number of potential new milk proteins or mammary gland remodelling markers, including noncoding RNAs have been identified.

  7. Comparative analysis of transcriptomes in aerial stems and roots of Ephedra sinica based on high-throughput mRNA sequencing

    Directory of Open Access Journals (Sweden)

    Taketo Okada

    2016-12-01

    Full Text Available Ephedra plants are taxonomically classified as gymnosperms, and are medicinally important as the botanical origin of crude drugs and as bioresources that contain pharmacologically active chemicals. Here we show a comparative analysis of the transcriptomes of aerial stems and roots of Ephedra sinica based on high-throughput mRNA sequencing by RNA-Seq. De novo assembly of short cDNA sequence reads generated 23,358, 13,373, and 28,579 contigs longer than 200 bases from aerial stems, roots, or both aerial stems and roots, respectively. The presumed functions encoded by these contig sequences were annotated by BLAST (blastx. Subsequently, these contigs were classified based on gene ontology slims, Enzyme Commission numbers, and the InterPro database. Furthermore, comparative gene expression analysis was performed between aerial stems and roots. These transcriptome analyses revealed differences and similarities between the transcriptomes of aerial stems and roots in E. sinica. Deep transcriptome sequencing of Ephedra should open the door to molecular biological studies based on the entire transcriptome, tissue- or organ-specific transcriptomes, or targeted genes of interest.

  8. The subclonal structure and genomic evolution of oral squamous cell carcinoma revealed by ultra-deep sequencing

    DEFF Research Database (Denmark)

    Tabatabaeifar, Siavosh; Thomassen, Mads; Larsen, Martin J

    2017-01-01

    Recent studies suggest that head and neck squamous cell carcinomas are very heterogeneous between patients; however the subclonal structure remains unexplored mainly due to studies using only a single biopsy per patient. To deconvolutethe clonal structure and describe the genomic cancer evolution......, we applied whole-exome sequencing combined with ultra-deep targeted sequencing on oral squamous cell carcinomas (OSCC). From each patient, a set of biopsies was sampled from distinct geographical sites in primary tumor and lymph node metastasis.We demonstrate that the included OSCCs show a high...

  9. Improved detection of CXCR4-using HIV by V3 genotyping: application of population-based and "deep" sequencing to plasma RNA and proviral DNA.

    Science.gov (United States)

    Swenson, Luke C; Moores, Andrew; Low, Andrew J; Thielen, Alexander; Dong, Winnie; Woods, Conan; Jensen, Mark A; Wynhoven, Brian; Chan, Dennison; Glascock, Christopher; Harrigan, P Richard

    2010-08-01

    Tropism testing should rule out CXCR4-using HIV before treatment with CCR5 antagonists. Currently, the recombinant phenotypic Trofile assay (Monogram) is most widely utilized; however, genotypic tests may represent alternative methods. Independent triplicate amplifications of the HIV gp120 V3 region were made from either plasma HIV RNA or proviral DNA. These underwent standard, population-based sequencing with an ABI3730 (RNA n = 63; DNA n = 40), or "deep" sequencing with a Roche/454 Genome Sequencer-FLX (RNA n = 12; DNA n = 12). Position-specific scoring matrices (PSSMX4/R5) (-6.96 cutoff) and geno2pheno[coreceptor] (5% false-positive rate) inferred tropism from V3 sequence. These methods were then independently validated with a separate, blinded dataset (n = 278) of screening samples from the maraviroc MOTIVATE trials. Standard sequencing of HIV RNA with PSSM yielded 69% sensitivity and 91% specificity, relative to Trofile. The validation dataset gave 75% sensitivity and 83% specificity. Proviral DNA plus PSSM gave 77% sensitivity and 71% specificity. "Deep" sequencing of HIV RNA detected >2% inferred-CXCR4-using virus in 8/8 samples called non-R5 by Trofile, and <2% in 4/4 samples called R5. Triplicate analyses of V3 standard sequence data detect greater proportions of CXCR4-using samples than previously achieved. Sequencing proviral DNA and "deep" V3 sequencing may also be useful tools for assessing tropism.

  10. cDNA library information - Dicty_cDB | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Dicty_cDB cDNA library information Data detail Data name cDNA library information DOI 10.189...s Data item Description cDNA library name Names of cDNA libraries (AF, AH, CF, CH, FC, FC-IC, FCL, SF, SH, S...(C) 5) sexually fusion-competent KAX3 cells (Gamete phase) (F) cDNA library construction method How to construct cDNA library...dir) 2) Full-length cDNA libraries (oligocapped method)(fl) 3) Gamete-specific subtraction library (sub) cDNA library... construction protocol Link to the webpage describing the protocol for generating cDNA library Size

  11. Cdna cloning and expression analyses of the isoflavone reductase-like gene of dendrobium officinale

    International Nuclear Information System (INIS)

    Qian, X.; Xu, S.Z.

    2015-01-01

    The full length of the isoflavone reductase-like gene (IRL) cDNA of Dendrobium officinale was cloned by using reverse transcription (RT) PCR combined with cDNA library, the IRL function was identified by Bioinformatics and prokaryotic expression analyses, and the IRL expression levels in the organs and tissues of D. officinale plants with different ages were determined by using real-time quantitative PCR (RT-qPCR). The results indicated that the full length of the cDNA of D. officinale IRL, DoIRL, was 1238 bp (accession no. KJ661023). Its open reading frame (ORF) was 930 bp which encoded 309 amino acids with a predicted molecular mass of 34 kDa, the 5 untranslated region (UTR) was 61 bp and the 3 UTR containing a poly (A) tail was 247 bp. The deduced amino acid sequence of DoIRL, DoIRL, was forecast to contain a NAD(P)H-binding motif (GGTGYIG) in the N-terminal region, two conserved N-glycosylation sites, a conserved nitrogen metabolite repression regulator (NmrA) domain and a phenylcoumaran benzylic ether reductase (PCBER) domain, to hold the nearest phylogenetic relationship with the PCBER of Striga asiatica, and to share both 73% identity with the isoflavone reductases-like (IRLs) of Cucumis sativus and Striga asiatica. In Escherichia coli 'BL21' cells, the DoIRL cDNA expression produced a protein band holding the predicted molecular mass of 34 kDa. DoIRL expressed in all organs and tissues of D. officinale plants with different ages at comparatively low levels, and the expression level in the leaves of the two-year-old plants was the highest. (author)

  12. Complete cDNA sequence of human complement C1s and close physical linkage of the homologous genes C1s and C1r

    International Nuclear Information System (INIS)

    Tosi, M.; Duponchel, C.; Meo, T.; Julier, C.

    1987-01-01

    Overlapping molecular clones encoding the complement subcomponent C1s were isolated from a human liver cDNA library. The nucleotide sequence reconstructed from these clones spans about 85% of the length of the liver C1s messenger RNAs, which occur in three distinct size classes around 3 kilobases in length. Comparisons with the sequence of C1r, the other enzymatic subcomponent of C1, reveal 40% amino acid identity and conservation of all the cysteine residues. Beside the serine protease domain, the following sequence motifs, previously described in C1r, were also found in C1s: (a) two repeats of the type found in the Ba fragment of complement factor B and in several other complement but also noncomplement proteins, (b) a cysteine-rich segment homologous to the repeats of epidermal growth factor precursor, and (c) a duplicated segment found only in C1r and C1s. Differences in each of these structural motifs provide significant clues for the interpretation of the functional divergence of these interacting serine protease zymogens. Hybridizations of C1r and C1s probes to restriction endonuclease fragments of genomic DNA demonstrate close physical linkage of the corresponding genes. The implications of this finding are discussed with respect to the evolution of C1r and C1s after their origin by tandem gene duplication and to the previously observed combined hereditary deficiencies of Clr and Cls

  13. CDNA library from the Latex of Hevea brasiliensis

    Directory of Open Access Journals (Sweden)

    Wilaiwan Chotigeat

    2010-12-01

    Full Text Available Latex from Hevea brasiliensis contains 30-50% (w/w of natural rubber (cis-1,4-polyisoprene, the important rawmaterial for many rubber industries. We have constructed a cDNA library from the latex of H. brasiliensis to investigate theexpressed genes and molecular events in the latex. We analyzed 412 expressed sequence tags (ESTs. More than 90% of theEST clones showed homology to previously described sequences in public databases. Functional classification of the ESTsshowed that the largest category were proteins of unknown function (30.1%, 11.4% of ESTs encoded for rubber synthesisrelatedproteins (RS and 8.5% for defense or stress related proteins (DS. Those with no significant homology to knownsequences (NSH accounted for 8.7%, primary metabolism (PM and gene expression and RNA metabolism were 7.8% and6.6%, respectively. Other categories included, protein synthesis-related proteins (6.6%, chromatin and DNA metabolism(CDM 3.9%, energy metabolism (EM 3.4%, cellular transport (CT 3.2%, cell structure (CS 3.2%, signal transduction (ST2.2%, secondary metabolism (SM 1.7%, protein fate (PF 2.2%, and reproductive proteins (RP 0.7%.

  14. cDNA library Table - KAIKOcDNA | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available c00951-005 Description of data contents List of Bombyx mori cDNA libraries. Data file File name: kaiko_cdna_...library.zip File URL: ftp://ftp.biosciencedbc.jp/archive/kaiko-cdna/LATEST/kaiko_cdna_library.zip File size:... 4.8 KB Simple search URL http://togodb.biosciencedbc.jp/togodb/view/kaiko_cdna_l

  15. The Porcelain Crab Transcriptome and PCAD, the Porcelain Crab Microarray and Sequence Database

    Energy Technology Data Exchange (ETDEWEB)

    Tagmount, Abderrahmane; Wang, Mei; Lindquist, Erika; Tanaka, Yoshihiro; Teranishi, Kristen S.; Sunagawa, Shinichi; Wong, Mike; Stillman, Jonathon H.

    2010-01-27

    Background: With the emergence of a completed genome sequence of the freshwater crustacean Daphnia pulex, construction of genomic-scale sequence databases for additional crustacean sequences are important for comparative genomics and annotation. Porcelain crabs, genus Petrolisthes, have been powerful crustacean models for environmental and evolutionary physiology with respect to thermal adaptation and understanding responses of marine organisms to climate change. Here, we present a large-scale EST sequencing and cDNA microarray database project for the porcelain crab Petrolisthes cinctipes. Methodology/Principal Findings: A set of ~;;30K unique sequences (UniSeqs) representing ~;;19K clusters were generated from ~;;98K high quality ESTs from a set of tissue specific non-normalized and mixed-tissue normalized cDNA libraries from the porcelain crab Petrolisthes cinctipes. Homology for each UniSeq was assessed using BLAST, InterProScan, GO and KEGG database searches. Approximately 66percent of the UniSeqs had homology in at least one of the databases. All EST and UniSeq sequences along with annotation results and coordinated cDNA microarray datasets have been made publicly accessible at the Porcelain Crab Array Database (PCAD), a feature-enriched version of the Stanford and Longhorn Array Databases.Conclusions/Significance: The EST project presented here represents the third largest sequencing effort for any crustacean, and the largest effort for any crab species. Our assembly and clustering results suggest that our porcelain crab EST data set is equally diverse to the much larger EST set generated in the Daphnia pulex genome sequencing project, and thus will be an important resource to the Daphnia research community. Our homology results support the pancrustacea hypothesis and suggest that Malacostraca may be ancestral to Branchiopoda and Hexapoda. Our results also suggest that our cDNA microarrays cover as much of the transcriptome as can reasonably be captured in

  16. ISOLASI cDNA SUCROSE TRANSPORTER (SUT DARI BATANG TANAMAN TEBU (Saccharum officinarum L.

    Directory of Open Access Journals (Sweden)

    - Slameto

    2010-09-01

    Full Text Available Sucrose Transporter (SUT is kind of protein transporter that control in sucrose translocation. Sucrose Transporter is intermediate in translocation of sucrose from apoplasmic to simplasmic. SUT facilitates sucrose transportation from vascular tissues to parenchyma cells toward in node sugarcane stem. This research was purposed to isolate cDNA SUT from sugarcane stem, and cloned in Escherichia coli strain DH5α. Total RNA of sugarcane stem was isolated by single step method, then add with oligo dT in order to obtain the first strand of SUT cDNA then used as template for PCR. The primer used for PCR is 5’ –ggg ctg att gtg gcc atg tc- ‘3 (SUT-F and 5’ –tgc cct ttg tct ccg gaa cc- ‘3 (SUT-R. PCR was programmed as follow denaturation at 94°C for 2 minutes and 30 second, annealing at 54°C for 30 s, extension at 72°C 2 min and 7 min, and storage at 4°C for unlimited, It was for 30 cycles. Complementary DNA SUT from PCR ligalized to pTOPO bunt-end, then it cloned in to E. coli strain DH5α. The cloning resulted then be sequenced in order to observe the homologues with other nucleotides sequences of some plant using BLASTn program in GENE BANK NCBI and the level of homology determined by Genetyx program. The concentrated of total RNA isolated was 5,024 μg/μl, with purity of 1,85. Complementary DNA SUT fragment from PCR with size 2037 bp appropriated to the both of primer was used. Complementary DNA SUT fragment showed by analyzed some of restriction enzyme e.g. EcoRI, PstI and BamHI. Homologues of this cDNA SUT fragment was 100% to SoSUT 2A of sugarcane stem and 84% to OsSUT of rice plant (Casu et al ., 2003.

  17. Identification of miRNAs and their target genes in developing soybean seeds by deep sequencing

    Directory of Open Access Journals (Sweden)

    Chen Shou-Yi

    2011-01-01

    Full Text Available Abstract Background MicroRNAs (miRNAs regulate gene expression by mediating gene silencing at transcriptional and post-transcriptional levels in higher plants. miRNAs and related target genes have been widely studied in model plants such as Arabidopsis and rice; however, the number of identified miRNAs in soybean (Glycine max is limited, and global identification of the related miRNA targets has not been reported in previous research. Results In our study, a small RNA library and a degradome library were constructed from developing soybean seeds for deep sequencing. We identified 26 new miRNAs in soybean by bioinformatic analysis and further confirmed their expression by stem-loop RT-PCR. The miRNA star sequences of 38 known miRNAs and 8 new miRNAs were also discovered, providing additional evidence for the existence of miRNAs. Through degradome sequencing, 145 and 25 genes were identified as targets of annotated miRNAs and new miRNAs, respectively. GO analysis indicated that many of the identified miRNA targets may function in soybean seed development. Additionally, a soybean homolog of Arabidopsis SUPPRESSOR OF GENE SLIENCING 3 (AtSGS3 was detected as a target of the newly identified miRNA Soy_25, suggesting the presence of feedback control of miRNA biogenesis. Conclusions We have identified large numbers of miRNAs and their related target genes through deep sequencing of a small RNA library and a degradome library. Our study provides more information about the regulatory network of miRNAs in soybean and advances our understanding of miRNA functions during seed development.

  18. Deep RNA sequencing of the skeletal muscle transcriptome in swimming fish.

    Directory of Open Access Journals (Sweden)

    Arjan P Palstra

    Full Text Available Deep RNA sequencing (RNA-seq was performed to provide an in-depth view of the transcriptome of red and white skeletal muscle of exercised and non-exercised rainbow trout (Oncorhynchus mykiss with the specific objective to identify expressed genes and quantify the transcriptomic effects of swimming-induced exercise. Pubertal autumn-spawning seawater-raised female rainbow trout were rested (n = 10 or swum (n = 10 for 1176 km at 0.75 body-lengths per second in a 6,000-L swim-flume under reproductive conditions for 40 days. Red and white muscle RNA of exercised and non-exercised fish (4 lanes was sequenced and resulted in 15-17 million reads per lane that, after de novo assembly, yielded 149,159 red and 118,572 white muscle contigs. Most contigs were annotated using an iterative homology search strategy against salmonid ESTs, the zebrafish Danio rerio genome and general Metazoan genes. When selecting for large contigs (>500 nucleotides, a number of novel rainbow trout gene sequences were identified in this study: 1,085 and 1,228 novel gene sequences for red and white muscle, respectively, which included a number of important molecules for skeletal muscle function. Transcriptomic analysis revealed that sustained swimming increased transcriptional activity in skeletal muscle and specifically an up-regulation of genes involved in muscle growth and developmental processes in white muscle. The unique collection of transcripts will contribute to our understanding of red and white muscle physiology, specifically during the long-term reproductive migration of salmonids.

  19. Construction and application of a bovine immune-endocrine cDNA microarray.

    Science.gov (United States)

    Tao, Wenjing; Mallard, Bonnie; Karrow, Niel; Bridle, Byram

    2004-09-01

    A variety of commercial DNA arrays specific for humans and rodents are widely available; however, microarrays containing well-characterized genes to study pathway-specific gene expression are not as accessible for domestic animals, such as cattle, sheep and pigs. Therefore, a small-scale application-targeted bovine immune-endocrine cDNA array was developed to evaluate genetic pathways involved in the immune-endocrine axis of cattle during periods of altered homeostasis provoked by physiological or environmental stressors, such as infection, vaccination or disease. For this purpose, 167 cDNA sequences corresponding to immune, endocrine and inflammatory response genes were collected and categorized. Positive controls included 5 housekeeping genes (glyceraldehydes-3-phosphate dehydrogenase, hypoxanthine phosphoribosyltransferase, ribosomal protein L19, beta-actin, beta2-microglobulin) and bovine genomic DNA. Negative controls were a bacterial gene (Rhodococcus equi 17-kDa virulence-associated protein) and a partial sequence of the plasmid pACYC177. In addition, RNA extracted from un-stimulated, as well as superantigen (Staphylococcus aureus enterotoxin-A, S. aureus Cowan Pansorbin Cells) and mitogen-stimulated (LPS, ConA) bovine blood leukocytes was mixed, reverse transcribed and PCR amplified using gene-specific primers. The endocrine-associated genes were amplified from cDNA derived from un-stimulated bovine hypothalamus, pituitary, adrenal and thyroid gland tissues. The array was constructed in 4 repeating grids of 180 duplicated spots by coupling the PCR amplified 213-630 bp gene fragments onto poly-l-lysine coated glass slides. The bovine immune-endocrine arrays were standardized and preliminary gene expression profiles generated using Cy3 and Cy5 labelled cDNA from un-stimulated and ConA (5 microg/ml) stimulated PBMC of 4 healthy Holstein cows (2-4 replicate arrays/cow) in a time course study. Mononuclear cell-derived cytokine and chemokine (IL-2, IL-1alpha

  20. Comparison of next generation sequencing technologies for transcriptome characterization

    Directory of Open Access Journals (Sweden)

    Soltis Douglas E

    2009-08-01

    Full Text Available Abstract Background We have developed a simulation approach to help determine the optimal mixture of sequencing methods for most complete and cost effective transcriptome sequencing. We compared simulation results for traditional capillary sequencing with "Next Generation" (NG ultra high-throughput technologies. The simulation model was parameterized using mappings of 130,000 cDNA sequence reads to the Arabidopsis genome (NCBI Accession SRA008180.19. We also generated 454-GS20 sequences and de novo assemblies for the basal eudicot California poppy (Eschscholzia californica and the magnoliid avocado (Persea americana using a variety of methods for cDNA synthesis. Results The Arabidopsis reads tagged more than 15,000 genes, including new splice variants and extended UTR regions. Of the total 134,791 reads (13.8 MB, 119,518 (88.7% mapped exactly to known exons, while 1,117 (0.8% mapped to introns, 11,524 (8.6% spanned annotated intron/exon boundaries, and 3,066 (2.3% extended beyond the end of annotated UTRs. Sequence-based inference of relative gene expression levels correlated significantly with microarray data. As expected, NG sequencing of normalized libraries tagged more genes than non-normalized libraries, although non-normalized libraries yielded more full-length cDNA sequences. The Arabidopsis data were used to simulate additional rounds of NG and traditional EST sequencing, and various combinations of each. Our simulations suggest a combination of FLX and Solexa sequencing for optimal transcriptome coverage at modest cost. We have also developed ESTcalc http://fgp.huck.psu.edu/NG_Sims/ngsim.pl, an online webtool, which allows users to explore the results of this study by specifying individualized costs and sequencing characteristics. Conclusion NG sequencing technologies are a highly flexible set of platforms that can be scaled to suit different project goals. In terms of sequence coverage alone, the NG sequencing is a dramatic advance

  1. A secreted aspartic proteinase from Glomerella cingulata: purification of the enzyme and molecular cloning of the cDNA.

    Science.gov (United States)

    Clark, S J; Templeton, M D; Sullivan, P A

    1997-04-01

    A secreted aspartic proteinase from Glomerella cingulata (GcSAP) was purified to homogeneity by ion exchange chromatography. The enzyme has an M, of 36000 as estimated by SDS-PAGE, optimal activity from pH 3.5 to pH 4.0 and is inhibited by pepstatin. The N-terminal sequence, 23 residues long, was used to design a gene-specific primer. This was used in 3' RACE (rapid amplification of cDNA ends) PCR to amplify a 1.2 kb fragment of the gcsap cDNA. A second gene-specific primer was designed and used in 5' RACE PCR to clone the 5' region. This yielded a 600 bp DNA fragment and completed the open reading frame. The gcsap open reading frame encodes a protein with a 78 residue prepro-sequence typical of other fungal secreted aspartic proteinases. Based on the deduced sequence, the mature enzyme contains 329 amino acids and shows approximately 40% identity to other fungal aspartic proteinases. Subsequent cloning and sequencing of gcsap fragments obtained from PCR with genomic DNA revealed a 73 bp intron beginning at nt 728. Southern analyses at medium and high stringency indicated that G. cingulata possesses one gene for the secreted aspartic proteinase, and Northern blots indicated that gene expression was induced by exogenous protein and repressed by ammonium salts. GcSAP is a putative pathogenicity factor of G. cingulata, and it will now be possible to create SAP-mutants and assess the role GcSAP plays in pathogenicity.

  2. Microbial Dark Matter: Unusual intervening sequences in 16S rRNA genes of candidate phyla from the deep subsurface

    Energy Technology Data Exchange (ETDEWEB)

    Jarett, Jessica; Stepanauskas, Ramunas; Kieft, Thomas; Onstott, Tullis; Woyke, Tanja

    2014-03-17

    The Microbial Dark Matter project has sequenced genomes from over 200 single cells from candidate phyla, greatly expanding our knowledge of the ecology, inferred metabolism, and evolution of these widely distributed, yet poorly understood lineages. The second phase of this project aims to sequence an additional 800 single cells from known as well as potentially novel candidate phyla derived from a variety of environments. In order to identify whole genome amplified single cells, screening based on phylogenetic placement of 16S rRNA gene sequences is being conducted. Briefly, derived 16S rRNA gene sequences are aligned to a custom version of the Greengenes reference database and added to a reference tree in ARB using parsimony. In multiple samples from deep subsurface habitats but not from other habitats, a large number of sequences proved difficult to align and therefore to place in the tree. Based on comparisons to reference sequences and structural alignments using SSU-ALIGN, many of these ?difficult? sequences appear to originate from candidate phyla, and contain intervening sequences (IVSs) within the 16S rRNA genes. These IVSs are short (39 - 79 nt) and do not appear to be self-splicing or to contain open reading frames. IVSs were found in the loop regions of stem-loop structures in several different taxonomic groups. Phylogenetic placement of sequences is strongly affected by IVSs; two out of three groups investigated were classified as different phyla after their removal. Based on data from samples screened in this project, IVSs appear to be more common in microbes occurring in deep subsurface habitats, although the reasons for this remain elusive.

  3. Cloning, sequencing and expression of a novel xylanase cDNA from ...

    African Journals Online (AJOL)

    A strain SH 2016, capable of producing xylanase, was isolated and identified as Aspergillus awamori, based on its physiological and biochemical characteristics as well as its ITS rDNA gene sequence analysis. A xylanase gene of 591 bp was cloned from this newly isolated A. awamori and the ORF sequence predicted a ...

  4. Molecular cloning of a cDNA and chromosomal localization of a human theta-class glutathione S-transferase gene (GSTT2) to chromosome 22

    Energy Technology Data Exchange (ETDEWEB)

    Tan, K.L.; Baker, R.T.; Board, P.G. [Australian National Univ., Canberra (Australia)] [and others

    1995-01-20

    Until recently the Theta-class glutathione S-transferases (GSTs) were largely overlooked due to their low activity with the model substrate 1-chloro-2,4-dinitrobenzene (CDNB) and their failure to bind to immobilized glutathione affinity matrices. Little is known about the number of genes in this class. Recently, Pemble et al. reported the cDNA cloning of a human Theta-class GST, termed GSTT1. In this study, we describe the molecular cloning of a cDNA encoding a second human Theta-class GST (GSTT2) from a {lambda}gt11 human liver 5{prime}-stretch cDNA library. The encoded protein contains 244 amino acids and has 78.3% sequence identity with the rat subunit 12 and only 55.0% identity with human GSTT1. GSTT2 has been mapped to chromosome 22 by somatic cell hybrid analysis. The precise position of the gene was localized to subband 22q11.2 by in situ hybridization. The absence of other regions of hybridization suggests that there are no closely related sequences (e.g., reverse transcribed pseudogenes) scattered throughout the genome and that if there are closely related genes, they must be clustered near GSTT2. Southern blot analysis of human DNA digested with BamHI shows that the size of the GSTT2 gene is relatively small, as the coding sequence falls within a 3.6-kb BamHI fragment. 35 refs., 6 figs.

  5. Construction of C35 gene bait recombinants and T47D cell cDNA library.

    Science.gov (United States)

    Yin, Kun; Xu, Chao; Zhao, Gui-Hua; Liu, Ye; Xiao, Ting; Zhu, Song; Yan, Ge

    2017-11-20

    C35 is a novel tumor biomarker associated with metastasis progression. To investigate the interaction factors of C35 in its high expressed breast cancer cell lines, we constructed bait recombinant plasmids of C35 gene and T47D cell cDNA library for yeast two-hybrid screening. Full length C35 sequences were subcloned using RT-PCR from cDNA template extracted from T47D cells. Based on functional domain analysis, the full-length C35 1-348bp was also truncated into two fragments C351-153bp and C35154-348bp to avoid auto-activation. The three kinds of C35 genes were successfully amplified and inserted into pGBKT7 to construct bait recombinant plasmids pGBKT7-C351-348bp, pGBKT7-C351-153bp and pGBKT7-C35154-348bp, then transformed into Y187 yeast cells by the lithium acetate method. Auto-activation and toxicity of C35 baits were detected using nutritional deficient medium and X-α-Gal assays. The T47D cell ds cDNA was generated by SMART TM technology and the library was constructed using in vivo recombination-mediated cloning in the AH109 yeast strain using a pGADT7-Rec plasmid. The transformed Y187/pGBKT7-C351-348bp line was intensively inhibited while the truncated Y187/pGBKT7-C35 lines had no auto-activation and toxicity in yeast cells. The titer of established cDNA library was 2 × 10 7 pfu/mL with high transformation efficiency of 1.4 × 10 6 , and the insert size of ds cDNA was distributed homogeneously between 0.5-2.0 kb. Our research generated a T47D cell cDNA library with high titer, and the constructed two C35 "baits" contained a respective functional immunoreceptor tyrosine based activation motif (ITAM) and the conserved last four amino acids Cys-Ile-Leu-Val (CILV) motif, and therefore laid a foundation for screening the C35 interaction factors in a BC cell line.

  6. Deep sequencing reveals persistence of cell-associated mumps vaccine virus in chronic encephalitis.

    Science.gov (United States)

    Morfopoulou, Sofia; Mee, Edward T; Connaughton, Sarah M; Brown, Julianne R; Gilmour, Kimberly; Chong, W K 'Kling'; Duprex, W Paul; Ferguson, Deborah; Hubank, Mike; Hutchinson, Ciaran; Kaliakatsos, Marios; McQuaid, Stephen; Paine, Simon; Plagnol, Vincent; Ruis, Christopher; Virasami, Alex; Zhan, Hong; Jacques, Thomas S; Schepelmann, Silke; Qasim, Waseem; Breuer, Judith

    2017-01-01

    Routine childhood vaccination against measles, mumps and rubella has virtually abolished virus-related morbidity and mortality. Notwithstanding this, we describe here devastating neurological complications associated with the detection of live-attenuated mumps virus Jeryl Lynn (MuV JL5 ) in the brain of a child who had undergone successful allogeneic transplantation for severe combined immunodeficiency (SCID). This is the first confirmed report of MuV JL5 associated with chronic encephalitis and highlights the need to exclude immunodeficient individuals from immunisation with live-attenuated vaccines. The diagnosis was only possible by deep sequencing of the brain biopsy. Sequence comparison of the vaccine batch to the MuV JL5 isolated from brain identified biased hypermutation, particularly in the matrix gene, similar to those found in measles from cases of SSPE. The findings provide unique insights into the pathogenesis of paramyxovirus brain infections.

  7. Sequencing Infrastructure Investments under Deep Uncertainty Using Real Options Analysis

    Directory of Open Access Journals (Sweden)

    Nishtha Manocha

    2018-02-01

    Full Text Available The adaptation tipping point and adaptation pathway approach developed to make decisions under deep uncertainty do not shed light on which among the multiple available pathways should be chosen as the preferred pathway. This creates the need to extend these approaches by means of suitable tools that can help sequence actions and subsequently enable the outlining of relevant policies. This paper presents two sequencing approaches, namely, the “Build to Target” and “Build Up” approach, to aid in sub-selecting a set of preferred pathways. Both approaches differ in the levels of flexibility they offer. They are exemplified by means of two case studies wherein the Net Present Valuation and the Real Options Analysis are employed as selection criterions. The results demonstrate the benefit of these two approaches when used in conjunction with the adaptation pathways and show how the pathways selected by means of a Build to Target approach generally have a value greater than, or at least the same as, the pathways selected by the Build Up approach. Further, this paper also demonstrates the capacity of Real Options to quantify and capture the economic value of flexibility, which cannot be done by traditional valuation approaches such as Net Present Valuation.

  8. Characterization of gonadotrophin-releasing hormone precursor cDNA in the Old World mole-rat Cryptomys hottentotus pretoriae: high degree of identity with the New World guinea pig sequence.

    Science.gov (United States)

    Kalamatianos, T; du Toit, L; Hrabovszky, E; Kalló, I; Marsh, P J; Bennett, N C; Coen, C W

    2005-05-01

    Regulation of pituitary gonadotrophins by the decapeptide gonadotrophin-releasing hormone 1 (GnRH1) is crucial for the development and maintenance of reproductive functions. A common amino acid sequence for this decapeptide, designated as 'mammalian' GnRH, has been identified in all mammals thus far investigated with the exception of the guinea pig, in which there are two amino acid substitutions. Among hystricognath rodents, the members of the family Bathyergidae regulate reproduction in response to diverse cues. Thus, highveld mole-rats (Cryptomys hottentotus pretoriae) are social bathyergids in which breeding is restricted to a particular season in the dominant female, but continuously suppressed in subordinate colony members. Elucidation of reproductive control in these animals will be facilitated by characterization of their GnRH1 gene. A partial sequence of GnRH1 precursor cDNA was isolated and characterized. Comparative analysis revealed the highest degree of identity (86%) to guinea pig GnRH1 precursor mRNA. Nevertheless, the deduced amino acid sequence of the mole-rat decapeptide is identical to the 'mammalian' sequence rather than that of guinea pigs. Successful detection of GnRH1-synthesizing neurones using either a guinea pig GnRH1 riboprobe or an antibody against the 'mammalian' decapeptide is consistent with the guinea pig-like sequence for the precursor and the classic 'mammalian' form for the decapeptide. The high degree of identity in the GnRH1 precursor sequence between this Old World mole-rat and the New World guinea pig is consistent with the theory that caviomorphs and phiomorphs originated from a common ancestral line in the Palaeocene to mid Eocene, some 63-45 million years ago.

  9. Assignment of casein kinase 2 alpha sequences to two different human chromosomes

    DEFF Research Database (Denmark)

    Boldyreff, B; Klett, C; Göttert, E

    1992-01-01

    Human casein kinase 2 alpha gene (CK-2-alpha) sequences have been localized within the human genome by in situ hybridization and somatic cell hybrid analysis using a CK-2 alpha cDNA as a probe. By in situ hybridization, the CK-2 alpha cDNA could be assigned to two different loci, one on 11p15.1-ter...

  10. Anchoring a Defined Sequence to the 55' Ends of mRNAs : The Bolt to Clone Rare Full Length mRNAs and Generate cDNA Libraries porn a Few Cells.

    Science.gov (United States)

    Baptiste, J; Milne Edwards, D; Delort, J; Mallet, J

    1993-01-01

    Among numerous applications, the polymerase chain reaction (PCR) (1,2) provides a convenient means to clone 5' ends of rare mRNAs and to generate cDNA libraries from tissue available in amounts too low to be processed by conventional methods. Basically, the amplification of cDNAs by the PCR requires the availability of the sequences of two stretches of the molecule to be amplified. A sequence can easily be imposed at the 5' end of the first-strand cDNAs (corresponding to the 3' end of the mRNAs) by priming the reverse transcription with a specific primer (for cloning the 5' end of rare messenger) or with an oligonucleotide tailored with a poly (dT) stretch (for cDNA library construction), taking advantage of the poly (A) sequence that is located at the 3' end of mRNAs. Several strategies have been devised to tag the 3' end of the ss-cDNAs (corresponding to the 55' end of the mRNAs). We (3) and others have described strategies based on the addition of a homopolymeric dG (4,5) or dA (6,7) tail using terminal deoxyribonucleotide transferase (TdT) ("anchor-PCR" [4]). However, this strategy has important limitations. The TdT reaction is difficult to control and has a low efficiency (unpublished observations). But most importantly, the return primers containing a homopolymeric (dC or dT) tail generate nonspecific amplifications, a phenomenon that prevents the isolation of low abundance mRNA species and/or interferes with the relative abundance of primary clones in the library. To circumvent these drawbacks, we have used two approaches. First, we devised a strategy based on a cRNA enrichment procedure, which has been useful to eliminate nonspecific-PCR products and to allow detection and cloning of cDNAs of low abundance (3). More recently, to avoid the nonspecific amplification resulting from the annealing of the homopolymeric tail oligonucleotide, we have developed a novel anchoring strategy that is based on the ligation of an oligonucleotide to the 35' end of ss

  11. Molecular cloning of a catalase cDNA from Nicotiana glutinosa L. and its repression by tobacco mosaic virus infection.

    Science.gov (United States)

    Yi, S Y; Yu, S H; Choi, D

    1999-06-30

    Recent reports revealed that catalase has a role in the plant defense mechanism against a broad range of pathogens through being inhibited by salicylic acid (SA). During an effort to clone disease resistance-responsive genes, a cDNA encoding catalase (Ngcat1; Nicotiana glutinosa cat1) was isolated from a tobacco cDNA library. In N. glutinosa, catalase is encoded by a small gene family. The deduced amino acid sequence of the Ngcat1 cDNA has 98% homology with the cat1 gene of N. plumbaginifolia. The Ngcat1 expression is controlled by the circadian clock, and its mRNA level is the most abundant in leaves. Both the expression of Ngcat1 mRNA and its enzyme activity in the tobacco plant undergoing a hypersensitive response (HR) to TMV infection were repressed. The repression of the mRNA level was also observed following treatment with SA. These results imply that SA may act as an inhibitor of catalase transcription during the HR of tobacco. Cloning and expression of the Ngcat1 in tobacco following pathogen infection and SA treatment are presented.

  12. Yoctomole electrochemical genosensing of Ebola virus cDNA by rolling circle and circle to circle amplification.

    Science.gov (United States)

    Carinelli, S; Kühnemund, M; Nilsson, M; Pividori, M I

    2017-07-15

    This work addresses the design of an Ebola diagnostic test involving a simple, rapid, specific and highly sensitive procedure based on isothermal amplification on magnetic particles with electrochemical readout. Ebola padlock probes were designed to detect a specific L-gene sequence present in the five most common Ebola species. Ebola cDNA was amplified by rolling circle amplification (RCA) on magnetic particles. Further re-amplification was performed by circle-to-circle amplification (C2CA) and the products were detected in a double-tagging approach using a biotinylated capture probe for immobilization on magnetic particles and a readout probe for electrochemical detection by square-wave voltammetry on commercial screen-printed electrodes. The electrochemical genosensor was able to detect as low as 200 ymol, corresponding to 120 cDNA molecules of L-gene Ebola virus with a limit of detection of 33 cDNA molecules. The isothermal double-amplification procedure by C2CA combined with the electrochemical readout and the magnetic actuation enables the high sensitivity, resulting in a rapid, inexpensive, robust and user-friendly sensing strategy that offers a promising approach for the primary care in low resource settings, especially in less developed countries. Copyright © 2016 Elsevier B.V. All rights reserved.

  13. Characterization of a pollen-specific cDNA clone from Nicotiana tabacum expressed during microgametogenesis and germination.

    Science.gov (United States)

    Weterings, K; Reijnen, W; van Aarssen, R; Kortstee, A; Spijkers, J; van Herpen, M; Schrauwen, J; Wullems, G

    1992-04-01

    This report describes the isolation and characterization of a cDNA clone representing a gene specifically expressed in pollen. A cDNA library was constructed against mRNA from mature pollen of Nicotiana tabacum. It was screened differentially against cDNA from mRNA of leaf and of pollen. One clone, NTPc303, was further characterized. On northern blot this clone hybridizes to a transcript 2100 nucleotides in length. NTPc303 is abundant in pollen. Expression of the corresponding gene is restricted to pollen, because no other generative or vegetative tissue contains transcripts hybridizing to NTPc303. Expression of NTP303 is evolutionarily conserved: homologous transcripts are present in pollen from various plant species. The first NTP303 transcripts are detectable on northern blot at the early bi-nucleate stage and accumulate until the pollen has reached maturity. During germination and pollen tube growth in vitro new NTP303 transcripts appear. This transcription has been proved by northern blots as well as by pulse labelling experiments. Nucleotide sequence analysis revealed that NTPc303 has an open reading frame coding for a predicted protein of 62 kDa. This protein shares homology to ascorbate oxidase and other members of the blue copper oxidase family. A possible function for this clone during pollen germination is discussed.

  14. Cloning of the γ-aminobutyric acid (GABA) ρ1 cDNA: A GABA receptor subunit highly expressed in the retina

    International Nuclear Information System (INIS)

    Cutting, G.R.; Lu, Luo; Kasch, L.M.; Montrose-Rafizadeh, C.; Antonarakis, S.E.; Guggino, W.B.; Kazazian, H.H. Jr.; O'Hara, B.F.; Donovan, D.M.; Shimada, Shoichi; Uhl, G.R.

    1991-01-01

    Type A γ-aminobutyric acid (GABA A ) receptors are a family of ligand-gated chloride channels that are the major inhibitory neurotransmitter receptors in the nervous system. Molecular cloning has revealed diversity in the subunits that compose this heterooligomeric receptor, but each previously elucidated subunit displays amino acid similarity in conserved structural elements. The authors have used these highly conserved regions to identify additional members of this family by using the polymerase chain reaction (PCR). One PCR product was used to isolate a full-length cDNA from a human retina cDNA library. The mature protein predicted from this cDNA sequence is 458 amino acids long and displays between 30 and 38% amino acid similarity to the previously identified GABA A subunits. This gene is expressed primarily in the retina but transcripts are also detected in the brain, lung, and thymus. Injection of Xenopus oocytes with RNA transcribed in vitro produces a GABA-responsive chloride conductance and expression of the cDNA in COS cells yields GABA-displaceable muscimol binding. These features are consistent with our identification of a GABA subunit, GABA ρ 1 , with prominent retinal expression that increases the diversity and tissue specificity of this ligand-gated ion-channel receptor family

  15. Molecular cloning of a cDNA encoding human calumenin, expression in Escherichia coli and analysis of its Ca2+-binding activity

    DEFF Research Database (Denmark)

    Vorum, H; Liu, X; Madsen, Peder

    1998-01-01

    By microsequencing and cDNA cloning we have identified the transformation-sensitive protein No. IEF SSP 9302 as the human homologue of calumenin. The nucleotide sequence predicts a 315 amino acid protein with high identity to murine and rat calumenin. The deduced protein contains a 19 amino acid N...

  16. Molecular cloning, sequence analysis and phylogeny of first caudata g-type lysozyme in axolotl (Ambystoma mexicanum).

    Science.gov (United States)

    Yu, Haining; Gao, Jiuxiang; Lu, Yiling; Guang, Huijuan; Cai, Shasha; Zhang, Songyan; Wang, Yipeng

    2013-11-01

    Lysozymes are key proteins that play important roles in innate immune defense in many animal phyla by breaking down the bacterial cell-walls. In this study, we report the molecular cloning, sequence analysis and phylogeny of the first caudate amphibian g-lysozyme: a full-length spleen cDNA library from axolotl (Ambystoma mexicanum). A goose-type (g-lysozyme) EST was identified and the full-length cDNA was obtained using RACE-PCR. The axolotl g-lysozyme sequence represents an open reading frame for a putative signal peptide and the mature protein composed of 184 amino acids. The calculated molecular mass and the theoretical isoelectric point (pl) of this mature protein are 21523.0 Da and 4.37, respectively. Expression of g-lysozyme mRNA is predominantly found in skin, with lower levels in spleen, liver, muscle, and lung. Phylogenetic analysis revealed that caudate amphibian g-lysozyme had distinct evolution pattern for being juxtaposed with not only anura amphibian, but also with the fish, bird and mammal. Although the first complete cDNA sequence for caudate amphibian g-lysozyme is reported in the present study, clones encoding axolotl's other functional immune molecules in the full-length cDNA library will have to be further sequenced to gain insight into the fundamental aspects of antibacterial mechanisms in caudate.

  17. An improved method for RNA isolation and cDNA library construction from immature seeds of Jatropha curcas L

    Directory of Open Access Journals (Sweden)

    Kaur Jatinder

    2010-05-01

    Full Text Available Abstract Background RNA quality and quantity is sometimes unsuitable for cDNA library construction, from plant seeds rich in oil, polysaccharides and other secondary metabolites. Seeds of jatropha (Jatropha curcas L. are rich in fatty acids/lipids, storage proteins, polysaccharides, and a number of other secondary metabolites that could either bind and/or co-precipitate with RNA, making it unsuitable for downstream applications. Existing RNA isolation methods and commercial kits often fail to deliver high-quality total RNA from immature jatropha seeds for poly(A+ RNA purification and cDNA synthesis. Findings A protocol has been developed for isolating good quality total RNA from immature jatropha seeds, whereby a combination of the CTAB based RNA extraction method and a silica column of a commercial plant RNA extraction kit is used. The extraction time was reduced from two days to about 3 hours and the RNA was suitable for poly(A+ RNA purification, cDNA synthesis, cDNA library construction, RT-PCR, and Northern hybridization. Based on sequence information from selected clones and amplified PCR product, the cDNA library seems to be a good source of full-length jatropha genes. The method was equally effective for isolating RNA from mustard and rice seeds. Conclusions This is a simple CTAB + silica column method to extract high quality RNA from oil rich immature jatropha seeds that is suitable for several downstream applications. This method takes less time for RNA extraction and is equally effective for other tissues where the quality and quantity of RNA is highly interfered by the presence of fatty acids, polysaccharides and polyphenols.

  18. Deep Sequencing Reveals the Complete Genome and Evidence for Transcriptional Activity of the First Virus-Like Sequences Identified in Aristotelia chilensis (Maqui Berry

    Directory of Open Access Journals (Sweden)

    Javier Villacreses

    2015-04-01

    Full Text Available Here, we report the genome sequence and evidence for transcriptional activity of a virus-like element in the native Chilean berry tree Aristotelia chilensis. We propose to name the endogenous sequence as Aristotelia chilensis Virus 1 (AcV1. High-throughput sequencing of the genome of this tree uncovered an endogenous viral element, with a size of 7122 bp, corresponding to the complete genome of AcV1. Its sequence contains three open reading frames (ORFs: ORFs 1 and 2 shares 66%–73% amino acid similarity with members of the Caulimoviridae virus family, especially the Petunia vein clearing virus (PVCV, Petuvirus genus. ORF1 encodes a movement protein (MP; ORF2 a Reverse Transcriptase (RT and a Ribonuclease H (RNase H domain; and ORF3 showed no amino acid sequence similarity with any other known virus proteins. Analogous to other known endogenous pararetrovirus sequences (EPRVs, AcV1 is integrated in the genome of Maqui Berry and showed low viral transcriptional activity, which was detected by deep sequencing technology (DNA and RNA-seq. Phylogenetic analysis of AcV1 and other pararetroviruses revealed a closer resemblance with Petuvirus. Overall, our data suggests that AcV1 could be a new member of Caulimoviridae family, genus Petuvirus, and the first evidence of this kind of virus in a fruit plant.

  19. cDNA microarray screening in food safety

    International Nuclear Information System (INIS)

    Roy, Sashwati; Sen, Chandan K.

    2006-01-01

    The cDNA microarray technology and related bioinformatics tools presents a wide range of novel application opportunities. The technology may be productively applied to address food safety. In this mini-review article, we present an update highlighting the late breaking discoveries that demonstrate the vitality of cDNA microarray technology as a tool to analyze food safety with reference to microbial pathogens and genetically modified foods. In order to bring the microarray technology to mainstream food safety, it is important to develop robust user-friendly tools that may be applied in a field setting. In addition, there needs to be a standardized process for regulatory agencies to interpret and act upon microarray-based data. The cDNA microarray approach is an emergent technology in diagnostics. Its values lie in being able to provide complimentary molecular insight when employed in addition to traditional tests for food safety, as part of a more comprehensive battery of tests

  20. Genomic variation in macrophage-cultured European porcine reproductive and respiratory syndrome virus Olot/91 revealed using ultra-deep next generation sequencing.

    Science.gov (United States)

    Lu, Zen H; Brown, Alexander; Wilson, Alison D; Calvert, Jay G; Balasch, Monica; Fuentes-Utrilla, Pablo; Loecherbach, Julia; Turner, Frances; Talbot, Richard; Archibald, Alan L; Ait-Ali, Tahar

    2014-03-04

    Porcine Reproductive and Respiratory Syndrome (PRRS) is a disease of major economic impact worldwide. The etiologic agent of this disease is the PRRS virus (PRRSV). Increasing evidence suggest that microevolution within a coexisting quasispecies population can give rise to high sequence heterogeneity in PRRSV. We developed a pipeline based on the ultra-deep next generation sequencing approach to first construct the complete genome of a European PRRSV, strain Olot/9, cultured on macrophages and then capture the rare variants representative of the mixed quasispecies population. Olot/91 differs from the reference Lelystad strain by about 5% and a total of 88 variants, with frequencies as low as 1%, were detected in the mixed population. These variants included 16 non-synonymous variants concentrated in the genes encoding structural and nonstructural proteins; including Glycoprotein 2a and 5. Using an ultra-deep sequencing methodology, the complete genome of Olot/91 was constructed without any prior knowledge of the sequence. Rare variants that constitute minor fractions of the heterogeneous PRRSV population could successfully be detected to allow further exploration of microevolutionary events.

  1. Chromosomal Localization of DNA Amplifications in Neuroblastoma Tumors Using cDNA Microarray Comparative Genomic Hybridization

    Directory of Open Access Journals (Sweden)

    Ben Beheshti

    2003-01-01

    Full Text Available Conventional comparative genomic hybridization (CGH profiling of neuroblastomas has identified many genomic aberrations, although the limited resolution has precluded a precise localization of sequences of interest within amplicons. To map high copy number genomic gains in clinically matched stage IV neuroblastomas, CGH analysis using a 19,200-feature cDNA microarray was used. A dedicated (freely available algorithm was developed for rapid in silico determination of chromosomal localizations of microarray cDNA targets, and for generation of an ideogram-type profile of copy number changes. Using these methodologies, novel gene amplifications undetectable by chromosome CGH were identified, and larger MYCN amplicon sizes (in one tumor up to 6 Mb than those previously reported in neuroblastoma were identified. The genes HPCAL1, LPIN1/KIAA0188, NAG, and NSE1/LOC151354 were found to be coamplified with MYCN. To determine whether stage IV primary tumors could be further subclassified based on their genomic copy number profiles, hierarchical clustering was performed. Cluster analysis of microarray CGH data identified three groups: 1 no amplifications evident, 2 a small MYCN amplicon as the only detectable imbalance, and 3 a large MYCN amplicon with additional gene amplifications. Application of CGH to cDNA microarray targets will help to determine both the variation of amplicon size and help better define amplification-dependent and independent pathways of progression in neuroblastoma.

  2. Isolation and sequence of complementary DNA encoding human extracellular superoxide dismutase

    International Nuclear Information System (INIS)

    Hjalmarsson, K.; Marklund, S.L.; Engstroem, A.; Edlund, T.

    1987-01-01

    A complementary DNA (cDNA) clone from a human placenta cDNA library encoding extracellular superoxide dismutase has been isolated and the nucleotide sequence determined. The cDNA has a very high G + C content. EC-SOD is synthesized with a putative 18-amino acid signal peptide, preceding the 222 amino acids in the mature enzyme, indicating that the enzyme is a secretory protein. The first 95 amino acids of the mature enzyme show no sequence homology with other sequenced proteins and there is one possible N-glycosylation site (Asn-89). The amino acid sequence from residues 96-193 shows strong homology (∼ 50%) with the final two-thirds of the sequences of all know eukaryotic CuZn SODs, whereas the homology with the P. leiognathi CuZn SOD is clearly lower. The ligands to Cu and Zn, the cysteines forming the intrasubunit disulfide bridge in the CuZn SODs, and the arginine found in all CuZn SODs in the entrance to the active site can all be identified in EC-SOD. A comparison with bovine CuZn SOD, the three-dimensional structure of which is known, reveals that the homologies occur in the active site and the divergencies are in the part constituting the subunit contact area in CuZn SOD. Amino acid sequence 194-222 in the carboxyl-terminal end of EC-SOD is strongly hydrophilic and contains nine amino acids with a positive charge. This sequence probably confers the affinity of EC-SOD for heparin and heparan sulfate. An analysis of the amino acid sequence homologies with CuZn SODs from various species indicates that the EC-SODs may have evolved form the CuZn SODs before the evolution of fungi and plants

  3. 3′ terminal diversity of MRP RNA and other human noncoding RNAs revealed by deep sequencing

    Science.gov (United States)

    2013-01-01

    Background Post-transcriptional 3′ end processing is a key component of RNA regulation. The abundant and essential RNA subunit of RNase MRP has been proposed to function in three distinct cellular compartments and therefore may utilize this mode of regulation. Here we employ 3′ RACE coupled with high-throughput sequencing to characterize the 3′ terminal sequences of human MRP RNA and other noncoding RNAs that form RNP complexes. Results The 3′ terminal sequence of MRP RNA from HEK293T cells has a distinctive distribution of genomically encoded termini (including an assortment of U residues) with a portion of these selectively tagged by oligo(A) tails. This profile contrasts with the relatively homogenous 3′ terminus of an in vitro transcribed MRP RNA control and the differing 3′ terminal profiles of U3 snoRNA, RNase P RNA, and telomerase RNA (hTR). Conclusions 3′ RACE coupled with deep sequencing provides a valuable framework for the functional characterization of 3′ terminal sequences of noncoding RNAs. PMID:24053768

  4. A cDNA microarray, UniShrimpChip, for identification of genes relevant to testicular development in the black tiger shrimp (Penaeus monodon

    Directory of Open Access Journals (Sweden)

    Klinbunga Sirawut

    2011-04-01

    Full Text Available Abstract Background Poor reproductive maturation in captive male broodstock of the black tiger shrimp (Penaeus monodon is one of the serious problems to the farming industries. Without genome sequence, EST libraries of P. monodon were previously constructed to identify transcripts with important biological functions. In this study, a new version of cDNA microarray, UniShrimpChip, was constructed from the Peneaus monodon EST libraries of 12 tissues, containing 5,568 non-redundant cDNA clones from 10,536 unique cDNA in the P. monodon EST database. UniShrimpChip was used to study testicular development by comparing gene expression levels of wild brooders from the West and East coasts of Thailand and domesticated brooders with different ages (10-, 14-, 18-month-old. Results The overall gene expression patterns from the microarray experiments revealed distinct transcriptomic patterns between the wild and domesticated groups. Moreover, differentially expressed genes from the microarray comparisons were identified, and the expression patterns of eight selected transcripts were subsequently confirmed by reverse-transcriptase quantitative PCR (RT-qPCR. Among these, expression levels of six subunits (CSN2, 4, 5, 6, 7a, and 8 of the COP9 signalosome (CSN gene family in wild and different ages of domesticated brooders were examined by RT-qPCR. Among the six subunits, CSN5 and CSN6 were most highly expressed in wild brooders and least expressed in the 18-month-old domesticated group; therefore, their full-length cDNA sequences were characterized. Conclusions This study is the first report to employ cDNA microarray to study testicular development in the black tiger shrimp. We show that there are obvious differences between the wild and domesticated shrimp at the transcriptomic level. Furthermore, our study is the first to investigate the feasibility that the CSN gene family might have involved in reproduction and development of this economically important

  5. cDNA structure, genomic organization and expression patterns of ...

    African Journals Online (AJOL)

    Visfatin was a newly identified adipocytokine, which was involved in various physiologic and pathologic processes of organisms. The cDNA structure, genomic organization and expression patterns of silver Prussian carp visfatin were described in this report. The silver Prussian carp visfatin cDNA cloned from the liver was ...

  6. Deep sequencing of cardiac microRNA-mRNA interactomes in clinical and experimental cardiomyopathy.

    Science.gov (United States)

    Matkovich, Scot J; Dorn, Gerald W

    2015-01-01

    MicroRNAs are a family of short (~21 nucleotide) noncoding RNAs that serve key roles in cellular growth and differentiation and the response of the heart to stress stimuli. As the sequence-specific recognition element of RNA-induced silencing complexes (RISCs), microRNAs bind mRNAs and prevent their translation via mechanisms that may include transcript degradation and/or prevention of ribosome binding. Short microRNA sequences and the ability of microRNAs to bind to mRNA sites having only partial/imperfect sequence complementarity complicate purely computational analyses of microRNA-mRNA interactomes. Furthermore, computational microRNA target prediction programs typically ignore biological context, and therefore the principal determinants of microRNA-mRNA binding: the presence and quantity of each. To address these deficiencies we describe an empirical method, developed via studies of stressed and failing hearts, to determine disease-induced changes in microRNAs, mRNAs, and the mRNAs targeted to the RISC, without cross-linking mRNAs to RISC proteins. Deep sequencing methods are used to determine RNA abundances, delivering unbiased, quantitative RNA data limited only by their annotation in the genome of interest. We describe the laboratory bench steps required to perform these experiments, experimental design strategies to achieve an appropriate number of sequencing reads per biological replicate, and computer-based processing tools and procedures to convert large raw sequencing data files into gene expression measures useful for differential expression analyses.

  7. Infectious Maize rayado fino virus from cloned cDNA

    Science.gov (United States)

    Maize rayado fino virus (MRFV) is the type member of the marafiviruses within the family Tymoviridae. A cDNA clone from which infectious RNA can be transcribed was produced from a US isolate of MRFV (MRFV-US). Infectivity of transcripts derived from cDNA clones was demonstrated by infection of mai...

  8. DeepARG: a deep learning approach for predicting antibiotic resistance genes from metagenomic data.

    Science.gov (United States)

    Arango-Argoty, Gustavo; Garner, Emily; Pruden, Amy; Heath, Lenwood S; Vikesland, Peter; Zhang, Liqing

    2018-02-01

    Growing concerns about increasing rates of antibiotic resistance call for expanded and comprehensive global monitoring. Advancing methods for monitoring of environmental media (e.g., wastewater, agricultural waste, food, and water) is especially needed for identifying potential resources of novel antibiotic resistance genes (ARGs), hot spots for gene exchange, and as pathways for the spread of ARGs and human exposure. Next-generation sequencing now enables direct access and profiling of the total metagenomic DNA pool, where ARGs are typically identified or predicted based on the "best hits" of sequence searches against existing databases. Unfortunately, this approach produces a high rate of false negatives. To address such limitations, we propose here a deep learning approach, taking into account a dissimilarity matrix created using all known categories of ARGs. Two deep learning models, DeepARG-SS and DeepARG-LS, were constructed for short read sequences and full gene length sequences, respectively. Evaluation of the deep learning models over 30 antibiotic resistance categories demonstrates that the DeepARG models can predict ARGs with both high precision (> 0.97) and recall (> 0.90). The models displayed an advantage over the typical best hit approach, yielding consistently lower false negative rates and thus higher overall recall (> 0.9). As more data become available for under-represented ARG categories, the DeepARG models' performance can be expected to be further enhanced due to the nature of the underlying neural networks. Our newly developed ARG database, DeepARG-DB, encompasses ARGs predicted with a high degree of confidence and extensive manual inspection, greatly expanding current ARG repositories. The deep learning models developed here offer more accurate antimicrobial resistance annotation relative to current bioinformatics practice. DeepARG does not require strict cutoffs, which enables identification of a much broader diversity of ARGs. The

  9. Sequence analysis and over-expression of ribosomal protein S28 ...

    African Journals Online (AJOL)

    RPS28 is a component of the 40S small ribosomal subunit encoded by RPS28 gene, which is specific to eukaryotes. The cDNA and the genomic sequence of RPS28 were cloned successfully from the Giant Panda using RT-PCR technology and Touchdown-PCR, respectively. Both sequences were analyzed preliminarily ...

  10. Purification, cDNA cloning and modification of a defensin from the coconut rhinoceros beetle, Oryctes rhinoceros.

    Science.gov (United States)

    Ishibashi, J; Saido-Sakanaka, H; Yang, J; Sagisaka, A; Yamakawa, M

    1999-12-01

    A novel member of the insect defensins, a family of antibacterial peptides, was purified from larvae of the coconut rhinoceros beetle, Oryctes rhinoceros, immunized with Escherichia coli. A full-size cDNA was cloned by combining reverse-transcription PCR (RT-PCR), and 5'- and 3'-rapid amplification of cDNA ends (RACE). Analysis of the O. rhinoceros defensin gene expression showed it to be expressed in the fat body and hemocyte, midgut and Malpighian tubules. O. rhinoceros defensin showed strong antibacterial activity against Staphylococcus aureus. A 9-mer peptide amidated at its C-terminus, AHCLAICRK-NH2 (Ala22-Lys30-NH2), was synthesized based on the deduced amino-acid sequence, assumed to be an active site sequence by analogy with the sequence of a defensin isolated from larvae of the beetle Allomyrina dichotoma. This peptide showed antibacterial activity against S. aureus, methicillin-resistant S. aureus, E. coli and Pseudomonas aeruginosa. We further modified this oligopeptide and synthesized five 9-mer peptides, ALRLAIRKR-NH2, ALLLAIRKR-NH2, AWLLAIRKR-NH2, ALYLAIRKR-NH2 and ALWLAIRKR-NH2. These oligopeptides showed strong antibacterial activity against Gram-negative and Gram-positive bacteria. The antibacterial effect of Ala22-Lys30-NH2 analogues was due to its interaction with bacterial membranes, judging from the leakage of liposome-entrapped glucose. These Ala22-Lys30-NH2 analogues did not show haemolytic activity and did not inhibit the growth of murine fibroblast cells or macrophages, except for AWLLAIRKR-NH2.

  11. α/sub i/-3 cDNA encodes the α subunit of G/sub k/, the stimulatory G protein of receptor-regulated K+ channels

    International Nuclear Information System (INIS)

    Codina, J.; Olate, J.; Abramowitz, J.; Mattera, R.; Cook, R.G.; Birnbaumer, L.

    1988-01-01

    cDNA cloning has identified the presence in the human genome of three genes encoding α subunits of pertussis toxin substrates, generically called G/sub i/. They are named α/sub i/-1, α/sub i/-2 and α/sub i/-3. However, none of these genes has been functionally identified with any of the α subunits of several possible G proteins, including pertussis toxin-sensitive G/sub p/'s, stimulatory to phospholipase C or A 2 , G/sub i/, inhibitory to adenylyl cyclase, or G/sub k/, stimulatory to a type of K + channels. The authors now report the nucleotide sequence and the complete predicted amino acid sequence of human liver α/sub i/-3 and the partial amino acid sequence of proteolytic fragments of the α subunit of human erythrocyte G/sub k/. The amino acid sequence of the proteolytic fragment is uniquely encoded by the cDNA of α/sub i/-3, thus identifying it as α/sub k/. The probable identity of α/sub i/-1 with α/sub p/ and possible roles for α/sub i/-2, as well as additional roles for α/sub i/-1 and α/sub i/-3 (α/sub k/) are discussed

  12. Identification and Molecular Characterization of the cDNA Encoding Cucumis melo Allergen, Cuc m 3, a Plant Pathogenesis-Related Protein

    Directory of Open Access Journals (Sweden)

    Mojtaba Sankian

    2014-05-01

    Full Text Available Background: Melon (Cucumis melo allergy is one of the most common food allergies, characterized by oral allergy syndrome. To date, two allergen molecules, Cuc m 1 and Cuc m 2, have been fully characterized in melon pulp, but there are few reports about the molecular characteristics of Cuc m 3. Methods:The Cuc m 3 cDNA has been characterized by rapid amplification of cDNA ends (RACE, which revealed a 456 base-pair (bp fragment encoding a 151-amino acid polypeptide with a predicted molecular mass of 16.97 kDa, and identified 79 and 178 bp untranslated sequences at the 5′ and 3´ ends, respectively. Results: In silico analysis showed strong similarities between Cuc m 3 and other plant pathogen-related protein 1s from cucumber, grape, bell pepper, and tomato. Conclusion: Here we report the identification and characterization of the Cuc m 3 cDNA, which will be utilized for further analyses of structural and allergenic features of this allergen

  13. A rapid method for screening arrayed plasmid cDNA library by PCR

    International Nuclear Information System (INIS)

    Hu Yingchun; Zhang Kaitai; Wu Dechang; Li Gang; Xiang Xiaoqiong

    1999-01-01

    Objective: To develop a PCR-based method for rapid and effective screening of arrayed plasmid cDNA library. Methods: The plasmid cDNA library was arrayed and screened by PCR with a particular set of primers. Results: Four positive clones were obtained through about one week. Conclusion: This method can be applied to screening not only normal cDNA clones, but also cDNA clones-containing small size fragments. This method offers significant advantages over traditional screening method in terms of sensitivity, specificity and efficiency

  14. Identification of ribonucleotide reductase mutation causing temperature-sensitivity of herpes simplex virus isolates from whitlow by deep sequencing.

    Science.gov (United States)

    Daikoku, Tohru; Oyama, Yukari; Yajima, Misako; Sekizuka, Tsuyoshi; Kuroda, Makoto; Shimada, Yuka; Takehara, Kazuhiko; Miwa, Naoko; Okuda, Tomoko; Sata, Tetsutaro; Shiraki, Kimiyasu

    2015-06-01

    Herpes simplex virus 2 caused a genital ulcer, and a secondary herpetic whitlow appeared during acyclovir therapy. The secondary and recurrent whitlow isolates were acyclovir-resistant and temperature-sensitive in contrast to a genital isolate. We identified the ribonucleotide reductase mutation responsible for temperature-sensitivity by deep-sequencing analysis.

  15. Purification of a jojoba embryo fatty acyl-coenzyme A reductase and expression of its cDNA in high erucic acid rapeseed.

    Science.gov (United States)

    Metz, J G; Pollard, M R; Anderson, L; Hayes, T R; Lassner, M W

    2000-03-01

    The jojoba (Simmondsia chinensis) plant produces esters of long-chain alcohols and fatty acids (waxes) as a seed lipid energy reserve. This is in contrast to the triglycerides found in seeds of other plants. We purified an alcohol-forming fatty acyl-coenzyme A reductase (FAR) from developing embryos and cloned the cDNA encoding the enzyme. Expression of a cDNA in Escherichia coli confers FAR activity upon those cells and results in the accumulation of fatty alcohols. The FAR sequence shows significant homology to an Arabidopsis protein of unknown function that is essential for pollen development. When the jojoba FAR cDNA is expressed in embryos of Brassica napus, long-chain alcohols can be detected in transmethylated seed oils. Resynthesis of the gene to reduce its A plus T content resulted in increased levels of alcohol production. In addition to free alcohols, novel wax esters were detected in the transgenic seed oils. In vitro assays revealed that B. napus embryos have an endogenous fatty acyl-coenzyme A: fatty alcohol acyl-transferase activity that could account for this wax synthesis. Thus, introduction of a single cDNA into B. napus results in a redirection of a portion of seed oil synthesis from triglycerides to waxes.

  16. Rapid and Deep Proteomes by Faster Sequencing on a Benchtop Quadrupole Ultra-High-Field Orbitrap Mass Spectrometer

    DEFF Research Database (Denmark)

    Kelstrup, Christian D; Jersie-Christensen, Rosa R; Batth, Tanveer Singh

    2014-01-01

    per second or up to 600 new peptides sequenced per gradient minute. We identify 4400 proteins from one microgram of HeLa digest using a one hour gradient, which is an approximately 30% improvement compared to previous instrumentation. In addition, we show very deep proteome coverage can be achieved...... in less than 24 hours of analysis time by offline high pH reversed-phase peptide fractionation from which we identify more than 140,000 unique peptide sequences. This is comparable to state-of-the-art multi-day, multi-enzyme efforts. Finally the acquisition methods are evaluated for single...

  17. DeepRT: deep learning for peptide retention time prediction in proteomics

    OpenAIRE

    Ma, Chunwei; Zhu, Zhiyong; Ye, Jun; Yang, Jiarui; Pei, Jianguo; Xu, Shaohang; Zhou, Ruo; Yu, Chang; Mo, Fan; Wen, Bo; Liu, Siqi

    2017-01-01

    Accurate predictions of peptide retention times (RT) in liquid chromatography have many applications in mass spectrometry-based proteomics. Herein, we present DeepRT, a deep learning based software for peptide retention time prediction. DeepRT automatically learns features directly from the peptide sequences using the deep convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) model, which eliminates the need to use hand-crafted features or rules. After the feature learning, pr...

  18. Comparison of cDNA-derived protein sequences of the human fibronectin and vitronectin receptor α-subunits and platelet glycoprotein IIb

    International Nuclear Information System (INIS)

    Fitzgerald, L.A.; Poncz, M.; Steiner, B.; Rall, S.C. Jr.; Bennett, J.S.; Phillips, D.R.

    1987-01-01

    The fibronectin receptor (FnR), the vitronectin receptor (VnR), and the platelet membrane glycoprotein (GP) IIb-IIIa complex are members of a family of cell adhesion receptors, which consist of noncovalently associated α- and β-subunits. The present study was designed to compare the cDNA-derived protein sequences of the α-subunits of human FnR, VnR, and platelet GP IIb. cDNA clones for the α-subunit of the FnR (FnR/sub α/) were obtained from a human umbilical vein endothelial (HUVE) cell library by using an oligonucleotide probe designed from a peptide sequence of platelet GP IIb. cDNA clones for platelet GP IIb were isolated from a cDNA expression library of human erythroleukemia cells by using antibodies. cDNA clones of the VnR α-subunit (VnR/sub α/) were obtained from the HUVE cell library by using an oligonucleotide probe from the partial cDNA sequence for the VnR/sub α/. Translation of these sequences showed that the FNR/sub α/, the VnR/sub α/, and GP IIb are composed of disulfide-linked large (858-871 amino acids) and small (137-158 amino acids) chains that are posttranslationally processed from a single mRNA. A single hydrophobic segment located near the carboxyl terminus of each small chain appears to be a transmembrane domain. The large chains appear to be entirely extracellular, and each contains four repeated putative Ca 2+ -binding domains of about 30 amino acids that have sequence similarities to other Ca 2+ -binding proteins. The identity among the protein sequences of the three receptor α-subunits ranges from 36.1% to 44.5%, with the Ca 2+ -binding domains having the greatest homology. These proteins apparently evolved by a process of gene duplication

  19. Transcriptomic identification of candidate genes involved in sunflower responses to chilling and salt stresses based on cDNA microarray analysis

    Directory of Open Access Journals (Sweden)

    Paniego Norma

    2008-01-01

    Full Text Available Abstract Background Considering that sunflower production is expanding to arid regions, tolerance to abiotic stresses as drought, low temperatures and salinity arises as one of the main constrains nowadays. Differential organ-specific sunflower ESTs (expressed sequence tags were previously generated by a subtractive hybridization method that included a considerable number of putative abiotic stress associated sequences. The objective of this work is to analyze concerted gene expression profiles of organ-specific ESTs by fluorescence microarray assay, in response to high sodium chloride concentration and chilling treatments with the aim to identify and follow up candidate genes for early responses to abiotic stress in sunflower. Results Abiotic-related expressed genes were the target of this characterization through a gene expression analysis using an organ-specific cDNA fluorescence microarray approach in response to high salinity and low temperatures. The experiment included three independent replicates from leaf samples. We analyzed 317 unigenes previously isolated from differential organ-specific cDNA libraries from leaf, stem and flower at R1 and R4 developmental stage. A statistical analysis based on mean comparison by ANOVA and ordination by Principal Component Analysis allowed the detection of 80 candidate genes for either salinity and/or chilling stresses. Out of them, 50 genes were up or down regulated under both stresses, supporting common regulatory mechanisms and general responses to chilling and salinity. Interestingly 15 and 12 sequences were up regulated or down regulated specifically in one stress but not in the other, respectively. These genes are potentially involved in different regulatory mechanisms including transcription/translation/protein degradation/protein folding/ROS production or ROS-scavenging. Differential gene expression patterns were confirmed by qRT-PCR for 12.5% of the microarray candidate sequences. Conclusion

  20. Norrie disease: linkage analysis using a 4.2-kb RFLP detected by a human ornithine aminotransferase cDNA probe.

    Science.gov (United States)

    Ngo, J T; Bateman, J B; Cortessis, V; Sparkes, R S; Mohandas, T; Inana, G; Spence, M A

    1989-05-01

    Previous study has shown that the usual DNA marker for Norrie disease, the L1.28 probe which identifies the DXS7 locus, can recombine with the disease locus. In this study, we used a human ornithine aminotransferase (OAT) cDNA which detects OAT-related DNA sequences mapped to the same region on the X chromosome as that of the L1.28 probe to investigate the family with Norrie disease who exhibited the recombinational event. When genomic DNA from this family was digested with the PvuII restriction endonuclease, we found a restriction fragment length polymorphism (RFLP) of 4.2 kb in size. This fragment was absent in the affected males and cosegregated with the disease locus; we calculated a lod score of 0.602, at theta = 0.00. No deletion could be detected by chromosomal analysis or on Southern blots with other enzymes. These results suggest that one of the OAT-related sequences on the X chromosome may be in close proximity to the Norrie disease locus and represent the first report which indicates that the OAT cDNA may be useful for the identification of carrier status and/or prenatal diagnosis.

  1. Alternative splicing enriched cDNA libraries identify breast cancer-associated transcripts

    Science.gov (United States)

    2010-01-01

    Background Alternative splicing (AS) is a central mechanism in the generation of genomic complexity and is a major contributor to transcriptome and proteome diversity. Alterations of the splicing process can lead to deregulation of crucial cellular processes and have been associated with a large spectrum of human diseases. Cancer-associated transcripts are potential molecular markers and may contribute to the development of more accurate diagnostic and prognostic methods and also serve as therapeutic targets. Alternative splicing-enriched cDNA libraries have been used to explore the variability generated by alternative splicing. In this study, by combining the use of trapping heteroduplexes and RNA amplification, we developed a powerful approach that enables transcriptome-wide exploration of the AS repertoire for identifying AS variants associated with breast tumor cells modulated by ERBB2 (HER-2/neu) oncogene expression. Results The human breast cell line (C5.2) and a pool of 5 ERBB2 over-expressing breast tumor samples were used independently for the construction of two AS-enriched libraries. In total, 2,048 partial cDNA sequences were obtained, revealing 214 alternative splicing sequence-enriched tags (ASSETs). A subset with 79 multiple exon ASSETs was compared to public databases and reported 138 different AS events. A high success rate of RT-PCR validation (94.5%) was obtained, and 2 novel AS events were identified. The influence of ERBB2-mediated expression on AS regulation was evaluated by capillary electrophoresis and probe-ligation approaches in two mammary cell lines (Hb4a and C5.2) expressing different levels of ERBB2. The relative expression balance between AS variants from 3 genes was differentially modulated by ERBB2 in this model system. Conclusions In this study, we presented a method for exploring AS from any RNA source in a transcriptome-wide format, which can be directly easily adapted to next generation sequencers. We identified AS transcripts

  2. Ultra Deep Sequencing of a Baculovirus Population Reveals Widespread Genomic Variations

    Directory of Open Access Journals (Sweden)

    Aurélien Chateigner

    2015-07-01

    Full Text Available Viruses rely on widespread genetic variation and large population size for adaptation. Large DNA virus populations are thought to harbor little variation though natural populations may be polymorphic. To measure the genetic variation present in a dsDNA virus population, we deep sequenced a natural strain of the baculovirus Autographa californica multiple nucleopolyhedrovirus. With 124,221X average genome coverage of our 133,926 bp long consensus, we could detect low frequency mutations (0.025%. K-means clustering was used to classify the mutations in four categories according to their frequency in the population. We found 60 high frequency non-synonymous mutations under balancing selection distributed in all functional classes. These mutants could alter viral adaptation dynamics, either through competitive or synergistic processes. Lastly, we developed a technique for the delimitation of large deletions in next generation sequencing data. We found that large deletions occur along the entire viral genome, with hotspots located in homologous repeat regions (hrs. Present in 25.4% of the genomes, these deletion mutants presumably require functional complementation to complete their infection cycle. They might thus have a large impact on the fitness of the baculovirus population. Altogether, we found a wide breadth of genomic variation in the baculovirus population, suggesting it has high adaptive potential.

  3. Deep-sequencing to resolve complex diversity of apicomplexan parasites in platypuses and echidnas: Proof of principle for wildlife disease investigation.

    Science.gov (United States)

    Šlapeta, Jan; Saverimuttu, Stefan; Vogelnest, Larry; Sangster, Cheryl; Hulst, Frances; Rose, Karrie; Thompson, Paul; Whittington, Richard

    2017-11-01

    The short-beaked echidna (Tachyglossus aculeatus) and the platypus (Ornithorhynchus anatinus) are iconic egg-laying monotremes (Mammalia: Monotremata) from Australasia. The aim of this study was to demonstrate the utility of diversity profiles in disease investigations of monotremes. Using small subunit (18S) rDNA amplicon deep-sequencing we demonstrated the presence of apicomplexan parasites and confirmed by direct and cloned amplicon gene sequencing Theileria ornithorhynchi, Theileria tachyglossi, Eimeria echidnae and Cryptosporidium fayeri. Using a combination of samples from healthy and diseased animals, we show a close evolutionary relationship between species of coccidia (Eimeria) and piroplasms (Theileria) from the echidna and platypus. The presence of E. echidnae was demonstrated in faeces and tissues affected by disseminated coccidiosis. Moreover, the presence of E. echidnae DNA in the blood of echidnas was associated with atoxoplasma-like stages in white blood cells, suggesting Hepatozoon tachyglossi blood stages are disseminated E. echidnae stages. These next-generation DNA sequencing technologies are suited to material and organisms that have not been previously characterised and for which the material is scarce. The deep sequencing approach supports traditional diagnostic methods, including microscopy, clinical pathology and histopathology, to better define the status quo. This approach is particularly suitable for wildlife disease investigation. Copyright © 2017 Elsevier B.V. All rights reserved.

  4. Accurate RNA consensus sequencing for high-fidelity detection of transcriptional mutagenesis-induced epimutations.

    Science.gov (United States)

    Reid-Bayliss, Kate S; Loeb, Lawrence A

    2017-08-29

    Transcriptional mutagenesis (TM) due to misincorporation during RNA transcription can result in mutant RNAs, or epimutations, that generate proteins with altered properties. TM has long been hypothesized to play a role in aging, cancer, and viral and bacterial evolution. However, inadequate methodologies have limited progress in elucidating a causal association. We present a high-throughput, highly accurate RNA sequencing method to measure epimutations with single-molecule sensitivity. Accurate RNA consensus sequencing (ARC-seq) uniquely combines RNA barcoding and generation of multiple cDNA copies per RNA molecule to eliminate errors introduced during cDNA synthesis, PCR, and sequencing. The stringency of ARC-seq can be scaled to accommodate the quality of input RNAs. We apply ARC-seq to directly assess transcriptome-wide epimutations resulting from RNA polymerase mutants and oxidative stress.

  5. Deep-sequencing protocols influence the results obtained in small-RNA sequencing.

    Directory of Open Access Journals (Sweden)

    Joern Toedling

    Full Text Available Second-generation sequencing is a powerful method for identifying and quantifying small-RNA components of cells. However, little attention has been paid to the effects of the choice of sequencing platform and library preparation protocol on the results obtained. We present a thorough comparison of small-RNA sequencing libraries generated from the same embryonic stem cell lines, using different sequencing platforms, which represent the three major second-generation sequencing technologies, and protocols. We have analysed and compared the expression of microRNAs, as well as populations of small RNAs derived from repetitive elements. Despite the fact that different libraries display a good correlation between sequencing platforms, qualitative and quantitative variations in the results were found, depending on the protocol used. Thus, when comparing libraries from different biological samples, it is strongly recommended to use the same sequencing platform and protocol in order to ensure the biological relevance of the comparisons.

  6. 阴道毛滴虫Rac1蛋白的cDNA克隆和序列分析%Molecular Cloning and Characterization of a Rac1 Homologue cDNA from Trichomonas vaginalis

    Institute of Scientific and Technical Information of China (English)

    傅玉才; 章家新; 郑晓虹; 刘红

    2004-01-01

    Objective To clone and characterize a Racl homologue from Trichomonas vaginalis for studying cell cycle of the organism. Methods A cDNA library derived from T. vaginalis mRNA was constructed into λ TriplEx2 phage vector. An expression sequence tag program was launched. Sequences of cDNA clones were analyzed using NCBI BLAST algorithms, and ClustalW and Treeview programs. Results A cDNA clone with a length of 714 base pairs was isolated. The sequence analysis showed that the cDNA clone has an open reading frame with 600 bp. The deduced amino acid sequence from the open reading frame contains 200 residuals and is most homologous to Rac1 subfamily of Rho GTPases with > 60% identity. The conserved sequence elements of Rho GTPases, such as GTP-binding sites, GTPase-activating protein (GAP) interaction motifs, GTP-dissociation inhibitors (GDI) interaction motifs, guanine nucleotide exchange factor (GEF) interaction elements, etc, were detected in the amino acid sequence. The phylogenetic analysis showed that the cDNA clone is grouped in the Rac subfamily and is more closely related to Rac1 proteins of protozoa. Conclusion The cDNA clone isolated belongs to Rac subfamily of Rho GTPases and is probably a Rac1 protein of T. vaginalis.%目的获得阴道毛滴虫Rac1蛋白的cDNA克隆,研究其在细胞周期中的调解作用.方法提取阴道毛滴虫总RNA,构建cDNA表达文库,随机分离cDNA克隆并测序.用在线生物分析软件NCBI BLAST、ClustalW以及Treeview等程序进行序列分析.结果获得一株有714 bp的cDNA克隆.序列分析表明,该克隆开放阅读框具600 bp,推测肽链具200个氨基酸.该肽链与Rho家族中Rac1鸟苷三磷酸(GTP)酶同源性最高(>60%),并具多种Rho GTP酶的保守基序,如GTP结合部位、GTP酶激活蛋白作用基序、GTP分离抑制因子作用基序、鸟嘌呤核苷酸交换因子作用基序等.进化树分析显示该克隆属于Rac亚家族GTP酶,与原虫Rac1蛋白最接近.结论该克隆

  7. Isolation and sequence analysis of a cDNA clone encoding the fifth complement component

    DEFF Research Database (Denmark)

    Lundwall, Åke B; Wetsel, Rick A; Kristensen, Torsten

    1985-01-01

    DNA clone of 1.85 kilobase pairs was isolated. Hybridization of the mixed-sequence probe to the complementary strand of the plasmid insert and sequence analysis by the dideoxy method predicted the expected protein sequence of C5a (positions 1-12), amino-terminal to the anticipated priming site. The sequence......, subcloned into M13 mp8, and sequenced at random by the dideoxy technique, thereby generating a contiguous sequence of 1703 base pairs. This clone contained coding sequence for the C-terminal 262 amino acid residues of the beta-chain, the entire C5a fragment, and the N-terminal 98 residues of the alpha......'-chain. The 3' end of the clone had a polyadenylated tail preceded by a polyadenylation recognition site, a 3'-untranslated region, and base pairs homologous to the human Alu concensus sequence. Comparison of the derived partial human C5 protein sequence with that previously determined for murine C3 and human...

  8. Cloning and sequencing of phenol oxidase 1 (pox1) gene from ...

    African Journals Online (AJOL)

    The gene (pox1) encoding a phenol oxidase 1 from Pleurotus ostreatus was sequenced and the corresponding pox1-cDNA was also synthesized, cloned and sequenced. The isolated gene is flanked by an upstream region called the promoter (399 bp) prior to the start codon (ATG). The putative metalresponsive elements ...

  9. The genomic sequence of cowpea aphid-borne mosaic virus and its similarities with other potyviruses

    NARCIS (Netherlands)

    Mlotshwa, S.; Verver, J.; Sithole-Niang, I.; Kampen, van T.; Kammen, van A.; Wellink, J.

    2002-01-01

    The genomic sequence of a Zimbabwe isolate of Cowpea aphid-borne mosaic virus (CABMV-Z) was determined by sequencing overlapping viral cDNA clones generated by RT-PCR using degenerate and/or specific primers. The sequence is 9465 nucleotides in length excluding the 3' terminal poly (A) tail and

  10. Mapping vaccinia virus DNA replication origins at nucleotide level by deep sequencing.

    Science.gov (United States)

    Senkevich, Tatiana G; Bruno, Daniel; Martens, Craig; Porcella, Stephen F; Wolf, Yuri I; Moss, Bernard

    2015-09-01

    Poxviruses reproduce in the host cytoplasm and encode most or all of the enzymes and factors needed for expression and synthesis of their double-stranded DNA genomes. Nevertheless, the mode of poxvirus DNA replication and the nature and location of the replication origins remain unknown. A current but unsubstantiated model posits only leading strand synthesis starting at a nick near one covalently closed end of the genome and continuing around the other end to generate a concatemer that is subsequently resolved into unit genomes. The existence of specific origins has been questioned because any plasmid can replicate in cells infected by vaccinia virus (VACV), the prototype poxvirus. We applied directional deep sequencing of short single-stranded DNA fragments enriched for RNA-primed nascent strands isolated from the cytoplasm of VACV-infected cells to pinpoint replication origins. The origins were identified as the switching points of the fragment directions, which correspond to the transition from continuous to discontinuous DNA synthesis. Origins containing a prominent initiation point mapped to a sequence within the hairpin loop at one end of the VACV genome and to the same sequence within the concatemeric junction of replication intermediates. These findings support a model for poxvirus genome replication that involves leading and lagging strand synthesis and is consistent with the requirements for primase and ligase activities as well as earlier electron microscopic and biochemical studies implicating a replication origin at the end of the VACV genome.

  11. Development of three full-length infectious cDNA clones of distinct brassica yellows virus genotypes for agrobacterium-mediated inoculation.

    Science.gov (United States)

    Zhang, Xiao-Yan; Dong, Shu-Wei; Xiang, Hai-Ying; Chen, Xiang-Ru; Li, Da-Wei; Yu, Jia-Lin; Han, Cheng-Gui

    2015-02-02

    Brassica yellows virus is a newly identified species in the genus of Polerovirus within the family Luteoviridae. Brassica yellows virus (BrYV) is prevalently distributed throughout Mainland China and South Korea, is an important virus infecting cruciferous crops. Based on six BrYV genomic sequences of isolates from oilseed rape, rutabaga, radish, and cabbage, three genotypes, BrYV-A, BrYV-B, and BrYV-C, exist, which mainly differ in the 5' terminal half of the genome. BrYV is an aphid-transmitted and phloem-limited virus. The use of infectious cDNA clones is an alternative means of infecting plants that allows reverse genetic studies to be performed. In this study, full-length cDNA clones of BrYV-A, recombinant BrYV5B3A, and BrYV-C were constructed under control of the cauliflower mosaic virus 35S promoter. An agrobacterium-mediated inoculation system of Nicotiana benthamiana was developed using these cDNA clones. Three days after infiltration with full-length BrYV cDNA clones, necrotic symptoms were observed in the inoculated leaves of N. benthamiana; however, no obvious symptoms appeared in the upper leaves. Reverse transcription-PCR (RT-PCR) and western blot detection of samples from the upper leaves showed that the maximum infection efficiency of BrYVs could reach 100%. The infectivity of the BrYV-A, BrYV-5B3A, and BrYV-C cDNA clones was further confirmed by northern hybridization. The system developed here will be useful for further studies of BrYV, such as host range, pathogenicity, viral gene functions, and plant-virus-vector interactions, and especially for discerning the differences among the three genotypes. Copyright © 2014 Elsevier B.V. All rights reserved.

  12. Isolation and characterisation of cDNA clones representing the genes encoding the major tuber storage protein (dioscorin) of yam (Dioscorea cayenensis Lam.).

    Science.gov (United States)

    Conlan, R S; Griffiths, L A; Napier, J A; Shewry, P R; Mantell, S; Ainsworth, C

    1995-06-01

    cDNA clones encoding dioscorins, the major tuber storage proteins (M(r) 32,000) of yam (Dioscorea cayenesis) have been isolated. Two classes of clone (A and B, based on hybrid release translation product sizes and nucleotide sequence differences) which are 84.1% similar in their protein coding regions, were identified. The protein encoded by the open reading frame of the class A cDNA insert is of M(r) 30,015. The difference in observed and calculated molecular mass might be attributed to glycosylation. Nucleotide sequencing and in vitro transcription/translation suggest that the class A dioscorin proteins are synthesised with signal peptides of 18 amino acid residues which are cleaved from the mature peptide. The class A and class B proteins are 69.6% similar with respect to each other, but show no sequence identity with other plant proteins or with the major tuber storage proteins of potato (patatin) or sweet potato (sporamin). Storage protein gene expression was restricted to developing tubers and was not induced by growth conditions known to induce expression of tuber storage protein genes in other plant species. The codon usage of the dioscorin genes suggests that the Dioscoreaceae are more closely related to dicotyledonous than to monocotyledonous plants.

  13. Alternative splicing of human elastin mRNA indicated by sequence analysis of cloned genomic and complementary DNA

    International Nuclear Information System (INIS)

    Indik, Z.; Yeh, H.; Ornstein-goldstein, N.; Sheppard, P.; Anderson, N.; Rosenbloom, J.C.; Peltonen, L.; Rosenbloom, J.

    1987-01-01

    Poly(A) + RNA, isolated from a single 7-mo fetal human aorta, was used to synthesize cDNA by the RNase H method, and the cDNA was inserted into λgt10. Recombinant phage containing elastin sequences were identified by hybridization with cloned, exon-containing fragments of the human elastin gene. Three clones containing inserts of 3.3, 2.7, and 2.3 kilobases were selected for further analysis. Three overlapping clones containing 17.8 kilobases of the human elastin gene were also isolated from genomic libraries. Complete sequence analysis of the six clones demonstrated that: (i) the cDNA encompassed the entire translated portion of the mRNA encoding 786 amino acids, including several unusual hydrophilic amino acid sequences not previously identified in porcine tropoelastin, (ii) exons encoding either hydrophobic or crosslinking domains in the protein alternated in the gene, and (iii) a great abundance of Alu repetitive sequences occurred throughout the introns. The data also indicated substantial alternative splicing of the mRNA. These results suggest the potential for significant variation in the precise molecular structure of the elastic fiber in the human population

  14. A method for diagnosis of plant environmental stresses by gene expression profiling using a cDNA macroarray

    International Nuclear Information System (INIS)

    Tamaoki, Masanori; Matsuyama, Takashi; Nakajima, Nobuyoshi; Aono, Mitsuko; Kubo, Akihiro; Saji, Hikaru

    2004-01-01

    Plants in the field are subjected to numerous environmental stresses. Lengthy continuation of such environmental stresses or a rapid increase in their intensity is harmful to vegetation. Assessments of the phytotoxicity of various stresses have been performed in many countries, although they have largely been based on estimates of leaf injury. We developed a novel method of detecting plant stresses that is more sensitive and specific than those previously available. This method is based on the detection of mRNA expression changes in 205 ozone-responsive Arabidopsis expressed sequence tags (ESTs) by cDNA macroarray analysis. By using this method, we illustrated shifts in gene expression in response to stressors such as drought, salinity, UV-B, low temperature, high temperature, and acid rain, as distinct from those in response to ozone. We also made a mini-scale macroarray with 12 ESTs for diagnosis of the above environmental stresses in plants. These results illustrate the potential of our cDNA macroarray for diagnosis of various stresses in plants

  15. BRICHOS domain-containing leukocyte cell-derived chemotaxin 1-like cDNA from disk abalone Haliotis discus discus.

    Science.gov (United States)

    Kim, Yucheol; De Zoysa, Mahanama; Lee, Youngdeuk; Whang, Ilson; Lee, Jehee

    2010-11-01

    A BRICHOS domain-containing leukocyte cell-derived chemotaxin 1-like cDNA was cloned from the disk abalone (Haliotis discus discus) and designated as AbLECT-1. A full-length (705 bp) of AbLECT-1 cDNA was composed of a 576 bp open reading frame that translates into a putative peptide of 192 amino acids. Deduced amino acid sequence of AbLECT-1 had 15.5- and 27.8% identity and similarity to human LECT-1, respectively. Quantitative real-time PCR analysis results showed that the mRNA of AbLECT-1 was constitutively expressed in abalone hemocytes, gills, mantle, muscle, digestive tract and hepatopancreas in a tissue-specific manner. Moreover, the AbLECT-1 transcription level was induced in hemocytes after challenge with Vibrio alginolyticus, Vibrio parahemolyticus, and Listeria monocytogenes suggesting that it may be involved in immune response reactions in abalone. Copyright 2010 Elsevier Ltd. All rights reserved.

  16. Molecular cloning of cDNA for the human tumor-associated antigen CO-029 and identification of related transmembrane antigens

    International Nuclear Information System (INIS)

    Szala, S.; Kasai, Yasushi; Steplewski, Z.; Rodeck, U.; Koprowski, H.; Linnenbach, A.J.

    1990-01-01

    The human tumor-associated antigen CO-029 is a monoclonal antibody-defined cell surface glycoprotein of 27-34 kDa. By using the high-efficiency COS cell expression system, a full-length cDNA clone for CO-029 was isolated. When transiently expressed in COS cells, the cDNA clone directed the synthesis of an antigen reactive to monoclonal antibody CO-029 in mixed hemadsorption and immunoblot assays. Sequence analysis revealed that CO-029 belongs to a family of cell surface antigens that includes the melanoma-associated antigen ME491, the leukocyte cell surface antigen CD37, and the Sm23 antigen of the parasitic helminth Schistosoma mansoni. CO-029 and ME491 antigen expression and the effect of their corresponding monoclonal antibodies on cell growth were compared in human tumor cell lines of various histologic origins

  17. Draft Genome Sequences of TwoThiomicrospiraStrains Isolated from the Brine-Seawater Interface of Kebrit Deep in the Red Sea

    KAUST Repository

    Zhang, Guishan

    2016-03-11

    Two Thiomicrospira strains, WB1 and XS5, were isolated from the Kebrit Deep brine-seawater interface in the Red Sea, Saudi Arabia. Here, we present the draft genome sequences of these gammaproteobacteria, which both produce sulfuric acid from thiosulfate in culture.

  18. Draft Genome Sequences of TwoThiomicrospiraStrains Isolated from the Brine-Seawater Interface of Kebrit Deep in the Red Sea

    KAUST Repository

    Zhang, Guishan; Haroon, Mohamed; Zhang, Ruifu; Hikmawan, Tyas I.; Stingl, Ulrich

    2016-01-01

    Two Thiomicrospira strains, WB1 and XS5, were isolated from the Kebrit Deep brine-seawater interface in the Red Sea, Saudi Arabia. Here, we present the draft genome sequences of these gammaproteobacteria, which both produce sulfuric acid from thiosulfate in culture.

  19. The nucleotide sequence of parsnip yellow fleck virus: a plant picorna-like virus.

    Science.gov (United States)

    Turnbull-Ross, A D; Reavy, B; Mayo, M A; Murant, A F

    1992-12-01

    The complete sequence of 9871 nucleotides (nts) of parsnip yellow fleck virus (PYFV; isolate P-121) was determined from cDNA clones and by direct sequencing of viral RNA. The RNA contains a large open reading frame between nts 279 and 9362 which encodes a polyprotein of 3027 amino acids with a calculated M(r) of 336212 (336K). A PYFV polyclonal antiserum reacted with the proteins expressed from phage carrying cDNA clones from the 5' half of the PYFV genome. Comparison of the polyprotein sequence of PYFV with other viral polyprotein sequences reveals similarities to the putative NTP-binding and RNA polymerase domains of cowpea mosaic comovirus, tomato black ring nepovirus and several animal picornaviruses. The 3' untranslated region of PYFV RNA is 509 nts long and does not have a poly(A) tail. The 3'-terminal 121 nts may form a stem-loop structure which resembles that formed in the genomic RNA of mosquito-borne flaviviruses.

  20. Salmo salar and Esox lucius full-length cDNA sequences reveal changes in evolutionary pressures on a post-tetraploidization genome

    Directory of Open Access Journals (Sweden)

    Holt Robert A

    2010-04-01

    Full Text Available Abstract Background Salmonids are one of the most intensely studied fish, in part due to their economic and environmental importance, and in part due to a recent whole genome duplication in the common ancestor of salmonids. This duplication greatly impacts species diversification, functional specialization, and adaptation. Extensive new genomic resources have recently become available for Atlantic salmon (Salmo salar, but documentation of allelic versus duplicate reference genes remains a major uncertainty in the complete characterization of its genome and its evolution. Results From existing expressed sequence tag (EST resources and three new full-length cDNA libraries, 9,057 reference quality full-length gene insert clones were identified for Atlantic salmon. A further 1,365 reference full-length clones were annotated from 29,221 northern pike (Esox lucius ESTs. Pairwise dN/dS comparisons within each of 408 sets of duplicated salmon genes using northern pike as a diploid out-group show asymmetric relaxation of selection on salmon duplicates. Conclusions 9,057 full-length reference genes were characterized in S. salar and can be used to identify alleles and gene family members. Comparisons of duplicated genes show that while purifying selection is the predominant force acting on both duplicates, consistent with retention of functionality in both copies, some relaxation of pressure on gene duplicates can be identified. In addition, there is evidence that evolution has acted asymmetrically on paralogs, allowing one of the pair to diverge at a faster rate.

  1. (α,α-dimethyl)glycyl (dmg) PNAs: achiral PNA analogs that form stronger hybrids with cDNA relative to isosequential RNA.

    Science.gov (United States)

    Gourishankar, Aland; Ganesh, Krishna N

    2012-01-01

    The design and facile synthesis of sterically constrained new analogs of PNA having gem-dimethyl substitutions on glycine (dmg-PNA-T) is presented. The PNA oligomers [aminoethyl dimethylglycyl (aedmg) and aminopropyl dimethylglycyl (apdmg)] synthesized from the monomers 6 and 12) effected remarkable stabilization of homothyminePNA(2):homoadenine DNA/RNA triplexes and mixed base sequence duplexes with target cDNA or RNA. They show a higher binding to DNA relative to that with isosequential RNA. This may be a structural consequence of the sterically rigid gem-dimethyl group, imposing a pre-organized conformation favorable for complex formation with cDNA. The results complement our previous work that had demonstrated that cyclohexanyl-PNAs favor binding with cRNA compared with cDNA and imply that the biophysical and structural properties of PNAs can be directed by introduction of the right rigidity in PNA backbone devoid of chirality. This approach of tweaking selectivity in binding of PNA constructs by installing gem-dimethyl substitution in PNA backbone can be extended to further fine-tuning by similar substitution in the aminoethyl segment as well either individually or in conjunction with present substitution.

  2. Construction of a T7 Human Lung Cancer cDNA Library

    Directory of Open Access Journals (Sweden)

    Wentao YUE

    2008-10-01

    Full Text Available Background and objective Currently, only a limited numbers of tumor markers for non small lung cancer (NSCLC diagnosis, new biomarker, such as serum autoantibody may improve the early detection of lung cancer. Our objective is construction human lung squamous carcinoma and adenocarcinoma T7 phage display cDNA library from the tissues of NSCLC patients. Methods mRNA was isolated from a pool of total RNA extract from NSCLC tissues obtained from 5 adenocarcinomas and 5 squamous carcinomas, and then mRNA was reverse transcribed into double stranded cDNA. After digestion, the cDNA was inserted into T7Select 10-3 vector. The phage display cDNA library was constructed by package reaction in vitro and plate proliferation. Plaque assay and PCR were used to evaluate the library.Results Two T7 phage display cDNA library were established. Plaque assay show the titer of lung squamas carcinoma library was 1.8×106 pfu, and the adenocarcinoma library was 5×106 pfu. The phage titer of the amplified library were 3.2×1010 pfu/mL and 2.5×1010 pfu/mL. PCR amplification of random plaque show insert ratio were 100% (24/24 in adenocarcinoma library and 95.8% in human lung squamas carcinoma library (23/24. Insert range from 300 bp to 1 500 bp. Conclusion Two phage display cDNA library from NSCLC were constructed.

  3. Genomic organization, sequence characterization and expression analysis of Tenebrio molitor apolipophorin-III in response to an intracellular pathogen, Listeria monocytogenes.

    Science.gov (United States)

    Noh, Ju Young; Patnaik, Bharat Bhusan; Tindwa, Hamisi; Seo, Gi Won; Kim, Dong Hyun; Patnaik, Hongray Howrelia; Jo, Yong Hun; Lee, Yong Seok; Lee, Bok Luel; Kim, Nam Jung; Han, Yeon Soo

    2014-01-25

    Apolipophorin III (apoLp-III) is a well-known hemolymph protein having a functional role in lipid transport and immune response of insects. We cloned full-length cDNA encoding putative apoLp-III from larvae of the coleopteran beetle, Tenebrio molitor (TmapoLp-III), by identification of clones corresponding to the partial sequence of TmapoLp-III, subsequently followed with full length sequencing by a clone-by-clone primer walking method. The complete cDNA consists of 890 nucleotides, including an ORF encoding 196 amino acid residues. Excluding a putative signal peptide of the first 20 amino acid residues, the 176-residue mature apoLp-III has a calculated molecular mass of 19,146Da. Genomic sequence analysis with respect to its cDNA showed that TmapoLp-III was organized into four exons interrupted by three introns. Several immune-related transcription factor binding sites were discovered in the putative 5'-flanking region. BLAST and phylogenetic analyses reveal that TmapoLp-III has high sequence identity (88%) with Tribolium castaneum apoLp-III but shares little sequence homologies (molitor. Copyright © 2013 Elsevier B.V. All rights reserved.

  4. MytiBase: a knowledgebase of mussel (M. galloprovincialis transcribed sequences

    Directory of Open Access Journals (Sweden)

    Roch Philippe

    2009-02-01

    Full Text Available Abstract Background Although Bivalves are among the most studied marine organisms due to their ecological role, economic importance and use in pollution biomonitoring, very little information is available on the genome sequences of mussels. This study reports the functional analysis of a large-scale Expressed Sequence Tag (EST sequencing from different tissues of Mytilus galloprovincialis (the Mediterranean mussel challenged with toxic pollutants, temperature and potentially pathogenic bacteria. Results We have constructed and sequenced seventeen cDNA libraries from different Mediterranean mussel tissues: gills, digestive gland, foot, anterior and posterior adductor muscle, mantle and haemocytes. A total of 24,939 clones were sequenced from these libraries generating 18,788 high-quality ESTs which were assembled into 2,446 overlapping clusters and 4,666 singletons resulting in a total of 7,112 non-redundant sequences. In particular, a high-quality normalized cDNA library (Nor01 was constructed as determined by the high rate of gene discovery (65.6%. Bioinformatic screening of the non-redundant M. galloprovincialis sequences identified 159 microsatellite-containing ESTs. Clusters, consensuses, related similarities and gene ontology searches have been organized in a dedicated, searchable database http://mussel.cribi.unipd.it. Conclusion We defined the first species-specific catalogue of M. galloprovincialis ESTs including 7,112 unique transcribed sequences. Putative microsatellite markers were identified. This annotated catalogue represents a valuable platform for expression studies, marker validation and genetic linkage analysis for investigations in the biology of Mediterranean mussels.

  5. Rapid in silico cloning of genes using expressed sequence tags (ESTs).

    Science.gov (United States)

    Gill, R W; Sanseau, P

    2000-01-01

    Expressed sequence tags (ESTs) are short single-pass DNA sequences obtained from either end of cDNA clones. These ESTs are derived from a vast number of cDNA libraries obtained from different species. Human ESTs are the bulk of the data and have been widely used to identify new members of gene families, as markers on the human chromosomes, to discover polymorphism sites and to compare expression patterns in different tissues or pathologies states. Information strategies have been devised to query EST databases. Since most of the analysis is performed with a computer, the term "in silico" strategy has been coined. In this chapter we will review the current status of EST databases, the pros and cons of EST-type data and describe possible strategies to retrieve meaningful information.

  6. Inconsistencies of genome annotations in apicomplexan parasites revealed by 5'-end-one-pass and full-length sequences of oligo-capped cDNAs

    Directory of Open Access Journals (Sweden)

    Sugano Sumio

    2009-07-01

    Full Text Available Abstract Background Apicomplexan parasites are causative agents of various diseases including malaria and have been targets of extensive genomic sequencing. We generated 5'-EST collections for six apicomplexa parasites using our full-length oligo-capping cDNA library method. To improve upon the current genome annotations, as well as to validate the importance for physical cDNA clone resources, we generated a large-scale collection of full-length cDNAs for several apicomplexa parasites. Results In this study, we used a total of 61,056 5'-end-single-pass cDNA sequences from Plasmodium falciparum, P. vivax, P. yoelii, P. berghei, Cryptosporidium parvum, and Toxoplasma gondii. We compared these partially sequenced cDNA sequences with the currently annotated gene models and observed significant inconsistencies between the two datasets. In particular, we found that on average 14% of the exons in the current gene models were not supported by any cDNA evidence, and that 16% of the current gene models may contain at least one mis-annotation and should be re-evaluated. We also identified a large number of transcripts that had been previously unidentified. For 732 cDNAs in T. gondii, the entire sequences were determined in order to evaluate the annotated gene models at the complete full-length transcript level. We found that 41% of the T. gondii gene models contained at least one inconsistency. We also identified and confirmed by RT-PCR 140 previously unidentified transcripts found in the intergenic regions of the current gene annotations. We show that the majority of these discrepancies are due to questionable predictions of one or two extra exons in the upstream or downstream regions of the genes. Conclusion Our data indicates that the current gene models are likely to still be incomplete and have much room for improvement. Our unique full-length cDNA information is especially useful for further refinement of the annotations for the genomes of

  7. Deep sequencing of the Camellia chekiangoleosa transcriptome revealed candidate genes for anthocyanin biosynthesis.

    Science.gov (United States)

    Wang, Zhong-Wei; Jiang, Cong; Wen, Qiang; Wang, Na; Tao, Yuan-Yuan; Xu, Li-An

    2014-03-15

    Camellia chekiangoleosa is an important species of genus Camellia. It provides high-quality edible oil and has great ornamental value. The flowers are big and red which bloom between February and March. Flower pigmentation is closely related to the accumulation of anthocyanin. Although anthocyanin biosynthesis has been studied extensively in herbaceous plants, little molecular information on the anthocyanin biosynthesis pathway of C. chekiangoleosa is yet known. In the present study, a cDNA library was constructed to obtain detailed and general data from the flowers of C. chekiangoleosa. To explore the transcriptome of C. chekiangoleosa and investigate genes involved in anthocyanin biosynthesis, a 454 GS FLX Titanium platform was used to generate an EST dataset. About 46,279 sequences were obtained, and 24,593 (53.1%) were annotated. Using Blast search against the AGRIS, 1740 unigenes were found homologous to 599 Arabidopsis transcription factor genes. Based on the transcriptome dataset, nine anthocyanin biosynthesis pathway genes (PAL, CHS1, CHS2, CHS3, CHI, F3H, DFR, ANS, and UFGT) were identified and cloned. The spatio-temporal expression patterns of these genes were also analyzed using quantitative real-time polymerase chain reaction. The study results not only enrich the gene resource but also provide valuable information for further studies concerning anthocyanin biosynthesis. Copyright © 2014 Elsevier B.V. All rights reserved.

  8. Isolation of a cDNA for a Growth Factor of Vascular Endothelial Cells from Human Lung Cancer Cells: Its Identity with Insulin‐like Growth Factor II

    Science.gov (United States)

    Hagiwara, Koichi; Kobayashi, Tatsuo; Tobita, Masato; Kikyo, Nobuaki; Yazaki, Yoshio

    1995-01-01

    We have found growth‐promoting activity for vascular endothelial cells in the conditioned medium of a human lung cancer cell line, T3M‐11. Purification and characterization of the growth‐promoting activity have been carried out using ammonium sulfate precipitation and gel‐exclusion chromatography. The activity migrated as a single peak just after ribonuclease. It did not bind to a heparin affinity column. These results suggest that the activity is not a heparin‐binding growth factor (including fibroblast growth factors) or a vascular endothelial growth factor. To identify the molecule exhibiting the growth‐promoting activity, a cDNA encoding the growth factor was isolated through functional expression cloning in COS‐1 cells from a cDNA library prepared from T3M‐11 cells. The nucleotide sequence encoded by the cDNA proved to be identical with that of insulin‐like growth factor II. PMID:7730145

  9. High-resolution deep sequencing reveals biodiversity, population structure, and persistence of HIV-1 quasispecies within host ecosystems

    Directory of Open Access Journals (Sweden)

    Yin Li

    2012-12-01

    Full Text Available Abstract Background Deep sequencing provides the basis for analysis of biodiversity of taxonomically similar organisms in an environment. While extensively applied to microbiome studies, population genetics studies of viruses are limited. To define the scope of HIV-1 population biodiversity within infected individuals, a suite of phylogenetic and population genetic algorithms was applied to HIV-1 envelope hypervariable domain 3 (Env V3 within peripheral blood mononuclear cells from a group of perinatally HIV-1 subtype B infected, therapy-naïve children. Results Biodiversity of HIV-1 Env V3 quasispecies ranged from about 70 to 270 unique sequence clusters across individuals. Viral population structure was organized into a limited number of clusters that included the dominant variants combined with multiple clusters of low frequency variants. Next generation viral quasispecies evolved from low frequency variants at earlier time points through multiple non-synonymous changes in lineages within the evolutionary landscape. Minor V3 variants detected as long as four years after infection co-localized in phylogenetic reconstructions with early transmitting viruses or with subsequent plasma virus circulating two years later. Conclusions Deep sequencing defines HIV-1 population complexity and structure, reveals the ebb and flow of dominant and rare viral variants in the host ecosystem, and identifies an evolutionary record of low-frequency cell-associated viral V3 variants that persist for years. Bioinformatics pipeline developed for HIV-1 can be applied for biodiversity studies of virome populations in human, animal, or plant ecosystems.

  10. Recurrent chimeric RNAs enriched in human prostate cancer identified by deep sequencing

    Science.gov (United States)

    Kannan, Kalpana; Wang, Liguo; Wang, Jianghua; Ittmann, Michael M.; Li, Wei; Yen, Laising

    2011-01-01

    Transcription-induced chimeric RNAs, possessing sequences from different genes, are expected to increase the proteomic diversity through chimeric proteins or altered regulation. Despite their importance, few studies have focused on chimeric RNAs especially regarding their presence/roles in human cancers. By deep sequencing the transcriptome of 20 human prostate cancer and 10 matched benign prostate tissues, we obtained 1.3 billion sequence reads, which led to the identification of 2,369 chimeric RNA candidates. Chimeric RNAs occurred in significantly higher frequency in cancer than in matched benign samples. Experimental investigation of a selected 46 set led to the confirmation of 32 chimeric RNAs, of which 27 were highly recurrent and previously undescribed in prostate cancer. Importantly, a subset of these chimeras was present in prostate cancer cell lines, but not detectable in primary human prostate epithelium cells, implying their associations with cancer. These chimeras contain discernable 5′ and 3′ splice sites at the RNA junction, indicating that their formation is mediated by splicing. Their presence is also largely independent of the expression of parental genes, suggesting that other factors are involved in their production and regulation. One chimera, TMEM79-SMG5, is highly differentially expressed in human cancer samples and therefore a potential biomarker. The prevalence of chimeric RNAs may allow the limited number of human genes to encode a substantially larger number of RNAs and proteins, forming an additional layer of cellular complexity. Together, our results suggest that chimeric RNAs are widespread, and increased chimeric RNA events could represent a unique class of molecular alteration in cancer. PMID:21571633

  11. Ultra-deep sequencing of mouse mitochondrial DNA: mutational patterns and their origins.

    Directory of Open Access Journals (Sweden)

    Adam Ameur

    2011-03-01

    Full Text Available Somatic mutations of mtDNA are implicated in the aging process, but there is no universally accepted method for their accurate quantification. We have used ultra-deep sequencing to study genome-wide mtDNA mutation load in the liver of normally- and prematurely-aging mice. Mice that are homozygous for an allele expressing a proof-reading-deficient mtDNA polymerase (mtDNA mutator mice have 10-times-higher point mutation loads than their wildtype siblings. In addition, the mtDNA mutator mice have increased levels of a truncated linear mtDNA molecule, resulting in decreased sequence coverage in the deleted region. In contrast, circular mtDNA molecules with large deletions occur at extremely low frequencies in mtDNA mutator mice and can therefore not drive the premature aging phenotype. Sequence analysis shows that the main proportion of the mutation load in heterozygous mtDNA mutator mice and their wildtype siblings is inherited from their heterozygous mothers consistent with germline transmission. We found no increase in levels of point mutations or deletions in wildtype C57Bl/6N mice with increasing age, thus questioning the causative role of these changes in aging. In addition, there was no increased frequency of transversion mutations with time in any of the studied genotypes, arguing against oxidative damage as a major cause of mtDNA mutations. Our results from studies of mice thus indicate that most somatic mtDNA mutations occur as replication errors during development and do not result from damage accumulation in adult life.

  12. Human beta 2 chain of laminin (formerly S chain): cDNA cloning, chromosomal localization, and expression in carcinomas

    DEFF Research Database (Denmark)

    Wewer, U M; Gerecke, D R; Durkin, M E

    1994-01-01

    or other known laminin genes. Immunostaining showed that the beta 2 chain is localized to the smooth muscle basement membranes of the arteries, while the homologous beta 1 chain is confined to the subendothelial basement membranes. The beta 2 chain was found in the basement membranes of ovarian carcinomas......Overlapping cDNA clones that encode the full-length human laminin beta 2 chain, formerly called the S chain, were isolated. The cDNA of 5680 nt contains a 5391-nt open reading frame encoding 1797 amino acids. At the amino terminus is a 32-amino-acid signal peptide that is followed by the mature...... beta 2 chain polypeptide of 1765 amino acids with a calculated molecular mass of 192,389 Da. The human beta 2 chain is predicted to have all of the seven structural domains typical of the beta chains of laminin, including the short cysteine-rich alpha region. The amino acid sequence of human beta 2...

  13. Characterization of cDNA encoding molt-inhibiting hormone of the crab, Cancer pagurus; expression of MIH in non-X-organ tissues.

    Science.gov (United States)

    Lu, W; Wainwright, G; Olohan, L A; Webster, S G; Rees, H H; Turner, P C

    2001-10-31

    Synthesis of ecdysteroids (molting hormones) by crustacean Y-organs is regulated by a neuropeptide, molt-inhibiting hormone (MIH), produced in eyestalk neural ganglia. We report here the molecular cloning of a cDNA encoding MIH of the edible crab, Cancer pagurus. Full-length MIH cDNA was obtained by using reverse transcription-polymerase chain reaction (RT-PCR) with degenerate oligonucleotides based upon the amino acid sequence of MIH, in conjunction with 5'- and 3'-RACE. Full-length clones of MIH cDNA were obtained that encoded a 35 amino acid putative signal peptide and the mature 78 amino acid peptide. Of various tissues examined by Northern blot analysis, the X-organ was the sole major site of expression of the MIH gene. However, a nested-PCR approach using non-degenerate MIH-specific primers indicated the presence of MIH transcripts in other tissues. Southern blot analysis indicated a simple gene arrangement with at least two copies of the MIH gene in the genome of C. pagurus. Additional Southern blotting experiments detected MIH-hybridizing bands in another Cancer species, Cancer antennarius and another crab species, Carcinus maenas.

  14. Quantification of differential gene expression by multiplexed targeted resequencing of cDNA

    Science.gov (United States)

    Arts, Peer; van der Raadt, Jori; van Gestel, Sebastianus H.C.; Steehouwer, Marloes; Shendure, Jay; Hoischen, Alexander; Albers, Cornelis A.

    2017-01-01

    Whole-transcriptome or RNA sequencing (RNA-Seq) is a powerful and versatile tool for functional analysis of different types of RNA molecules, but sample reagent and sequencing cost can be prohibitive for hypothesis-driven studies where the aim is to quantify differential expression of a limited number of genes. Here we present an approach for quantification of differential mRNA expression by targeted resequencing of complementary DNA using single-molecule molecular inversion probes (cDNA-smMIPs) that enable highly multiplexed resequencing of cDNA target regions of ∼100 nucleotides and counting of individual molecules. We show that accurate estimates of differential expression can be obtained from molecule counts for hundreds of smMIPs per reaction and that smMIPs are also suitable for quantification of relative gene expression and allele-specific expression. Compared with low-coverage RNA-Seq and a hybridization-based targeted RNA-Seq method, cDNA-smMIPs are a cost-effective high-throughput tool for hypothesis-driven expression analysis in large numbers of genes (10 to 500) and samples (hundreds to thousands). PMID:28474677

  15. Analysis of Plasmodium falciparum diversity in natural infections by deep sequencing

    Science.gov (United States)

    Manske, Magnus; Miotto, Olivo; Campino, Susana; Auburn, Sarah; Almagro-Garcia, Jacob; Maslen, Gareth; O’Brien, Jack; Djimde, Abdoulaye; Doumbo, Ogobara; Zongo, Issaka; Ouedraogo, Jean-Bosco; Michon, Pascal; Mueller, Ivo; Siba, Peter; Nzila, Alexis; Borrmann, Steffen; Kiara, Steven M.; Marsh, Kevin; Jiang, Hongying; Su, Xin-Zhuan; Amaratunga, Chanaki; Fairhurst, Rick; Socheat, Duong; Nosten, Francois; Imwong, Mallika; White, Nicholas J.; Sanders, Mandy; Anastasi, Elisa; Alcock, Dan; Drury, Eleanor; Oyola, Samuel; Quail, Michael A.; Turner, Daniel J.; Rubio, Valentin Ruano; Jyothi, Dushyanth; Amenga-Etego, Lucas; Hubbart, Christina; Jeffreys, Anna; Rowlands, Kate; Sutherland, Colin; Roper, Cally; Mangano, Valentina; Modiano, David; Tan, John C.; Ferdig, Michael T.; Amambua-Ngwa, Alfred; Conway, David J.; Takala-Harrison, Shannon; Plowe, Christopher V.; Rayner, Julian C.; Rockett, Kirk A.; Clark, Taane G.; Newbold, Chris I.; Berriman, Matthew; MacInnis, Bronwyn; Kwiatkowski, Dominic P.

    2013-01-01

    Malaria elimination strategies require surveillance of the parasite population for genetic changes that demand a public health response, such as new forms of drug resistance. 1,2 Here we describe methods for large-scale analysis of genetic variation in Plasmodium falciparum by deep sequencing of parasite DNA obtained from the blood of patients with malaria, either directly or after short term culture. Analysis of 86,158 exonic SNPs that passed genotyping quality control in 227 samples from Africa, Asia and Oceania provides genome-wide estimates of allele frequency distribution, population structure and linkage disequilibrium. By comparing the genetic diversity of individual infections with that of the local parasite population, we derive a metric of within-host diversity that is related to the level of inbreeding in the population. An open-access web application has been established for exploration of regional differences in allele frequency and of highly differentiated loci in the P. falciparum genome. PMID:22722859

  16. Genomic region operation kit for flexible processing of deep sequencing data.

    Science.gov (United States)

    Ovaska, Kristian; Lyly, Lauri; Sahu, Biswajyoti; Jänne, Olli A; Hautaniemi, Sampsa

    2013-01-01

    Computational analysis of data produced in deep sequencing (DS) experiments is challenging due to large data volumes and requirements for flexible analysis approaches. Here, we present a mathematical formalism based on set algebra for frequently performed operations in DS data analysis to facilitate translation of biomedical research questions to language amenable for computational analysis. With the help of this formalism, we implemented the Genomic Region Operation Kit (GROK), which supports various DS-related operations such as preprocessing, filtering, file conversion, and sample comparison. GROK provides high-level interfaces for R, Python, Lua, and command line, as well as an extension C++ API. It supports major genomic file formats and allows storing custom genomic regions in efficient data structures such as red-black trees and SQL databases. To demonstrate the utility of GROK, we have characterized the roles of two major transcription factors (TFs) in prostate cancer using data from 10 DS experiments. GROK is freely available with a user guide from >http://csbi.ltdk.helsinki.fi/grok/.

  17. Geochemical features and effects on deep-seated fluids during the May-June 2012 southern Po Valley seismic sequence

    Directory of Open Access Journals (Sweden)

    Francesco Italiano

    2012-10-01

    Full Text Available A periodic sampling of the groundwaters and dissolved and free gases in selected deep wells located in the area affected by the May-June 2012 southern Po Valley seismic sequence has provided insight into seismogenic-induced changes of the local aquifer systems. The results obtained show progressive changes in the fluid geochemistry, allowing it to be established that deep-seated fluids were mobilized during the seismic sequence and reached surface layers along faults and fractures, which generated significant geochemical anomalies. The May-June 2012 seismic swarm (mainshock on May 29, 2012, M 5.8; 7 shocks M >5, about 200 events 3 > M > 5 induced several modifications in the circulating fluids. This study reports the preliminary results obtained for the geochemical features of the waters and gases collected over the epicentral area from boreholes drilled at different depths, thus intercepting water and gases with different origins and circulation. The aim of the investigations was to improve our knowledge of the fluids circulating over the seismic area (e.g. origin, provenance, interactions, mixing of different components, temporal changes. This was achieved by collecting samples from both shallow and deep-drilled boreholes, and then, after the selection of the relevant sites, we looked for temporal changes with mid-to-long-term monitoring activity following a constant sampling rate. This allowed us to gain better insight into the relationships between the fluid circulation and the faulting activity. The sampling sites are listed in Table 1, along with the analytical results of the gas phase. […

  18. Cloning of cDNA sequences encoding cowpea (Vigna unguiculata) vicilins: Computational simulations suggest a binding mode of cowpea vicilins to chitin oligomers.

    Science.gov (United States)

    Rocha, Antônio J; Sousa, Bruno L; Girão, Matheus S; Barroso-Neto, Ito L; Monteiro-Júnior, José E; Oliveira, José T A; Nagano, Celso S; Carneiro, Rômulo F; Monteiro-Moreira, Ana C O; Rocha, Bruno A M; Freire, Valder N; Grangeiro, Thalles B

    2018-05-27

    Vicilins are 7S globulins which constitute the major seed storage proteins in leguminous species. Variant vicilins showing differential binding affinities for chitin have been implicated in the resistance and susceptibility of cowpea to the bruchid Callosobruchus maculatus. These proteins are members of the cupin superfamily, which includes a wide variety of enzymes and non-catalytic seed storage proteins. The cupin fold does not share similarity with any known chitin-biding domain. Therefore, it is poorly understood how these storage proteins bind to chitin. In this work, partial cDNA sequences encoding β-vignin, the major component of cowpea vicilins, were obtained from developing seeds. Three-dimensional molecular models of β-vignin showed the characteristic cupin fold and computational simulations revealed that each vicilin trimer contained 3 chitin-binding sites. Interaction models showed that chito-oligosaccharides bound to β-vignin were stabilized mainly by hydrogen bonds, a common structural feature of typical carbohydrate-binding proteins. Furthermore, many of the residues involved in the chitin-binding sites of β-vignin are conserved in other 7S globulins. These results support previous experimental evidences on the ability of vicilin-like proteins from cowpea and other leguminous species to bind in vitro to chitin as well as in vivo to chitinous structures of larval C. maculatus midgut. Copyright © 2018. Published by Elsevier B.V.

  19. Molecular cloning of the cDNA encoding follicle-stimulating hormone beta subunit of the Chinese soft-shell turtle Pelodiscus sinensis, and its gene expression.

    Science.gov (United States)

    Chien, Jung-Tsun; Shen, San-Tai; Lin, Yao-Sung; Yu, John Yuh-Lin

    2005-04-01

    Follicle-stimulating hormone (FSH) is a member of the pituitary glycoprotein hormone family. These hormones are composed of two dissimilar subunits, alpha and beta. Very little information is available regarding the nucleotide and amino acid sequence of FSHbeta in reptilian species. For better understanding of the phylogenetic diversity and evolution of FSH molecule, we have isolated and sequenced the complementary DNA (cDNA) encoding the Chinese soft-shell turtle (Pelodiscus sinensis, Family of Trionychidae) FSHbeta precursor molecule by reverse transcription-polymerase chain reaction (RT-PCR) and rapid amplification of cDNA end (RACE) methods. The cloned Chinese soft-shell turtle FSHbeta cDNA consists of 602-bp nucleotides, including 34-bp nucleotides of the 5'-untranslated region (UTR), 396-bp of the open reading frame, and 3'-UTR of 206-bp nucleotides. It encodes a 131-amino acid precursor molecule of FSHbeta subunit with a signal peptide of 20 amino acids followed by a mature protein of 111 amino acids. Twelve cysteine residues, forming six disulfide bonds within beta-subunit and two putative asparagine-linked glycosylation sites, are also conserved in the Chinese soft-shell turtle FSHbeta subunit. The deduced amino acid sequence of the Chinese soft-shell turtle FSHbeta shares identities of 97% with Reeves's turtle (Family of Bataguridae), 83-89% with birds, 61-70% with mammals, 63-66% with amphibians and 40-58% with fish. By contrast, when comparing the FSHbeta with the beta-subunits of the Chinese soft-shell turtle luteinizing hormone and thyroid stimulating hormone, the homologies are as low as 38 and 39%, respectively. A phylogenetic tree including reptilian species of FSHbeta subunits, is presented for the first time. Out of various tissues examined, FSHbeta mRNA was only expressed in the pituitary gland and can be up-regulated by gonadotropin-releasing hormone in pituitary tissue culture as estimated by fluorescence real-time PCR analysis.

  20. Nucleotide sequences of cDNAs for human papillomavirus type 18 transcripts in HeLa cells

    International Nuclear Information System (INIS)

    Inagaki, Yutaka; Tsunokawa, Youko; Takebe, Naoko; Terada, Masaaki; Sugimura, Takashi; Nawa, Hiroyuki; Nakanishi, Shigetada

    1988-01-01

    HeLa cells expressed 3.4- and 1.6-kilobase (kb) transcripts of the integrated human papillomavirus (HPV) type 18 genome. Two types of cDNA clones representing each size of HPV type 18 transcript were isolated. Sequence analysis of these two types of cDNA clones revealed that the 3.4-kb transcript contained E6, E7, the 5' portion of E1, and human sequence and that the 1.6-kb transcript contained spliced and frameshifted E6 (E6 * ), E7, and human sequence. There was a common human sequence containing a poly(A) addition signal in the 3' end portions of both transcripts, indicating that they were transcribed from the HPV genome at the same integration site with different splicing. Furthermore, the 1.6-kb transcript contained both of the two viral TATA boxes upstream of E6, strongly indicating that a cellular promoter was used for its transcription

  1. Creation of Functional Viruses from Non-Functional cDNA Clones Obtained from an RNA Virus Population by the Use of Ancestral Reconstruction

    DEFF Research Database (Denmark)

    Fahnøe, Ulrik; Pedersen, Anders Gorm; Dräger, Carolin

    2015-01-01

    necessarily be the descendant of a functional ancestor, we hypothesized that it should be possible to produce functional clones by reconstructing ancestral sequences. To test this we used phylogenetic methods to infer two ancestral sequences, which were then reconstructed as cDNA clones. Viruses rescued from...... the reconstructed cDNAs were tested in cell culture and pigs. Both reconstructed ancestral genomes proved functional, and displayed distinct phenotypes in vitro and in vivo. We suggest that reconstruction of ancestral viruses is a useful tool for experimental and computational investigations of virulence and viral...... evolution. Importantly, ancestral reconstruction can be done even on the basis of a set of sequences that all correspond to non-functional variants....

  2. Characterization of shark complement factor I gene(s): genomic analysis of a novel shark-specific sequence.

    Science.gov (United States)

    Shin, Dong-Ho; Webb, Barbara M; Nakao, Miki; Smith, Sylvia L

    2009-07-01

    Complement factor I is a crucial regulator of mammalian complement activity. Very little is known of complement regulators in non-mammalian species. We isolated and sequenced four highly similar complement factor I cDNAs from the liver of the nurse shark (Ginglymostoma cirratum), designated as GcIf-1, GcIf-2, GcIf-3 and GcIf-4 (previously referred to as nsFI-a, -b, -c and -d) which encode 689, 673, 673 and 657 amino acid residues, respectively. They share 95% (shark-specific sequence between the leader peptide (LP) and the factor I membrane attack complex (FIMAC) domain. The cDNA sequences differ only in the size and composition of the shark-specific region (SSR). Sequence analysis of each SSR has identified within the region two novel short sequences (SS1 and SS2) and three repeat sequences (RS1-3). Genomic analysis has revealed the existence of three introns between the leader peptide and the FIMAC domain, tentatively designated intron 1, intron 2, and intron 3 which span 4067, 2293 and 2082bp, respectively. Southern blot analysis suggests the presence of a single gene copy for each cDNA type. Phylogenetic analysis suggests that complement factor I of cartilaginous fish diverged prior to the emergence of mammals. All four GcIf cDNA species are expressed in four different tissues and the liver is the main tissue in which expression level of all four is high. This suggests that the expression of GcIf isotypes is tissue-dependent.

  3. cDNA Cloning, expression and characterization of an allergenic 60s ribosomal protein of almond (prunus dulcis).

    Science.gov (United States)

    Abolhassani, Mohsen; Roux, Kenneth H

    2009-06-01

    Tree nuts, including almond (prunus dulcis) are a source of food allergens often associated with life-threatening allergic reactions in susceptible individuals. Although the proteins in almonds have been biochemically characterized, relatively little has been reported regarding the identity of the allergens involved in almond sensitivity. The present study was undertaken to identify the allergens of the almond by cDNA library approach. cDNA library of almond seeds was constructed in Uni-Zap XR lamda vector and expressed in E. coli XL-1 blue. Plaques were immunoscreened with pooled sera of allergic patients. The cDNA clone reacting significantly with specific IgE antibodies was selected and subcloned and subsequently expressed in E. coli. The amino acids deducted from PCR product of clone showed homology to 60s acidic ribosomal protein of almond. The expressed protein was 11,450 Dalton without leader sequence. Immunoreactivity of the recombinant 60s ribosomal protein (r60sRP) was evaluated with dot blot analysis using pooled and individual sera of allergic patients. The data showed that r60sRP and almond extract (as positive control) possess the ability to bind the IgE antibodies. The results showed that expressed protein is an almond allergen.Whether this r60sRP represents a major allergen of almond needs to be further studied which requires a large number of sera from the almond atopic patients and also need to determine the IgE-reactive frequencies of each individual allergen.

  4. Identification of immune protective genes of Eimeria maxima through cDNA expression library screening.

    Science.gov (United States)

    Yang, XinChao; Li, MengHui; Liu, JianHua; Ji, YiHong; Li, XiangRui; Xu, LiXin; Yan, RuoFeng; Song, XiaoKai

    2017-02-16

    Eimeria maxima is one of the most prevalent Eimeria species causing avian coccidiosis, and results in huge economic loss to the global poultry industry. Current control strategies, such as anti-coccidial medication and live vaccines have been limited because of their drawbacks. The third generation anticoccidial vaccines including the recombinant vaccines as well as DNA vaccines have been suggested as a promising alternative strategy. To date, only a few protective antigens of E. maxima have been reported. Hence, there is an urgent need to identify novel protective antigens of E. maxima for the development of neotype anticoccidial vaccines. With the aim of identifying novel protective genes of E. maxima, a cDNA expression library of E. maxima sporozoites was constructed using Gateway technology. Subsequently, the cDNA expression library was divided into 15 sub-libraries for cDNA expression library immunization (cDELI) using parasite challenged model in chickens. Protective sub-libraries were selected for the next round of screening until individual protective clones were obtained, which were further sequenced and analyzed. Adopting the Gateway technology, a high-quality entry library was constructed, containing 9.2 × 10 6 clones with an average inserted fragments length of 1.63 kb. The expression library capacity was 2.32 × 10 7 colony-forming units (cfu) with an average inserted fragments length of 1.64 Kb. The expression library was screened using parasite challenged model in chickens. The screening yielded 6 immune protective genes including four novel protective genes of EmJS-1, EmRP, EmHP-1 and EmHP-2, and two known protective genes of EmSAG and EmCKRS. EmJS-1 is the selR domain-containing protein of E. maxima whose function is unknown. EmHP-1 and EmHP-2 are the hypothetical proteins of E. maxima. EmRP and EmSAG are rhomboid-like protein and surface antigen glycoproteins of E. maxima respectively, and involved in invasion of the parasite. Our

  5. Cloning of soluble alkaline phosphatase cDNA and molecular basis of the polymorphic nature in alkaline phosphatase isozymes of Bombyx mori midgut.

    Science.gov (United States)

    Itoh, M; Kanamori, Y; Takao, M; Eguchi, M

    1999-02-01

    A cDNA coding for soluble type alkaline phosphatase (sALP) of Bombyx mori was isolated. Deduced amino acid sequence showed high identities to various ALPs and partial similarities to ATPase of Manduca sexta. Using this cDNA sequence as a probe, the molecular basis of electrophoretic polymorphism in sALP and membrane-bound type ALP (mALP) was studied. As for mALP, the result suggested that post-translational modification was important for the proteins to express activity and to represent their extensive polymorphic nature, whereas the magnitude of activities was mainly regulated by transcription. On the other hand, sALP zymogram showed poor polymorphism, but one exception was the null mutant, in which the sALP gene was largely lost. Interestingly, the sALP gene was shown to be transcribed into two mRNAs of different sizes, 2.0 and 2.4 Kb. In addition to the null mutant of sALP, we found a null mutant for mALP. Both of these mutants seem phenotypically silent, suggesting that the functional differentiation between these isozymes is not perfect, so that they can still work mutually and complement each other as an indispensable enzyme for B. mori.

  6. Purification of Single-Stranded cDNA Based on RNA Degradation Treatment and Adsorption Chromatography.

    Science.gov (United States)

    Trujillo-Esquivel, Elías; Franco, Bernardo; Flores-Martínez, Alberto; Ponce-Noyola, Patricia; Mora-Montes, Héctor M

    2016-08-02

    Analysis of gene expression is a common research tool to study networks controlling gene expression, the role of genes with unknown function, and environmentally induced responses of organisms. Most of the analytical tools used to analyze gene expression rely on accurate cDNA synthesis and quantification to obtain reproducible and quantifiable results. Thus far, most commercial kits for isolation and purification of cDNA target double-stranded molecules, which do not accurately represent the abundance of transcripts. In the present report, we provide a simple and fast method to purify single-stranded cDNA, exhibiting high purity and yield. This method is based on the treatment with RNase H and RNase A after cDNA synthesis, followed by separation in silica spin-columns and ethanol precipitation. In addition, our method avoids the use of DNase I to eliminate genomic DNA from RNA preparations, which improves cDNA yield. As a case report, our method proved to be useful in the purification of single-stranded cDNA from the pathogenic fungus Sporothrix schenckii.

  7. Molecular cloning of a cysteine proteinase cDNA from the cotton boll weevil Anthonomus grandis (Coleoptera: Curculionidae).

    Science.gov (United States)

    De Oliveira Neto, Osmundo Brilhante; Batista, João Aguiar Nogueira; Rigden, Daniel John; Franco, Octávio Luiz; Fragoso, Rodrigo Rocha; Monteiro, Ana Carolina Santos; Monnerat, Rose Gomes; Grossi-De-Sa, Maria Fátima

    2004-06-01

    The cotton boll weevil (Anthonomus grandis) causes severe cotton crop losses in North and South America. This report describes the presence of cysteine proteinase activity in the cotton boll weevil. Cysteine proteinase inhibitors from different sources were assayed against total A. grandis proteinases but, unexpectedly, no inhibitor tested was particularly effective. In order to screen for active inhibitors against the boll weevil, a cysteine proteinase cDNA (Agcys1) was isolated from A. grandis larvae using degenerate primers and rapid amplification of cDNA ends (RACE) techniques. Sequence analysis showed significant homologies with other insect cysteine proteinases. Northern blot analysis indicated that the mRNA encoding the proteinase was transcribed mainly in the gut of larvae. No mRNA was detected in neonatal larvae, pupae, or in the gut of the adult insect, suggesting that Agcys1 is an important cysteine proteinase for larvae digestion. The isolated gene will facilitate the search for highly active inhibitors towards boll weevil larvae that may provide a new opportunity to control this important insect pest.

  8. Construction of full-length cDNA library of white flower Salvia ...

    African Journals Online (AJOL)

    In order to screen and isolate secondary metabolite biosynthesis related gene, we construct a cDNA library of white flower Salvia miltiorrhiza bge. f.alba. High quality of total RNA was successfully isolated from roots of white flower S. miltiorrhiza using modified CTAB method. Double strand cDNA was cloned into pDNR-LIB ...

  9. Sequencing and analysis of full-length cDNAs, 5'-ESTs and 3'-ESTs from a cartilaginous fish, the elephant shark (Callorhinchus milii).

    KAUST Repository

    Brenner, Sydney

    2012-10-08

    Cartilaginous fishes are the most ancient group of living jawed vertebrates (gnathostomes) and are, therefore, an important reference group for understanding the evolution of vertebrates. The elephant shark (Callorhinchus milii), a holocephalan cartilaginous fish, has been identified as a model cartilaginous fish genome because of its compact genome (∼910 Mb) and a genome project has been initiated to obtain its whole genome sequence. In this study, we have generated and sequenced full-length enriched cDNA libraries of the elephant shark using the \\'oligo-capping\\' method and Sanger sequencing. A total of 6,778 full-length protein-coding cDNA and 10,701 full-length noncoding cDNA were sequenced from six tissues (gills, intestine, kidney, liver, spleen, and testis) of the elephant shark. Analysis of their polyadenylation signals showed that polyadenylation usage in elephant shark is similar to that in mammals. Furthermore, both coding and noncoding transcripts of the elephant shark use the same proportion of canonical polyadenylation sites. Besides BLASTX searches, protein-coding transcripts were annotated by Gene Ontology, InterPro domain, and KEGG pathway analyses. By comparing elephant shark genes to bony vertebrate genes, we identified several ancient genes present in elephant shark but differentially lost in tetrapods or teleosts. Only ∼6% of elephant shark noncoding cDNA showed similarity to known noncoding RNAs (ncRNAs). The rest are either highly divergent ncRNAs or novel ncRNAs. In addition to full-length transcripts, 30,375 5\\'-ESTs and 41,317 3\\'-ESTs were sequenced and annotated. The clones and transcripts generated in this study are valuable resources for annotating transcription start sites, exon-intron boundaries, and UTRs of genes in the elephant shark genome, and for the functional characterization of protein sequences. These resources will also be useful for annotating genes in other cartilaginous fishes whose genomes have been targeted for

  10. Sequencing and analysis of full-length cDNAs, 5'-ESTs and 3'-ESTs from a cartilaginous fish, the elephant shark (Callorhinchus milii).

    KAUST Repository

    Brenner, Sydney; Kodzius, Rimantas; Tan, Yue Ying; Tay, Alice; Tay, Boon-Hui; Venkatesh, Byrappa

    2012-01-01

    Cartilaginous fishes are the most ancient group of living jawed vertebrates (gnathostomes) and are, therefore, an important reference group for understanding the evolution of vertebrates. The elephant shark (Callorhinchus milii), a holocephalan cartilaginous fish, has been identified as a model cartilaginous fish genome because of its compact genome (∼910 Mb) and a genome project has been initiated to obtain its whole genome sequence. In this study, we have generated and sequenced full-length enriched cDNA libraries of the elephant shark using the 'oligo-capping' method and Sanger sequencing. A total of 6,778 full-length protein-coding cDNA and 10,701 full-length noncoding cDNA were sequenced from six tissues (gills, intestine, kidney, liver, spleen, and testis) of the elephant shark. Analysis of their polyadenylation signals showed that polyadenylation usage in elephant shark is similar to that in mammals. Furthermore, both coding and noncoding transcripts of the elephant shark use the same proportion of canonical polyadenylation sites. Besides BLASTX searches, protein-coding transcripts were annotated by Gene Ontology, InterPro domain, and KEGG pathway analyses. By comparing elephant shark genes to bony vertebrate genes, we identified several ancient genes present in elephant shark but differentially lost in tetrapods or teleosts. Only ∼6% of elephant shark noncoding cDNA showed similarity to known noncoding RNAs (ncRNAs). The rest are either highly divergent ncRNAs or novel ncRNAs. In addition to full-length transcripts, 30,375 5'-ESTs and 41,317 3'-ESTs were sequenced and annotated. The clones and transcripts generated in this study are valuable resources for annotating transcription start sites, exon-intron boundaries, and UTRs of genes in the elephant shark genome, and for the functional characterization of protein sequences. These resources will also be useful for annotating genes in other cartilaginous fishes whose genomes have been targeted for whole

  11. Identification of human chromosome 22 transcribed sequences with ORF expressed sequence tags

    Science.gov (United States)

    de Souza, Sandro J.; Camargo, Anamaria A.; Briones, Marcelo R. S.; Costa, Fernando F.; Nagai, Maria Aparecida; Verjovski-Almeida, Sergio; Zago, Marco A.; Andrade, Luis Eduardo C.; Carrer, Helaine; El-Dorry, Hamza F. A.; Espreafico, Enilza M.; Habr-Gama, Angelita; Giannella-Neto, Daniel; Goldman, Gustavo H.; Gruber, Arthur; Hackel, Christine; Kimura, Edna T.; Maciel, Rui M. B.; Marie, Suely K. N.; Martins, Elizabeth A. L.; Nóbrega, Marina P.; Paçó-Larson, Maria Luisa; Pardini, Maria Inês M. C.; Pereira, Gonçalo G.; Pesquero, João Bosco; Rodrigues, Vanderlei; Rogatto, Silvia R.; da Silva, Ismael D. C. G.; Sogayar, Mari C.; de Fátima Sonati, Maria; Tajara, Eloiza H.; Valentini, Sandro R.; Acencio, Marcio; Alberto, Fernando L.; Amaral, Maria Elisabete J.; Aneas, Ivy; Bengtson, Mário Henrique; Carraro, Dirce M.; Carvalho, Alex F.; Carvalho, Lúcia Helena; Cerutti, Janete M.; Corrêa, Maria Lucia C.; Costa, Maria Cristina R.; Curcio, Cyntia; Gushiken, Tsieko; Ho, Paulo L.; Kimura, Elza; Leite, Luciana C. C.; Maia, Gustavo; Majumder, Paromita; Marins, Mozart; Matsukuma, Adriana; Melo, Analy S. A.; Mestriner, Carlos Alberto; Miracca, Elisabete C.; Miranda, Daniela C.; Nascimento, Ana Lucia T. O.; Nóbrega, Francisco G.; Ojopi, Élida P. B.; Pandolfi, José Rodrigo C.; Pessoa, Luciana Gilbert; Rahal, Paula; Rainho, Claudia A.; da Ro's, Nancy; de Sá, Renata G.; Sales, Magaly M.; da Silva, Neusa P.; Silva, Tereza C.; da Silva, Wilson; Simão, Daniel F.; Sousa, Josane F.; Stecconi, Daniella; Tsukumo, Fernando; Valente, Valéria; Zalcberg, Heloisa; Brentani, Ricardo R.; Reis, Luis F. L.; Dias-Neto, Emmanuel; Simpson, Andrew J. G.

    2000-01-01

    Transcribed sequences in the human genome can be identified with confidence only by alignment with sequences derived from cDNAs synthesized from naturally occurring mRNAs. We constructed a set of 250,000 cDNAs that represent partial expressed gene sequences and that are biased toward the central coding regions of the resulting transcripts. They are termed ORF expressed sequence tags (ORESTES). The 250,000 ORESTES were assembled into 81,429 contigs. Of these, 1,181 (1.45%) were found to match sequences in chromosome 22 with at least one ORESTES contig for 162 (65.6%) of the 247 known genes, for 67 (44.6%) of the 150 related genes, and for 45 of the 148 (30.4%) EST-predicted genes on this chromosome. Using a set of stringent criteria to validate our sequences, we identified a further 219 previously unannotated transcribed sequences on chromosome 22. Of these, 171 were in fact also defined by EST or full length cDNA sequences available in GenBank but not utilized in the initial annotation of the first human chromosome sequence. Thus despite representing less than 15% of all expressed human sequences in the public databases at the time of the present analysis, ORESTES sequences defined 48 transcribed sequences on chromosome 22 not defined by other sequences. All of the transcribed sequences defined by ORESTES coincided with DNA regions predicted as encoding exons by genscan. (http://genes.mit.edu/GENSCAN.html). PMID:11070084

  12. Deep learning

    CERN Document Server

    Goodfellow, Ian; Courville, Aaron

    2016-01-01

    Deep learning is a form of machine learning that enables computers to learn from experience and understand the world in terms of a hierarchy of concepts. Because the computer gathers knowledge from experience, there is no need for a human computer operator to formally specify all the knowledge that the computer needs. The hierarchy of concepts allows the computer to learn complicated concepts by building them out of simpler ones; a graph of these hierarchies would be many layers deep. This book introduces a broad range of topics in deep learning. The text offers mathematical and conceptual background, covering relevant concepts in linear algebra, probability theory and information theory, numerical computation, and machine learning. It describes deep learning techniques used by practitioners in industry, including deep feedforward networks, regularization, optimization algorithms, convolutional networks, sequence modeling, and practical methodology; and it surveys such applications as natural language proces...

  13. Deep Learning and Its Applications in Biomedicine.

    Science.gov (United States)

    Cao, Chensi; Liu, Feng; Tan, Hai; Song, Deshou; Shu, Wenjie; Li, Weizhong; Zhou, Yiming; Bo, Xiaochen; Xie, Zhi

    2018-02-01

    Advances in biological and medical technologies have been providing us explosive volumes of biological and physiological data, such as medical images, electroencephalography, genomic and protein sequences. Learning from these data facilitates the understanding of human health and disease. Developed from artificial neural networks, deep learning-based algorithms show great promise in extracting features and learning patterns from complex data. The aim of this paper is to provide an overview of deep learning techniques and some of the state-of-the-art applications in the biomedical field. We first introduce the development of artificial neural network and deep learning. We then describe two main components of deep learning, i.e., deep learning architectures and model optimization. Subsequently, some examples are demonstrated for deep learning applications, including medical image classification, genomic sequence analysis, as well as protein structure classification and prediction. Finally, we offer our perspectives for the future directions in the field of deep learning. Copyright © 2018. Production and hosting by Elsevier B.V.

  14. Sequences of 12 monoclonal anti-dinitrophenyl spin-label antibodies for NMR studies

    International Nuclear Information System (INIS)

    Leahy, D.J.; Rule, G.S.; Whittaker, M.M.; McConnell, H.M.

    1988-01-01

    Eleven monoclonal antibodies specific for a spin-labeled dinitrophenyl hapten (DNP-SL) have been produces for use in NMR studies. They have been named AN01 and ANO3-AN12. The stability constants for the association of these antibodies with DNP-SL and related haptens were measured by fluorescence quenching. cDNA clones coding for the heavy and light chains of each antibody and of an additional anti-DNP-SL monoclonal antibody, ANO2, have been isolated. The nucleic acid sequence of the 5' end of each clone has been determined, and the amino acid sequence of the variable regions of each antibody has been deduced from the cDNA sequence. The sequences are relatively heterogeneous, but both the heavy and the light chains of ANO1 and ANO3 are derived from the same variable-region gene families as those of the ANO2 antibody. ANO7 has a heavy chain that is related to that of ANO2, and ANO9 has a related light chain. ANO5 and ANO6 are unrelated to ANO2 but share virtually identical heavy and light chains. Preliminary NMR difference spectra comparing related antibodies show that sequence-specific assignment of resonances is possible. Such spectra also provide a measure of structural relatedness

  15. Revealing stable processing products from ribosome-associated small RNAs by deep-sequencing data analysis.

    Science.gov (United States)

    Zywicki, Marek; Bakowska-Zywicka, Kamilla; Polacek, Norbert

    2012-05-01

    The exploration of the non-protein-coding RNA (ncRNA) transcriptome is currently focused on profiling of microRNA expression and detection of novel ncRNA transcription units. However, recent studies suggest that RNA processing can be a multi-layer process leading to the generation of ncRNAs of diverse functions from a single primary transcript. Up to date no methodology has been presented to distinguish stable functional RNA species from rapidly degraded side products of nucleases. Thus the correct assessment of widespread RNA processing events is one of the major obstacles in transcriptome research. Here, we present a novel automated computational pipeline, named APART, providing a complete workflow for the reliable detection of RNA processing products from next-generation-sequencing data. The major features include efficient handling of non-unique reads, detection of novel stable ncRNA transcripts and processing products and annotation of known transcripts based on multiple sources of information. To disclose the potential of APART, we have analyzed a cDNA library derived from small ribosome-associated RNAs in Saccharomyces cerevisiae. By employing the APART pipeline, we were able to detect and confirm by independent experimental methods multiple novel stable RNA molecules differentially processed from well known ncRNAs, like rRNAs, tRNAs or snoRNAs, in a stress-dependent manner.

  16. Phylogenetic and genome-wide deep-sequencing analyses of canine parvovirus reveal co-infection with field variants and emergence of a recent recombinant strain.

    Directory of Open Access Journals (Sweden)

    Ruben Pérez

    Full Text Available Canine parvovirus (CPV, a fast-evolving single-stranded DNA virus, comprises three antigenic variants (2a, 2b, and 2c with different frequencies and genetic variability among countries. The contribution of co-infection and recombination to the genetic variability of CPV is far from being fully elucidated. Here we took advantage of a natural CPV population, recently formed by the convergence of divergent CPV-2c and CPV-2a strains, to study co-infection and recombination. Complete sequences of the viral coding region of CPV-2a and CPV-2c strains from 40 samples were generated and analyzed using phylogenetic tools. Two samples showed co-infection and were further analyzed by deep sequencing. The sequence profile of one of the samples revealed the presence of CPV-2c and CPV-2a strains that differed at 29 nucleotides. The other sample included a minor CPV-2a strain (13.3% of the viral population and a major recombinant strain (86.7%. The recombinant strain arose from inter-genotypic recombination between CPV-2c and CPV-2a strains within the VP1/VP2 gene boundary. Our findings highlight the importance of deep-sequencing analysis to provide a better understanding of CPV molecular diversity.

  17. Phylogenetic and Genome-Wide Deep-Sequencing Analyses of Canine Parvovirus Reveal Co-Infection with Field Variants and Emergence of a Recent Recombinant Strain

    Science.gov (United States)

    Pérez, Ruben; Calleros, Lucía; Marandino, Ana; Sarute, Nicolás; Iraola, Gregorio; Grecco, Sofia; Blanc, Hervé; Vignuzzi, Marco; Isakov, Ofer; Shomron, Noam; Carrau, Lucía; Hernández, Martín; Francia, Lourdes; Sosa, Katia; Tomás, Gonzalo; Panzera, Yanina

    2014-01-01

    Canine parvovirus (CPV), a fast-evolving single-stranded DNA virus, comprises three antigenic variants (2a, 2b, and 2c) with different frequencies and genetic variability among countries. The contribution of co-infection and recombination to the genetic variability of CPV is far from being fully elucidated. Here we took advantage of a natural CPV population, recently formed by the convergence of divergent CPV-2c and CPV-2a strains, to study co-infection and recombination. Complete sequences of the viral coding region of CPV-2a and CPV-2c strains from 40 samples were generated and analyzed using phylogenetic tools. Two samples showed co-infection and were further analyzed by deep sequencing. The sequence profile of one of the samples revealed the presence of CPV-2c and CPV-2a strains that differed at 29 nucleotides. The other sample included a minor CPV-2a strain (13.3% of the viral population) and a major recombinant strain (86.7%). The recombinant strain arose from inter-genotypic recombination between CPV-2c and CPV-2a strains within the VP1/VP2 gene boundary. Our findings highlight the importance of deep-sequencing analysis to provide a better understanding of CPV molecular diversity. PMID:25365348

  18. Amino acid substitutions in genetic variants of human serum albumin and in sequences inferred from molecular cloning

    International Nuclear Information System (INIS)

    Takahashi, N.; Takahashi, Y.; Blumberg, B.S.; Putnam, F.W.

    1987-01-01

    The structural changes in four genetic variants of human serum albumin were analyzed by tandem high-pressure liquid chromatography (HPLC) of the tryptic peptides, HPLC mapping and isoelectric focusing of the CNBr fragments, and amino acid sequence analysis of the purified peptides. Lysine-372 of normal (common) albumin A was changed to glutamic acid both in albumin Naskapi, a widespread polymorphic variant of North American Indians, and in albumin Mersin found in Eti Turks. The two variants also exhibited anomalous migration in NaDodSO 4 /PAGE, which is attributed to a conformational change. The identity of albumins Naskapi and Mersin may have originated through descent from a common mid-Asiatic founder of the two migrating ethnic groups, or it may represent identical but independent mutations of the albumin gene. In albumin Adana, from Eti Turks, the substitution site was not identified but was localized to the region from positions 447 through 548. The substitution of aspartic acid-550 by glycine was found in albumin Mexico-2 from four individuals of the Pima tribe. Although only single-point substitutions have been found in these and in certain other genetic variants of human albumin, five differences exist in the amino acid sequences inferred from cDNA sequences by workers in three other laboratories. However, our results on albumin A and on 14 different genetic variants accord with the amino acid sequence of albumin deduced from the genomic sequence. The apparent amino acid substitutions inferred from comparison of individual cDNA sequences probably reflect artifacts in cloning or in cDNA sequence analysis rather than polymorphism of the coding sections of the albumin gene

  19. Novel infectious cDNA clones of hepatitis C virus genotype 3a (strain S52) and 4a (strain ED43): genetic analyses and in vivo pathogenesis studies

    DEFF Research Database (Denmark)

    Gottwein, Judith; Scheel, Troels; Callendret, Benoit

    2010-01-01

    Previously, RNA transcripts of cDNA clones of hepatitis C virus (HCV) genotypes 1a (strains H77, HCV-1, and HC-TN), 1b (HC-J4, Con1, and HCV-N), and 2a (HC-J6 and JFH1) were found to be infectious in chimpanzees. However, only JFH1 was infectious in human hepatoma Huh7 cells. We performed genetic...... analysis of HCV genotype 3a (strain S52) and 4a (strain ED43) prototype strains and generated full-length consensus cDNA clones (pS52 and pED43). Transfection of Huh7.5 cells with RNA transcripts of these clones did not yield cells expressing HCV Core. However, intrahepatic transfection of chimpanzees...... resulted in robust infection with peak HCV RNA titers of approximately 5.5 log(10) international units (IU)/ml. Genomic consensus sequences recovered from serum at the times of peak viral titers were identical to the sequences of the parental plasmids. Both chimpanzees developed acute hepatitis...

  20. Molecular cloning and nucleotide sequence of CYP6BF1 from the diamondback moth, Plutella xylostella

    Science.gov (United States)

    Li, Hongshan; Dai, Huaguo; Wei, Hui

    2005-01-01

    A novel cDNA clong encoding a cytochrome P450 was screened from the insecticide-susceptible strain of Plutella xylostella (L.) (Lepidoptera:Yponomeutidae). The nucleotide sequence of the clone, designated CYP6BF1, was determined. This is the first full-length sequence of the CYP6 family from Plutella xylostella (L.). The cDNA is 1661bp in length and contains an open reading frame from base pairs 26 to 1570, encoding a protein of 514 amino acid residues. It is similar to the other insect P450s in gene family 6, including CYP6AE1 from Depressaria pastinacella, (46%). The GenBank accession number is AY971374. PMID:17119627

  1. Molecular characterization, sequence analysis and tissue expression of a porcine gene – MOSPD2

    Directory of Open Access Journals (Sweden)

    Yang Jie

    2017-01-01

    Full Text Available The full-length cDNA sequence of a porcine gene, MOSPD2, was amplified using the rapid amplification of cDNA ends method based on a pig expressed sequence tag sequence which was highly homologous to the coding sequence of the human MOSPD2 gene. Sequence prediction analysis revealed that the open reading frame of this gene encodes a protein of 491 amino acids that has high homology with the motile sperm domain-containing protein 2 (MOSPD2 of five species: horse (89%, human (90%, chimpanzee (89%, rhesus monkey (89% and mouse (85%; thus, it could be defined as a porcine MOSPD2 gene. This novel porcine gene was assigned GeneID: 100153601. This gene is structured in 15 exons and 14 introns as revealed by computer-assisted analysis. The phylogenetic analysis revealed that the porcine MOSPD2 gene has a closer genetic relationship with the MOSPD2 gene of horse. Tissue expression analysis indicated that the porcine MOSPD2 gene is generally and differentially expressed in the spleen, muscle, skin, kidney, lung, liver, fat and heart. Our experiment is the first to establish the primary foundation for further research on the porcine MOSPD2 gene.

  2. Construction of an adult barnacle (Balanus amphitrite cDNA library and selection of reference genes for quantitative RT-PCR studies

    Directory of Open Access Journals (Sweden)

    Burgess J Grant

    2009-06-01

    Full Text Available Abstract Background Balanus amphitrite is a barnacle commonly used in biofouling research. Although many aspects of its biology have been elucidated, the lack of genetic information is impeding a molecular understanding of its life cycle. As part of a wider multidisciplinary approach to reveal the biogenic cues influencing barnacle settlement and metamorphosis, we have sequenced and annotated the first cDNA library for B. amphitrite. We also present a systematic validation of potential reference genes for normalization of quantitative real-time PCR (qRT-PCR data obtained from different developmental stages of this animal. Results We generated a cDNA library containing expressed sequence tags (ESTs from adult B. amphitrite. A total of 609 unique sequences (comprising 79 assembled clusters and 530 singlets were derived from 905 reliable unidirectionally sequenced ESTs. Bioinformatics tools such as BLAST, HMMer and InterPro were employed to allow functional annotation of the ESTs. Based on these analyses, we selected 11 genes to study their ability to normalize qRT-PCR data. Total RNA extracted from 7 developmental stages was reverse transcribed and the expression stability of the selected genes was compared using geNorm, BestKeeper and NormFinder. These software programs produced highly comparable results, with the most stable gene being mt-cyb, while tuba, tubb and cp1 were clearly unsuitable for data normalization. Conclusion The collection of B. amphitrite ESTs and their annotation has been made publically available representing an important resource for both basic and applied research on this species. We developed a qRT-PCR assay to determine the most reliable reference genes. Transcripts encoding cytochrome b and NADH dehydrogenase subunit 1 were expressed most stably, although other genes also performed well and could prove useful to normalize gene expression studies.

  3. Draft Genome Sequence of Pseudoalteromonas sp. Strain XI10 Isolated from the Brine-Seawater Interface of Erba Deep in the Red Sea

    KAUST Repository

    Zhang, Guishan; Haroon, Mohamed; Zhang, Ruifu; Hikmawan, Tyas I.; Stingl, Ulrich

    2016-01-01

    Pseudoalteromonas sp. strain XI10 was isolated from the brine-seawater interface of Erba Deep in the Red Sea, Saudi Arabia. Here, we present the draft genome sequence of strain XI10, a gammaproteobacterium that synthesizes polysaccharides for biofilm formation when grown in liquid culture.

  4. Draft Genome Sequence of Pseudoalteromonas sp. Strain XI10 Isolated from the Brine-Seawater Interface of Erba Deep in the Red Sea

    KAUST Repository

    Zhang, Guishan

    2016-03-10

    Pseudoalteromonas sp. strain XI10 was isolated from the brine-seawater interface of Erba Deep in the Red Sea, Saudi Arabia. Here, we present the draft genome sequence of strain XI10, a gammaproteobacterium that synthesizes polysaccharides for biofilm formation when grown in liquid culture.

  5. Full-Length Sequence of Mouse Acupuncture-Induced 1-L (Aig1l Gene Including Its Transcriptional Start Site

    Directory of Open Access Journals (Sweden)

    Mika Ohta

    2011-01-01

    Full Text Available We have been investigating the molecular efficacy of electroacupuncture (EA, which is one type of acupuncture therapy. In our previous molecular biological study of acupuncture, we found an EA-induced gene, named acupuncture-induced 1-L (Aig1l, in mouse skeletal muscle. The aims of this study consisted of identification of the full-length cDNA sequence of Aig1l including the transcriptional start site, determination of the tissue distribution of Aig1l and analysis of the effect of EA on Aig1l gene expression. We determined the complete cDNA sequence including the transcriptional start site via cDNA cloning with the cap site hunting method. We then analyzed the tissue distribution of Aig1l by means of northern blot analysis and real-time quantitative polymerase chain reaction. We used the semiquantitative reverse transcriptase-polymerase chain reaction to examine the effect of EA on Aig1l gene expression. Our results showed that the complete cDNA sequence of Aig1l was 6073 bp long, and the putative protein consisted of 962 amino acids. All seven tissues that we analyzed expressed the Aig1l gene. In skeletal muscle, EA induced expression of the Aig1l gene, with high expression observed after 3 hours of EA. Our findings thus suggest that the Aig1l gene may play a key role in the molecular mechanisms of EA efficacy.

  6. Modulations of RNA sequences by cytokinin in pumpkin cotyledons

    International Nuclear Information System (INIS)

    Chang, C.; Ertl, J.; Chen, C.

    1987-01-01

    Polyadenylated mRNAs from excised pumpkin cotyledons treated with or without 10 -4 M benzyladenine (BA) for various time periods in suspension culture were assayed by in vitro translation in the presence of [ 35 S] methionine. The radioactive polypeptides were analyzed by one- and two-dimensional polyacrylamide gel electrophoresis. Specific sequences of mRNAs were enhanced, reduced, induced, or suppressed by the hormone within 60 min of the application of BA to the cotyledons. Four independent cDNA clones of cytokinin-modulated mRNAs have been selected and characterized. RNA blot hybridization using the four cDNA probes also indicates that the levels of specific mRNAs are modulated upward or downward by the hormone

  7. Foundations of Sequence-to-Sequence Modeling for Time Series

    OpenAIRE

    Kuznetsov, Vitaly; Mariet, Zelda

    2018-01-01

    The availability of large amounts of time series data, paired with the performance of deep-learning algorithms on a broad class of problems, has recently led to significant interest in the use of sequence-to-sequence models for time series forecasting. We provide the first theoretical analysis of this time series forecasting framework. We include a comparison of sequence-to-sequence modeling to classical time series models, and as such our theory can serve as a quantitative guide for practiti...

  8. Human cDNA mapping using fluorescence in situ hybridization. Progress report, April 1--December 31, 1992

    Energy Technology Data Exchange (ETDEWEB)

    Korenberg, J.R.

    1993-12-31

    The ultimate goal of this proposal is to create a cDNA map of the human genome. Mapping is approached using the techniques of high resolution fluorescence in situ hybridization (FISH). This technology and the results of its application are designed to rapidly generate whole genome as tool box of expressed sequence to speed the identification of human disease genes. The results of this study are intended to dovetail with and to link the results of existing technologies for creating backbone YAC and genetic maps. In the first eight months, this approach will generate 60--80% of the expressed sequence map, the remainder expected to be derived through more long-term, labor-intensive, regional chromosomal gene searches or sequencing. The laboratory has made significant progress in the set-up phase, in mapping fetal and adult brain and other cDNAs, in testing a model system for directly linking genetic and physical maps using FISH with small fragments, in setting up a database, and in establishing the validity and throughput of the system.

  9. Heterologous expression of a Rauvolfia cDNA encoding strictosidine glucosidase, a biosynthetic key to over 2000 monoterpenoid indole alkaloids.

    Science.gov (United States)

    Gerasimenko, Irina; Sheludko, Yuri; Ma, Xueyan; Stöckigt, Joachim

    2002-04-01

    Strictosidine glucosidase (SG) is an enzyme that catalyses the second step in the biosynthesis of various classes of monoterpenoid indole alkaloids. Based on the comparison of cDNA sequences of SG from Catharanthus roseus and raucaffricine glucosidase (RG) from Rauvolfia serpentina, primers for RT-PCR were designed and the cDNA encoding SG was cloned from R. serpentina cell suspension cultures. The active enzyme was expressed in Escherichia coli and purified to homogeneity. Analysis of its deduced amino-acid sequence assigned the SG from R. serpentina to family 1 of glycosyl hydrolases. In contrast to the SG from C. roseus, the enzyme from R. serpentina is predicted to lack an uncleavable N-terminal signal sequence, which is believed to direct proteins to the endoplasmic reticulum. The temperature and pH optimum, enzyme kinetic parameters and substrate specificity of the heterologously expressed SG were studied and compared to those of the C. roseus enzyme, revealing some differences between the two glucosidases. In vitro deglucosylation of strictosidine by R. serpentina SG proceeds by the same mechanism as has been shown for the C. roseus enzyme preparation. The reaction gives rise to the end product cathenamine and involves 4,21-dehydrocorynantheine aldehyde as an intermediate. The enzymatic hydrolysis of dolichantoside (Nbeta-methylstrictosidine) leads to several products. One of them was identified as a new compound, 3-isocorreantine A. From the data it can be concluded that the divergence of the biosynthetic pathways leading to different classes of indole alkaloids formed in R. serpentina and C. roseus cell suspension cultures occurs at a later stage than strictosidine deglucosylation.

  10. RTA, a candidate G protein-coupled receptor: Cloning, sequencing, and tissue distribution

    International Nuclear Information System (INIS)

    Ross, P.C.; Figler, R.A.; Corjay, M.H.; Barber, C.M.; Adam, N.; Harcus, D.R.; Lynch, K.R.

    1990-01-01

    Genomic and cDNA clones, encoding a protein that is a member of the guanine nucleotide-binding regulatory protein (G protein)-coupled receptor superfamily, were isolated by screening rat genomic and thoracic aorta cDNA libraries with an oligonucleotide encoding a highly conserved region of the M 1 muscarinic acetylcholine receptor. Sequence analyses of these clones showed that they encode a 343-amino acid protein (named RTA). The RTA gene is single copy, as demonstrated by restriction mapping and Southern blotting of genomic clones and rat genomic DNA. RTA RNA sequences are relatively abundant throughout the gut, vas deferens, uterus, and aorta but are only barely detectable (on Northern blots) in liver, kidney, lung, and salivary gland. In the rat brain, RTA sequences are markedly abundant in the cerebellum. TRA is most closely related to the mas oncogene (34% identity), which has been suggested to be a forebrain angiotensin receptor. They conclude that RTA is not an angiotensin receptor; to date, they have been unable to identify its ligand

  11. cDNA cloning and transcriptional controlling of a novel low dose radiation-induced gene and its function analysis

    International Nuclear Information System (INIS)

    Zhou Pingkun; Sui Jianli

    2002-01-01

    Objective: To clone a novel low dose radiation-induced gene (LRIGx) and study its function as well as its transcriptional changes after irradiation. Methods: Its cDNA was obtained by DDRT-PCR and RACE techniques. Northern blot hybridization was used to investigate the gene transcription. Bioinformatics was employed to analysis structure and function of this gene. Results: LRIGx cDNA was cloned. The sequence of LRIGx was identical to a DNA clone located in human chromosome 20 q 11.2-12 Bioinformatics analysis predicted an encoded protein with a conserved helicase domain. Northern analysis revealed a ∼8.5 kb transcript which was induced after 0.2 Gy as well as 0.02 Gy irradiation, and the transcript level was increased 5 times at 4 h after 0.2 Gy irradiation. The induced level of LRIGx transcript by 2.0 Gy high dose was lower than by 0.2 Gy. Conclusion: A novel low dose radiation-induced gene has been cloned. It encodes a protein with a conserved helicase domain that could involve in DNA metabolism in the cellular process of radiation response

  12. [Primary culture of cat intestinal epithelial cell and construction of its cDNA library].

    Science.gov (United States)

    Ye, L; Gui-Hua, Z; Kun, Y; Hong-Fa, W; Ting, X; Gong-Zhen, L; Wei-Xia, Z; Yong, C

    2017-04-12

    Objective To establish the primary cat intestinal epithelial cells (IECs) culture methods and construct the cDNA library for the following yeast two-hybrid experiment, so as to screen the virulence interaction factors among the final host. Methods The primary cat IECs were cultured by the tissue cultivation and combined digestion with collagenase XI and dispase I separately. Then the cat IECs cultured was identified with the morphological observation and cyto-keratin detection, by using goat anti-cyto-keratin monoclonal antibodies. The mRNA of cat IECs was isolated and used as the template to synthesize the first strand cDNA by SMART™ technology, and then the double-strand cDNAs were acquired by LD-PCR, which were subsequently cloned into the plasmid PGADT7-Rec to construct yeast two-hybrid cDNA library in the yeast strain Y187 by homologous recombination. Matchmaker™ Insert Check PCR was used to detect the size distribution of cDNA fragments after the capacity calculation of the cDNA library. Results The comparison of the two cultivation methods indicated that the combined digestion of collagenase XI and dispase I was more effective than the tissue cultivation. The cat IECs system of continuous culture was established and the cat IECs with high purity were harvested for constructing the yeast two-hybrid cDNA library. The library contained 1.1×10 6 independent clones. The titer was 2.8×10 9 cfu/ml. The size of inserted fragments was among 0.5-2.0 kb. Conclusion The yeast two-hybrid cDNA library of cat IECs meets the requirements of further screen research, and this study lays the foundation of screening the Toxoplasma gondii virulence interaction factors among the cDNA libraries of its final hosts.

  13. cDNA cloning of a snake venom metalloproteinase from the eastern diamondback rattlesnake (Crotalus adamanteus), and the expression of its disintegrin domain with anti-platelet effects

    Science.gov (United States)

    Suntravat, Montamas; Jia, Ying; Lucena, Sara E.; Sánchez, Elda E.; Pérez, John C.

    2013-01-01

    A 5′ truncated snake venom metalloproteinase was identified from a cDNA library constructed from venom glands of an eastern diamondback rattlesnake (Crotalus adamanteus). The 5′-rapid amplification of cDNA ends (RACE) was used to obtain the 1865 bp full-length cDNA sequence of a snake venom metalloproteinase (CamVMPII). CamVMPII encodes an open reading frame of 488 amino acids, which includes a signal peptide, a pro-domain, a metalloproteinase domain, a spacer, and an RGD-disintegrin domain. The predicted amino acid sequence of CamVMPII showed a 91%, 90%, 83%, and 82% sequence homology to the P-II class enzymes of C. adamanteus metalloproteinase 2, C. atrox CaVMP-II, Gloydius halys agkistin, and Protobothrops jerdonii jerdonitin, respectively. Disintegrins are potent inhibitors of both platelet aggregation and integrin-dependent cell adhesion. Therefore, the disintegrin domain (Cam-dis) of CamVMPII was amplified by PCR, cloned into a pET-43.1a vector, and expressed in Escherichia coli BL21. Affinity purified recombinantly modified Cam-dis (r-Cam-dis) with a yield of 8.5 mg/L culture medium was cleaved from the fusion tags by enterokinase cleavage. r-Cam-dis was further purified by two-step chromatography consisting of HiTrap™ Benzamidine FF column, followed by Talon Metal affinity column with a final yield of 1 mg/L culture. r-Cam-dis was able to inhibit all three processes of platelet thrombus formation including platelet adhesion with an estimated IC50 of 1 nM, collagen- and ADP-induced platelet aggregation with the estimated IC50s of 18 and 6 nM, respectively, and platelet function on clot retraction. It is a potent anti-platelet inhibitor, which should be further investigated for drug discovery to treat stroke patients or patients with thrombotic disorders. PMID:23313448

  14. Sequence of structures in fine-grained turbidites: Comparison of recent deep-sea and ancient flysch sediments

    Science.gov (United States)

    Stow, Dorrik A. V.; Shanmugam, Ganapathy

    1980-01-01

    A comparative study of the sequence of sedimentary structures in ancient and modern fine-grained turbidites is made in three contrasting areas. They are (1) Holocene and Pleistocene deep-sea muds of the Nova Scotian Slope and Rise, (2) Middle Ordovician Sevier Shale of the Valley and Ridge Province of the Southern Appalachians, and (3) Cambro-Ordovician Halifax Slate of the Meguma Group in Nova Scotia. A standard sequence of structures is proposed for fine-grained turbidites. The complete sequence has nine sub-divisions that are here termed T 0 to T 8. "The lower subdivision (T 0) comprises a silt lamina which has a sharp, scoured and load-cast base, internal parallel-lamination and cross-lamination, and a sharp current-lineated or wavy surface with 'fading-ripples' (= Type C etc. …)." (= Type C ripple-drift cross-lamination, Jopling and Walker, 1968). The overlying sequence shows textural and compositional grading through alternating silt and mud laminae. A convolute-laminated sub-division (T 1) is overlain by low-amplitude climbing ripples (T 2), thin regular laminae (T 3), thin indistinct laminae (T 4), and thin wipsy or convolute laminae (T 5). The topmost three divisions, graded mud (T 6), ungraded mud (T 7) and bioturbated mud (T 8), do not have silt laminae but rare patchy silt lenses and silt pseudonodules and a thin zone of micro-burrowing near the upper surface. The proposed sequence is analogous to the Bouma (1962) structural scheme for sandy turbidites and is approximately equivalent to Bouma's (C)DE divisions. The repetition of partial sequences characterizes different parts of the slope/base-of-slope/basin plain environment, and represents deposition from different stages of evolution of a large, muddy, turbidity flow. Microstructural detail and sequence are well preserved in ancient and even slightly metamorphosed sediments. Their recognition is important for determining depositional processes and for palaeoenvironmental interpretation.

  15. Peptidomics combined with cDNA library unravel the diversity of centipede venom

    DEFF Research Database (Denmark)

    Rong, Mingqiang; Yang, Shilong; Wen, Bo

    2015-01-01

    of centipede venom. In the present study, we use peptidomics combined with cDNA library to uncover the diversity of centipede Scolopendra subspinipes mutilans L. Koch. 192 peptides were identified by LC-MS/MS and 79 precursors were deduced by cDNA library. Surprisingly, the signal peptides of centipede toxins...

  16. Analysis of expressed sequence tags from Prunus mume flower and fruit and development of simple sequence repeat markers

    Directory of Open Access Journals (Sweden)

    Gao Zhihong

    2010-07-01

    Full Text Available Abstract Background Expressed Sequence Tag (EST has been a cost-effective tool in molecular biology and represents an abundant valuable resource for genome annotation, gene expression, and comparative genomics in plants. Results In this study, we constructed a cDNA library of Prunus mume flower and fruit, sequenced 10,123 clones of the library, and obtained 8,656 expressed sequence tag (EST sequences with high quality. The ESTs were assembled into 4,473 unigenes composed of 1,492 contigs and 2,981 singletons and that have been deposited in NCBI (accession IDs: GW868575 - GW873047, among which 1,294 unique ESTs were with known or putative functions. Furthermore, we found 1,233 putative simple sequence repeats (SSRs in the P. mume unigene dataset. We randomly tested 42 pairs of PCR primers flanking potential SSRs, and 14 pairs were identified as true-to-type SSR loci and could amplify polymorphic bands from 20 individual plants of P. mume. We further used the 14 EST-SSR primer pairs to test the transferability on peach and plum. The result showed that nearly 89% of the primer pairs produced target PCR bands in the two species. A high level of marker polymorphism was observed in the plum species (65% and low in the peach (46%, and the clustering analysis of the three species indicated that these SSR markers were useful in the evaluation of genetic relationships and diversity between and within the Prunus species. Conclusions We have constructed the first cDNA library of P. mume flower and fruit, and our data provide sets of molecular biology resources for P. mume and other Prunus species. These resources will be useful for further study such as genome annotation, new gene discovery, gene functional analysis, molecular breeding, evolution and comparative genomics between Prunus species.

  17. Characterization and phylogenetic analysis of lectin gene cDNA isolated from sea cucumber ( Apostichopus japonicus) body wall

    Science.gov (United States)

    Xue, Zhuang; Li, Hui; Liu, Yang; Zhou, Wei; Sun, Jing; Wang, Xiuli

    2017-12-01

    As a `living fossil' of species origin and `rich treasure' of food and nutrition development, sea cucumber has received a lot of attentions from researchers. The cDNA library construction and EST sequencing of blood had been conducted previously in our lab. The bioinformatic analysis provided a gene fragment which is highly homologous with the genes of lectin family, named AjL ( Apostichopus japonicus lectin). To characterize and determine the phylogeny of AjL genes in early evolution, we isolated a full-length cDNA of lectin gene from the body wall of A. japonicus. The open reading frame of this gene contained 489 bp and encoded a 163 amino acids secretory protein being homologous to lectins of mammals and aquatic organisms. The deduced protein included a lectin-like domain. SDS-PAGE analysis showed that AjL migrated as a specific band (about 36.09 kDa under reducing), and agglutinated against rabbit red blood cells. AjL was similar to chain A of CEL-IV in space structure. We predicted that AjL may play the same role of CEL-IV. Our results suggested that more than one lectin gene functioned in sea cucumber and most of other species, which was fused by uncertain sequences during the evolution and encoded different proteins with diverse functions. Our findings provided the insights into the function and characteristics of lectin genes invertebrates. The results will also be helpful for the identification and structural, functional, and evolutionary analyses of lectin genes.

  18. Application of a cDNA microarray for profiling the gene expression of Echinococcus granulosus protoscoleces treated with albendazole and artemisinin.

    Science.gov (United States)

    Lü, Guodong; Zhang, Wenbao; Wang, Jianhua; Xiao, Yunfeng; Zhao, Jun; Zhao, Jianqin; Sun, Yimin; Zhang, Chuanshan; Wang, Junhua; Lin, Renyong; Liu, Hui; Zhang, Fuchun; Wen, Hao

    2014-12-01

    Cystic echinoccocosis (CE) is a neglected zoonosis that is caused by the dog-tapeworm Echinococcus granulosus. The disease is endemic worldwide. There is an urgent need for searching effective drug for the treatment of the disease. In this study, we sequenced a cDNA library constructed using RNA isolated from oncospheres, protoscoleces, cyst membrane and adult worms of E. granulosus. A total of 9065 non-redundant or unique sequences were obtained and spotted on chips as uniEST probes to profile the gene expression in protoscoleces of E. granulosus treated with the anthelmintic drugs albendazole and artemisinin, respectively. The results showed that 7 genes were up-regulated and 38 genes were down-regulated in the protoscoleces treated with albendazole. Gene analysis showed that these genes are responsible for energy metabolism, cell cycle and assembly of cell structure. We also identified 100 genes up-regulated and 6 genes down-regulated in the protoscoleces treated with artemisinin. These genes play roles in the transduction of environmental signals, and metabolism. Albendazole appeared its drug efficacy in damaging cell structure, while artemisinin was observed to increase the formation of the heterochromatin in protoscolex cells. Our results highlight the utility of using cDNA microarray methods to detect gene expression profiles of E. granulosus and, in particular, to understand the pharmacologic mechanism of anti-echinococcosis drugs. Copyright © 2014 Elsevier B.V. All rights reserved.

  19. DeepLoc: prediction of protein subcellular localization using deep learning

    DEFF Research Database (Denmark)

    Almagro Armenteros, Jose Juan; Sønderby, Casper Kaae; Sønderby, Søren Kaae

    2017-01-01

    The prediction of eukaryotic protein subcellular localization is a well-studied topic in bioinformatics due to its relevance in proteomics research. Many machine learning methods have been successfully applied in this task, but in most of them, predictions rely on annotation of homologues from...... knowledge databases. For novel proteins where no annotated homologues exist, and for predicting the effects of sequence variants, it is desirable to have methods for predicting protein properties from sequence information only. Here, we present a prediction algorithm using deep neural networks to predict...... current state-of-the-art algorithms, including those relying on homology information. The method is available as a web server at http://www.cbs.dtu.dk/services/DeepLoc . Example code is available at https://github.com/JJAlmagro/subcellular_localization . The dataset is available at http...

  20. Deep sequencing of foot-and-mouth disease virus reveals RNA sequences involved in genome packaging.

    Science.gov (United States)

    Logan, Grace; Newman, Joseph; Wright, Caroline F; Lasecka-Dykes, Lidia; Haydon, Daniel T; Cottam, Eleanor M; Tuthill, Tobias J

    2017-10-18

    Non-enveloped viruses protect their genomes by packaging them into an outer shell or capsid of virus-encoded proteins. Packaging and capsid assembly in RNA viruses can involve interactions between capsid proteins and secondary structures in the viral genome as exemplified by the RNA bacteriophage MS2 and as proposed for other RNA viruses of plants, animals and human. In the picornavirus family of non-enveloped RNA viruses, the requirements for genome packaging remain poorly understood. Here we show a novel and simple approach to identify predicted RNA secondary structures involved in genome packaging in the picornavirus foot-and-mouth disease virus (FMDV). By interrogating deep sequencing data generated from both packaged and unpackaged populations of RNA we have determined multiple regions of the genome with constrained variation in the packaged population. Predicted secondary structures of these regions revealed stem loops with conservation of structure and a common motif at the loop. Disruption of these features resulted in attenuation of virus growth in cell culture due to a reduction in assembly of mature virions. This study provides evidence for the involvement of predicted RNA structures in picornavirus packaging and offers a readily transferable methodology for identifying packaging requirements in many other viruses. Importance In order to transmit their genetic material to a new host, non-enveloped viruses must protect their genomes by packaging them into an outer shell or capsid of virus-encoded proteins. For many non-enveloped RNA viruses the requirements for this critical part of the viral life cycle remain poorly understood. We have identified RNA sequences involved in genome packaging of the picornavirus foot-and-mouth disease virus. This virus causes an economically devastating disease of livestock affecting both the developed and developing world. The experimental methods developed to carry out this work are novel, simple and transferable to the

  1. RNA2 of grapevine fanleaf virus: sequence analysis and coat protein cistron location.

    Science.gov (United States)

    Serghini, M A; Fuchs, M; Pinck, M; Reinbolt, J; Walter, B; Pinck, L

    1990-07-01

    The nucleotide sequence of the genomic RNA2 (3774 nucleotides) of grapevine fanleaf virus strain F13 was determined from overlapping cDNA clones and its genetic organization was deduced. Two rapid and efficient methods were used for cDNA cloning of the 5' region of RNA2. The complete sequence contained only one long open reading frame of 3555 nucleotides (1184 codons, 131K product). The analysis of the N-terminal sequence of purified coat protein (CP) and identification of its C-terminal residue have allowed the CP cistron to be precisely positioned within the polyprotein. The CP produced by proteolytic cleavage at the Arg/Gly site between residues 680 and 681 contains 504 amino acids (Mr 56019) and has hydrophobic properties. The Arg/Gly cleavage site deduced by N-terminal amino acid sequence analysis is the first for a nepovirus coat protein and for plant viruses expressing their genomic RNAs by polyprotein synthesis. Comparison of GFLV RNA2 with M RNA of cowpea mosaic comovirus and with RNA2 of two closely related nepoviruses, tomato black ring virus and Hungarian grapevine chrome mosaic virus, showed strong similarities among the 3' non-coding regions but less similarity among the 5' end non-coding sequences than reported among other nepovirus RNAs.

  2. Transcriptome sequencing of the blind subterranean mole rat, Spalax galili: Utility and potential for the discovery of novel evolutionary patterns

    KAUST Repository

    Malik, Assaf; Korol, Abraham; Hü bner, Sariel; Hernandez, Alvaro G.; Thimmapuram, Jyothi; Ali, Shahjahan; Glaser, Fabian; Paz, Arnon; Avivi, Aaron; Band, Mark

    2011-01-01

    sequencing of Spalax galili, a chromosomal type of S. ehrenbergi. cDNA pools from muscle and brain tissues isolated from animals exposed to hypoxic and normoxic conditions were sequenced using Sanger, GS FLX, and GS FLX Titanium technologies. Assembly

  3. Reverse transcription using random pentadecamer primers increases yield and quality of resulting cDNA

    DEFF Research Database (Denmark)

    Stangegaard, Michael; Dufva, I.H.; Dufva, Hans Martin

    2006-01-01

    oligonucleotides (pentadecamers) consistently, yielded at least 2 fold as much cDNA as did random hexamers using either-poly(A) RNA or an amplified version of messenger RNA (aRNA) as a template. The cDNA generated using pentadecamers did not differ in size distribution or the amount of incorporated label compared...... with cDNA generated with random hexamers. The increased efficiency of priming using random pentadecamers resulted in reverse transcription of > 80% of the template aRNA, while random hexamers induced reverse transcription of only 40% of the template aRNA. This suggests a better coverage...... that random pentadecamers can replace random hexamers in reverse transcription reactions on both poly(A) RNA and amplified RNA, resulting in higher cDNA yields and quality....

  4. Transcriptional Slippage and RNA Editing Increase the Diversity of Transcripts in Chloroplasts: Insight from Deep Sequencing of Vigna radiata Genome and Transcriptome.

    Directory of Open Access Journals (Sweden)

    Ching-Ping Lin

    Full Text Available We performed deep sequencing of the nuclear and organellar genomes of three mungbean genotypes: Vigna radiata ssp. sublobata TC1966, V. radiata var. radiata NM92 and the recombinant inbred line RIL59 derived from a cross between TC1966 and NM92. Moreover, we performed deep sequencing of the RIL59 transcriptome to investigate transcript variability. The mungbean chloroplast genome has a quadripartite structure including a pair of inverted repeats separated by two single copy regions. A total of 213 simple sequence repeats were identified in the chloroplast genomes of NM92 and RIL59; 78 single nucleotide variants and nine indels were discovered in comparing the chloroplast genomes of TC1966 and NM92. Analysis of the mungbean chloroplast transcriptome revealed mRNAs that were affected by transcriptional slippage and RNA editing. Transcriptional slippage frequency was positively correlated with the length of simple sequence repeats of the mungbean chloroplast genome (R2=0.9911. In total, 41 C-to-U editing sites were found in 23 chloroplast genes and in one intergenic spacer. No editing site that swapped U to C was found. A combination of bioinformatics and experimental methods revealed that the plastid-encoded RNA polymerase-transcribed genes psbF and ndhA are affected by transcriptional slippage in mungbean and in main lineages of land plants, including three dicots (Glycine max, Brassica rapa, and Nicotiana tabacum, two monocots (Oryza sativa and Zea mays, two gymnosperms (Pinus taeda and Ginkgo biloba and one moss (Physcomitrella patens. Transcript analysis of the rps2 gene showed that transcriptional slippage could affect transcripts at single sequence repeat regions with poly-A runs. It showed that transcriptional slippage together with incomplete RNA editing may cause sequence diversity of transcripts in chloroplasts of land plants.

  5. Extending Immunological Profiling in the Gilthead Sea Bream, Sparus aurata, by Enriched cDNA Library Analysis, Microarray Design and Initial Studies upon the Inflammatory Response to PAMPs

    Directory of Open Access Journals (Sweden)

    Sebastian Boltaña

    2017-02-01

    Full Text Available This study describes the development and validation of an enriched oligonucleotide-microarray platform for Sparus aurata (SAQ to provide a platform for transcriptomic studies in this species. A transcriptome database was constructed by assembly of gilthead sea bream sequences derived from public repositories of mRNA together with reads from a large collection of expressed sequence tags (EST from two extensive targeted cDNA libraries characterizing mRNA transcripts regulated by both bacterial and viral challenge. The developed microarray was further validated by analysing monocyte/macrophage activation profiles after challenge with two Gram-negative bacterial pathogen-associated molecular patterns (PAMPs; lipopolysaccharide (LPS and peptidoglycan (PGN. Of the approximately 10,000 EST sequenced, we obtained a total of 6837 EST longer than 100 nt, with 3778 and 3059 EST obtained from the bacterial-primed and from the viral-primed cDNA libraries, respectively. Functional classification of contigs from the bacterial- and viral-primed cDNA libraries by Gene Ontology (GO showed that the top five represented categories were equally represented in the two libraries: metabolism (approximately 24% of the total number of contigs, carrier proteins/membrane transport (approximately 15%, effectors/modulators and cell communication (approximately 11%, nucleoside, nucleotide and nucleic acid metabolism (approximately 7.5% and intracellular transducers/signal transduction (approximately 5%. Transcriptome analyses using this enriched oligonucleotide platform identified differential shifts in the response to PGN and LPS in macrophage-like cells, highlighting responsive gene-cassettes tightly related to PAMP host recognition. As observed in other fish species, PGN is a powerful activator of the inflammatory response in S. aurata macrophage-like cells. We have developed and validated an oligonucleotide microarray (SAQ that provides a platform enriched for the study

  6. Extending Immunological Profiling in the Gilthead Sea Bream, Sparus aurata, by Enriched cDNA Library Analysis, Microarray Design and Initial Studies upon the Inflammatory Response to PAMPs.

    Science.gov (United States)

    Boltaña, Sebastian; Castellana, Barbara; Goetz, Giles; Tort, Lluis; Teles, Mariana; Mulero, Victor; Novoa, Beatriz; Figueras, Antonio; Goetz, Frederick W; Gallardo-Escarate, Cristian; Planas, Josep V; Mackenzie, Simon

    2017-02-03

    This study describes the development and validation of an enriched oligonucleotide-microarray platform for Sparus aurata (SAQ) to provide a platform for transcriptomic studies in this species. A transcriptome database was constructed by assembly of gilthead sea bream sequences derived from public repositories of mRNA together with reads from a large collection of expressed sequence tags (EST) from two extensive targeted cDNA libraries characterizing mRNA transcripts regulated by both bacterial and viral challenge. The developed microarray was further validated by analysing monocyte/macrophage activation profiles after challenge with two Gram-negative bacterial pathogen-associated molecular patterns (PAMPs; lipopolysaccharide (LPS) and peptidoglycan (PGN)). Of the approximately 10,000 EST sequenced, we obtained a total of 6837 EST longer than 100 nt, with 3778 and 3059 EST obtained from the bacterial-primed and from the viral-primed cDNA libraries, respectively. Functional classification of contigs from the bacterial- and viral-primed cDNA libraries by Gene Ontology (GO) showed that the top five represented categories were equally represented in the two libraries: metabolism (approximately 24% of the total number of contigs), carrier proteins/membrane transport (approximately 15%), effectors/modulators and cell communication (approximately 11%), nucleoside, nucleotide and nucleic acid metabolism (approximately 7.5%) and intracellular transducers/signal transduction (approximately 5%). Transcriptome analyses using this enriched oligonucleotide platform identified differential shifts in the response to PGN and LPS in macrophage-like cells, highlighting responsive gene-cassettes tightly related to PAMP host recognition. As observed in other fish species, PGN is a powerful activator of the inflammatory response in S. aurata macrophage-like cells. We have developed and validated an oligonucleotide microarray (SAQ) that provides a platform enriched for the study of gene

  7. Molecular cloning and sequence analysis of complementary DNA encoding rat mammary gland medium-chain S-acyl fatty acid synthetase thio ester hydrolase

    International Nuclear Information System (INIS)

    Safford, R.; de Silva, J.; Lucas, C.

    1987-01-01

    Poly(A) + RNA from pregnant rat mammary glands was size-fractionated by sucrose gradient centrifugation, and fractions enriched in medium-chain S-acyl fatty acid synthetase thio ester hydrolase (MCH) were identified by in vitro translation and immunoprecipitation. A cDNA library was constructed, in pBR322, from enriched poly(A) + RNA and screened with two oligonucleotide probes deduced from rat MCH amino acid sequence data. Cross-hybridizing clones were isolated and found to contain cDNA inserts ranging from ∼ 1100 to 1550 base pairs (bp). A 1550-bp cDNA insert, from clone 43H09, was confirmed to encode MCH by hybrid-select translation/immunoprecipitation studies and by comparison of the amino acid sequence deduced from the DNA sequence of the clone to the amino acid sequence of the MCH peptides. Northern blot analysis revealed the size of the MCH mRNA to be 1500 nucleotides, and it is therefore concluded that the 1550-bp insert (including G x C tails) of clone 43H09 represents a full- or near-full-length copy of the MCH gene. The rat MCH sequence is the first reported sequence of a thioesterase from a mammalian source, but comparison of the deduced amino acid sequences of MCH and the recently published mallard duck medium-chain S-acyl fatty acid synthetase thioesterase reveals significant homology. In particular, a seven amino acid sequence containing the proposed active serine of the duck thioesterase is found to be perfectly conserved in rat MCH

  8. Amplification of a transcriptionally active DNA sequence in the human brain

    International Nuclear Information System (INIS)

    Yakovlev, A.G.; Sazonov, A.E.; Spunde, A.Ya.; Gindilis, V.M.

    1986-01-01

    The authors present their findings of tissue-specific amplification of a DNA fragment actively transcribed in the human brain. This genome fragment was found in the library complement of cDNA of the human brain and evidently belongs to a new class of moderate repetitions of DNA with an unstable copying capacity in the human genome. The authors isolated total cell RNA from various human tissues (brain, placenta), and rat tissues (brain, liver), by the method of hot phenol extraction with guanidine thiocynate. The poly(A + ) RNA fraction was isolated by chromatography. Synthesis of cDNA was done on a matrix of poly(A + ) RNA of human brain. The cDNA obtained was cloned in plasmid pBR322 for the PstI site using (dC/dG) sequences synthesized on the 3' ends of the vector molecule and cDNA respectively. In cloning 75 ng cDNA, the authors obtained approximately 10 5 recombinant. This library was analyzed by the hybridization method on columns with two radioactive ( 32 P) probes: the total cDNA preparation and the total nuclear DNA from the human brain. The number of copies of the cloned DNA fragment in the genome was determined by dot hybridization. Restricting fragments of human and rat DNA genomes homologous to the cloned cDNA were identified on radio-autographs. In each case, 10 micrograms of EcoRI DNA hydrolyzate was fractionated in 1% agarose gel. The probe was also readied with RNA samples fractionated in agarose gel with formaldehyde and transferred to a nitrocellulose filter under weak vacuum. The filter was hybridized with 0.1 micrograms DNA pAG 02, labeled with ( 32 P) to a specific activity of 0.5-1 x 10 9 counts/min x microgram. The autograph was exposed with amplifying screens at -70 0 C for 2 days

  9. Identification of microRNAs from Amur grape (Vitis amurensis Rupr.) by deep sequencing and analysis of microRNA variations with bioinformatics.

    Science.gov (United States)

    Wang, Chen; Han, Jian; Liu, Chonghuai; Kibet, Korir Nicholas; Kayesh, Emrul; Shangguan, Lingfei; Li, Xiaoying; Fang, Jinggui

    2012-03-29

    MicroRNA (miRNA) is a class of functional non-coding small RNA with 19-25 nucleotides in length while Amur grape (Vitis amurensis Rupr.) is an important wild fruit crop with the strongest cold resistance among the Vitis species, is used as an excellent breeding parent for grapevine, and has elicited growing interest in wine production. To date, there is a relatively large number of grapevine miRNAs (vv-miRNAs) from cultivated grapevine varieties such as Vitis vinifera L. and hybrids of V. vinifera and V. labrusca, but there is no report on miRNAs from Vitis amurensis Rupr, a wild grapevine species. A small RNA library from Amur grape was constructed and Solexa technology used to perform deep sequencing of the library followed by subsequent bioinformatics analysis to identify new miRNAs. In total, 126 conserved miRNAs belonging to 27 miRNA families were identified, and 34 known but non-conserved miRNAs were also found. Significantly, 72 new potential Amur grape-specific miRNAs were discovered. The sequences of these new potential va-miRNAs were further validated through miR-RACE, and accumulation of 18 new va-miRNAs in seven tissues of grapevines confirmed by real time RT-PCR (qRT-PCR) analysis. The expression levels of va-miRNAs in flowers and berries were found to be basically consistent in identity to those from deep sequenced sRNAs libraries of combined corresponding tissues. We also describe the conservation and variation of va-miRNAs using miR-SNPs and miR-LDs during plant evolution based on comparison of orthologous sequences, and further reveal that the number and sites of miR-SNP in diverse miRNA families exhibit distinct divergence. Finally, 346 target genes for the new miRNAs were predicted and they include a number of Amur grape stress tolerance genes and many genes regulating anthocyanin synthesis and sugar metabolism. Deep sequencing of short RNAs from Amur grape flowers and berries identified 72 new potential miRNAs and 34 known but non-conserved mi

  10. Identification of microRNAs from Amur grape (vitis amurensis Rupr. by deep sequencing and analysis of microRNA variations with bioinformatics

    Directory of Open Access Journals (Sweden)

    Wang Chen

    2012-03-01

    Full Text Available Abstract Background MicroRNA (miRNA is a class of functional non-coding small RNA with 19-25 nucleotides in length while Amur grape (Vitis amurensis Rupr. is an important wild fruit crop with the strongest cold resistance among the Vitis species, is used as an excellent breeding parent for grapevine, and has elicited growing interest in wine production. To date, there is a relatively large number of grapevine miRNAs (vv-miRNAs from cultivated grapevine varieties such as Vitis vinifera L. and hybrids of V. vinifera and V. labrusca, but there is no report on miRNAs from Vitis amurensis Rupr, a wild grapevine species. Results A small RNA library from Amur grape was constructed and Solexa technology used to perform deep sequencing of the library followed by subsequent bioinformatics analysis to identify new miRNAs. In total, 126 conserved miRNAs belonging to 27 miRNA families were identified, and 34 known but non-conserved miRNAs were also found. Significantly, 72 new potential Amur grape-specific miRNAs were discovered. The sequences of these new potential va-miRNAs were further validated through miR-RACE, and accumulation of 18 new va-miRNAs in seven tissues of grapevines confirmed by real time RT-PCR (qRT-PCR analysis. The expression levels of va-miRNAs in flowers and berries were found to be basically consistent in identity to those from deep sequenced sRNAs libraries of combined corresponding tissues. We also describe the conservation and variation of va-miRNAs using miR-SNPs and miR-LDs during plant evolution based on comparison of orthologous sequences, and further reveal that the number and sites of miR-SNP in diverse miRNA families exhibit distinct divergence. Finally, 346 target genes for the new miRNAs were predicted and they include a number of Amur grape stress tolerance genes and many genes regulating anthocyanin synthesis and sugar metabolism. Conclusions Deep sequencing of short RNAs from Amur grape flowers and berries identified 72

  11. Deep learning methods for protein torsion angle prediction.

    Science.gov (United States)

    Li, Haiou; Hou, Jie; Adhikari, Badri; Lyu, Qiang; Cheng, Jianlin

    2017-09-18

    Deep learning is one of the most powerful machine learning methods that has achieved the state-of-the-art performance in many domains. Since deep learning was introduced to the field of bioinformatics in 2012, it has achieved success in a number of areas such as protein residue-residue contact prediction, secondary structure prediction, and fold recognition. In this work, we developed deep learning methods to improve the prediction of torsion (dihedral) angles of proteins. We design four different deep learning architectures to predict protein torsion angles. The architectures including deep neural network (DNN) and deep restricted Boltzmann machine (DRBN), deep recurrent neural network (DRNN) and deep recurrent restricted Boltzmann machine (DReRBM) since the protein torsion angle prediction is a sequence related problem. In addition to existing protein features, two new features (predicted residue contact number and the error distribution of torsion angles extracted from sequence fragments) are used as input to each of the four deep learning architectures to predict phi and psi angles of protein backbone. The mean absolute error (MAE) of phi and psi angles predicted by DRNN, DReRBM, DRBM and DNN is about 20-21° and 29-30° on an independent dataset. The MAE of phi angle is comparable to the existing methods, but the MAE of psi angle is 29°, 2° lower than the existing methods. On the latest CASP12 targets, our methods also achieved the performance better than or comparable to a state-of-the art method. Our experiment demonstrates that deep learning is a valuable method for predicting protein torsion angles. The deep recurrent network architecture performs slightly better than deep feed-forward architecture, and the predicted residue contact number and the error distribution of torsion angles extracted from sequence fragments are useful features for improving prediction accuracy.

  12. Genome-wide analyses of long noncoding RNA expression profiles correlated with radioresistance in nasopharyngeal carcinoma via next-generation deep sequencing.

    Science.gov (United States)

    Li, Guo; Liu, Yong; Liu, Chao; Su, Zhongwu; Ren, Shuling; Wang, Yunyun; Deng, Tengbo; Huang, Donghai; Tian, Yongquan; Qiu, Yuanzheng

    2016-09-06

    Radioresistance is one of the major factors limiting the therapeutic efficacy and prognosis of patients with nasopharyngeal carcinoma (NPC). Accumulating evidence has suggested that aberrant expression of long noncoding RNAs (lncRNAs) contributes to cancer progression. Therefore, here we identified lncRNAs associated with radioresistance in NPC. The differential expression profiles of lncRNAs associated with NPC radioresistance were constructed by next-generation deep sequencing by comparing radioresistant NPC cells with their parental cells. LncRNA-related mRNAs were predicted and analyzed using bioinformatics algorithms compared with the mRNA profiles related to radioresistance obtained in our previous study. Several lncRNAs and associated mRNAs were validated in established NPC radioresistant cell models and NPC tissues. By comparison between radioresistant CNE-2-Rs and parental CNE-2 cells by next-generation deep sequencing, a total of 781 known lncRNAs and 2054 novel lncRNAs were annotated. The top five upregulated and downregulated known/novel lncRNAs were detected using quantitative real-time reverse transcription-polymerase chain reaction, and 7/10 known lncRNAs and 3/10 novel lncRNAs were demonstrated to have significant differential expression trends that were the same as those predicted by deep sequencing. From the prediction process, 13 pairs of lncRNAs and their associated genes were acquired, and the prediction trends of three pairs were validated in both radioresistant CNE-2-Rs and 6-10B-Rs cell lines, including lncRNA n373932 and SLITRK5, n409627 and PRSS12, and n386034 and RIMKLB. LncRNA n373932 and its related SLITRK5 showed dramatic expression changes in post-irradiation radioresistant cells and a negative expression correlation in NPC tissues (R = -0.595, p < 0.05). Our study provides an overview of the expression profiles of radioresistant lncRNAs and potentially related mRNAs, which will facilitate future investigations into the

  13. Expression of Kirsten murine sarcoma virus sequences in Beagle dog tissues

    International Nuclear Information System (INIS)

    Kerkof, P.R.; Kelly, G.

    1988-01-01

    Labeled cDNA synthesized from RNA extracted from 238 PuO 2 -, 239 PuO 2 -, and 90 Sr-induced lung tumors in Beagle dogs, from nontumor tissue from 239 PuO 2 -exposed dogs, and from unexposed dog lung and liver tissue produces strong hybridization signals with a plasmid (pKSma) that contains Kirsten murine sarcoma virus (KMSV) sequences. At least 90 percent of the KMSV sequences are expressed in these dog tissues, including sequences corresponding to p21 K-ras, qp70 envelope glycoprotein, and at least one other proviral sequence. The expression of Kirsten ras and other sarcoma virus sequences may have important implications for the interpretation of carcinogenesis studies in these dogs. (author)

  14. Transcriptome sequencing and metabolite analysis reveals the role of delphinidin metabolism in flower colour in grape hyacinth.

    Science.gov (United States)

    Lou, Qian; Liu, Yali; Qi, Yinyan; Jiao, Shuzhen; Tian, Feifei; Jiang, Ling; Wang, Yuejin

    2014-07-01

    Grape hyacinth (Muscari) is an important ornamental bulbous plant with an extraordinary blue colour. Muscari armeniacum, whose flowers can be naturally white, provides an opportunity to unravel the complex metabolic networks underlying certain biochemical traits, especially colour. A blue flower cDNA library of M. armeniacum and a white flower library of M. armeniacum f. album were used for transcriptome sequencing. A total of 89 926 uni-transcripts were isolated, 143 of which could be identified as putative homologues of colour-related genes in other species. Based on a comprehensive analysis relating colour compounds to gene expression profiles, the mechanism of colour biosynthesis was studied in M. armeniacum. Furthermore, a new hypothesis explaining the lack of colour phenotype of the grape hyacinth flower is proposed. Alteration of the substrate competition between flavonol synthase (FLS) and dihydroflavonol 4-reductase (DFR) may lead to elimination of blue pigmentation while the multishunt from the limited flux in the cyanidin (Cy) synthesis pathway seems to be the most likely reason for the colour change in the white flowers of M. armeniacum. Moreover, mass sequence data obtained by the deep sequencing of M. armeniacum and its white variant provided a platform for future function and molecular biological research on M. armeniacum. © The Author 2014. Published by Oxford University Press on behalf of the Society for Experimental Biology.

  15. The Pekin duck programmed death-ligand 1: cDNA cloning, genomic structure, molecular characterization and mRNA expression analysis.

    Science.gov (United States)

    Yao, Q; Fischer, K P; Tyrrell, D L; Gutfreund, K S

    2015-04-01

    Programmed death ligand-1 (PD-L1) plays an important role in the attenuation of adaptive immune responses in higher vertebrates. Here, we describe the identification of the Pekin duck PD-L1 orthologue (duPD-L1) and its gene structure. The duPD-L1 cDNA encodes a 311-amino acid protein that has an amino acid identity of 78% and 42% with chicken and human PD-L1, respectively. Mapping of the duPD-L1 cDNA with duck genomic sequences revealed an exonic structure of its coding sequence similar to those of other vertebrates but lacked a noncoding exon 1. Homology modelling of the duPD-L1 extracellular domain was compatible with the tandem IgV-like and IgC-like IgSF domain structure of human PD-L1 (PDB ID: 3BIS). Residues known to be important for receptor binding of human PD-L1 were mostly conserved in duPD-L1 within the N-terminus and the G sheet, and partially conserved within the F sheet but not within sheets C and C'. DuPD-L1 mRNA was constitutively expressed in all tissues examined with highest expression levels in lung and spleen and very low levels of expression in muscle, kidney and brain. Mitogen stimulation of duck peripheral blood mononuclear cells transiently increased duPD-L1 mRNA expression. Our observations demonstrate evolutionary conservation of the exonic structure of its coding sequence, the extracellular domain structure and residues implicated in receptor binding, but the role of the longer cytoplasmic tail in avian PD-L1 proteins remains to be determined. © 2014 John Wiley & Sons Ltd.

  16. High-throughput screening of suppression subtractive hybridization cDNA libraries using DNA microarray analysis

    CSIR Research Space (South Africa)

    Van den Berg, N

    2004-11-01

    Full Text Available Efficient construction of cDNA libraries enriched for differentially expressed transcripts is an important first step in many biological investigations. We present a quantitative procedure for screening cDNA libraries constructed by suppression...

  17. Construction and identification of subtracted cDNA library in bone marrow cells of radon-exposed mice

    International Nuclear Information System (INIS)

    Li Jianxiang; Nie Jihua; Tong Jian; Fu Chunling; Zhou Jianwei

    2008-01-01

    Objective: To construct and identify subtracted cDNA library in bone marrow cells of mice exposed to radon inhalation. Methods: Adult male BALB/c mice, weighing 18-22 g, were placed in a multi- functional radon chamber. One group of mice was exposed to radon up to the accumulative dose of 105 work level month (WLM). The control group of mice was housed in a room with an accumulative dose of 1 WLM. To construct a subtracted cDNA library enriched with differentially expressed genes, the SMART technique and the suppression subtractive hybridization were performed. The obtained forward and reverse cDNA fragments were directly inserted into pMD18-T vector and transformed into E. coli JM109. The inserting cDNA fragments were screened by the blue-and-white blot screening and nested PCR of bacterium liquid. Results: The 244 of 285 white bacteria clones obtained randomly were positive clones contained 100-1100 bp inserted cDNA fragments. Conclusions: The forward and reverse subtracted cDNA library in bone marrow cells of mice exposed to radon inhalation is successfully constructed. (authors)

  18. Evolution of simeprevir-resistant variants over time by ultra-deep sequencing in HCV genotype 1b.

    Science.gov (United States)

    Akuta, Norio; Suzuki, Fumitaka; Sezaki, Hitomi; Suzuki, Yoshiyuki; Hosaka, Tetsuya; Kobayashi, Masahiro; Kobayashi, Mariko; Saitoh, Satoshi; Ikeda, Kenji; Kumada, Hiromitsu

    2014-08-01

    Using ultra-deep sequencing technology, the present study was designed to investigate the evolution of simeprevir-resistant variants (amino acid substitutions of aa80, aa155, aa156, and aa168 positions in HCV NS3 region) over time. In Toranomon Hospital, 18 Japanese patients infected with HCV genotype 1b, received triple therapy of simeprevir/PEG-IFN/ribavirin (DRAGON or CONCERT study). Sustained virological response rate was 67%, and that was significantly higher in patients with IL28B rs8099917 TT than in those with non-TT. Six patients, who did not achieve sustained virological response, were tested for resistant variants by ultra-deep sequencing, at the baseline, at the time of re-elevation of viral loads, and at 96 weeks after the completion of treatment. Twelve of 18 resistant variants, detected at re-elevation of viral load, were de novo resistant variants. Ten of 12 de novo resistant variants become undetectable over time, and that five of seven resistant variants, detected at baseline, persisted over time. In one patient, variants of Q80R at baseline (0.3%) increased at 96-week after the cessation of treatment (10.2%), and de novo resistant variants of D168E (0.3%) also increased at 96-week after the cessation of treatment (9.7%). In conclusion, the present study indicates that the emergence of simeprevir-resistant variants after the start of treatment could not be predicted at baseline, and the majority of de novo resistant variants become undetectable over time. Further large-scale prospective studies should be performed to investigate the clinical utility in detecting simeprevir-resistant variants. © 2014 Wiley Periodicals, Inc.

  19. Sequence of cDNAs for mammalian H2A. Z, an evolutionarily diverged but highly conserved basal histone H2A isoprotein species

    Energy Technology Data Exchange (ETDEWEB)

    Hatch, C L; Bonner, W M

    1988-02-11

    The nucleotide sequences of cDNAs for the evolutionarily diverged but highly conserved basal H2A isoprotein, H2A.Z, have been determined for the rat, cow, and human. As a basal histone, H2A.Z is synthesized throughout the cell cycle at a constant rate, unlinked to DNA replication, and at a much lower rate in quiescent cells. Each of the cDNA isolates encodes the entire H2A.Z polypeptide. The human isolate is about 1.0 kilobases long. It contains a coding region of 387 nucleotides flanked by 106 nucleotides of 5'UTR and 376 nucleotides of 3'UTR, which contains a polyadenylation signal followed by a poly A tail. The bovine and rat cDNAs have 97 and 94% nucleotide positional identity to the human cDNA in the coding region and 98% in the proximal 376 nucleotides of the 3'UTR which includes the polyadenylation signal. A potential stem-forming sequence imbedded in a direct repeat is found centered at 261 nucleotides into the 3'UTR. Each of the cDNA clones could be transcribed and translated in vitro to yield H2A.Z protein. The mammalian H2A.Z cDNA coding sequences are approximately 80% similar to those in chicken and 75% to those in sea urchin.

  20. In vitro and in silico cloning of Xenopus laevis SOD2 cDNA and its phylogenetic analysis.

    Science.gov (United States)

    Purrello, Michele; Di Pietro, Cinzia; Ragusa, Marco; Pulvirenti, Alfredo; Giugno, Rosalba; Di Pietro, Valentina; Emmanuele, Giovanni; Travali, Salvo; Scalia, Marina; Shasha, Dennis; Ferro, Alfredo

    2005-02-01

    By using the methodology of both wet and dry biology (i.e., RT-PCR and cycle sequencing, and biocomputational technology, respectively) and the data obtained through the Genome Projects, we have cloned Xenopus laevis SOD2 (MnSOD) cDNA and determined its nucleotide sequence. These data and the deduced protein primary structure were compared with all the other SOD2 nucleotide and amino acid sequences from eukaryotes and prokaryotes, published in public databases. The analysis was performed by using both Clustal W, a well known and widely used program for sequence analysis, and AntiClustAl, a new algorithm recently created and implemented by our group. Our results demonstrate a very high conservation of the enzyme amino acid sequence during evolution, which proves a close structure-function relationship. This is to be expected for very ancient molecules endowed with critical biological functions, performed through a specific structural organization. The nucleotide sequence conservation is less pronounced: this too was foreseeable, due to neutral mutations and to the species-specific codon usage. The data obtained by using AntiClustAl are comparable with those produced with Clustal W, which validates this algorithm as an important new tool for biocomputational analysis. Finally, it is noteworthy that evolutionary trees, drawn by using all the available data on SOD2 nucleotide sequences and amino acid and either Clustal W or AntiClustAl, are comparable to those obtained through phylogenetic analysis based on fossil records.