WorldWideScience

Sample records for cdna deep sequencing

  1. Proteome-wide Identification of Novel Ceramide-binding Proteins by Yeast Surface cDNA Display and Deep Sequencing.

    Science.gov (United States)

    Bidlingmaier, Scott; Ha, Kevin; Lee, Nam-Kyung; Su, Yang; Liu, Bin

    2016-04-01

    Although the bioactive sphingolipid ceramide is an important cell signaling molecule, relatively few direct ceramide-interacting proteins are known. We used an approach combining yeast surface cDNA display and deep sequencing technology to identify novel proteins binding directly to ceramide. We identified 234 candidate ceramide-binding protein fragments and validated binding for 20. Most (17) bound selectively to ceramide, although a few (3) bound to other lipids as well. Several novel ceramide-binding domains were discovered, including the EF-hand calcium-binding motif, the heat shock chaperonin-binding motif STI1, the SCP2 sterol-binding domain, and the tetratricopeptide repeat region motif. Interestingly, four of the verified ceramide-binding proteins (HPCA, HPCAL1, NCS1, and VSNL1) and an additional three candidate ceramide-binding proteins (NCALD, HPCAL4, and KCNIP3) belong to the neuronal calcium sensor family of EF hand-containing proteins. We used mutagenesis to map the ceramide-binding site in HPCA and to create a mutant HPCA that does not bind to ceramide. We demonstrated selective binding to ceramide by mammalian cell-produced wild type but not mutant HPCA. Intriguingly, we also identified a fragment from prostaglandin D2synthase that binds preferentially to ceramide 1-phosphate. The wide variety of proteins and domains capable of binding to ceramide suggests that many of the signaling functions of ceramide may be regulated by direct binding to these proteins. Based on the deep sequencing data, we estimate that our yeast surface cDNA display library covers ∼60% of the human proteome and our selection/deep sequencing protocol can identify target-interacting protein fragments that are present at extremely low frequency in the starting library. Thus, the yeast surface cDNA display/deep sequencing approach is a rapid, comprehensive, and flexible method for the analysis of protein-ligand interactions, particularly for the study of non-protein ligands. PMID

  2. Preparing unbiased T cell receptor and antibody cDNA libraries for the deep next generation sequencing profiling

    Directory of Open Access Journals (Sweden)

    Ilgar Z Mamedov

    2013-12-01

    Full Text Available High-throughput sequencing has the power to reveal the nature of adaptive immunity as represented by the full complexity of T cell receptor (TCR and antibody (IG repertoires, but is at present severely compromised by the quantitative bias, bottlenecks, and accumulated errors that inevitably occur in the course of library preparation and sequencing. Here we report an optimized protocol for the unbiased preparation of TCR and IG cDNA libraries for high-throughput sequencing, starting from thousands or millions of live cells in an investigated sample. Critical points to control are revealed, along with tips that allow researchers to minimize quantitative bias, accumulated errors, and cross-sample contamination at each stage, and to enhance the subsequent bioinformatic analysis. The protocol is simple, reliable, and can be performed in 1–2 days.

  3. cDNA sequence quality data - Budding yeast cDNA sequencing project | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available Budding yeast cDNA sequencing project cDNA sequence quality data Data detail Data name cDNA sequence quality... data Description of data contents Phred's quality score. PHD format, one file to a single cDNA data, and co...ription Download License Update History of This Database Site Policy | Contact Us cDNA sequence quality data - Budding yeast cDNA sequencing project | LSDB Archive ...

  4. 5'-end sequences of budding yeast full-length cDNA clones - Budding yeast cDNA sequencing project | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available Budding yeast cDNA sequencing project 5'-end sequences of budding yeast full-length cDNA clones Data detail Data name 5'-end sequence...s of budding yeast full-length cDNA clones Description of data contents cDNA sequence...e Update History of This Database Site Policy | Contact Us 5'-end sequences of budding yeast full-length cDNA clones - Budding yeast cDNA sequencing project | LSDB Archive ...

  5. Vector sequences - Budding yeast cDNA sequencing project | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available Budding yeast cDNA sequencing project Vector sequences Data detail Data name Vector sequences Description of data contents Vector seq...wnload License Update History of This Database Site Policy | Contact Us Vector sequences - Budding yeast cDNA sequencing project | LSDB Archive ... ...uences used for sequencing. Multi FASTA format. 7 entries. Data file File name: vec

  6. cDNA cloning and sequencing of ostrich Growth hormone

    Directory of Open Access Journals (Sweden)

    Doosti Abbas

    2012-01-01

    Full Text Available In recent years, industrial breeding of ostrich (Struthio camelus has been widely developed in Iran. Growth hormone (GH is a peptide hormone that stimulates growth and cell reproduction in different animals. The aim of this study was to clone and sequence the ostrich growth hormone gene in E. coli, done for the first time in Iran. The cDNA that encodes ostrich growth hormone was isolated from total mRNA of the pituitary gland and amplified by RT-PCR using GH specific PCR primers. Then GH cDNA was cloned by T/A cloning technique and the construct was transformed into E. coli. Finally, GH cDNA sequence was submitted to the GenBank (Accession number: JN559394. The results of present study showed that GH cDNA was successfully cloned in E. coli. Sequencing confirmed that GH cDNA was cloned and that the length of ostrich GH cDNA was 672 bp; BLAST search showed that the sequence of growth hormone cDNA of the ostrich from Iran has 100% homology with other records existing in GenBank.

  7. cDNA sequence for human erythrocyte ankyrin

    International Nuclear Information System (INIS)

    The cDNA for human erythrocyte ankyrin has been isolated from a series of overlapping clones obtained from a reticulocyte cDNA library. The composite cDNA sequence has a large open reading frame of 5636 base pairs (bp) with the complete coding sequence for a polypeptide of 1879 amino acids with a predicted molecular mass of 206 kDa. The derived amino acid sequence contained 194 residues that were identical to those obtained by direct amino acid sequencing of 11 ankyrin proteolytic peptides. The primary sequence contained 23 highly homologous repeat units of 33 amino acids within the 90-kDa band 3 binding domain. Two cDNA clones showed evidence of apparent mRNA processing, resulting in the deletions of 486 bp and 135 bp, respectively. The 486-bp deletion resulted in the removal of a 16-kDa highly acidic peptide, and the smaller deletion had the effect of altering the COOH terminus of the molecule. Radiolabeled ankyrin cDNAs recognized two erythroid message sizes by RNA blot analysis, one of which was predominantly associated with early erythroid cell types. An ankyrin message was also observed in RNA from the human cerebellum by the same method. The ankyrin gene is assigned to chromosome 8 using genomic DNA from a panel of sorted human chromosomes

  8. Complete nucleotide sequence of cDNA and deduced amino acid sequence of rat liver catalase.

    OpenAIRE

    Furuta, S.; Hayashi, H; Hijikata, M; Miyazawa, S.; Osumi, T; Hashimoto, T.

    1986-01-01

    We have isolated five cDNA clones for rat liver catalase (hydrogen peroxide:hydrogen peroxide oxidoreductase, EC 1.11.1.6). These clones overlapped with each other and covered the entire length of the mRNA, which had been estimated to be 2.4 kilobases long by blot hybridization analysis of electrophoretically fractionated RNA. Nucleotide sequencing was carried out on these five clones and the composite nucleotide sequence of catalase cDNA was determined. The 5' noncoding region contained 83 b...

  9. Sequence analysis of a cDNA coding for a pancreatic precursor to somatostatin.

    OpenAIRE

    Taylor, W.L.; Collier, K J; Deschenes, R J; Weith, H L; Dixon, J. E.

    1981-01-01

    A synthetic oligonucleotide having the sequence d(T-T-C-C-A-G-A-A-G-A-A) deduced from the amino acid sequence Phe-Phe-Trp-Lys of somatostatin-14 was used to prime the synthesis of a cDNA from channel catfish (Ictalurus punctatus) pancreatic poly(A)-RNA. The major product of this reaction was a cDNA fragment of 565 nucleotides. Chemical sequence analysis of the cDNA fragment revealed that it was complementary to a mRNA coding for somatostatin. The 565-nucleotide cDNA hybridizes strongly with a...

  10. Characterization of Expressed Sequence Tags From a Gallus gallus Pineal Gland cDNA Library

    OpenAIRE

    Stefanie Hartman; Greg Touchton; Jessica Wynn; Tuoyu Geng; Chong, Nelson W.; Ed Smith

    2005-01-01

    The pineal gland is the circadian oscillator in the chicken, regulating diverse functions ranging from egg laying to feeding. Here, we describe the isolation and characterization of expressed sequence tags (ESTs) isolated from a chicken pineal gland cDNA library. A total of 192 unique sequences were analysed and submitted to GenBank; 6% of the ESTs matched neither GenBank cDNA sequences nor the newly assembled chicken genomic DNA sequence, three ESTs aligned with sequences d...

  11. Analysis of proteins encoded by full-length cDNA sequence from IRM-2 mouse

    International Nuclear Information System (INIS)

    Objective: To screen and isolate radioresistance-related genes from IRM-2 mouse. Methods: Full-length cDNA products were amplified by PCR from IRM-2 mouse cDNA library according to twenty-one pieces of expressed sequence tags. The property of proteins encoded by full-length cDNA were analyzed by comparing with GenBank database. Results: Five pieces of full-length cDNA which were not the same source as the known mice genes were found out from IRM-2 mouse cDNA library.Amino acid sequence and property of proteins encoded by these five pieces of full-length cDNA were obtained. Conclusion: Proteins encoded by full-length cDNA imply that unknown radioresistance-related genes may exist in IR M-2 mouse. (authors)

  12. The function analysis of full-length cDNA sequence from IRM-2 mouse cDNA library

    International Nuclear Information System (INIS)

    Objective: To identify the function of full-length cDNA sequence from IRM-2 mouse cDNA library. Methods: Full-length cDNA products were amplified by PCR from IRM-2 mouse cDNA library according to twenty-one pieces of expressed sequence tag. The expression of full-length cDNAs were detected after mouse embryonic fibroblasts were exposed to 6.5 Gy γ-ray radiation. And the effect on the growth of radiosensitivity cells AT5B1VA transfected with full-length cDNAs was investigated. Results: The expression of No.4, 5 and 2 full-length cDNAs from IRM-2 mouse were higher than that of parental ICR and 615 mouse after mouse embryonic fibroblasts irradiated with γ-ray radiation. And the survival rate of AT5B1VA cells transfected with No.4, 5 and 2 full-length cDNAs was high. Conclusion: No.4, 5 and 2 full-length cDNAs of IRM-2 mouse are of high radioresistance. (authors)

  13. Cloning, sequencing and expression of cDNA encoding growth hormone from Indian catfish (Heteropneustes fossilis)

    Indian Academy of Sciences (India)

    Vikas Anathy; Thayanithy Venugopal; Ramanathan Koteeswaran; Thavamani J Pandian; Sinnakaruppan Mathavan

    2001-09-01

    A tissue-specific cDNA library was constructed using polyA+ RNA from pituitary glands of the Indian catfish Heteropneustes fossilis (Bloch) and a cDNA clone encoding growth hormone (GH) was isolated. Using polymerase chain reaction (PCR) primers representing the conserved regions of fish GH sequences the 3′ region of catfish GH cDNA (540 bp) was cloned by random amplification of cDNA ends and the clone was used as a probe to isolate recombinant phages carrying the full-length cDNA sequence. The full-length cDNA clone is 1132 bp in length, coding for an open reading frame (ORF) of 603 bp; the reading frame encodes a putative polypeptide of 200 amino acids including the signal sequence of 22 amino acids. The 5′ and 3′ untranslated regions of the cDNA are 58 bp and 456 bp long, respectively. The predicted amino acid sequence of H. fossils GH shared 98% homology with other catfishes. Mature GH protein was efficiently expressed in bacterial and zebrafish systems using appropriate expression vectors. The successful expression of the cloned GH cDNA of catfish confirms the functional viability of the clone.

  14. Cloning, sequencing and expression of cDNA encoding growth hormone from Indian catfish (Heteropneustes fossilis)

    Indian Academy of Sciences (India)

    Vikas Anathy; Thayanithy Venugopal; Ramanathan Koteeswaran; Thavamani J Pandian; Sinnakaruppan Mathavan

    2013-03-01

    A tissue-specific cDNA library was constructed using polyA+ RNA from pituitary glands of the Indian catfish Heteropneustes fossilis (Bloch) and a cDNA clone encoding growth hormone (GH) was isolated. Using polymerase chain reaction (PCR) primers representing the conserved regions of fish GH sequences the 3′ region of catfish GH cDNA (540 bp) was cloned by random amplification of cDNA ends and the clone was used as a probe to isolate recombinant phages carrying the full-length cDNA sequence. The full-length cDNA clone is 1132 bp in length, coding for an open reading frame (ORF) of 603 bp; the reading frame encodes a putative polypeptide of 200 amino acids including the signal sequence of 22 amino acids. The 5′ and 3′ untranslated regions of the cDNA are 58 bp and 456 bp long, respectively. The predicted amino acid sequence of H. fossils GH shared 98% homology with other catfishes. Mature GH protein was efficiently expressed in bacterial and zebrafish systems using appropriate expression vectors. The successful expression of the cloned GH cDNA of catfish confirms the functional viability of the clone.

  15. Systematic sequencing of cDNA clones using the transposon Tn5

    OpenAIRE

    Shevchenko, Yuriy; Bouffard, Gerard G.; Butterfield, Yaron S.N.; Blakesley, Robert W.; Hartley, James L.; Young, Alice C.; Marco A. Marra; Jones, Steven J M; Touchman, Jeffrey W.; Green, Eric D.

    2002-01-01

    In parallel with the production of genomic sequence data, attention is being focused on the generation of comprehensive cDNA-sequence resources. Such efforts are increasingly emphasizing the production of high-accuracy sequence corresponding to the entire insert of cDNA clones, especially those presumed to reflect the full-length mRNA. The complete sequencing of cDNA clones on a large scale presents unique challenges because of the generally small, yet heterogeneous, sizes of the cloned inser...

  16. 5'-end sequences of budding yeast full-length cDNA clones and quality scores - Budding yeast cDNA sequencing project | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available Budding yeast cDNA sequencing project 5'-end sequences of budding yeast full-length cDNA clones and quality ...scores Data detail Data name 5'-end sequences of budding yeast full-length cDNA clones and quality scores De...from the budding yeast full-length cDNA library by the vector-capping method, the sequence quality score gen...s accession only. Sequence 5'-end sequence data of budding yeast full-length cDNA clones. FASTA format. Quality Phred's quality... Update History of This Database Site Policy | Contact Us 5'-end sequences of budding yeast full-length cDNA clones and quality

  17. Assessment of clone identity and sequence fidelity for 1189 IMAGE cDNA clones

    OpenAIRE

    Halgren, Robert G.; Fielden, Mark R.; Fong, Cora J.; Zacharewski, Timothy R

    2001-01-01

    This report documents the error rate in a commercially distributed subset of the IMAGE Consortium mouse cDNA clone collection. After isolation of plasmid DNA from 1189 bacterial stock cultures, only 62.2% were uncontaminated and contained cDNA inserts that had significant sequence identity to published data for the ordered clones. An agarose gel electrophoresis pre-screening strategy identified 361 stock cultures that appeared to contain two or more plasmid species. Is...

  18. cDNA sequencing improves the detection of P53 missense mutations in colorectal cancer

    International Nuclear Information System (INIS)

    Recently published data showed discrepancies beteween P53 cDNA and DNA sequencing in glioblastomas. We hypothesised that similar discrepancies may be observed in other human cancers. To this end, we analyzed 23 colorectal cancers for P53 mutations and gene expression using both DNA and cDNA sequencing, real-time PCR and immunohistochemistry. We found P53 gene mutations in 16 cases (15 missense and 1 nonsense). Two of the 15 cases with missense mutations showed alterations based only on cDNA, and not DNA sequencing. Moreover, in 6 of the 15 cases with a cDNA mutation those mutations were difficult to detect in the DNA sequencing, so the results of DNA analysis alone could be misinterpreted if the cDNA sequencing results had not also been available. In all those 15 cases, we observed a higher ratio of the mutated to the wild type template by cDNA analysis, but not by the DNA analysis. Interestingly, a similar overexpression of P53 mRNA was present in samples with and without P53 mutations. In terms of colorectal cancer, those discrepancies might be explained under three conditions: 1, overexpression of mutated P53 mRNA in cancer cells as compared with normal cells; 2, a higher content of cells without P53 mutation (normal cells and cells showing K-RAS and/or APC but not P53 mutation) in samples presenting P53 mutation; 3, heterozygous or hemizygous mutations of P53 gene. Additionally, for heterozygous mutations unknown mechanism(s) causing selective overproduction of mutated allele should also be considered. Our data offer new clues for studying discrepancy in P53 cDNA and DNA sequencing analysis

  19. cDNA cloning, sequence analysis, and chromosomal localization of the gene for human carnitine palmitoyltransferase

    Energy Technology Data Exchange (ETDEWEB)

    Finocchiaro, G.; Taroni, F.; Martin, A.L.; Colombo, I.; Tarelli, G.T.; DiDonato, S. (Istituto Nazionale Neurologico C. Besta, Milan (Italy)); Rocchi, M. (Istituto G. Gaslini, Genoa (Italy))

    1991-01-15

    The authors have cloned and sequenced a cDNA encoding human liver carnitine palmitoyltransferase an inner mitochondrial membrane enzyme that plays a major role in the fatty acid oxidation pathway. Mixed oligonucleotide primers whose sequences were deduced from one tryptic peptide obtained from purified CPTase were used in a polymerase chain reaction, allowing the amplification of a 0.12-kilobase fragment of human genomic DNA encoding such a peptide. A 60-base-pair (bp) oligonucleotide synthesized on the basis of the sequence from this fragment was used for the screening of a cDNA library from human liver and hybridized to a cDNA insert of 2255 bp. This cDNA contains an open reading frame of 1974 bp that encodes a protein of 658 amino acid residues including 25 residues of an NH{sub 2}-terminal leader peptide. The assignment of this open reading frame to human liver CPTase is confirmed by matches to seven different amino acid sequences of tryptic peptides derived from pure human CPTase and by the 82.2% homology with the amino acid sequence of rat CPTase. The NH{sub 2}-terminal region of CPTase contains a leucine-proline motif that is shared by carnitine acetyl- and octanoyltransferases and by choline acetyltransferase. The gene encoding CPTase was assigned to human chromosome 1, region 1q12-1pter, by hybridization of CPTase cDNA with a DNA panel of 19 human-hanster somatic cell hybrids.

  20. CLONING AND EXPRESSION OF A cDNA SEQUENCE FOR HUMAN THIOREDOXIN

    Institute of Scientific and Technical Information of China (English)

    Liu Qingyong(刘庆勇); Ruan Xiyun(阮喜云); Liu Xiaogong(刘效恭); Ji Zongzheng(纪宗正); Dang Jiangong; Nan Xunyi(南勋义); Wang Quanying(王全颖); Yang Guangxiao(杨广笑)

    2003-01-01

    Objective To clone and determine the sequence and expression of a cDNA segment for human thioredoxin. Methods The cDNA segment of thioredoxin was obtained through amplification by RT-PCR cloning from 143 (TK-) human osteosarcoma cell. The amplified products were cloned into pGEM-T Easy vector and sequenced. Then the expressed vector pBV220-hTRX was constructed and transformed into E.coli strain DH5α for hTRX expression. The hTRX was purified by DEAE-Sephadex A-50 column and the activity of recombinant hTRX was determined by the insulin disulfide reduction assay. Results Comparison of cDNA sequence of the cloned fragments with that of the reported hTRX (GenBank J04026) demonstrated that there were two differences compared to the reported cDNA sequence for hTRX at bp180 and bp284, and the amino acids enceoded altered respectively, but motif of the sequence was identical to that of the reported hTRX. The recombinant hTRX can catalyze insulin reduction by DTT. Conclusion The successful cloning and expression of hTRX cDNA formed a basis for further study on biological functions and utilization of hTRX.

  1. cDNA cloning, sequence analysis, and chromosomal localization of the gene for human carnitine palmitoyltransferase

    International Nuclear Information System (INIS)

    The authors have cloned and sequenced a cDNA encoding human liver carnitine palmitoyltransferase an inner mitochondrial membrane enzyme that plays a major role in the fatty acid oxidation pathway. Mixed oligonucleotide primers whose sequences were deduced from one tryptic peptide obtained from purified CPTase were used in a polymerase chain reaction, allowing the amplification of a 0.12-kilobase fragment of human genomic DNA encoding such a peptide. A 60-base-pair (bp) oligonucleotide synthesized on the basis of the sequence from this fragment was used for the screening of a cDNA library from human liver and hybridized to a cDNA insert of 2255 bp. This cDNA contains an open reading frame of 1974 bp that encodes a protein of 658 amino acid residues including 25 residues of an NH2-terminal leader peptide. The assignment of this open reading frame to human liver CPTase is confirmed by matches to seven different amino acid sequences of tryptic peptides derived from pure human CPTase and by the 82.2% homology with the amino acid sequence of rat CPTase. The NH2-terminal region of CPTase contains a leucine-proline motif that is shared by carnitine acetyl- and octanoyltransferases and by choline acetyltransferase. The gene encoding CPTase was assigned to human chromosome 1, region 1q12-1pter, by hybridization of CPTase cDNA with a DNA panel of 19 human-hanster somatic cell hybrids

  2. Brain tubulin and actin cDNA sequences: isolation of recombinant plasmids.

    OpenAIRE

    Ginzburg, I.(Sobolev Institute of Mathematics and Novosibirsk State University, 630090, Novosibirsk, Russia); de Baetselier, A; Walker, M D; Behar, L; Lehrach, H; Frischauf, A M; Littauer, U Z

    1980-01-01

    Rat brain mRNA enriched for tubulin and actin sequences was used to prepare double stranded cDNA. A library of recombinant clones was constructed by inserting the dsDNA into the Pst1 site of pBR322 plasmid and transformation of E. coli chi 1776 host. Clones bearing sequences coding for tubulin and actin were identified and characterized.

  3. Cloning, sequencing, and expression of cDNA for human β-glucuronidase

    International Nuclear Information System (INIS)

    The authors report here the cDNA sequence for human placental β-glucuronidase (β-D-glucuronoside glucuronosohydrolase, EC 3.2.1.31) and demonstrate expression of the human enzyme in transfected COS cells. They also sequenced a partial cDNA clone from human fibroblasts that contained a 153-base-pair deletion within the coding sequence and found a second type of cDNA clone from placenta that contained the same deletion. Nuclease S1 mapping studies demonstrated two types of mRNAs in human placenta that corresponded to the two types of cDNA clones isolated. The NH2-terminal amino acid sequence determined for human spleen β-glucuronidase agreed with that inferred from the DNA sequence of the two placental clones, beginning at amino acid 23, suggesting a cleaved signal sequence of 22 amino acids. When transfected into COS cells, plasmids containing either placental clone expressed an immunoprecipitable protein that contained N-linked oligosaccharides as evidenced by sensitivity to endoglycosidase F. However, only transfection with the clone containing the 153-base-pair segment led to expression of human β-glucuronidase activity. These studies provide the sequence for the full-length cDNA for human β-glucuronidase, demonstrate the existence of two populations of mRNA for β-glucuronidase in human placenta, only one of which specifies a catalytically active enzyme, and illustrate the importance of expression studies in verifying that a cDNA is functionally full-length

  4. cDNA sequence of a new chicken embryonic rho-globin.

    OpenAIRE

    Roninson, I B; Ingram, V M

    1981-01-01

    In order to use specific DNA probes for the study of developmentally regulated gene expression, we have prepared cDNA clones corresponding to chicken embryonic globins by inserting cDNA.mRNA hybrids into the Pst I site of the plasmid pBR322 by using poly(dG) and poly(dC) linkers. The nucleotide sequence of the insert of one clone, representing a nearly full-length copy of an embryonic beta-like globin cDNA, has been determined. The amino acid sequence of the globin encoded by this insert is i...

  5. Cloning and sequencing of complete -crystallin cDNA from embryonic lens of Crocodylus palustris

    Indian Academy of Sciences (India)

    Raman Agrawal; Reena Chandrashekhar; Anurag Kumar Mishra; Jetty Ramadevi; Yogendra Sharma; Ramesh K Aggarwal

    2002-06-01

    -Crystallin is a taxon-specific structural protein found in eye lenses. We present here the cloning and sequencing of complete -crystallin cDNA from the embryonic lens of Crocodylus palustris and establish it to be identical to the -enolase gene from non-lenticular tissues. Quantitatively, the -crystallin was found to be the least abundant crystallin of the crocodilian embryonic lenses. Crocodile -crystallin cDNA was isolated by RT-PCR using primers designed from the only other reported sequence from duck and completed by 5′- and 3′-rapid amplification of cDNA ends (RACE) using crocodile gene specific primers designed in the study. The complete -crystallin cDNA of crocodile comprises 1305 bp long ORF and 92 and 409 bp long untranslated 5′-and 3′-ends respectively. Further, it was found to be identical to its putative counterpart enzyme -enolase, from brain, heart and gonad, suggesting both to be the product of the same gene. The study thus provides the first report on cDNA sequence of -crystallin from a reptilian species and also re-confirms it to be an example of the phenomenon of gene sharing as was demonstrated earlier in the case of peking duck. Moreover, the gene lineage reconstruction analysis helps our understanding of the evolution of crocodilians and avian species.

  6. cDNA sequence of human transforming gene hst and identification of the coding sequence required for transforming activity

    International Nuclear Information System (INIS)

    The hst gene was originally identified as a transforming gene in DNAs from human stomach cancers and from a noncancerous portion of stomach mucosa by DNA-mediated transfection assay using NIH3T3 cells. cDNA clones of hst were isolated from the cDNA library constructed from poly(A)+ RNA of a secondary transformant induced by the DNA from a stomach cancer. The sequence analysis of the hst cDNA revealed the presence of two open reading frames. When this cDNA was inserted into an expression vector containing the simian virus 40 promoter, it efficiently induced the transformation of NIH3T3 cells upon transfection. It was found that one of the reading frames, which coded for 206 amino acids, was responsible for the transforming activity

  7. Molecular cloning and sequence analysis of cDNA encoding human prostatic acid phosphatase.

    Science.gov (United States)

    Vihko, P; Virkkunen, P; Henttu, P; Roiko, K; Solin, T; Huhtala, M L

    1988-08-29

    lambda gt11 clones encoding human prostatic acid phosphatase (PAP) (EC 3.1.3.2) were isolated from human prostatic cDNA libraries by immunoscreening with polyclonal antisera. Sequence data obtained from several overlapping clones indicated that the composite cDNAs contained the complete coding region for PAP, which encodes a 354-residue protein with a calculated molecular mass of 41,126 Da. In the 5'-end, the cDNA codes for a signal peptide of 32 amino acids. Direct protein sequencing of the amino-terminus of the mature protein and its proteolytic fragments confirmed the identity of the predicted protein sequence. PAP has no apparent sequence homology to other known proteins. However, both the cDNA clones coding for human placental alkaline phosphatase and PAP have an alu-type repetitive sequence about 900 nucleotides downstream from the coding region in the 3'-untranslated region. Two of our cDNA clones differed from others at the 5'-ends. RNA blot analysis indicated mRNA of 3.3 kb. We are continuing to study whether acid phosphatases form a gene family as do alkaline phosphatases. PMID:2842184

  8. Cloning and sequencing of human intestinal alkaline phosphatase cDNA

    International Nuclear Information System (INIS)

    Partial protein sequence data obtained on intestinal alkaline phosphatase indicated a high degree of homology with the reported sequence of the placental isoenzyme. Accordingly, placental alkaline phosphatase cDNA was cloned and used as a probe to clone intestinal alkaline phosphatase cDNA. The latter is somewhat larger (3.1 kilobases) than the cDNA for the placental isozyme (2.8 kilobases). Although the 3' untranslated regions are quite different, there is almost 90% homology in the translated regions of the two isozymes. There are, however, significant differences at their amino and carboxyl termini and a substitution of an alanine in intestinal alkaline phosphatase for a glycine in the active site of the placental isozyme

  9. Molecular Cloning and Sequencing of Channel Catfish, Ictalurus punctatus, Cathepsin H and L cDNA

    Science.gov (United States)

    Cathepsin H and L, a lysosomal cysteine endopeptidase of the papain family, are ubiquitously expressed and involve in antigen processing. In this communication, the channel catfish cathepsin H and L transcripts were sequenced and analyzed. Total RNA from tissues was extracted and cDNA libraries we...

  10. Complete amino acid sequence of human intestinal aminopeptidase N as deduced from cloned cDNA

    DEFF Research Database (Denmark)

    Cowell, G M; Kønigshøfer, E; Danielsen, E M;

    1988-01-01

    The complete primary structure (967 amino acids) of an intestinal human aminopeptidase N (EC 3.4.11.2) was deduced from the sequence of a cDNA clone. Aminopeptidase N is anchored to the microvillar membrane via an uncleaved signal for membrane insertion. A domain constituting amino acid 250...

  11. Deep sequencing approach for investigating infectious agents causing fever.

    Science.gov (United States)

    Susilawati, T N; Jex, A R; Cantacessi, C; Pearson, M; Navarro, S; Susianto, A; Loukas, A C; McBride, W J H

    2016-07-01

    Acute undifferentiated fever (AUF) poses a diagnostic challenge due to the variety of possible aetiologies. While the majority of AUFs resolve spontaneously, some cases become prolonged and cause significant morbidity and mortality, necessitating improved diagnostic methods. This study evaluated the utility of deep sequencing in fever investigation. DNA and RNA were isolated from plasma/sera of AUF cases being investigated at Cairns Hospital in northern Australia, including eight control samples from patients with a confirmed diagnosis. Following isolation, DNA and RNA were bulk amplified and RNA was reverse transcribed to cDNA. The resulting DNA and cDNA amplicons were subjected to deep sequencing on an Illumina HiSeq 2000 platform. Bioinformatics analysis was performed using the program Kraken and the CLC assembly-alignment pipeline. The results were compared with the outcomes of clinical tests. We generated between 4 and 20 million reads per sample. The results of Kraken and CLC analyses concurred with diagnoses obtained by other means in 87.5 % (7/8) and 25 % (2/8) of control samples, respectively. Some plausible causes of fever were identified in ten patients who remained undiagnosed following routine hospital investigations, including Escherichia coli bacteraemia and scrub typhus that eluded conventional tests. Achromobacter xylosoxidans, Alteromonas macleodii and Enterobacteria phage were prevalent in all samples. A deep sequencing approach of patient plasma/serum samples led to the identification of aetiological agents putatively implicated in AUFs and enabled the study of microbial diversity in human blood. The application of this approach in hospital practice is currently limited by sequencing input requirements and complicated data analysis. PMID:27180244

  12. Isolation and sequence analysis of a cDNA clone encoding the fifth complement component

    DEFF Research Database (Denmark)

    Lundwall, Åke B; Wetsel, Rick A; Kristensen, Torsten;

    1985-01-01

    clone of 1.85 kilobase pairs was isolated. Hybridization of the mixed-sequence probe to the complementary strand of the plasmid insert and sequence analysis by the dideoxy method predicted the expected protein sequence of C5a (positions 1-12), amino-terminal to the anticipated priming site. The sequence......We have used available protein sequence data for the anaphylatoxin (C5a) portion of the fifth component of human complement (residues 19-25) to synthesize a mixed-sequence oligonucleotide probe. The labeled oligonucleotide was then used to screen a human liver cDNA library, and a single candidate cDNA...... obtained further predicted an arginine-rich sequence (RPRR) immediately upstream of the N-terminal threonine of C5a, indicating that the promolecule form of C5 is synthesized with a beta alpha-chain orientation as previously shown for pro-C3 and pro-C4. The C5 cDNA clone was sheared randomly by sonication...

  13. Budding yeast cDNA sequencing project: S03036-24_C13 [Budding yeast cDNA sequencing project

    Lifescience Database Archive (English)

    Full Text Available S03036-24_C 13 - - - Show S03036-24_C 13 Seqid S03036-24_C 13 Link to SGD - Link to dbEST - Link to ... UC SC ... Genome Browser - Sequenc e >S03036-24_C 13.phd NNN ... NNNNNNNNNNNNNNANNNNNNNNTGNNGGNNNNNTNNNNNN ANTTGNTATC GAC TTNNTTCC NNNNGAGANNNNNNTGGANAANGANGAGNNNNNGNANAC ... TTNNGANAAANANNC NNANGGTGAGAGCC AGATCC NNNNNNGC GGATTGNGAGC AAATC G TTAAG ... TTC AGGTC AAGTAAAAATTGATTTC NAAAAC TAATTTC TC TTATAC NANNNTTT ...

  14. Budding yeast cDNA sequencing project: S03052-35_P07 [Budding yeast cDNA sequencing project

    Lifescience Database Archive (English)

    Full Text Available S03052-35_P07 S03052-35_P07 - - Show S03052-35_P07 Seqid S03052-35_P07 Link to SGD S03052-35_P07 ... Browser - Sequence >S03052-35_P07.phd CGGNNNNNNNTTTTNT CAGAATCTNNNNNNTNNATACGACTCACTNT AGGGCGAATTGGC GGCCAA ... TCATANCTGTTT CCTGTGTGAAATTGTTATCCGCTNAAAATTCCACACAATNT ACGATCNNNAANCATAAAG TGTAAAGCCTGGGGGTGCCTAATGGAGTGAG ... TCCCTCCCGCANNCCTGAGGAGGGAGAGGGGATTTATGGATCTTCTAAGA TNT ... Quality >S03052-35_P07.phd 9 8 8 5 4 4 5 4 4 5 8 1 ...

  15. cDNA cloning, sequence analysis, and chromosomal localization of the gene for human carnitine palmitoyltransferase.

    Science.gov (United States)

    Finocchiaro, G; Taroni, F; Rocchi, M; Martin, A L; Colombo, I; Tarelli, G T; DiDonato, S

    1991-01-01

    We have cloned and sequenced a cDNA encoding human liver carnitine palmitoyltransferase (CPTase; palmitoyl-CoA:L-carnitine O-palmitoyltransferase, EC 2.3.1.21), an inner mitochondrial membrane enzyme that plays a major role in the fatty acid oxidation pathway. Mixed oligonucleotide primers whose sequences were deduced from one tryptic peptide obtained from purified CPTase were used in a polymerase chain reaction, allowing the amplification of a 0.12-kilobase fragment of human genomic DNA encoding such a peptide. A 60-base-pair (bp) oligonucleotide synthesized on the basis of the sequence from this fragment was used for the screening of a cDNA library from human liver and hybridized to a cDNA insert of 2255 bp. This cDNA contains an open reading frame of 1974 bp that encodes a protein of 658 amino acid residues including 25 residues of an NH2-terminal leader peptide. The assignment of this open reading frame to human liver CPTase is confirmed by matches to seven different amino acid sequences of tryptic peptides derived from pure human CPTase and by the 82.2% homology with the amino acid sequence of rat CPTase. The NH2-terminal region of CPTase contains a leucine-proline motif that is shared by carnitine acetyl- and octanoyltransferases and by choline acetyltransferase. The gene encoding CPTase was assigned to human chromosome 1, region 1q12-1pter, by hybridization of CPTase cDNA with a DNA panel of 19 human-hamster somatic cell hybrids. Images PMID:1988962

  16. cDNA cloning, sequence analysis, and chromosomal localization of the gene for human carnitine palmitoyltransferase.

    OpenAIRE

    Finocchiaro, G; Taroni, F; Rocchi, M; Martin, A.L.; Colombo, I; Tarelli, G T; DiDonato, S

    1991-01-01

    We have cloned and sequenced a cDNA encoding human liver carnitine palmitoyltransferase (CPTase; palmitoyl-CoA:L-carnitine O-palmitoyltransferase, EC 2.3.1.21), an inner mitochondrial membrane enzyme that plays a major role in the fatty acid oxidation pathway. Mixed oligonucleotide primers whose sequences were deduced from one tryptic peptide obtained from purified CPTase were used in a polymerase chain reaction, allowing the amplification of a 0.12-kilobase fragment of human genomic DNA enco...

  17. Quantitative phenotyping via deep barcode sequencing

    OpenAIRE

    SMITH, ANDREW M.; Heisler, Lawrence E.; Mellor, Joseph; Kaper, Fiona; Thompson, Michael J.; Chee, Mark; Roth, Frederick P.; Giaever, Guri; Nislow, Corey

    2009-01-01

    Next-generation DNA sequencing technologies have revolutionized diverse genomics applications, including de novo genome sequencing, SNP detection, chromatin immunoprecipitation, and transcriptome analysis. Here we apply deep sequencing to genome-scale fitness profiling to evaluate yeast strain collections in parallel. This method, Barcode analysis by Sequencing, or “Bar-seq,” outperforms the current benchmark barcode microarray assay in terms of both dynamic range and throughput. When applied...

  18. [Construction and sequencing of full-length cDNA of peste des petits ruminants virus].

    Science.gov (United States)

    Zhai, Jun-Jun; Dou, Yong-Xi; Zhang, Hai-Rui; Mao, Li; Meng, Xue-Lian; Luo, Xuo-Nong; Cai, Xue-Peng

    2010-07-01

    To develop a reverse genetics system of Peste des petits ruminants virus(PPRV), five pairs of oligonucleotide primers were designed on the basis of the full-length genomic sequence of PPRV Nigeria 75/ 1 strain. Using RT-PCR technique, five over-lapping cDNA fragments, designated as JF1, JF2, JF3, JF4 and JF5, respectively, were amplified, followed by cloning into pcDNA3.1(+)vector. An AscI restriction enzyme site and a T7 promoter sequence were introduced immediately upstream of 5'-end, while a PacI restriction enzyme site was engineered downstream of 3'-end. Using pok12 as a plasmid vector, the full-length cDNA clone pok12-PPRV of Nigeria 75/1 was assembled by connecting the five cDNA fragments via the unique restriction endonuclease site of PPRV genome. The resultant nucleotide sequence of the PPRV Nigeria 75/1 strain in the study was compared with other members of genus morbillivirus, and phylogenetic analysis was used to examine the evolutionary relationships. The results showed that PPRV Nigeria 75/ 1 was antigenically closely related to Rinderpest virus and Measles virus. Successful construction of full-length cDNA clone of PPRV Nigeria 75/1 strain lays the basis rescuing PPRV effectively and enables further research of PPRV at molecular level. PMID:20836386

  19. Human thrombomodulin: complete cDNA sequence and chromosome localization of the gene

    International Nuclear Information System (INIS)

    A human umbilical vein endothelial cell cDNA library in λgt11 was screened for expression of thrombomodulin antigens with affinity-purified rabbit polyclonal anti-thrombomodulin immunoglobulin G (IgG) and mouse monoclonal anti-human thrombomodulin IgG. Among 7 million recombinant clones screened, 12 were recognized by both antibodies. Two of these, λHTm10 and λHTm12, were shown to encode thrombomodulin by comparison of the amino acid sequence deduced from the nucleotide sequence to the amino acid sequence determined directly from tryptic peptides of thrombomodulin. Thrombomodulin mRNA was estimated to be 3.7 kilobases in length by Northern blot analysis of endothelial cell and placental poly(A) + RNA. Thrombomodulin mRNA was not detected in human brain, HepG2 hepatoma cells, or the monocytic U937 cell line. Additional cDNA clones were selected by hybridization with the 1.2-kilobase insert of λHTm10. One isolate, λHTm15, contained a 3693 base pair cDNA insert with an apparent 5'-noncoding region of 146 base pairs, an open reading frame of 1725 base pairs, a stop codon, a 3'-noncoding region of 1779 base pairs, and a poly(A) tail of 40 base pairs. The cDNA sequence encodes a 60.3-kDa protein of 575 amino acids. The organization of thrombomodulin is similar to that of the low-density lipoprotein receptor, and the protein is homologous to a large number of other proteins that also contain EGF-like domains, including factor VII, factor IX, factor X, factor XII, protein C, tissue plasminogen activator, and urokinase. The gene for thrombomodulin has been localized to chromosome 20 by hybridization of cDNA probes to purified human chromosomes

  20. Cloning and Sequence Analysis of cDNA Encoding MRJP3 of Apis cerana cerana

    Institute of Scientific and Technical Information of China (English)

    SU Song-kun; ZHNEG Huo-qing; CHEN Sheng-lu; ZHONG Bo-xiong; Stefan Albert

    2005-01-01

    By screening the worker (Apis cerana cerana) heads cDNA library using a fragment of the mrjp3 gene ofApis cerana as probe, 120 positive clones were obtained. The clone containing A. cerana cerana MRJP3 (AccMRJP3) cDNA was selected. Based on the sequencing of the inserts of the positive clone, a sequence of AccMRJP3 cDNA which is 1 887 bp long including a poly (A) tail was obtained. The AccMRJP3 cDNA encompassed an open-reading frame (ORF) with 1 779 bp encoding 593 amino acids. The un-translated regions (UTR) of the 5' end and 3' end are 46 bp and 160 bp in length,respectively. Similar to AmMRJP3 and AdMRJP3, the putative AccMRJP3 also has a repetitive region. The comparison of the repetitive region of AccMRJP3, AmMRJP3 and AdMRJP3 shows some differences between them.

  1. Generation and Analysis of Full-length cDNA Sequences from Elephant Shark (Callorhinchus milii)

    KAUST Repository

    Kodzius, Rimantas

    2009-03-17

    Cartilaginous fishes are the oldest living group of jawed vertebrates and therefore is an important group for understanding the evolution of vertebrate genomes including the human genome. Our laboratory has proposed elephant shark (C. milii) as a model cartilaginous fish genome because of its relatively small genome size (910 Mb). The whole genome of C. milii is being sequenced (first cartilaginous fish genome to be sequenced completely). To characterize the transcriptome of C. milii and to assist in annotating exon-intron boundaries, transcriptional start sites and alternatively spliced transcripts, we are generating full-length cDNA sequences from C. milii.

  2. Cloning and sequencing of Indian Water buffalo (Bubalus bubalis) interleukin-3 cDNA

    KAUST Repository

    Sugumar, Thennarasu

    2011-12-12

    Full-length cDNA (435 bp) of the interleukin-3(IL-3) gene of the Indian water buffalo was amplified by reverse transcriptase-polymerase chain reaction and sequenced. This sequence had 96% nucleotide identity and 92% amino acid identity with bovine IL-3. There are 10 amino acid substitutions in buffalo compared with that of bovine. The amino acid sequence of buffalo IL-3 also showed very high identity with that of other ruminants, indicating functional cross-reactivity. Structural homology modelling of buffalo IL-3 protein with human IL-3 showed the presence of five helical structures.

  3. Rat beta casein cDNA: sequence analysis and evolutionary comparisons.

    OpenAIRE

    Blackburn, D E; Hobbs, A A; Rosen, J. M.

    1982-01-01

    The complete sequence of a 1072 nucleotide rat beta-casein cDNA insertion in the hybrid plasmid pC beta 23 has been determined. Primer extension was employed to determine the sequence of an additional 82 5'-terminal nucleotides in beta-casein mRNA. Rat beta-casein mRNA consists of a 696 nucleotide coding region, flanked by 52 nucleotide 5' and 406 nucleotide 3' noncoding regions, including a 40 nucleotide poly(A) tail. The derived 216 amino acid sequence of rat beta-casein was compared to the...

  4. Molecular cloning and sequencing of a cDNA encoding partial putative molt-inhibiting hormone from Penaeus chinensis

    Science.gov (United States)

    Wang, Zai-Zhao; Xiang, Jian-Hai

    2002-09-01

    Total RNA was extracted from eyestalks of shrimp Penaeus chinensis. Eyestalk cDNA was obtained from total RNA by reverse transcription. Reverse transcriptase-polymerase chain reaction (RT-PCR) was initiated using eyestalk cDNA and degenerate primers designed from the amino acid sequence of molt-inhibiting hormone from shrimp Penaeus japonicus. A specific cDNA was obtained and cloned into a T vector for sequencing. The cDNA consisted of 201 base pairs and encoding for a peptide of 67 amino acid residues. The peptide of P. chinensis had the highest identity with molt-inhibiting hormones of P. japonicus. The cDNA could be a partial gene of molt-inhibiting hormones from P. chinensis. This paper reports for the first time cDNA encoding for neuropeptide of P. chinensis.

  5. MOLECULAR CLONING AND SEQUENCING OF A cDNA ENCODING PARTIAL PUTATIVE MOLT-INHIBITING HORMONE FROM PENAEUS CHINENSIS

    Institute of Scientific and Technical Information of China (English)

    王在照; 相建海

    2002-01-01

    Total RNA was extracted from eyestalks of shrimp Penaeue chinensis. Eyestalk cDNA was obtained from total RNA by reverse transcription. Reverse transcriptase-polymerase chain reaction (RT-PCR) was initiated using eyestalk cDNA and degenerate primers designed from the amino acid sequence of molt-inhibiting hormone from shrimp Penaeus japonicus. A specific cDNA was obtained and cloned into a T vector for sequencing. The cDNA consisted of 201 base pairs and encoding for a peptide of 67 amino acid residues. The peptide of P. chinensis had the highest identity with molt-inhibiting hormones of P. japonicus. The cDNA could be a partial gene of molt-inhibiting hormones from P. chinensis. This paper reports for the first time cDNA encoding for neuropeptide of P. chinensis.

  6. MOLECULAR CLONING AND SEQUENCING OF A cDNA ENCODING PARTIAL PUTATIVE MOLT-INHIBITING HORMONE FROM PENAEUS CHINENSIS

    Institute of Scientific and Technical Information of China (English)

    王在照; 相建海

    2002-01-01

    Total RNA was extracted from eyestalks of shrimp Penaeus chinensis. Eyestalk cDNA was obtained from total RNA by reverse transcription. Reverse transcriptase-polymer ase chain reaction (RT-PCR) was initiated using eyestalk cDNA and degenerate primers designed from the amino acid sequence of molt-inhibiting hormone from shrimp Penaeus japonicus. A s pecific cDNA was obtained and cloned into a T vector for sequencing. The cDNA consisted of 201 ba se pairs and encoding for a peptide of 67 amino acid residues. The peptide of P. chinensis had the highest identity with molt-inhibiting hormones of P. japonicus. The cDNA could be a partial gene of molt-inhibiting hormones from P. chinensis. This paper reports for the first time cDNA encoding for neuropeptide of P. chinensis.

  7. Human tissue factor: cDNA sequence and chromosome localization of the gene

    International Nuclear Information System (INIS)

    A human placenta cDNA library in λgt11 was screened for the expression of tissue factor antigens with rabbit polyclonal anti-human tissue factor immunoglobulin G. Among 4 million recombinant clones screened, one positive, λHTF8, expressed a protein that shared epitopes with authentic human brain tissue factor. The 1.1-kilobase cDNA insert of λHTF8 encoded a peptide that contained the amino-terminal protein sequence of human brain tissue factor. Northern blotting identified a major mRNA species of 2.2 kilobases and a minor species of ∼ 3.2 kilobases in poly(A) + RNA of placenta. Only 2.2-kilobase mRNA was detected in human brain and in the human monocytic U937 cell line. In U937 cells, the quantity of tissue factor mRNA was increased several fold by exposure of the cells to phorbol 12-myristate 13-acetate. Additional cDNA clones were selected by hybridization with the cDNA insert of λHTF8. These overlapping isolates span 2177 base pairs of the tissue factor cDNA sequence that includes a 5'-noncoding region of 75 base pairs, an open reading frame of 885 base pairs, a stop codon, a 3'-noncoding region of 1141 base pairs, and a poly(a) tail. The open reading frame encodes a 33-kilodalton protein of 295 amino acids. The predicted sequence includes a signal peptide of 32 or 34 amino acids, a probable extracellular factor VII binding domain of 217 or 219 amino acids, a transmembrane segment of 23 acids, and a cytoplasmic tail of 21 amino acids. There are three potential glycosylation sites with the sequence Asn-X-Thr/Ser. The 3'-noncoding region contains an inverted Alu family repetitive sequence. The tissue factor gene was localized to chromosome 1 by hybridization of the cDNA insert of λHTF8 to flow-sorted human chromosomes

  8. Microarray and cDNA sequence analysis of transcription during nerve-dependent limb regeneration

    Directory of Open Access Journals (Sweden)

    Bryant Susan V

    2009-01-01

    Full Text Available Abstract Background Microarray analysis and 454 cDNA sequencing were used to investigate a centuries-old problem in regenerative biology: the basis of nerve-dependent limb regeneration in salamanders. Innervated (NR and denervated (DL forelimbs of Mexican axolotls were amputated and transcripts were sampled after 0, 5, and 14 days of regeneration. Results Considerable similarity was observed between NR and DL transcriptional programs at 5 and 14 days post amputation (dpa. Genes with extracellular functions that are critical to wound healing were upregulated while muscle-specific genes were downregulated. Thus, many processes that are regulated during early limb regeneration do not depend upon nerve-derived factors. The majority of the transcriptional differences between NR and DL limbs were correlated with blastema formation; cell numbers increased in NR limbs after 5 dpa and this yielded distinct transcriptional signatures of cell proliferation in NR limbs at 14 dpa. These transcriptional signatures were not observed in DL limbs. Instead, gene expression changes within DL limbs suggest more diverse and protracted wound-healing responses. 454 cDNA sequencing complemented the microarray analysis by providing deeper sampling of transcriptional programs and associated biological processes. Assembly of new 454 cDNA sequences with existing expressed sequence tag (EST contigs from the Ambystoma EST database more than doubled (3935 to 9411 the number of non-redundant human-A. mexicanum orthologous sequences. Conclusion Many new candidate gene sequences were discovered for the first time and these will greatly enable future studies of wound healing, epigenetics, genome stability, and nerve-dependent blastema formation and outgrowth using the axolotl model.

  9. Rice bicoid-related cDNA sequence and its expression during early embryogenesis

    Institute of Scientific and Technical Information of China (English)

    2001-01-01

    Bicoid is one of the important Drosophila maternal genes involved in the control of embryo polarity and larvae segmentation.To clone and characterize the rice bicoid-related genes,one cDNA clone,Rb24 (EMBL accession number: AJ2771380),was isolated by screening of rice unmature seed cDNA library.Sequence analysis indicates that Rb24 contains a putative amino acid sequence,which is homologous to unique 8 amino acids sequence within Drosophila bicoid homeodomain (50% identity,75% similarity) and involves a lys-9 in putative helix 3.Northern blot analysis of rice RNA has shown that this sequence is expressed in a tissue-specific manner.The transcript was detected strongly in young panicles,but less in young leaves and roots.This results are further confirmed with paraffin section in situ hybridization.The signal is intensive in rice globular embryo and located at the apical tip of the embryo,then,along with the development of embryo,the signal is getting reduced and transfers into both sides of embryo.The existence of bicoid-related sequence in rice embryo and the similarity of polar distribution of bicoid and Rb24 mRNA in early embryo development may implicates a conserved maternal regulation mechanism of body axis presents in Drosophila and in rice.

  10. Construction of cDNA library and preliminary analysis of expressed sequence tags from Siberian tiger

    Directory of Open Access Journals (Sweden)

    Chang-Qing Liu, Tao-Feng Lu, Bao-Gang Feng, Dan Liu, Wei-Jun Guan, Yue-Hui Ma

    2010-01-01

    Full Text Available In this study we successfully constructed a full-length cDNA library from Siberian tiger, Panthera tigris altaica, the most well-known wild Animal. Total RNA was extracted from cultured Siberian tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.30×106 pfu/ml and 1.62×109 pfu/ml respectively. The proportion of recombinants from unamplified library was 90.5% and average length of exogenous inserts was 1.13 kb. A total of 282 individual ESTs with sizes ranging from 328 to 1,142bps were then analyzed the BLASTX score revealed that 53.9% of the sequences were classified as strong match, 38.6% as nominal and 7.4% as weak match. 28.0% of them were found to be related to enzyme/catalytic protein, 20.9% ESTs to metabolism, 13.1% ESTs to transport, 12.1% ESTs to signal transducer/cell communication, 9.9% ESTs to structure protein, 3.9% ESTs to immunity protein/defense metabolism, 3.2% ESTs to cell cycle, and 8.9 ESTs classified as novel genes. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genomic research of Siberian tigers.

  11. Benchmarking of the Oxford Nanopore MinION sequencing for quantitative and qualitative assessment of cDNA populations.

    Science.gov (United States)

    Oikonomopoulos, Spyros; Wang, Yu Chang; Djambazian, Haig; Badescu, Dunarel; Ragoussis, Jiannis

    2016-01-01

    To assess the performance of the Oxford Nanopore Technologies MinION sequencing platform, cDNAs from the External RNA Controls Consortium (ERCC) RNA Spike-In mix were sequenced. This mix mimics mammalian mRNA species and consists of 92 polyadenylated transcripts with known concentration. cDNA libraries were generated using a template switching protocol to facilitate the direct comparison between different sequencing platforms. The MinION performance was assessed for its ability to sequence the cDNAs directly with good accuracy in terms of abundance and full length. The abundance of the ERCC cDNA molecules sequenced by MinION agreed with their expected concentration. No length or GC content bias was observed. The majority of cDNAs were sequenced as full length. Additionally, a complex cDNA population derived from a human HEK-293 cell line was sequenced on an Illumina HiSeq 2500, PacBio RS II and ONT MinION platforms. We observed that there was a good agreement in the measured cDNA abundance between PacBio RS II and ONT MinION (rpearson = 0.82, isoforms with length more than 700bp) and between Illumina HiSeq 2500 and ONT MinION (rpearson = 0.75). This indicates that the ONT MinION can sequence quantitatively both long and short full length cDNA molecules. PMID:27554526

  12. Benchmarking of the Oxford Nanopore MinION sequencing for quantitative and qualitative assessment of cDNA populations

    Science.gov (United States)

    Oikonomopoulos, Spyros; Wang, Yu Chang; Djambazian, Haig; Badescu, Dunarel; Ragoussis, Jiannis

    2016-01-01

    To assess the performance of the Oxford Nanopore Technologies MinION sequencing platform, cDNAs from the External RNA Controls Consortium (ERCC) RNA Spike-In mix were sequenced. This mix mimics mammalian mRNA species and consists of 92 polyadenylated transcripts with known concentration. cDNA libraries were generated using a template switching protocol to facilitate the direct comparison between different sequencing platforms. The MinION performance was assessed for its ability to sequence the cDNAs directly with good accuracy in terms of abundance and full length. The abundance of the ERCC cDNA molecules sequenced by MinION agreed with their expected concentration. No length or GC content bias was observed. The majority of cDNAs were sequenced as full length. Additionally, a complex cDNA population derived from a human HEK-293 cell line was sequenced on an Illumina HiSeq 2500, PacBio RS II and ONT MinION platforms. We observed that there was a good agreement in the measured cDNA abundance between PacBio RS II and ONT MinION (rpearson = 0.82, isoforms with length more than 700bp) and between Illumina HiSeq 2500 and ONT MinION (rpearson = 0.75). This indicates that the ONT MinION can sequence quantitatively both long and short full length cDNA molecules. PMID:27554526

  13. Cloning and sequence analysis of cDNA for the canine neurotensin/neuromedin N precursor.

    OpenAIRE

    Dobner, P R; Barber, D L; Villa-Komaroff, L; McKiernan, C

    1987-01-01

    Cloned cDNAs encoding neurotensin were isolated from a cDNA library derived from primary cultures of canine enteric mucosa cells. Nucleotide sequence analysis has revealed the primary structure of a 170-amino acid precursor protein that encodes both neurotensin and the neurotensin-like peptide neuromedin N. The peptide-coding domains are located in tandem near the carboxyl terminus of the precursor and are bounded and separated by the paired, basic amino acid residues Lys-Arg. An additional c...

  14. Serine Protease Variants Encoded by Echis ocellatus Venom Gland cDNA: Cloning and Sequencing Analysis

    Directory of Open Access Journals (Sweden)

    S. S. Hasson

    2010-01-01

    Full Text Available Envenoming by Echis saw-scaled viper is the leading cause of death and morbidity in Africa due to snake bite. Despite its medical importance, there have been few investigations into the toxin composition of the venom of this viper. Here, we report the cloning of cDNA sequences encoding four groups or isoforms of the haemostasis-disruptive Serine protease proteins (SPs from the venom glands of Echis ocellatus. All these SP sequences encoded the cysteine residues scaffold that form the 6-disulphide bonds responsible for the characteristic tertiary structure of venom serine proteases. All the Echis ocellatus EoSP groups showed varying degrees of sequence similarity to published viper venom SPs. However, these groups also showed marked intercluster sequence conservation across them which were significantly different from that of previously published viper SPs. Because viper venom SPs exhibit a high degree of sequence similarity and yet exert profoundly different effects on the mammalian haemostatic system, no attempt was made to assign functionality to the new Echis ocellatus EoSPs on the basis of sequence alone. The extraordinary level of interspecific and intergeneric sequence conservation exhibited by the Echis ocellatus EoSPs and analogous serine proteases from other viper species leads us to speculate that antibodies to representative molecules should neutralise (that we will exploit, by epidermal DNA immunization the biological function of this important group of venom toxins in vipers that are distributed throughout Africa, the Middle East, and the Indian subcontinent.

  15. Serine protease variants encoded by Echis ocellatus venom gland cDNA: cloning and sequencing analysis.

    Science.gov (United States)

    Hasson, S S; Mothana, R A; Sallam, T A; Al-balushi, M S; Rahman, M T; Al-Jabri, A A

    2010-01-01

    Envenoming by Echis saw-scaled viper is the leading cause of death and morbidity in Africa due to snake bite. Despite its medical importance, there have been few investigations into the toxin composition of the venom of this viper. Here, we report the cloning of cDNA sequences encoding four groups or isoforms of the haemostasis-disruptive Serine protease proteins (SPs) from the venom glands of Echis ocellatus. All these SP sequences encoded the cysteine residues scaffold that form the 6-disulphide bonds responsible for the characteristic tertiary structure of venom serine proteases. All the Echis ocellatus EoSP groups showed varying degrees of sequence similarity to published viper venom SPs. However, these groups also showed marked intercluster sequence conservation across them which were significantly different from that of previously published viper SPs. Because viper venom SPs exhibit a high degree of sequence similarity and yet exert profoundly different effects on the mammalian haemostatic system, no attempt was made to assign functionality to the new Echis ocellatus EoSPs on the basis of sequence alone. The extraordinary level of interspecific and intergeneric sequence conservation exhibited by the Echis ocellatus EoSPs and analogous serine proteases from other viper species leads us to speculate that antibodies to representative molecules should neutralise (that we will exploit, by epidermal DNA immunization) the biological function of this important group of venom toxins in vipers that are distributed throughout Africa, the Middle East, and the Indian subcontinent. PMID:20936075

  16. Cloning and Sequencing cDNA Encoding for Rhoptry-2 Toxoplasma Gondii Tachyzoite Local Isolate

    Directory of Open Access Journals (Sweden)

    Murwantoko M

    2015-11-01

    Full Text Available Rhoptry protein belongs to an excretory and secretory antigens (ESAs that play an important role during active penetration of parasite into the cell target. This protein an able Toxoplasma gondii to actively penetrate targeted cell, meanwhile ESAs protein stimulates intracellular vacuole modification. It is, therefore, after the parasite successfully enter the cell target then Granule (GRA proteins are responsible for the formation of parasitophorus vacuole, which is protect the fusion with other intracellular compartments such as lysosomal vacuole. Consequently, this parasite is being able to survive and multiply at the cell target. The current study was aimed to clone and sequens cDNA encoding for ROP-2 of local isolated T. gondii tachizoite through DNA recombinant technique. Total ribonucleic acid (RNA was isolated from tachyzoites of local isolated T. gondii that were grown up in Balb/c mice. Messenger RNA was isolated from total RNA using PolyAtract mRNA Isolation System. Messenger RNA was used as a template for synthesis cDNA using Riboclone cDNA Synthesis System AMV-RT. EcoRI adaptor from Riboclone EcoRI Adaptor Ligation System was added to Complementary DNA and than ligated to pUC19. Recombinant plasmid was transformed into E. coli (XL1-Blue. The transformed E. coli XL-1 Blue were plated on LB agar containing X-Gal, IPTG and ampicillin. Recombinant clones (white colony were picked up and grown up in the LB medium at 37oC overnight. Expression of recombinant protein was analysed by immunoblotting in order to identify cDNA recombinant wich is express ESA of T. gondii local isolate. Recombinant plasmid were isolated using alkalilysis method and were elektroforated in 1% agarose gel. The isolated DNA recombinant plasmid was cut using Eco RI and then sequenced through Big Dye Terminator Mix AB1 377A Sequencer using M13 Forward and M13 Reverse primers. The conclusion of this results showed that the recombinant clone was coding for excretory

  17. Cloning and Sequencing cDNA Encoding for Rhoptry-2 Toxoplasma Gondii Tachyzoite Local Isolate

    Directory of Open Access Journals (Sweden)

    Wayan T. Artama

    2015-10-01

    Full Text Available Rhoptry protein belongs to an excretory and secretory antigens (ESAs that play an important role during activepenetration of parasite into the cell target. This protein an able Toxoplasma gondii to actively penetrate targetedcell, meanwhile ESAs protein stimulates intracellular vacuole modification. It is, therefore, after the parasitesuccessfully enter the cell target then Granule (GRA proteins are responsible for the formation of parasitophorusvacuole, which is protect the fusion with other intracellular compartments such as lysosomal vacuole. Consequently,this parasite is being able to survive and multiply at the cell target. The current study was aimed to clone andsequens cDNA encoding for ROP-2 of local isolated T. gondii tachizoite through DNA recombinant technique.Total ribonucleic acid (RNA was isolated from tachyzoites of local isolated T. gondii that were grown up in Balb/c mice. Messenger RNA was isolated from total RNA using PolyAtract mRNA Isolation System. Messenger RNA wasused as a template for synthesis cDNA using Riboclone cDNA Synthesis System AMV-RT. EcoRI adaptor fromRiboclone EcoRI Adaptor Ligation System was added to Complementary DNA and than ligated to pUC19. Recombinantplasmid was transformed into E. coli (XL1-Blue. The transformed E. coli XL-1 Blue were plated on LB agarcontaining X-Gal, IPTG and ampicillin. Recombinant clones (white colony were picked up and grown up in theLB medium at 37oC overnight. Expression of recombinant protein was analysed by immunoblotting in order toidentify cDNA recombinant wich is express ESA of T. gondii local isolate. Recombinant plasmid were isolatedusing alkalilysis method and were elektroforated in 1% agarose gel. The isolated DNA recombinant plasmid wascut using Eco RI and then sequenced through Big Dye Terminator Mix AB1 377A Sequencer using M13 Forward andM13 Reverse primers. The conclusion of this results showed that the recombinant clone was coding for excretoryand secretory

  18. Molecular cloning and sequence analysis of growth hormone cDNA of Neotropical freshwater fish Pacu (Piaractus mesopotamicus

    Directory of Open Access Journals (Sweden)

    Janeth Silva Pinheiro

    2008-01-01

    Full Text Available RT-PCR was used for amplifying Piaractus mesopotamicus growth hormone (GH cDNA obtained from mRNA extracted from pituitary cells. The amplified fragment was cloned and the complete cDNA sequence was determined. The cloned cDNA encompassed a sequence of 543 nucleotides that encoded a polypeptide of 178 amino acids corresponding to mature P. mesopotamicus GH. Comparison with other GH sequences showed a gap of 10 amino acids localized in the N terminus of the putative polypeptide of P. mesopotamicus. This same gap was also observed in other members of the family. Neighbor-joining tree analysis with GH sequences from fishes belonging to different taxonomic groups placed the P. mesopotamicus GH within the Otophysi group. To our knowledge, this is the first GH sequence of a Neotropical characiform fish deposited in GenBank.

  19. Nucleotide sequence of cloned cDNA for human sphingolipid activator protein 1 precursor

    International Nuclear Information System (INIS)

    Two cDNA clones encoding prepro-sphingolipid activator protein 1 (SAP-1) were isolated from a λ gt11 human hepatoma expression library using polyclonal antibodies. These had inserts of ≅ 2 kilobases (λ-S-1.2 and λ-S-1.3) and both were both homologous with a previously isolated clone (λ-S-1.1) for mature SAP-1. The authors report here the nucleotide sequence of the longer two EcoRI fragments of S-1.2 and S-1.3 that were not the same and the derived amino acid sequences of mature SAP-1 and its prepro form. The open reading frame encodes 19 amino acids, which are colinear with the amino-terminal sequence of mature SAP-1, and extends far beyond the predicted carboxyl terminus of mature SAP-1, indicating extensive carboxyl-terminal processing. The nucleotide sequence of cDNA encoding prepro-SAP-1 includes 1449 bases from the assigned initiation codon ATG at base-pair 472 to the stop codon TGA at base-pair 1921. The first 23 amino acids coded after the initiation ATG are characteristic of a signal peptide. The calculated molecular mass for a polypeptide encoded by 1449 bases is ≅ 53 kDa, in keeping with the reported value for pro-SAP-1. The data indicate that after removal of the signal peptide mature SAP-1 is generated by removing an additional 7 amino acids from the amino terminus and ≅ 373 amino acids from the carboxyl terminus. One potential glycosylation site was previously found in mature SAP-1. Three additional potential glycosylation sites are present in the processed carboxyl-terminal polypeptide, which they designate as P-2

  20. Molecular cloning of a human glycophorin B cDNA: nucleotide sequence and genomic relationship to glycophorin A

    International Nuclear Information System (INIS)

    The authors describe the isolation and nucleotide sequence of a human glycophorin B cDNA. The cDNA was identified by differential hybridization of synthetic oligonucleotide probes to a human erythroleukemic cell line (K562) cDNA library constructed in phage vector λgt10. The nucleotide sequence of the glycophorin B cDNA was compared with that of a previously cloned glycophorin A cDNA. The nucleotide sequences encoding the NH2-terminal leader peptide and first 26 amino acids of the two proteins are nearly identical. This homologous region is followed by areas specific to either glycophorin A or B and a number of small regions of homology, which in turn are followed by a very homologous region encoding the presumed membrane-spanning portion of the proteins. They used RNA blot hybridization with both cDNA and synthetic oligonucleotide probes to prove our previous hypothesis that glycophorin B is encoded by a single 0.5- to 0.6-kb mRNA and to show that glycophorins A and B are negatively and coordinately regulated by a tumor-promoting phorbol ester, phorbol 12-myristate 13-acetate. They established the intron/exon structure of the glycophorin A and B genes by oligonucleotide mapping; the results suggest a complex evolution of the glycophorin genes

  1. Cloning and sequence analysis of cDNA for the canine neurotensin/neuromedin N precursor

    Energy Technology Data Exchange (ETDEWEB)

    Dobner, P.R.; Barber, D.L.; Villa-Komaroff, L.; McKiernan, C.

    1987-05-01

    Cloned cDNAs encoding neurotensin were isolated from a cDNA library derived from primary cultures of canine enteric mucosa cells. Nucleotide sequence analysis using /sup 32/P-labeled nucleotides, has revealed the primary structure of a 170-amino acid precursor protein that encodes both neurotensin and the neurotensin-like peptide neuromedin N. The peptide-coding domains are located in tandem near the carboxyl terminus of the precursor and are bounded and separated by the paired, basic amino acid residues Lys-Arg. An additional coding domain, resembling neuromedin N, occurs immediately after an Arg-Arg basic amino acid pair located in the central region of the precursor. Additional amino acid homologies suggest that tandem duplications have contributed to the structure of the gene. RNA blot analysis, using the cloned cDNA probe, has revealed several mRNA species ranging in size from 500 to 980 nucleotides in the canine enteric mucosa. In contrast a single RNA species of 1500 nucleotides was detected in bovine hypothalamus poly-(A)/sup +/ RNA. The ability of the canine probe to cross-hybridize with bovine mRNA suggest that this probe can be used to isolate neurotensin/neuromedin N genes from other mammalian species.

  2. Cloning and sequence analysis of cDNA for the canine neurotensin/neuromedin N precursor

    International Nuclear Information System (INIS)

    Cloned cDNAs encoding neurotensin were isolated from a cDNA library derived from primary cultures of canine enteric mucosa cells. Nucleotide sequence analysis using 32P-labeled nucleotides, has revealed the primary structure of a 170-amino acid precursor protein that encodes both neurotensin and the neurotensin-like peptide neuromedin N. The peptide-coding domains are located in tandem near the carboxyl terminus of the precursor and are bounded and separated by the paired, basic amino acid residues Lys-Arg. An additional coding domain, resembling neuromedin N, occurs immediately after an Arg-Arg basic amino acid pair located in the central region of the precursor. Additional amino acid homologies suggest that tandem duplications have contributed to the structure of the gene. RNA blot analysis, using the cloned cDNA probe, has revealed several mRNA species ranging in size from 500 to 980 nucleotides in the canine enteric mucosa. In contrast a single RNA species of 1500 nucleotides was detected in bovine hypothalamus poly-(A)+ RNA. The ability of the canine probe to cross-hybridize with bovine mRNA suggest that this probe can be used to isolate neurotensin/neuromedin N genes from other mammalian species

  3. Differential representation of sunflower ESTs in enriched organ-specific cDNA libraries in a small scale sequencing project

    Science.gov (United States)

    Fernández, Paula; Paniego, Norma; Lew, Sergio; Hopp, H Esteban; Heinz, Ruth A

    2003-01-01

    Background Subtractive hybridization methods are valuable tools for identifying differentially regulated genes in a given tissue avoiding redundant sequencing of clones representing the same expressed genes, maximizing detection of low abundant transcripts and thus, affecting the efficiency and cost effectiveness of small scale cDNA sequencing projects aimed to the specific identification of useful genes for breeding purposes. The objective of this work is to evaluate alternative strategies to high-throughput sequencing projects for the identification of novel genes differentially expressed in sunflower as a source of organ-specific genetic markers that can be functionally associated to important traits. Results Differential organ-specific ESTs were generated from leaf, stem, root and flower bud at two developmental stages (R1 and R4). The use of different sources of RNA as tester and driver cDNA for the construction of differential libraries was evaluated as a tool for detection of rare or low abundant transcripts. Organ-specificity ranged from 75 to 100% of non-redundant sequences in the different cDNA libraries. Sequence redundancy varied according to the target and driver cDNA used in each case. The R4 flower cDNA library was the less redundant library with 62% of unique sequences. Out of a total of 919 sequences that were edited and annotated, 318 were non-redundant sequences. Comparison against sequences in public databases showed that 60% of non-redundant sequences showed significant similarity to known sequences. The number of predicted novel genes varied among the different cDNA libraries, ranging from 56% in the R4 flower to 16 % in the R1 flower bud library. Comparison with sunflower ESTs on public databases showed that 197 of non-redundant sequences (60%) did not exhibit significant similarity to previously reported sunflower ESTs. This approach helped to successfully isolate a significant number of new reported sequences putatively related to responses

  4. Differential representation of sunflower ESTs in enriched organ-specific cDNA libraries in a small scale sequencing project

    Directory of Open Access Journals (Sweden)

    Heinz Ruth A

    2003-09-01

    Full Text Available Abstract Background Subtractive hybridization methods are valuable tools for identifying differentially regulated genes in a given tissue avoiding redundant sequencing of clones representing the same expressed genes, maximizing detection of low abundant transcripts and thus, affecting the efficiency and cost effectiveness of small scale cDNA sequencing projects aimed to the specific identification of useful genes for breeding purposes. The objective of this work is to evaluate alternative strategies to high-throughput sequencing projects for the identification of novel genes differentially expressed in sunflower as a source of organ-specific genetic markers that can be functionally associated to important traits. Results Differential organ-specific ESTs were generated from leaf, stem, root and flower bud at two developmental stages (R1 and R4. The use of different sources of RNA as tester and driver cDNA for the construction of differential libraries was evaluated as a tool for detection of rare or low abundant transcripts. Organ-specificity ranged from 75 to 100% of non-redundant sequences in the different cDNA libraries. Sequence redundancy varied according to the target and driver cDNA used in each case. The R4 flower cDNA library was the less redundant library with 62% of unique sequences. Out of a total of 919 sequences that were edited and annotated, 318 were non-redundant sequences. Comparison against sequences in public databases showed that 60% of non-redundant sequences showed significant similarity to known sequences. The number of predicted novel genes varied among the different cDNA libraries, ranging from 56% in the R4 flower to 16 % in the R1 flower bud library. Comparison with sunflower ESTs on public databases showed that 197 of non-redundant sequences (60% did not exhibit significant similarity to previously reported sunflower ESTs. This approach helped to successfully isolate a significant number of new reported sequences

  5. Nucleotide sequence of a cDNA clone carrying the glycoprotein gene of infectious hematopoietic necrosis virus, a fish rhabdovirus.

    OpenAIRE

    Koener, J F; Passavant, C W; Kurath, G; Leong, J

    1987-01-01

    The nucleotide sequence of the mRNA encoding the glycoprotein of infectious hematopoietic necrosis virus was determined from a cDNA clone containing the entire coding region. The G-protein cDNA is 1,609 nucleotides long (excluding the polyadenylic acid) and encodes a protein of 508 amino acids. The predicted amino acid sequence was compared with that of the glycoprotein of the Indiana and New Jersey serotypes of vesicular stomatitis virus and with the glycoprotein of rabies virus, using a com...

  6. Isolation and sequencing of cDNA clones encoding alpha and beta subunits of Drosophila melanogaster casein kinase II.

    OpenAIRE

    Saxena, A.; Padmanabha, R; Glover, C V

    1987-01-01

    Cloned cDNAs encoding both subunits of Drosophila melanogaster casein kinase II have been isolated by immunological screening of lambda gt11 expression libraries, and the complete amino acid sequence of both polypeptides has been deduced by DNA sequencing. The alpha cDNA contained an open reading frame of 336 amino acid residues, yielding a predicted molecular weight for the alpha polypeptide of 39,833. The alpha sequence contained the expected semi-invariant residues present in the catalytic...

  7. Analysis of expressed sequence tags from a fetal human heart cDNA library

    Energy Technology Data Exchange (ETDEWEB)

    Hwang, D.M.; Fung, Y.W.; Wang, R.X.; Laurenssen, C.M. [Univ. of Toronto, Ontario (Canada)] [and others

    1995-11-20

    Single-pass sequencing of randomly selected cDNA clones to generate expressed sequence tags (ESTs) has been widely used to identify novel genes and to study gene expression in a variety of tissues. We have generated 2244 ESTs from a human fetal heart library (Gen-Bank Accession Nos. R30692-30774 and R56965-58824), which we present in this report. Of these, 51.7% showed no homology to known genes or were similar only to other ESTs, while 48.4% demonstrated homology to known transcripts. A total of 764 ESTs corresponding to known genes were used to study gene expression patterns in the fetal heart and to analyze differences in these patterns from those observed in the adult heart. These analyses demonstrate the utility of ESTs and sequence-tagged clones in comparative studies of gene expression in the cardiovascular system, and they reveal that differential gene expression underlies the structural and functional characteristics of the developing heart. 48 refs., 1 fig., 3 tabs.

  8. Epitopes of human testis-specific lactate dehydrogenase deduced from a cDNA sequence

    International Nuclear Information System (INIS)

    The sequence and structure of human testis-specific L-lactate dehydrogenase [LDHC4, LDHX; (L)-lactate:NAD+ oxidoreductase, EC 1.1.1.27] has been derived from analysis of a complementary DNA (cDNA) clone comprising the complete protein coding region of the enzyme. From the deduced amino acid sequence, human LDHC4 is as different from rodent LDHC4 (73% homology) as it is from human LDHA4 (76% homology) and porcine LDHB4 (68% homology). Subunit homologies are consistent with the conclusion that the LDHC gene arose by at least two independent duplication events. Furthermore, the lower degree of homology between mouse and human LDHC4 and the appearance of this isozyme late in evolution suggests a higher rate of mutation in the mammalian LDHC genes than in the LDHA and -B genes. Comparison of exposed amino acid residues of discrete anti-genic determinants of mouse and human LDHC4 reveals significant differences. Knowledge of the human LDHC4 sequence will help design human-specific peptides useful in the development of a contraceptive vaccine

  9. Preparation of cDNA libraries for high-throughput RNA sequencing analysis of RNA 5′ ends

    Science.gov (United States)

    Vvedenskaya, Irina O.; Goldman, Seth R.; Nickels, Bryce E.

    2015-01-01

    Summary We provide a detailed protocol for preparing cDNA libraries suitable for high throughput sequencing that are derived specifically from the 5′ ends of RNA (5′ specific RNA-seq). The protocol describes how cDNA libraries for 5′ specific RNA-seq can be tailored to analyze specific classes of RNAs based upon the phosphorylation status of the 5′ end. Thus, the analysis of cDNA libraries generated by these methods provides information regarding both the sequence and phosphorylation status of the 5′ ends of RNAs. 5′ specific RNA-seq can be used to analyze transcription initiation and post-transcriptional processing of RNAs with single base pair resolution on a genome-wide level. PMID:25665566

  10. Update History of This Database - Budding yeast cDNA sequencing project | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available [ Credits ] BLAST Search Image Search Home About Archive Update History Contact us ...Budding yeast cDNA sequencing project Update History of This Database Date Update contents 2010/03/29 Buddin...tio About This Database Database Description Download License Update History of This Database Site Policy | Contact Us Update History

  11. Molecular cloning and nucleotide sequence of cDNA for human glucose-6-phosphate dehydrogenase variant A(-)

    International Nuclear Information System (INIS)

    Glucose-6-phosphate dehydrogenase A(-) is a common variant in Blacks that causes sensitivity to drug- and infection-induced hemolytic anemia. A cDNA library was constructed from Epstein-Barr virus-transformed lymphoblastoid cells from a male who was G6PD A(-). One of four cDNA clones isolated contained a sequence not found in the other clones nor in the published cDNA sequence. Consisting of 138 bases and coding 46 amino acids, this segment of cDNA apparently is derived from the alternative splicing involving the 3' end of intron 7. Comparison of the remaining sequences of these clones with the published sequence revealed three nucleotide substitutions: C33 → G, G202 → A, and A376 → G. Each change produces a new restriction site. Genomic DNA from five G6PD A(-) individuals was amplified by the polymerase chain reaction. The findings of the same mutation in G6PD A(-) as is found in G6PD A(+) strongly suggests that the G6PD A(-) mutation arose in an individual with G6PD A(+), adding another mutation that causes the in vivo instability of this enzyme protein

  12. Molecular cloning and sequence analysis of the cDNA for thyroid-stimulating hormone beta subunit of Muscovy duck.

    Science.gov (United States)

    Hsieh, Y L; Chatterjee, A; Lee, G; Yuh-Lin Yu, J

    2000-12-01

    The cDNA-encoding thyroid-stimulating hormone beta subunit (TSHbeta) of the Muscovy duck (Cairina moschata) was cloned and sequenced to investigate the phylogenic diversity and evolution of TSH molecules. Oligonucleotide primers were designed and used for reverse transcription and polymerase chain reaction amplification of a TSHbeta cDNA fragment from the total cellular RNA of pituitary glands. The remaining sequence was completed by rapid amplification of cDNA ends. The nucleotide sequence of Muscovy duck TSHbeta cDNA includes 66 bp of 5'-untranslated region (UTR), 402 bp of coding region, and 82 bp of 3'-UTR, followed by an 18-bp poly(A)(+) tract. The total number of amino acids deduced from the cDNA of duck TSHbeta is 134, with a signal peptide of 19 amino acids and a mature protein of 115 amino acids. The homologies of the amino acid sequence of Muscovy duck TSHbeta compared with those of chicken, quail, mammals, amphibian (frog), and teleosts are 97, 98, 68-69, 56, and 42-47%, respectively. To test for tissue specificity of the duck TSHbeta cDNA, total cellular RNA samples from brain, liver, pituitary gland, testis, and thyroid gland were analyzed by Northern blotting. A band, approximately 700 bases, was found in the pituitary gland alone. The pituitary tissue fragments were treated for 24 h in serum-free medium containing thyrotropin-releasing hormone (TRH), triiodothyronine (T(3)), or thyroxine (T(4)). TRH increased and T(3) and T(4) decreased TSHbeta mRNA levels. This study demonstrated that the amino acid sequence of TSHbeta subunits is highly conserved among birds, exhibiting a high degree of inter-order homology, and that hypothalamic TRH upregulates the TSHbeta mRNA expression in duck. PMID:11121298

  13. Differential representation of sunflower ESTs in enriched organ-specific cDNA libraries in a small scale sequencing project

    OpenAIRE

    Heinz Ruth A; Lew Sergio; Hopp H; Paniego Norma; Fernández Paula

    2003-01-01

    Abstract Background Subtractive hybridization methods are valuable tools for identifying differentially regulated genes in a given tissue avoiding redundant sequencing of clones representing the same expressed genes, maximizing detection of low abundant transcripts and thus, affecting the efficiency and cost effectiveness of small scale cDNA sequencing projects aimed to the specific identification of useful genes for breeding purposes. The objective of this work is to evaluate alternative str...

  14. Identification of glucocorticoid-induced genes in rat hepatoma cells by isolation of cloned cDNA sequences.

    OpenAIRE

    Feinberg, R. F.; Sun, L. H.; Ordahl, C P; Frankel, F R

    1983-01-01

    The expression of specific cellular genes in M1.19 rat hepatoma cells involves glucocorticoid regulation by mechanisms that are not well understood. To approach this problem we cloned cDNA prepared from dexamethasone-induced poly(A)-RNA and used a comparative colony hybridization method to identify recombinant clones containing hormone-regulated sequences. Two such cDNA clones, p1394 and p255, hybridize to a homogeneous RNA species of 900 nucleotides that is present in high abundance in 24-hr...

  15. Identification and isolation of full-length cDNA sequences by sequencing and analysis of expressed sequence tags from guarana (Paullinia cupana).

    Science.gov (United States)

    Figueirêdo, L C; Faria-Campos, A C; Astolfi-Filho, S; Azevedo, J L

    2011-01-01

    The current intense production of biological data, generated by sequencing techniques, has created an ever-growing volume of unanalyzed data. We reevaluated data produced by the guarana (Paullinia cupana) transcriptome sequencing project to identify cDNA clones with complete coding sequences (full-length clones) and complete sequences of genes of biotechnological interest, contributing to the knowledge of biological characteristics of this organism. We analyzed 15,490 ESTs of guarana in search of clones with complete coding regions. A total of 12,402 sequences were analyzed using BLAST, and 4697 full-length clones were identified, responsible for the production of 2297 different proteins. Eighty-four clones were identified as full-length for N-methyltransferase and 18 were sequenced in both directions to obtain the complete genome sequence, and confirm the search made in silico for full-length clones. Phylogenetic analyses were made with the complete genome sequences of three clones, which showed only 0.017% dissimilarity; these are phylogenetically close to the caffeine synthase of Theobroma cacao. The search for full-length clones allowed the identification of numerous clones that had the complete coding region, demonstrating this to be an efficient and useful tool in the process of biological data mining. The sequencing of the complete coding region of identified full-length clones corroborated the data from the in silico search, strengthening its efficiency and utility. PMID:21732283

  16. Generation and Analysis of Expressed Sequence Tags (ESTs) from Muscle Full-Length cDNA Library of Wujin Pig

    Institute of Scientific and Technical Information of China (English)

    ZHAO Su-mei; LIU Yong-gang; PAN Hong-bing; ZHANG Xi; GE Chang-rong; JIA Jun-jing; GAO Shi-zheng

    2014-01-01

    Porcine skeletal muscle genes play a major role in determining muscle growth and meat quality. Construction of a full-length cDNA library is an effective way to understand the expression of functional genes in muscle tissues. In addition, novel genes for further research could be identiifed in the library. In this study, we constructed a full-length cDNA library from porcine muscle tissue. The estimated average size of the cDNA inserts was 1076 bp, and the cDNA fullness ratio was 86.2%. A total of 1058 unique sequences with 342 contigs (32.3%) and 716 singleton (67.7%) expressed sequence tags (EST) were obtained by clustering and assembling. Meanwhile, 826 (78.1%) ESTs were categorized as known genes, and 232 (21.9%) ESTs were categorized as unknown genes. 65 novel porcine genes that exhibit no identity in the TIGR gene index ofSus scrofa and 124 full-length sequences with unknown functions were deposited in the dbEST division of GenBank (accession numbers: EU650784-EU650788, GE843306, GH228978-GH229100). The abundantly expressed genes in porcine muscle tissue were related to muscle ifber development, energy metabolism and protein synthesis. Gene ontology analysis showed that sequences expressed in porcine muscle tissue contained a high percentage of binding activity, catalytic activity, structural molecule activity and motor activity, which involved mainly in metabolic, cellular and developmental process, distributed mainly in intracellular region. The sequence data generated in this study would provide valuable information for identifying porcine genes expressed in muscle tissue and help to advance the study on the structure and function of genes in pigs.

  17. Sequence of MET protooncogene cDNA has features characteristic of the tyrosine kinase family of growth-factor receptors

    International Nuclear Information System (INIS)

    The authors isolated overlapping cDNA clones corresponding to the major MET protooncogene transcript. The cDNA nucleotide sequence contained an open reading frame of 1408 amino acids with features characteristic of the tyrosine kinase family of growth factor receptors. These features include a putative 24-amino acid signal peptide and a candidate, hybrophobic, membrane-spanning segment of 23 amino acids, which defines an extracellular domain of 926 amino acids that could serve as a ligand-binding domain. A putative intracellular domain 435 amino acids long shows high homology with the SRC family of tyrosine kinases and within the kinase domain is most homologous with the human insulin receptor (44%) and v-abl (41%). Despite these similarities, however, they found no apparent sequence homology to other growth factor receptors in the putative ligand-binding domain. They conclude from the results that the MET protooncogene is a cell-surface receptor for an as-yet-unknown ligand

  18. In-depth cDNA Library Sequencing Provides Quantitative Gene Expression Profiling in Cancer Biomarker Discovery

    Institute of Scientific and Technical Information of China (English)

    Wanling Yang; Dingge Ying; Yu-Lung Lau

    2009-01-01

    procedures may allow detection of many expres-sion features for less abundant gene variants. With the reduction of sequencing cost and the emerging of new generation sequencing technology, in-depth sequencing of cDNA pools or libraries may represent a better and powerful tool in gene expression profiling and cancer biomarker detection. We also propose using sequence-specific subtraction to remove hundreds of the most abundant housekeeping genes to in-crease sequencing depth without affecting relative expression ratio of other genes, as transcripts from as few as 300 most abundantly expressed genes constitute about 20% of the total transcriptome. In-depth sequencing also represents a unique ad-vantage of detecting unknown forms of transcripts, such as alternative splicing variants, fusion genes, and regulatory RNAs, as well as detecting mutations and polymorphisms that may play important roles in disease pathogenesis.

  19. Geoseq: a tool for dissecting deep-sequencing datasets

    OpenAIRE

    Homann Robert; George Ajish; Levovitz Chaya; Shah Hardik; Cancio Anthony; Gurtowski James; Sachidanandam Ravi

    2010-01-01

    Abstract Background Datasets generated on deep-sequencing platforms have been deposited in various public repositories such as the Gene Expression Omnibus (GEO), Sequence Read Archive (SRA) hosted by the NCBI, or the DNA Data Bank of Japan (ddbj). Despite being rich data sources, they have not been used much due to the difficulty in locating and analyzing datasets of interest. Results Geoseq http://geoseq.mssm.edu provides a new method of analyzing short reads from deep sequencing experiments...

  20. Random sequencing of an induced Taxus cell cDNA library for identification of clones involved in Taxol biosynthesis

    OpenAIRE

    Jennewein, Stefan; Wildung, Mark R.; Chau, MyDoanh; Walker, Kevin; Croteau, Rodney

    2004-01-01

    Biosynthesis of the anticancer drug Taxol involves 19 enzymatic steps from the universal diterpenoid progenitor geranylgeranyl diphosphate derived by the plastidial methylerythritol phosphate pathway for isoprenoid precursor supply. To gain further insight about Taxol biosynthesis relevant to the improved production of this drug and to draw inferences about the organization, regulation, and origins of this complex natural product pathway, random sequencing of a cDNA library derived from Taxus...

  1. Amino acid sequence of the serine-repeat antigen (SERA) of Plasmodium falciparum determined from cloned cDNA.

    Science.gov (United States)

    Bzik, D J; Li, W B; Horii, T; Inselburg, J

    1988-09-01

    We report the isolation of cDNA clones for a Plasmodium falciparum gene that encodes the complete amino acid sequence of a previously identified exported blood stage antigen. The Mr of this antigen protein had been determined by sodium dodecylsulphate-polyacrylamide gel electrophoresis analysis, by different workers, to be 113,000, 126,000, and 140,000. We show, by cDNA nucleotide sequence analysis, that this antigen gene encodes a 989 amino acid protein (111 kDa) that contains a potential signal peptide, but not a membrane anchor domain. In the FCR3 strain the serine content of the protein was 11%, of which 57% of the serine residues were localized within a 201 amino acid sequence that included 35 consecutive serine residues. The protein also contained three possible N-linked glycosylation sites and numerous possible O-linked glycosylation sites. The mRNA was abundant during late trophozoite-schizont parasite stages. We propose to identity this antigen, which had been called p126, by the acronym SERA, serine-repeat antigen, based on its complete structure. The usefulness of the cloned cDNA as a source of a possible malaria vaccine is considered in view of the previously demonstrated ability of the antigen to induce parasite-inhibitory antibodies and a protective immune response in Saimiri monkeys. PMID:2847041

  2. Cloning and cDNA sequence of the regulator subunit of cAMP-dependent protein kinase from Dictyostelium discoideum

    International Nuclear Information System (INIS)

    cDNA clones encoding the regulatory subunit of the cAMP-dependent protein kinase (ATP:protein phosphotransferase, EC 2.7.1.37) from Dictyostelium discoideum were isolated by immunoscreening of a cDNA library constructed in the expression vector λgt11, using autoradiography. High-affinity cAMP binding activity was detected in extracts from bacteria lysogenized with these clones. Nucleotide sequence analysis of three overlapping clones allowed the determination of a 1195-base-pair cDNA sequence coding for the entire regulatory subunit and containing nontranslated 5' and 3' sequences. The open reading frame codes for a protein of 327 amino acids, with molecular weight 36,794. The regulatory subunit from Dictyostelium shares a high degree of homology with its mammalian counterparts, but is lacking the NH2-terminal domain required for the association of regulatory subunits into dimers in other eukaryotes. On the basis of the comparison of the regulatory subunits from Dictyostelium, yeast, and bovine tissues, a model for the evolution of these proteins is proposed

  3. Assessing the utility of the Oxford Nanopore MinION for snake venom gland cDNA sequencing.

    Science.gov (United States)

    Hargreaves, Adam D; Mulley, John F

    2015-01-01

    Portable DNA sequencers such as the Oxford Nanopore MinION device have the potential to be truly disruptive technologies, facilitating new approaches and analyses and, in some cases, taking sequencing out of the lab and into the field. However, the capabilities of these technologies are still being revealed. Here we show that single-molecule cDNA sequencing using the MinION accurately characterises venom toxin-encoding genes in the painted saw-scaled viper, Echis coloratus. We find the raw sequencing error rate to be around 12%, improved to 0-2% with hybrid error correction and 3% with de novo error correction. Our corrected data provides full coding sequences and 5' and 3' UTRs for 29 of 33 candidate venom toxins detected, far superior to Illumina data (13/40 complete) and Sanger-based ESTs (15/29). We suggest that, should the current pace of improvement continue, the MinION will become the default approach for cDNA sequencing in a variety of species. PMID:26623194

  4. Primary structure of cucumber (Cucumis sativus) ascorbate oxidase deduced from cDNA sequence: homology with blue copper proteins and tissue-specific expression.

    OpenAIRE

    Ohkawa, J; Okada, N; Shinmyo, A; Takano, M.

    1989-01-01

    cDNA clones for ascorbate oxidase were isolated from a cDNA library made from cucumber (Cucumis sativus) fruit mRNA. The library was screened with synthetic oligonucleotides that encode the NH2-terminal sequence of this enzyme. Nucleotide sequence analysis of the cloned cDNA inserts revealed a 1761-base-pair open reading frame that encoded an NH2-terminal signal peptide of 33 amino acids and a mature enzyme of 554 amino acids (Mr, 62,258). The amino acid sequence deduced from nucleotide seque...

  5. Molecular cloning and sequence analysis of cDNA encoding molt-inhibiting hormone from Portunus trituberculatus

    Directory of Open Access Journals (Sweden)

    ZHU Dong-Fa

    2008-12-01

    Full Text Available Molt-inhibiting hormone (MIH, which belongs to the crustacean hyperglycemic hormone (CHH neuropeptide family, inhibits the synthesis of ecdysteroids by Y-organs. The CHH family is divided into two major groups, designated I and II. Group Ⅰ is composed of all CHH peptides, and Group Ⅱ is composed of most MIHs, gonad-inhibiting hormone (GIH and mandibular organ-inhibiting hormone (MOIH. The Group Ⅰ peptides consist of a signal sequence, a CHH precursor related peptide (CPRP and a mature hormone. The Group Ⅱ peptides have only a signal sequence and a mature hormone. The full-length molt-inhibiting hormone cDNA of Portunus trituberculatus (Pot-MIH was cloned by the reverse transcription-polymerase chain reaction (RT-PCR and the rapid amplification of cDNA ends (RACE. The degenerate nucleotide primers was designed based on the amino acid sequence of other crustacean MIHs. The full-length sequence of Pot-MIH cDNA (GenBank accession number:EU284117 is 1572 bp in size and consists of a 210 bp 5′-untranslated region (5′-UTR, a 342 bp open reading frame and a 3′- untranslated region (3′-UTR. The deduced polypeptide consisted of a 78-amino acid mature MIH peptide and a 35-amino acid signal ptptide. The mature Pot-MIH peptide shared twelve conserved residues (Cys7, Gly12, Arg14, Cys24, Asp26, Cys27, Asn29, Arg32, Cys40, Cys44, Phe50, and Cys53 with other crustacean MIHs. The mature Pot-MIH peptide had the Gly12 residue and the precursor lacked a CPRP sequence, both of which are characteristic of all Group Ⅱ peptides. Alignment of the amino acid sequence deduced from the full-length Pot-MIH cDNA with MIH amino acid sequences of other crustaceans revealed that they had very high identity. The identities between the putative Pot-MIH amino acid sequence with that of MIH of P.pelagicus, Callinectes sapidus, Charybdis feriatus, Cancer pagurus, C.magister, Carcinus maenas and Gecarcinus lateralis were 97%, 95%, 90%, 82%, 81%, 77% and 60

  6. Generation of a large scale repertoire of Expressed Sequence Tags (ESTs from normalised rainbow trout cDNA libraries

    Directory of Open Access Journals (Sweden)

    Guiguen Yann

    2006-08-01

    Full Text Available Abstract Background Within the framework of a genomics project on livestock species (AGENAE, we initiated a high-throughput DNA sequencing program of Expressed Sequence Tags (ESTs in rainbow trout, Oncorhynchus mykiss. Results We constructed three cDNA libraries including one highly complex pooled-tissue library. These libraries were normalized and subtracted to reduce clone redundancy. ESTs sequences were produced, and 96 472 ESTs corresponding to high quality sequence reads were released on the international database, currently representing 42.5% of the overall sequence knowledge in this species. All these EST sequences and other publicly available ESTs in rainbow trout have been included on a publicly available Website (SIGENAE and have been clustered into a total of 52 930 clusters of putative transcripts groups, including 24 616 singletons. 57.1% of these 52 930 clusters are represented by at least one Agenae EST and 14 343 clusters (27.1% are only composed by Agenae ESTs. Sequence analysis also reveals that normalization and especially subtraction were effective in decreasing redundancy, and that the pooled-tissue library was representative of the initial tissue complexity. Conclusion Due to present work on the construction of rainbow trout normalized cDNA libraries and their extensive sequencing, along with other large scale sequencing programs, rainbow trout is now one of the major fish models in term of EST sequences available in a public database, just after Zebrafish, Danio rerio. This information is now used for the selection of a non redundant set of clones for producing DNA micro-arrays in order to examine global gene expression.

  7. Human thrombopoietin: gene structure, cDNA sequence, expression, and chromosomal localization.

    OpenAIRE

    Foster, D C; Sprecher, C A; Grant, F J; Kramer, J M; Kuijper, J L; Holly, R D; Whitmore, T E; Heipel, M D; Bell, L A; Ching, A F

    1994-01-01

    Thrombopoietin (TPO), a lineage-specific cytokine affecting the proliferation and maturation of megakaryocytes from committed progenitor cells, is believed to be the major physiological regulator of circulating platelet levels. Recently we have isolated a cDNA encoding a ligand for the murine c-mpl protooncogene and shown it to be TPO. By employing a murine cDNA probe, we have isolated a gene encoding human TPO from a human genomic library. The TPO locus spans over 6 kb and has a structure si...

  8. Human uroporphyrinogen III synthase: Molecular cloning, nucleotide sequence, and expression of a full-length cDNA

    International Nuclear Information System (INIS)

    Uroporphyrinogen III synthase, the fourth enzyme in the heme biosynthetic pathway, is responsible for conversion of the linear tetrapyrrole, hydroxymethylbilane, to the cyclic tetrapyrrole, uroporphyrinogen III. The deficient activity of URO-synthase is the enzymatic defect in the autosomal recessive disorder congenital erythropoietic porphyria. To facilitate the isolation of a full-length cDNA for human URO-synthase, the human erythrocyte enzyme was purified to homogeneity and 81 nonoverlapping amino acids were determined by microsequencing the N terminus and four tryptic peptides. Two synthetic oligonucleotide mixtures were used to screen 1.2 x 106 recombinants from a human adult liver cDNA library. Eight clones were positive with both oligonucleotide mixtures. Of these, dideoxy sequencing of the 1.3 kilobase insert from clone pUROS-2 revealed 5' and 3' untranslated sequences of 196 and 284 base pairs, respectively, and an open reading frame of 798 base pairs encoding a protein of 265 amino acids with a predicted molecular mass of 28,607 Da. The isolation and expression of this full-length cDNA for human URO-synthase should facilitate studies of the structure, organization, and chromosomal localization of this heme biosynthetic gene as well as the characterization of the molecular lesions causing congenital erythropoietic porphyria

  9. Human uroporphyrinogen III synthase: Molecular cloning, nucleotide sequence, and expression of a full-length cDNA

    Energy Technology Data Exchange (ETDEWEB)

    Tsai, Shihfeng; Bishop, D.F.; Desnick, R.J. (Mount Sinai School of Medicine, New York, NY (USA))

    1988-10-01

    Uroporphyrinogen III synthase, the fourth enzyme in the heme biosynthetic pathway, is responsible for conversion of the linear tetrapyrrole, hydroxymethylbilane, to the cyclic tetrapyrrole, uroporphyrinogen III. The deficient activity of URO-synthase is the enzymatic defect in the autosomal recessive disorder congenital erythropoietic porphyria. To facilitate the isolation of a full-length cDNA for human URO-synthase, the human erythrocyte enzyme was purified to homogeneity and 81 nonoverlapping amino acids were determined by microsequencing the N terminus and four tryptic peptides. Two synthetic oligonucleotide mixtures were used to screen 1.2 {times} 10{sup 6} recombinants from a human adult liver cDNA library. Eight clones were positive with both oligonucleotide mixtures. Of these, dideoxy sequencing of the 1.3 kilobase insert from clone pUROS-2 revealed 5' and 3' untranslated sequences of 196 and 284 base pairs, respectively, and an open reading frame of 798 base pairs encoding a protein of 265 amino acids with a predicted molecular mass of 28,607 Da. The isolation and expression of this full-length cDNA for human URO-synthase should facilitate studies of the structure, organization, and chromosomal localization of this heme biosynthetic gene as well as the characterization of the molecular lesions causing congenital erythropoietic porphyria.

  10. Molecular cloning, sequence analysis and expression in Escherichia coli of Camelus dromedarius glucose-6-phosphate dehydrogenase cDNA.

    Science.gov (United States)

    Saeed, Hesham Mahmoud; Alanazi, Mohammad Saud; Abduljaleel, Zainularifeen; Al-Amri, Abdullah; Khan, Zahid

    2012-06-01

    This study determined the full length sequence of glucose-6-phosphate dehydrogenase cDNA (G6PD) from the Arabian camel Camelus dromedarius using reverse transcription polymerase chain reaction. The C. dromedarius G6PD has an open reading frame of 1545 bp, and the cDNA encodes a protein of 515 amino acid residues with a molecular weight of 59.0 KDa. The amino acid sequence showed the highest identity with Equus caballus (92%) and Homo sapiens (92%). The G6PD cDNA was cloned and expressed into Escherichia coli as a fusion protein and was purified in a single chromatographic step using nickel affinity gel column. The purity and the molecular weight of the enzyme were checked on SDS-PAGE and the purified enzyme showed a single band on the gel with a molecular weight of 63.0 KDa. The specific activity of G6PD was determined to be 289.6 EU/mg protein with a fold purification of 95.45 and yield of 56.8%. PMID:22538316

  11. Detection of reverse transcriptase termination sites using cDNA ligation and massive parallel sequencing

    DEFF Research Database (Denmark)

    Kielpinski, Lukasz J; Boyd, Mette; Sandelin, Albin;

    2013-01-01

    Detection of reverse transcriptase termination sites is important in many different applications, such as structural probing of RNAs, rapid amplification of cDNA 5' ends (5' RACE), cap analysis of gene expression, and detection of RNA modifications and protein-RNA cross-links. The throughput of...

  12. cDNA sequence analysis of a 29-kDa cysteine-rich surface antigen of pathogenic Entamoeba histolytica

    International Nuclear Information System (INIS)

    A λgt11 cDNA library was constructed from poly(U)-Spharose-selected Entamoeba histolytica trophozoite RNA in order to clone and identify surface antigens. The library was screened with rabbit polyclonal anti-E. histolytica serum. A 700-base-pair cDNA insert was isolated and the nucleotide sequence was determined. The deduced amino acid sequence of the cDNA revealed a cysteine-rich protein. DNA hybridizations showed that the gene was specific to E. histolytica since the cDNA probe reacted with DNA from four axenic strains of E. histolytica but did not react with DNA from Entamoeba invadens, Acanthamoeba castellanii, or Trichomonas vaginalis. The insert was subcloned into the expression vector pGEX-1 and the protein was expressed as a fusion with the C terminus of glutathione S-transferase. Purified fusion protein was used to generate 22 monoclonal antibodies (mAbs) and a mouse polyclonal antiserum specific for the E. histolytica portion of the fusion protein. A 29-kDa protein was identified as a surface antigen when mAbs were used to immunoprecipitate the antigen from metabolically 35S-labeled live trophozoites. The surface location of the antigen was corroborated by mAb immunoprecipitation of a 29-kDa protein from surface-125I-labeled whole trophozoites as well as by the reaction of mAbs with live trophozoites in an indirect immunofluorescence assay performed at 4 degree C. Immunoblotting with mAbs demonstrated that the antigen was present on four axenic isolates tested. mAbs recognized epitopes on the 29-kDa native antigen on some but not all clinical isolates tested

  13. cDNA sequence analysis of a 29-kDa cysteine-rich surface antigen of pathogenic Entamoeba histolytica

    Energy Technology Data Exchange (ETDEWEB)

    Torian, B.E.; Stroeher, V.L.; Stamm, W.E. (Univ. of Washington, Seattle (USA)); Flores, B.M. (Louisiana State Univ. Medical Center, New Orleans (USA)); Hagen, F.S. (Zymogenetics Incorporated, Seattle, WA (USA))

    1990-08-01

    A {lambda}gt11 cDNA library was constructed from poly(U)-Spharose-selected Entamoeba histolytica trophozoite RNA in order to clone and identify surface antigens. The library was screened with rabbit polyclonal anti-E. histolytica serum. A 700-base-pair cDNA insert was isolated and the nucleotide sequence was determined. The deduced amino acid sequence of the cDNA revealed a cysteine-rich protein. DNA hybridizations showed that the gene was specific to E. histolytica since the cDNA probe reacted with DNA from four axenic strains of E. histolytica but did not react with DNA from Entamoeba invadens, Acanthamoeba castellanii, or Trichomonas vaginalis. The insert was subcloned into the expression vector pGEX-1 and the protein was expressed as a fusion with the C terminus of glutathione S-transferase. Purified fusion protein was used to generate 22 monoclonal antibodies (mAbs) and a mouse polyclonal antiserum specific for the E. histolytica portion of the fusion protein. A 29-kDa protein was identified as a surface antigen when mAbs were used to immunoprecipitate the antigen from metabolically {sup 35}S-labeled live trophozoites. The surface location of the antigen was corroborated by mAb immunoprecipitation of a 29-kDa protein from surface-{sup 125}I-labeled whole trophozoites as well as by the reaction of mAbs with live trophozoites in an indirect immunofluorescence assay performed at 4{degree}C. Immunoblotting with mAbs demonstrated that the antigen was present on four axenic isolates tested. mAbs recognized epitopes on the 29-kDa native antigen on some but not all clinical isolates tested.

  14. Abiotic stresses induce a thaumatin-like protein in maize; cDNA isolation and sequence analysis

    International Nuclear Information System (INIS)

    The cDNA clone (CHEM4) coding for a stress-induced protein has been isolated from a mercuric chloride-treated maize (Zea mays L.) cDNA library by differential screening. Nucleotide sequence analysis of the 694-bp-cDNA insert predicts an open reading frame of 522 nucleotides encoding a mature protein of 149 amino acids and a signal peptide of 25 amino acid residues. The sequence of CHEM4-encoded protein shows 50 to 60% sequence identity with thaumatin, bifunctional α-amylase/trypsin inhibitor from maize and thaumatin-like (TL) proteins from dicotyledonous plants. Higher sequence identities are found to wheat PWIR2 and barley Hv-1b TL-proteins. CHEM4-mRNA is not only induced by mercuric chloride treatment but also by other stresses such as high salt concentration, high temperatures, UV light, wounding and polluted rainwater. CHEM4 genes appear to be highly inducible by UV light and mercuric chloride, moderately by wounding, salt stress, polluted rainwater and heat shock and very weakly by cooling. (author)

  15. ANDES: Statistical tools for the ANalyses of DEep Sequencing

    Directory of Open Access Journals (Sweden)

    Denison Mark R

    2010-07-01

    Full Text Available Abstract Background The advancements in DNA sequencing technologies have allowed researchers to progress from the analyses of a single organism towards the deep sequencing of a sample of organisms. With sufficient sequencing depth, it is now possible to detect subtle variations between members of the same species, or between mixed species with shared biomarkers, such as the 16S rRNA gene. However, traditional sequencing analyses of samples from largely homogeneous populations are often still based on multiple sequence alignments (MSA, where each sequence is placed along a separate row and similarities between aligned bases can be followed down each column. While this visual format is intuitive for a small set of aligned sequences, the representation quickly becomes cumbersome as sequencing depths cover loci hundreds or thousands of reads deep. Findings We have developed ANDES, a software library and a suite of applications, written in Perl and R, for the statistical ANalyses of DEep Sequencing. The fundamental data structure underlying ANDES is the position profile, which contains the nucleotide distributions for each genomic position resultant from a multiple sequence alignment (MSA. Tools include the root mean square deviation (RMSD plot, which allows for the visual comparison of multiple samples on a position-by-position basis, and the computation of base conversion frequencies (transition/transversion rates, variation (Shannon entropy, inter-sample clustering and visualization (dendrogram and multidimensional scaling (MDS plot, threshold-driven consensus sequence generation and polymorphism detection, and the estimation of empirically determined sequencing quality values. Conclusions As new sequencing technologies evolve, deep sequencing will become increasingly cost-efficient and the inter and intra-sample comparisons of largely homogeneous sequences will become more common. We have provided a software package and demonstrated its

  16. Analysis of expressed sequence tags generated from full-length enriched cDNA libraries of melon

    Directory of Open Access Journals (Sweden)

    Bendahmane Abdelhafid

    2011-05-01

    Full Text Available Abstract Background Melon (Cucumis melo, an economically important vegetable crop, belongs to the Cucurbitaceae family which includes several other important crops such as watermelon, cucumber, and pumpkin. It has served as a model system for sex determination and vascular biology studies. However, genomic resources currently available for melon are limited. Result We constructed eleven full-length enriched and four standard cDNA libraries from fruits, flowers, leaves, roots, cotyledons, and calluses of four different melon genotypes, and generated 71,577 and 22,179 ESTs from full-length enriched and standard cDNA libraries, respectively. These ESTs, together with ~35,000 ESTs available in public domains, were assembled into 24,444 unigenes, which were extensively annotated by comparing their sequences to different protein and functional domain databases, assigning them Gene Ontology (GO terms, and mapping them onto metabolic pathways. Comparative analysis of melon unigenes and other plant genomes revealed that 75% to 85% of melon unigenes had homologs in other dicot plants, while approximately 70% had homologs in monocot plants. The analysis also identified 6,972 gene families that were conserved across dicot and monocot plants, and 181, 1,192, and 220 gene families specific to fleshy fruit-bearing plants, the Cucurbitaceae family, and melon, respectively. Digital expression analysis identified a total of 175 tissue-specific genes, which provides a valuable gene sequence resource for future genomics and functional studies. Furthermore, we identified 4,068 simple sequence repeats (SSRs and 3,073 single nucleotide polymorphisms (SNPs in the melon EST collection. Finally, we obtained a total of 1,382 melon full-length transcripts through the analysis of full-length enriched cDNA clones that were sequenced from both ends. Analysis of these full-length transcripts indicated that sizes of melon 5' and 3' UTRs were similar to those of tomato, but

  17. Budding yeast cDNA sequencing project: Y074_N01_F.ab1 [Budding yeast cDNA sequencing project

    Lifescience Database Archive (English)

    Full Text Available Y074_N01_F.ab1 - - - Show Y074_N01_F.ab1 Seqid Y074_N01_F.ab1 Link to SGD - Link to dbEST - Link ... to UC SC ... Genome Browser - Sequenc e >Y074_N01_F.ab1 301 0 ... 301 ABI GAACCCC AACCCCC AAAAAAGGC ATC AGGNATTCC GAAAC AC ACCCCC AA CC AC C AAAC AC ACC AAC ACC AAC AAC AAC AC AC AC AAC AAACCC ACCCCC ... AAAAAAAAC ... AC ACCC AC AC AACCCC AAAC AC AAC AAAC AC AAAAAAAC AA AAAAACC AC AA ...

  18. Primary structure of bovine pituitary secretory protein I (chromogranin A) deduced from the cDNA sequence

    International Nuclear Information System (INIS)

    Secretory protein I (SP-I), also referred to as chromogranin A, is an acidic glycoprotein that has been found in every tissue of endocrine and neuroendocrine origin examined but never in exocrine or epithelial cells. Its co-storage and co-secretion with peptide hormones and neurotransmitters suggest that it has an important endocrine or secretory function. The authors have isolated cDNA clones from a bovine pituitary λgt11 expression library using an antiserum to parathyroid SP-I. The largest clone (SP4B) hybridized to a transcript of 2.1 kilobases in RNA from parathyroid, pituitary, and adrenal medulla. Immunoblots of bacterial lysates derived from SP4B lysognes demonstrated specific antibody binding to an SP4B/β-galactosidase fusion protein (160 kDa) with a cDNA-derived component of 46 kDa. Radioimmunoassay of the bacterial lystates with SP-I antiserum yielded parallel displacement curves of 125I-labeled SP-I by the SP4B lysate and authentic SP-I. SP4B contains a cDNA of 1614 nucleotides that encodes a 449-amino acid protein (calculated mass, 50 kDa). The nucleotide sequences of the pituitary SP-I cDNA and adrenal medullary SP-I cDNAs are nearly identical. Analysis of genomic DNA suggests that pituitary, adrenal, and parathyroid SP-I are products of the same gene

  19. Generation and analysis of expressed sequence tags from a cDNA library of the fruiting body of Ganoderma lucidum

    Directory of Open Access Journals (Sweden)

    Li Xiwen

    2010-03-01

    Full Text Available Abstract Background Little genomic or trancriptomic information on Ganoderma lucidum (Lingzhi is known. This study aims to discover the transcripts involved in secondary metabolite biosynthesis and developmental regulation of G. lucidum using an expressed sequence tag (EST library. Methods A cDNA library was constructed from the G. lucidum fruiting body. Its high-quality ESTs were assembled into unique sequences with contigs and singletons. The unique sequences were annotated according to sequence similarities to genes or proteins available in public databases. The detection of simple sequence repeats (SSRs was preformed by online analysis. Results A total of 1,023 clones were randomly selected from the G. lucidum library and sequenced, yielding 879 high-quality ESTs. These ESTs showed similarities to a diverse range of genes. The sequences encoding squalene epoxidase (SE and farnesyl-diphosphate synthase (FPS were identified in this EST collection. Several candidate genes, such as hydrophobin, MOB2, profilin and PHO84 were detected for the first time in G. lucidum. Thirteen (13 potential SSR-motif microsatellite loci were also identified. Conclusion The present study demonstrates a successful application of EST analysis in the discovery of transcripts involved in the secondary metabolite biosynthesis and the developmental regulation of G. lucidum.

  20. Barcoded cDNA library preparation for small RNA profiling by next-generation sequencing

    OpenAIRE

    Hafner, Markus; Renwick, Neil; Farazi, Thalia A.; Mihailovi, Aleksandra; Pena, John T.G.; Tuschl, Thomas

    2012-01-01

    The characterization of post-transcriptional gene regulation by small regulatory (20–30 nt) RNAs, particularly miRNAs and piRNAs, has become a major focus of research in recent years. A prerequisite for characterizing small RNAs is their identification and quantification across different developmental stages, and in normal and disease tissues, as well as model cell lines. Here we present a step-by-step protocol for generating barcoded small RNA cDNA libraries compatible with Illumina HiSeq se...

  1. Geoseq: a tool for dissecting deep-sequencing datasets

    Directory of Open Access Journals (Sweden)

    Homann Robert

    2010-10-01

    Full Text Available Abstract Background Datasets generated on deep-sequencing platforms have been deposited in various public repositories such as the Gene Expression Omnibus (GEO, Sequence Read Archive (SRA hosted by the NCBI, or the DNA Data Bank of Japan (ddbj. Despite being rich data sources, they have not been used much due to the difficulty in locating and analyzing datasets of interest. Results Geoseq http://geoseq.mssm.edu provides a new method of analyzing short reads from deep sequencing experiments. Instead of mapping the reads to reference genomes or sequences, Geoseq maps a reference sequence against the sequencing data. It is web-based, and holds pre-computed data from public libraries. The analysis reduces the input sequence to tiles and measures the coverage of each tile in a sequence library through the use of suffix arrays. The user can upload custom target sequences or use gene/miRNA names for the search and get back results as plots and spreadsheet files. Geoseq organizes the public sequencing data using a controlled vocabulary, allowing identification of relevant libraries by organism, tissue and type of experiment. Conclusions Analysis of small sets of sequences against deep-sequencing datasets, as well as identification of public datasets of interest, is simplified by Geoseq. We applied Geoseq to, a identify differential isoform expression in mRNA-seq datasets, b identify miRNAs (microRNAs in libraries, and identify mature and star sequences in miRNAS and c to identify potentially mis-annotated miRNAs. The ease of using Geoseq for these analyses suggests its utility and uniqueness as an analysis tool.

  2. The complete cDNA sequence of laminin alpha 4 and its relationship to the other human laminin alpha chains.

    Science.gov (United States)

    Richards, A; Al-Imara, L; Pope, F M

    1996-06-15

    We previously localised the gene (LAMA4) encoding a novel laminin alpha 4 chain to chromosome 6q21. In this study, we describe the complete coding sequence and compare the protein with the other three known human laminin alpha chains. Although closely linked to LAMA2, the LAMA4 product most closely resembles laminin alpha 3, a constituent of laminin 5. Like laminin alpha 3A, the alpha 4 chain is a truncated version of the alpha 1 and alpha 2 chains, with a much reduced short arm. While the alpha 4 molecule is most similar to alpha 3, it shares some features of the C-terminal domains G4 and G5 in common with alpha 2. Unlike the LAMA3 gene, LAMA4 appears to encode only a single transcript, as determined by 5' rapid amplification of cDNA ends. The cDNA sequence encodes 1816 amino acids, which include a 24-residue signal peptide. The gene is expressed in skin, placenta, heart, lung, skeletal muscle, and pancreas. We have also shown that the mRNA can be readily reverse transcribed and amplified from cultured dermal fibroblasts. PMID:8706685

  3. Deep sequencing increases hepatitis C virus phylogenetic cluster detection compared to Sanger sequencing.

    Science.gov (United States)

    Montoya, Vincent; Olmstead, Andrea; Tang, Patrick; Cook, Darrel; Janjua, Naveed; Grebely, Jason; Jacka, Brendan; Poon, Art F Y; Krajden, Mel

    2016-09-01

    Effective surveillance and treatment strategies are required to control the hepatitis C virus (HCV) epidemic. Phylogenetic analyses are powerful tools for reconstructing the evolutionary history of viral outbreaks and identifying transmission clusters. These studies often rely on Sanger sequencing which typically generates a single consensus sequence for each infected individual. For rapidly mutating viruses such as HCV, consensus sequencing underestimates the complexity of the viral quasispecies population and could therefore generate different phylogenetic tree topologies. Although deep sequencing provides a more detailed quasispecies characterization, in-depth phylogenetic analyses are challenging due to dataset complexity and computational limitations. Here, we apply deep sequencing to a characterized population to assess its ability to identify phylogenetic clusters compared with consensus Sanger sequencing. For deep sequencing, a sample specific threshold determined by the 50th percentile of the patristic distance distribution for all variants within each individual was used to identify clusters. Among seven patristic distance thresholds tested for the Sanger sequence phylogeny ranging from 0.005-0.06, a threshold of 0.03 was found to provide the maximum balance between positive agreement (samples in a cluster) and negative agreement (samples not in a cluster) relative to the deep sequencing dataset. From 77 HCV seroconverters, 10 individuals were identified in phylogenetic clusters using both methods. Deep sequencing analysis identified an additional 4 individuals and excluded 8 other individuals relative to Sanger sequencing. The application of this deep sequencing approach could be a more effective tool to understand onward HCV transmission dynamics compared with Sanger sequencing, since the incorporation of minority sequence variants improves the discrimination of phylogenetically linked clusters. PMID:27282472

  4. Introduction by molecular cloning of artifactual inverted sequences at the 5' terminus of the sense strand of bovine parathyroid hormone cDNA.

    OpenAIRE

    Weaver, C A; Gordon, D. F.; Kemper, B

    1981-01-01

    To study the structure and function of the gene for parathyroid hormone, we obtained recombinant plasmids containing bovine parathyroid hormone cDNA. The nucleotide sequence at the 5' terminus (relative to the sense strand) of the cDNA insert in a recombinant plasmid, pPTHi4, was different from that previously reported for the bovine parathyroid hormone cDNA insert of another recombinant plasmid, pPTHm1 [Kronenberg, H. M., McDevitt, B. F., Majzoub, J. A., Nathans, J., Sharp, P. A., Potts, J. ...

  5. Nucleotide sequence of classical swine fever virus strain Alfort/187 and transcription of infectious RNA from stably cloned full-length cDNA.

    OpenAIRE

    Ruggli, N; Tratschin, J D; Mittelholzer, C.; Hofmann, M A

    1996-01-01

    The complete nucleotide sequence of the genome of classical swine fever virus (CSFV) strain Alfort/187 was determined from three cDNA libraries constructed by cloning of DNA fragments obtained from independent sets of reverse transcription and PCR. The cDNA fragments were then assembled and inserted downstream of a T7 promoter in a P15A-derived plasmid vector to obtain the full-length cDNA clone pA187-1. The first nucleotide of the CSFV genome was positioned at the transcription start site of...

  6. Controlled ribonucleotide tailing of cDNA ends (CRTC) by terminal deoxynucleotidyl transferase: a new approach in PCR-mediated analysis of mRNA sequences.

    OpenAIRE

    Schmidt, W. M.; Mueller, M W

    1996-01-01

    Controlled ribonucleotide tailing of cDNA ends (CRTC) by terminal deoxynucleotidyl transferase is a polymerase chain reaction (PCR)-mediated technique that was developed to facilitate cloning and direct sequence analysis of complete 5'-terminal unknown coding regions of rare RNA molecules. In contrast with standard tailing protocols using dNTPs as the substrate, ribo-tailing of cDNA ends is easily controllable, self-limited (from two to four rNMP incorporations) and highly efficient (>98%). B...

  7. cDNA sequence analysis of ribosomal protein S13 gene in Plutella xylostella (Lepidoptera: Plutellidae)

    Institute of Scientific and Technical Information of China (English)

    SHAO-LIWANG; CHENG-FASHENG; CHUAN-LINGQIAO; MIYATATADASHI

    2005-01-01

    Ribosomal protein S 13 gene has been cloned and analyzed in many organisms,but there are few documents relating to insects. In this communication, the full-length cDNA sequence of ribosomal protein S 13 gene in the diamondback moth, Plutella xylostella(Lepidoptera: Plutellidae), was determined by using PCR amplification technique. The features of the ribosomal protein S 13 gene sequence were analyzed and the deduced amino acids sequence was compared with those from other insects. The results of multi-alignment of the amino acid sequences between the diamondback moth and other insect species revealed that this gene sequence is highly conserved in insects. Based on maximum likelihood method, a phylogenetic tree was constructed from 10 different species using PHYLIP software. It showed that nematode is one separate lineage and the five insect speciesbe long to another lineage, whereas those species higher than insects form the third one. The pattern of this phylogenetic tree evidently represented the evolution of different species.

  8. Deep sequencing and human antibody repertoire analysis.

    Science.gov (United States)

    Boyd, Scott D; Crowe, James E

    2016-06-01

    In the past decade, high-throughput DNA sequencing (HTS) methods and improved approaches for isolating antigen-specific B cells and their antibody genes have been applied in many areas of human immunology. This work has greatly increased our understanding of human antibody repertoires and the specific clones responsible for protective immunity or immune-mediated pathogenesis. Although the principles underlying selection of individual B cell clones in the intact immune system are still under investigation, the combination of more powerful genetic tracking of antibody lineage development and functional testing of the encoded proteins promises to transform therapeutic antibody discovery and optimization. Here, we highlight recent advances in this fast-moving field. PMID:27065089

  9. WebGMAP: a web service for mapping and aligning cDNA sequences to genomes

    OpenAIRE

    Liang, Chun; Liu, Lin; Ji, Guoli

    2009-01-01

    The genomes of thousands of organisms are being sequenced, often with accompanying sequences of cDNAs or ESTs. One of the great challenges in bioinformatics is to make these genomic sequences and genome annotations accessible in a user-friendly manner to general biologists to address interesting biological questions. We have created an open-access web service called WebGMAP (http://www.bioinfolab.org/software/webgmap) that seamlessly integrates cDNA-genome alignment tools, such as GMAP, with ...

  10. cDNA sequence coding for the alpha'-chain of the third complement component in the African lungfish.

    Science.gov (United States)

    Sato, A; Sültmann, H; Mayer, W E; Figueroa, F; Tichy, H; Klein, J

    1999-04-01

    cDNA clones coding for almost the entire C3 alpha-chain of the African lungfish (Protopterus aethiopicus), a representative of the Sarcopterygii (lobe-finned fishes), were sequenced and characterized. From the sequence it is deduced that the lungfish C3 molecule is probably a disulphide-bonded alpha:beta dimer similar to that of the C3 components of other jawed vertebrates. The deduced sequence contains conserved sites presumably recognized by proteolytic enzymes (e.g. factor I) involved in the activation and inactivation of the component. It also contains the conserved thioester region and the putative site for binding properdin. However, the site for the interaction with complement receptor 2 and factor H are poorly conserved. Either complement receptor 2 and factor H are not present in the lungfish or they bind to different residues at the same or a different site than mammalian complement receptor 2 and factor H. The C3 alpha-chain sequences faithfully reflect the phylogenetic relationships among vertebrate classes and can therefore be used to help to resolve the long-standing controversy concerning the origin of the tetrapods. PMID:10219761

  11. Evolution of tissue-specific keratins as deduced from novel cDNA sequences of the lungfish Protopterus aethiopicus.

    Science.gov (United States)

    Schaffeld, Michael; Bremer, Miriam; Hunzinger, Christian; Markl, Jürgen

    2005-03-01

    Lungfishes are possibly the closest extant relatives of the land vertebrates (tetrapods). We report here the cDNA and predicted amino acid sequences of 13 different keratins (ten type I and three type II) of the lungfish Protopterus aethiopicus. These keratins include the orthologs of human K8 and K18. The lungfish keratins were also identified in tissue extracts using two-dimensional polyacrylamide gel electrophoresis, keratin blot binding assays and immunoblotting. The identified keratin spots were analyzed by peptide mass fingerprinting which assigned seven sequences (inclusively Protopterus K8 and K18) to their respective protein spot. The peptide mass fingerprints also revealed the fact that the major epidermal type I and type II keratins of this lungfish have not yet been sequenced. Nevertheless, phylogenetic trees constructed from multiple sequence alignments of keratins from lungfish and distantly related vertebrates such as lamprey, shark, trout, frog, and human reveal new insights into the evolution of K8 and K18, and unravel a variety of independent keratin radiation events. PMID:15819414

  12. Existence of homologous sequences corresponding to cDNA of the ver

    Institute of Scientific and Technical Information of China (English)

    2001-01-01

    The presence of DNA homologues corresponding to verc203 (vernalization-related cDNA clone) was investigated by molecular hybridization techniques. The genes were detected in 16 plant species that cover 12 subclasses of the Takhtajan system of angiosperms classification including diverse model species. The results of Southern blot analysis showed a low copy number of this gene existed in rice, wheat, barley and Arabidopsis. The hybridization result of PCR products demonstrated the conservation of the gene corresponding to ver203 in diverse plants. The phylogenetic tree of the ver203 gene in tested plants was supported by evolution relationship of species. The ver203 gene expressed in a vernalized plumule winter wheat, instead of the root. And the endosperm before the treatment was essential for the ver203 expression during vernalization in wheat. In Arabidopsis thaliana, the pattern of expression showed that the gene corresponding to ver203 was expressed at low temperature for 14 days. Gibberellin (GA3) may accelerate the expression of ver203 gene in Arabidopsis exposed to low temperature. However, it could not replace vernalization treatment to initiate the gene expression.

  13. cDNA sequence of the horse (Equus caballus) LAMA3 gene and characterization of two intronic SNP markers.

    Science.gov (United States)

    Milenkovic, Dragan; Mata, Xavier; Chadi, Sead; Guérin, Gérard

    2005-12-01

    Laminins are large heterotrimeric basement membrane glycoproteins composed of alpha, beta and gamma chains. The Laminin 5 isoform has an alpha3beta3gamma2 composition and is essential for the adhesion of basal keratinocytes to the underlying epithelial basement membrane where it is mainly located. Mutations in the genes coding for the 3 chains have been associated with a severe skin blistering disease, Herlitz's junctional epidermolysis bullosa (JEB), observed in different species as man, dog, cat and horse. In this study, we report the sequence of the 5.2 kb horse laminin alpha 3 cDNA (LAMA3) as well as the detection of two intronic SNPs. These data will be useful to further identify causal mutations for the disease in this gene. PMID:16287627

  14. Complete sequence of a cDNA clone specifying sandbar shark immunoglobulin light chain: gene organization and implications for the evolution of light chains.

    OpenAIRE

    Hohman, V S; Schluter, S F; Marchalonis, J J

    1992-01-01

    A full-length cDNA clone specifying sandbar shark (Carcharhinus plumbeus) immunoglobulin light chain has been isolated and sequenced. By alignment with human lambda chains, the leader, framework, complementarity-determining, joining, and constant regions are clearly identified in the shark light chain. Approximately 40-50% identity is shared between the human and shark sequences in the variable and constant regions. We have performed sequence comparisons of the individual segments and constru...

  15. Trichinella spiralis thymidylate synthase: cDNA cloning and sequencing, and developmental pattern of mRNA expression.

    Science.gov (United States)

    Dabrowska, M; Jagielska, E; Cieśla, J; Płucienniczak, A; Kwiatowski, J; Wranicz, M; Boireau, P; Rode, W

    2004-02-01

    The persistent expression of thymidylate synthase activity has previously been demonstrated not only in adult forms, but also in non-developing muscle larvae of Trichinella spiralis and T. pseudospiralis, pointing to an unusual pattern of cell cycle regulation, and prompting further studies on the developmental pattern of T. spiralis thymidylate synthase gene expression. The enzyme cDNA was cloned and sequenced, allowing the characterization of a single open reading frame of 307 amino acids coding for a putative protein of 35,582 Da molecular weight. The amino acid sequence of the parasite enzyme was analysed, the consensus phylogenetic tree built and its stability assessed. The aa sequence identity with thymidylate synthase was confirmed by the enzymatic activity of the recombinant protein expressed in E. coli. As compared with the enzyme purified from muscle larvae, it showed apparently similar Vmax value, but higher Km(app) values desscribing interactions with dUMP (28.8 microM vs. 3.9 microM) and (6RS,alphaS)-N(5,10)-methylenetetrahydrofolate (383 microM vs. 54.7 microM). With the coding region used as a probe, thymidylate synthase mRNA levels, relative to 18S rRNA, were found to be similar in muscle larvae, adult forms and newborn larvae, in agreement with muscle larvae cells being arrested in the cell cycle. PMID:15030008

  16. deepTools: a flexible platform for exploring deep-sequencing data

    OpenAIRE

    Ramírez, Fidel; Dündar, Friederike; Diehl, Sarah; Grüning, Björn A; Manke, Thomas

    2014-01-01

    We present a Galaxy based web server for processing and visualizing deeply sequenced data. The web server's core functionality consists of a suite of newly developed tools, called deepTools, that enable users with little bioinformatic background to explore the results of their sequencing experiments in a standardized setting. Users can upload pre-processed files with continuous data in standard formats and generate heatmaps and summary plots in a straight-forward, yet highly customizable mann...

  17. Constructing and random sequencing analysis of normalized cDNA library of testis tissue from oriental river prawn (Macrobrachium nipponense).

    Science.gov (United States)

    Qiao, Hui; Fu, Hongtuo; Jin, Shubo; Wu, Yan; Jiang, Sufei; Gong, Yongsheng; Xiong, Yiwei

    2012-09-01

    The oriental river prawn, Macrobrachium nipponense, is an important aquaculture species in China. Sexual precocity is a serious problem because of genetic retrogression, which has negative effects on product quality and dramatically affects price. Culture of all-male populations of this species would be economically advantageous, as the males grow faster and reach a much larger size than females. Developing such a culture scheme will require discovery of sex- or reproduction-related genes that affect sexual maturity and sex determination. In this study, a high-quality normalized testis cDNA library was constructed to identify novel transcripts. Of the 5280 successful sequencing reaction yields, 5202 expressed tagged sequences (ESTs) with an average length of 954 bp. Ultimately, 3677 unique sequences, including 891 contigs and 2786 singletons, were identified based on cluster and assembly analyses. Sixteen hundred (43.5%) genes were novel based on the NCBI protein database, thus these unidentified genes may improve basic molecular knowledge about M. nipponense. Of the novel unigenes, 34.4% (715/2077) were homologous to insects, such as Tribolium castaneum, Drosophila spp. and Apis mellifera. Fifty-two genes were identified as sex- or reproduction-related based on Gene Ontology classification and sequence comparison with data from other publications. These genes can be classified into groups based on different functions, including 10 sex-determination related genes, 8 male-reproductive genes, 5 cathepsin-related genes, 20 ubiquitin-related genes, 5 ferritin-related genes, and 4 LRR genes. The results of this study provide new sequence information about M. nipponense, which will be the basis for further genetic studies of this species and other decapods crustaceans. PMID:22632994

  18. Sequencing and comparative genomics analysis in Senecio scandens Buch.-Ham. Ex D. Don, based on full-length cDNA library

    Science.gov (United States)

    Qian, Gang; Ping, Junjiao; Zhang, Zhen; Xu, Delin

    2014-01-01

    Senecio scandens Buch.-Ham. ex D. Don, an important antibacterial source of Chinese traditional medicine, has a widespread distribution in a few ecological habitats of China. We generated a full-length complementary DNA (cDNA) library from a sample of elite individuals with superior antibacterial properties, with satisfactory parameters such as library storage (4.30 × 106 CFU), efficiency of titre (1.30 × 106 CFU/mL), transformation efficiency (96.35%), full-length ratio (64.00%) and redundancy ratio (3.28%). The BLASTN search revealed the facile formation of counterparts between the experimental sample and Arabidopsis thaliana in view of high-homology cDNA sequence (90.79%) with e-values <1e – 50. Sequence similarities to known proteins indicate that the entire sequences of the full-length cDNA clones consist of the major of functional genes identified by a large set of microarray data from the present experimental material. For other Compositae species, a large set of full-length cDNA clones reported in the present article will serve as a useful resource to facilitate further research on the transferability of expressed sequence tag-derived simple sequence repeats (EST-SSR) development, comparative genomics and novel transcript profiles. PMID:26740776

  19. Molecular cloning and sequence analysis of factor C cDNA from the Singapore horseshoe crab, Carcinoscorpius rotundicauda.

    Science.gov (United States)

    Ding, J L; Navas, M A; Ho, B

    1995-03-01

    Two forms of Factor C cDNAs: CrFC21 (3448 bp) and CrFC26 (4182 bp) have been cloned into lambda gt22. CrFC26 includes 568 nucleotides of 5' untranslated region (5' UTR) containing seven ATGs before the real initiation site, an open reading frame (ORF) of 3249 nucleotides, a stop codon, and 365 nucleotides of 3' untranslated sequence. There are four polyadenylation signals and six potential glycosylation sites. The ORF codes for a signal peptide of 24 amino acids and a Factor C zymogen of 1059 residues. The CrFC21 lacks most of the 5' UTR, and has some base changes in its ORF. The predicted secondary mRNA structures of the 5' end of CrFC26 showed numerous stem-and-loop structures, thus obscuring its real start codon. In contrast, CrFC21 has a well-exposed AUG start site, and expresses Factor C in transcription-translation reactions in vitro. There is a typical serine protease catalytic triad of Asp-His-Ser, which is structurally like prothrombin, but catalytically more similar to trypsin. Although an overall homology of 97.7% was observed in comparison with the Tachypleus tridentatus Factor C (TtFC) cDNA, there were notable differences in the restriction sites and subtle base substitutions in the CrFC cDNA. The high degree of homology between Factor C from T. tridentatus and C. rotundicauda substantiates, at the molecular level, the proximity of these two species in the course of evolution. This finding contravenes the apparent disparities with respect to their morphology, ecological habitat, and taxonomical classification. PMID:7538401

  20. cDNA and deduced amino acid sequence of human pulmonary surfactant-associated proteolipid SPL(Phe)

    International Nuclear Information System (INIS)

    Hydrophobic surfactant-associated protein of M/sub r/ 6000-14,000 was isolated from either/ethanol or chloroform/methanol extracts of mammalian pulmonary surfactant. Automated Edman degradation in a gas-phase sequencer showed the major N-terminus of the human low molecular weight protein to be Phe-Pro-Ile-Pro-Leu-Pro-Try-Cys-Trp-Leu-Cys-Arg-Ala-Leu-. Because of the N-terminal phenylalanine, the surfactant protein was designated SPL(Phe). Antiserum generated against hydrophobic surfactant protein(s) from bovine pulmonary surfactant recognized protein of M/sub r/ 6000-14,000 in immunoblot analysis and was used to screen a λgt11 expression library constructed from adult human lung poly(A)+ RNA. This resulted in identification of a 1.4-kilobase cDNA clone that was shown to encode the N-terminus of the surfactant polypeptide SPL(Phe) (Phe-Pro-Ile-Pro-Leu-Pro-) within an open reading frame for a larger protein. Expression of a fused β-galactosidase-SPL (Phe) gene in Escherichia coli yielded an immunoreactive M/sub r/ 34,000 fusion peptide. Hybrid-arrested translation with the cDNA and immunoprecipitation of [35S]methionine-labeled in vitro translation products of human poly(A)+ RNA with a surfactant polyclonal antibody resulted in identification of a M/sub r/ 40,000 precursor protein. Blot hybridization analysis of electrophoretically fractionated RNA from human lung detected a 2.0-kilobase RNA that was more abundant in adult lung than in fetal lung. These proteins, and specifically SPL(Phe), may therefore be useful for synthesis of replacement surfactants for treatment of hyaline membrane disease in newborn infants or of other surfactant-deficient states

  1. cDNA and deduced amino acid sequence of human pulmonary surfactant-associated proteolipid SPL(Phe)

    Energy Technology Data Exchange (ETDEWEB)

    Glasser, S.W.; Korfhagen, T.R.; Weaver, T.; Pilot-Matias, T.; Fox, J.L.; Whitsett, J.A.

    1987-06-01

    Hydrophobic surfactant-associated protein of M/sub r/ 6000-14,000 was isolated from either/ethanol or chloroform/methanol extracts of mammalian pulmonary surfactant. Automated Edman degradation in a gas-phase sequencer showed the major N-terminus of the human low molecular weight protein to be Phe-Pro-Ile-Pro-Leu-Pro-Try-Cys-Trp-Leu-Cys-Arg-Ala-Leu-. Because of the N-terminal phenylalanine, the surfactant protein was designated SPL(Phe). Antiserum generated against hydrophobic surfactant protein(s) from bovine pulmonary surfactant recognized protein of M/sub r/ 6000-14,000 in immunoblot analysis and was used to screen a lambdagt11 expression library constructed from adult human lung poly(A)/sup +/ RNA. This resulted in identification of a 1.4-kilobase cDNA clone that was shown to encode the N-terminus of the surfactant polypeptide SPL(Phe) (Phe-Pro-Ile-Pro-Leu-Pro-) within an open reading frame for a larger protein. Expression of a fused ..beta..-galactosidase-SPL (Phe) gene in Escherichia coli yielded an immunoreactive M/sub r/ 34,000 fusion peptide. Hybrid-arrested translation with the cDNA and immunoprecipitation of (/sup 35/S)methionine-labeled in vitro translation products of human poly(A)/sup +/ RNA with a surfactant polyclonal antibody resulted in identification of a M/sub r/ 40,000 precursor protein. Blot hybridization analysis of electrophoretically fractionated RNA from human lung detected a 2.0-kilobase RNA that was more abundant in adult lung than in fetal lung. These proteins, and specifically SPL(Phe), may therefore be useful for synthesis of replacement surfactants for treatment of hyaline membrane disease in newborn infants or of other surfactant-deficient states.

  2. cDNA: 43397 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus + Mm.30035 Mus musculus adult male corpora quadrigemina cDNA, RIKEN full-length enri ... FOLATE DEHYDROGENASE (EC 1.5.1.6) (10-FTHFDH) (FBP-CI ) homolog [Rattus norvegicus], full insert sequence ...

  3. cDNA: 36928 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.240850 Mus musculus adult male medulla oblongata cDNA, RIKEN full-length enrich ... MA AMPLIFIED SEQUENCE 1 (NOVEL AMPLIFIED IN BREAST CANCER ... 1) (AMPLIFIED AND OVEREXPRESSED IN BREAST CANCER ) ...

  4. cDNA: 36927 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.240850 Mus musculus adult male stomach cDNA, RIKEN full-length enriched library ... MA AMPLIFIED SEQUENCE 1 (NOVEL AMPLIFIED IN BREAST CANCER ... 1) (AMPLIFIED AND OVEREXPRESSED IN BREAST CANCER ) ...

  5. Evolution of vertebrate IgM: complete amino acid sequence of the constant region of Ambystoma mexicanum mu chain deduced from cDNA sequence.

    Science.gov (United States)

    Fellah, J S; Wiles, M V; Charlemagne, J; Schwager, J

    1992-10-01

    cDNA clones coding for the constant region of the Mexican axolotl (Ambystoma mexicanum) mu heavy immunoglobulin chain were selected from total spleen RNA, using a cDNA polymerase chain reaction technique. The specific 5'-end primer was an oligonucleotide homologous to the JH segment of Xenopus laevis mu chain. One of the clones, JHA/3, corresponded to the complete constant region of the axolotl mu chain, consisting of a 1362-nucleotide sequence coding for a polypeptide of 454 amino acids followed in 3' direction by a 179-nucleotide untranslated region and a polyA+ tail. The axolotl C mu is divided into four typical domains (C mu 1-C mu 4) and can be aligned with the Xenopus C mu with an overall identity of 56% at the nucleotide level. Percent identities were particularly high between C mu 1 (59%) and C mu 4 (71%). The C-terminal 20-amino acid segment which constitutes the secretory part of the mu chain is strongly homologous to the equivalent sequences of chondrichthyans and of other tetrapods, including a conserved N-linked oligosaccharide, the penultimate cysteine and the C-terminal lysine. The four C mu domains of 13 vertebrate species ranging from chondrichthyans to mammals were aligned and compared at the amino acid level. The significant number of mu-specific residues which are conserved into each of the four C mu domains argues for a continuous line of evolution of the vertebrate mu chain. This notion was confirmed by the ability to reconstitute a consistent vertebrate evolution tree based on the phylogenic parsimony analysis of the C mu 4 sequences. PMID:1382992

  6. Deep sequencing reveals complex spurious transcription from transiently transfected plasmids

    Czech Academy of Sciences Publication Activity Database

    Nejepínská, Jana; Malík, Radek; Moravec, Martin

    2012-01-01

    Roč. 7, č. 8 (2012), e43283. E-ISSN 1932-6203 R&D Projects: GA ČR GA204/09/0085 Grant ostatní: EMBO(XE) 0001488 Institutional research plan: CEZ:AV0Z50520514 Institutional support: RVO:68378050 Keywords : transient plasmid transfection * deep sequencing Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 3.730, year: 2012

  7. Deep sequencing approach for investigating infectious agents causing fever

    OpenAIRE

    Susilawati, T. N.; Jex, A. R.; Cantacessi, C.; Pearson, M.; Navarro, S.; Susianto, A.; Loukas, A. C.; McBride, W. J. H.

    2016-01-01

    Acute undifferentiated fever (AUF) poses a diagnostic challenge due to the variety of possible aetiologies. While the majority of AUFs resolve spontaneously, some cases become prolonged and cause significant morbidity and mortality, necessitating improved diagnostic methods. This study evaluated the utility of deep sequencing in fever investigation. DNA and RNA were isolated from plasma/sera of AUF cases being investigated at Cairns Hospital in northern Australia, including eight control samp...

  8. Sequence of an intestinal cDNA encoding human gastric inhibitory polypeptide precursor

    International Nuclear Information System (INIS)

    Gastric inhibitory polypeptide (GIP) is a 42-amino acid hormone that stimulates insulin secretion in the presence of glucose. Complementary DNA clones encoding human GIP were isolated from a library prepared with RNA from duodenum. The predicted amino acid sequence indicates that GIP is derived by proteolytic processing of a 153-residue precursor, preproGIP. The GIP moiety is flanked by polypeptide segments of 51 and 60 amino acids at its NH2 and COOH termini, respectively. The former includes a signal peptide of about 21 residues and an NH2-terminal propeptide of 30 amino acids. GIP is released from the precursor by processing at single arginine residues. There is a region of nine amino acids in the COOH-terminal propeptide of the GIP precursor that has partial homology with a portion of chromogranin A as well as pancreastatin

  9. Isolating Viral and Host RNA Sequences from Archival Material and Production of cDNA Libraries for High-Throughput DNA Sequencing

    Science.gov (United States)

    Xiao, Yongli; Sheng, Zong-Mei; Taubenberger, Jeffery K.

    2015-01-01

    The vast majority of surgical biopsy and post-mortem tissue samples are formalin-fixed and paraffin-embedded (FFPE), but this process leads to RNA degradation that limits gene expression analysis. As an example, the viral RNA genome of the 1918 pandemic influenza A virus was previously determined in a 9-year effort by overlapping RT-PCR from post-mortem samples. Using the protocols described here, the full genome of the 1918 virus at high coverage was determined in one high-throughput sequencing run of a cDNA library derived from total RNA of a 1918 FFPE sample after duplex-specific nuclease treatments. This basic methodological approach should assist in the analysis of FFPE tissue samples isolated over the past century from a variety of infectious diseases. PMID:26344216

  10. The isolation of transcription factors from lambda gt11 cDNA expression libraries: human steroid 5 alpha-reductase 1 has sequence-specific DNA binding activity.

    OpenAIRE

    Gaston, K; Fried, M

    1992-01-01

    The Surf-1/Surf-2 bi-directional promoter contains binding sites for at least three transcription factors (Su1, Su2, and Su3). By screening a lambda gt11 HeLa cell cDNA expression library with a concatenated Su2 factor binding site, we isolated a cDNA which encodes a protein with sequence-specific DNA binding activity. Gel retardation assays showed that the cloned factor binds specifically to the Su2 factor binding site present in the human Surf-1/Surf-2 promoter but not to an Su2 site contai...

  11. Nucleotide sequence of Phaseolus vulgaris L. alcohol dehydrogenase encoding cDNA and three-dimensional structure prediction of the deduced protein

    Science.gov (United States)

    Amelia, Kassim; Khor, Chin Yin; Shah, Farida Habib; Bhore, Subhash J.

    2015-01-01

    Background: Common beans (Phaseolus vulgaris L.) are widely consumed as a source of proteins and natural products. However, its yield needs to be increased. In line with the agenda of Phaseomics (an international consortium), work of expressed sequence tags (ESTs) generation from bean pods was initiated. Altogether, 5972 ESTs have been isolated. Alcohol dehydrogenase (AD) encoding gene cDNA was a noticeable transcript among the generated ESTs. This AD is an important enzyme; therefore, to understand more about it this study was undertaken. Objective: The objective of this study was to elucidate P. vulgaris L. AD (PvAD) gene cDNA sequence and to predict the three-dimensional (3D) structure of deduced protein. Materials and Methods: positive and negative strands of the PvAD cDNA clone were sequenced using M13 forward and M13 reverse primers to elucidate the nucleotide sequence. Deduced PvAD cDNA and protein sequence was analyzed for their basic features using online bioinformatics tools. Sequence comparison was carried out using bl2seq program, and tree-view program was used to construct a phylogenetic tree. The secondary structures and 3D structure of PvAD protein were predicted by using the PHYRE automatic fold recognition server. Results: The sequencing results analysis showed that PvAD cDNA is 1294 bp in length. It's open reading frame encodes for a protein that contains 371 amino acids. Deduced protein sequence analysis showed the presence of putative substrate binding, catalytic Zn binding, and NAD binding sites. Results indicate that the predicted 3D structure of PvAD protein is analogous to the experimentally determined crystal structure of s-nitrosoglutathione reductase from an Arabidopsis species. Conclusions: The 1294 bp long PvAD cDNA encodes for 371 amino acid long protein that contains conserved domains required for biological functions of AD. The predicted deduced PvAD protein's 3D structure reflects the analogy with the crystal structure of

  12. Sequencing and analysis of 10967 full-length cDNA clones from Xenopus laevis and Xenopus tropicalis

    Energy Technology Data Exchange (ETDEWEB)

    Morin, R D; Chang, E; Petrescu, A; Liao, N; Kirkpatrick, R; Griffith, M; Butterfield, Y; Stott, J; Barber, S; Babakaiff, R; Matsuo, C; Wong, D; Yang, G; Smailus, D; Brown-John, M; Mayo, M; Beland, J; Gibson, S; Olson, T; Tsai, M; Featherstone, R; Chand, S; Siddiqui, A; Jang, W; Lee, E; Klein, S; Prange, C; Myers, R M; Green, E D; Wagner, L; Gerhard, D; Marra, M; Jones, S M; Holt, R

    2005-10-31

    Sequencing of full-insert clones from full-length cDNA libraries from both Xenopus laevis and Xenopus tropicalis has been ongoing as part of the Xenopus Gene Collection initiative. Here we present an analysis of 10967 clones (8049 from X. laevis and 2918 from X. tropicalis). The clone set contains 2013 orthologs between X. laevis and X. tropicalis as well as 1795 paralog pairs within X. laevis. 1199 are in-paralogs, believed to have resulted from an allotetraploidization event approximately 30 million years ago, and the remaining 546 are likely out-paralogs that have resulted from more ancient gene duplications, prior to the divergence between the two species. We do not detect any evidence for positive selection by the Yang and Nielsen maximum likelihood method of approximating d{sub N}/d{sub S}. However, d{sub N}/d{sub S} for X. laevis in-paralogs is elevated relative to X. tropicalis orthologs. This difference is highly significant, and indicates an overall relaxation of selective pressures on duplicated gene pairs. Within both groups of paralogs, we found evidence of subfunctionalization, manifested as differential expression of paralogous genes among tissues, as measured by EST information from public resources. We have observed, as expected, a higher instance of subfunctionalization in out-paralogs relative to in-paralogs.

  13. Sequence analysis of keratin-like proteins and cloning of intermediate filament-like cDNA from higher plant cells

    Institute of Scientific and Technical Information of China (English)

    2000-01-01

    Two keratin-like proteins of 64 and 55 ku were purified from suspension cells of Daucus carota L.,and their partial amino acid sequences were determined.The homological analysis showed that the sequence from the 64 ku protein was highly homological to b -glucosidase,and that from the 55 ku protein had no significant homologue in GenBank.Using conservative sequence of animal IF proteins as primer,we cloned a cDNA fragment from Daucus carota L.Southern blot and Northern blot results indicated that this cDNA fragment was a single copy gene and expressed both in suspension cells and leaves.Homological analysis revealed that it had moderate homology to a variety of a -helical proteins.Our results might shed more light on molecular characterization of IF existence in higher plant.

  14. Sequence analysis of keratin-like proteins and cloning of intermediate filament-like cDNA from higher plant cells

    Institute of Scientific and Technical Information of China (English)

    赵大中; 陈丹英; 杨橙; 翟中和

    2000-01-01

    Two keratin-like proteins of 64 and 55 ku were purified from suspension cells of Caucus carota L, and their partial amino acid sequences were determined. The homological analysis showed that the sequence from the 64 ku protein was highly homological to p-glucosidase, and that from the 55 ku protein had no significant homologue in GenBank. Using conservative sequence of animal IF proteins as primer, we cloned a cDNA fragment from Daucus carota L. Southern blot and Northern blot results indicated that this cDNA fragment was a single copy gene and expressed both in suspension cells and leaves. Homological analysis revealed that it had moderate homology to a variety of a-helical proteins. Our results might shed more light on molecular characterization of IF existence in higher plant.

  15. deepTools2: a next generation web server for deep-sequencing data analysis.

    Science.gov (United States)

    Ramírez, Fidel; Ryan, Devon P; Grüning, Björn; Bhardwaj, Vivek; Kilpert, Fabian; Richter, Andreas S; Heyne, Steffen; Dündar, Friederike; Manke, Thomas

    2016-07-01

    We present an update to our Galaxy-based web server for processing and visualizing deeply sequenced data. Its core tool set, deepTools, allows users to perform complete bioinformatic workflows ranging from quality controls and normalizations of aligned reads to integrative analyses, including clustering and visualization approaches. Since we first described our deepTools Galaxy server in 2014, we have implemented new solutions for many requests from the community and our users. Here, we introduce significant enhancements and new tools to further improve data visualization and interpretation. deepTools continue to be open to all users and freely available as a web service at deeptools.ie-freiburg.mpg.de The new deepTools2 suite can be easily deployed within any Galaxy framework via the toolshed repository, and we also provide source code for command line usage under Linux and Mac OS X. A public and documented API for access to deepTools functionality is also available. PMID:27079975

  16. Cloning and Sequence Analysis of Interleukin 10 (IL-10) Full-length cDNA from Cyprinus carpio L.

    Institute of Scientific and Technical Information of China (English)

    Xiangru FENG; Yilong CHEN; Xiao ZHAO; Wendong WANG; Junhui ZHANG; Zhenguo YANG SUN; Shengmei JIA; Qiang LU

    2012-01-01

    Abstract [Objective] This study aimed to obtain IL-IO (interleukin 10) full-length cD- NA of common carp (Cyprinus carpio L.) and conduct the sequence analysis. []~lethod] The differentially expressed cDNA fragment was obtained by DD-RTPCR (differential display RT-PCR). The cDNA library of peripheral blood leukocytes which were separated from common carp and stimulated by mitogen was screened with a probe labeled with DIG (digoxigenin). The IL-IO full-length cDNA was cloned from 0.8x104 pfu of recombinant phages, and the sequence analysis and homology com- parison were carried out. [Result] Sequence analysis indicated that the IL-IO full- length cDNA of common carp was 1 117 bp long, containing a.55 bp 5'-UTR, a 522 bp 3"-UTR, and a 540 bp open reading frame(ORF) encoding 179 amino acids. In addition, there were three mRNA instability motifs (ATTTA) in the 3"-untranslated region. The deduced protein sequence shared typical sequence features of the IL-IO family. Homology comparison indicated that the obtained sequence shared 89.1% homology with the carp IL-IO gene from GenBank. [Conclusion] This study laid foun- dation for further study of the expression manner, functional characteristic and regu- lation mechanism of IL-IO in vivo and the interaction mechanism in the inflammatory reaction and immune response.

  17. 虹鳟 Ndufb2基因全长 cDNA 序列的克隆与分析%Cloning and sequence analysis of Ndufb2 full-length cDNA derived from Oncorhynchus mykiss

    Institute of Scientific and Technical Information of China (English)

    王家庆; 边佳; 李代宗; 马爽; 王亮; 那广宁

    2013-01-01

    Summary Rainbow trout belongs to Salmonidae aerobic fish,and it is necessary for high dissolved oxygen content of living water environment.If the dissolved oxygen content of living water is less than 5 mg/L,it will cause the increase of respiratory rate,which is the so-called“aquaculture floating head”phenomenon.Because the fish lives in hypoxia environment and the 90% oxygen consumption is in the mitochondria,the transmission mechanism in composition and electronic respiratory chain may be different from the terrestrial animal.At the mitochondrial inner membrane,electrons from NADH and succinate pass through the electron transport chain to oxygen,which is reduced to water.Complex I is one of the main sites at which premature electron leakage to oxygen occurs,thus being one of the main sites of production of harmful superoxide.The first isolation of mitochondrial complex I since 1 961,its composition and structure have had a primary understanding,but the specific mechanism of its participation in respiration,especially the function of each subunit is not clear.The protein encoded by Ndufb2 gene is a subunit of the multisubunit NADH:ubiquinone oxidoreductase(complex I). Mammalian complex I is composed of 45 different subunits.This protein has NADH dehydrogenase activity and oxidoreductase activity.It plays an important role in transferring electrons from NADH to the respiratory chain. Reverse transcription PCR(RT-PCR) and rapid amplification of cDNA ends(RACE)methods were used for the isolation of the whole cDNA of Ndufb2 gene from brain of Oncorhynchus mykiss .The assembly taskes of 3' and 5'-RACE sequence were completed by DNAman program.A pair of gene specific primers were designed to amplify the full-length cDNA sequence.ClustalX 1.81 and MEGA 3.0 software were used to calculate the amino acid sequence differences,and then the phylogenetic relationships of rainbow trout Ndufb2 gene sequence with other species were analyzed.Protein phosphorylation sites and

  18. The cDNA sequence for the protein-tyrosine kinase substrate p36 (calpactin I heavy chain) reveals a multidomain protein with internal repeats

    DEFF Research Database (Denmark)

    Sarin, C T; Tack, B F; Kristensen, Torsten; Glenney Jr., J R; Hunter, T

    1986-01-01

    We have isolated and sequenced a full-length cDNA clone for the protein-tyrosine kinase substrate p36 (calpactin I heavy chain). This sequence predicts a 339 amino acid (Mr 38,493) protein containing an N-terminal region of 20 amino acids, known to interact with a 10 kd protein (light chain), and...... A2 inhibitor lipocortin I were found to be 50% identical in sequence over the C-terminal 300 residues. The function of p36 and its relation to other proteins are discussed....

  19. Complete nucleotide and derived amino acid sequence of cDNA encoding the mitochondrial uncoupling protein of rat brown adipose tissue: lack of a mitochondrial targeting presequence.

    OpenAIRE

    Ridley, R. G.; Patel, H. V.; Gerber, G E; Morton, R C; Freeman, K. B.

    1986-01-01

    A cDNA clone spanning the entire amino acid sequence of the nuclear-encoded uncoupling protein of rat brown adipose tissue mitochondria has been isolated and sequenced. With the exception of the N-terminal methionine the deduced N-terminus of the newly synthesized uncoupling protein is identical to the N-terminal 30 amino acids of the native uncoupling protein as determined by protein sequencing. This proves that the protein contains no N-terminal mitochondrial targeting prepiece and that a t...

  20. Human thyroid peroxidase: complete cDNA and protein sequence, chromosome mapping, and identification of two alternately spliced mRNAs

    International Nuclear Information System (INIS)

    Two forms of human thyroid peroxidase cDNAs were isolated from a λgt11 cDNA library, prepared from Graves disease thyroid tissue mRNA, by use of oligonucleotides. The longest complete cDNA, designated phTPO-1, has 3048 nucleotides and an open reading frame consisting of 933 amino acids, which would encode a protein with a molecular weight of 103,026. Five potential asparagine-linked glycosylation sites are found in the deduced amino acid sequence. The second peroxidase cDNA, designated phTPO-2, is almost identical to phTPO-1 beginning 605 base pairs downstream except that it contains 1-base-pair difference and lacks 171 base pairs in the middle of the sequence. This results in a loss of 57 amino acids corresponding to a molecular weight of 6282. Interestingly, this 171-nucleotide sequence has GT and AG at its 5' and 3' boundaries, respectively, that are in good agreement with donor and acceptor splice site consensus sequences. Using specific oligonucleotide probes for the mRNAs derived from the cDNA sequences hTOP-1 and hTOP-2, the authors show that both are expressed in all thyroid tissues examined and the relative level of two mRNAs is different in each sample. The results suggest that two thyroid peroxidase proteins might be generated through alternate splicing of the same gene. By using somatic cell hybrid lines, the thyroid peroxidase gene was mapped to the short arm of human chromosome 2

  1. Analysis of a cDNA clone expressing a human autoimmune antigen: full-length sequence of the U2 small nuclear RNA-associated B antigen

    International Nuclear Information System (INIS)

    A U2 small nuclear RNA-associated protein, designated B'', was recently identified as the target antigen for autoimmune sera from certain patients with systemic lupus erythematosus and other rheumatic diseases. Such antibodies enabled them to isolate cDNA clone λHB''-1 from a phage λgt11 expression library. This clone appeared to code for the B'' protein as established by in vitro translation of hybrid-selected mRNA. The identity of clone λHB''-1 was further confirmed by partial peptide mapping and analysis of the reactivity of the recombinant antigen with monospecific and monoclonal antibodies. Analysis of the nucleotide sequence of the 1015-base-pair cDNA insert of clone λHB''-1 revealed a large open reading frame of 800 nucleotides containing the coding sequence for a polypeptide of 25,457 daltons. In vitro transcription of the λHB''-1 cDNA insert and subsequent translation resulted in a protein product with the molecular size of the B'' protein. These data demonstrate that clone λHB''-1 contains the complete coding sequence of this antigen. The deduced polypeptide sequence contains three very hydrophilic regions that might constitute RNA binding sites and/or antigenic determinants. These findings might have implications both for the understanding of the pathogenesis of rheumatic diseases as well as for the elucidation of the biological function of autoimmune antigens

  2. cDNA sequence of a human skeletal muscle ADP/ATP translocator: lack of a leader peptide, divergence from a fibroblast translocator cDNA, and coevolution with mitochondrial DNA genes

    International Nuclear Information System (INIS)

    The authors have characterized a 1400-nucleotide cDNA for the human skeletal muscle ADP/ATP translocator. The deduced amino acid sequence is 94% homologous to the beef heart ADP/ATP translocator protein and contains only a single additional amino-terminal methionine. This implies that the human translocator lacks an amino-terminal targeting peptide, a conclusion substantiated by measuring the molecular weight of the protein synthesized in vitro. A 1400-nucleotide transcript encoding the skeletal muscle translocator was detected on blots of total RNA from human heart, kidney, skeletal muscle, and HeLa cells by hybridization with oligonucleotide probes homologous to the coding region and 3' noncoding region of the cDNA. However, the level of this mRNA varied substantially among tissues. Comparison of our skeletal muscle translocator sequence with that of a recently published human fibroblast translocator cognate revealed that the two proteins are 88% identical and diverged about 275 million years ago. Hence, tissues vary both in the level of expression of individual translocator genes and in differential expression of cognate translocator genes. Comparison of the base substitution rates of the ADP/ATP translocator and the oxidative phosphorylation genes encoded by mitochondrial DNA revealed that the mitochondrial DNA genes fix 10 times more synonymous substitutions and 12 times more replacement substitutions; yet, these nuclear and cytoplasmic respiration genes experience comparable evolutionary constraints. This suggest that the mitochondrial DNA genes are highly prone to deleterious mutations

  3. Deep sequencing-based transcriptome analysis of Plutella xylostella larvae parasitized by Diadegma semiclausum

    Directory of Open Access Journals (Sweden)

    Glatz Richard V

    2011-09-01

    Full Text Available Abstract Background Parasitoid insects manipulate their hosts' physiology by injecting various factors into their host upon parasitization. Transcriptomic approaches provide a powerful approach to study insect host-parasitoid interactions at the molecular level. In order to investigate the effects of parasitization by an ichneumonid wasp (Diadegma semiclausum on the host (Plutella xylostella, the larval transcriptome profile was analyzed using a short-read deep sequencing method (Illumina. Symbiotic polydnaviruses (PDVs associated with ichneumonid parasitoids, known as ichnoviruses, play significant roles in host immune suppression and developmental regulation. In the current study, D. semiclausum ichnovirus (DsIV genes expressed in P. xylostella were identified and their sequences compared with other reported PDVs. Five of these genes encode proteins of unknown identity, that have not previously been reported. Results De novo assembly of cDNA sequence data generated 172,660 contigs between 100 and 10000 bp in length; with 35% of > 200 bp in length. Parasitization had significant impacts on expression levels of 928 identified insect host transcripts. Gene ontology data illustrated that the majority of the differentially expressed genes are involved in binding, catalytic activity, and metabolic and cellular processes. In addition, the results show that transcription levels of antimicrobial peptides, such as gloverin, cecropin E and lysozyme, were up-regulated after parasitism. Expression of ichnovirus genes were detected in parasitized larvae with 19 unique sequences identified from five PDV gene families including vankyrin, viral innexin, repeat elements, a cysteine-rich motif, and polar residue rich protein. Vankyrin 1 and repeat element 1 genes showed the highest transcription levels among the DsIV genes. Conclusion This study provides detailed information on differential expression of P. xylostella larval genes following parasitization, Ds

  4. Identification and characterization of cDNA sequences encoding the HIS3 and LEU2 genes of the fungus Alternaria tenuissima

    Institute of Scientific and Technical Information of China (English)

    Ying Wan; Xuli Wang; Yun Huang; Dewen Qiu; Linghuo Jiang

    2008-01-01

    Alternaria tenuissima is a fungus widely present in the environment and could cause diseases in plants and humans.In this study,through a yeast genetic approach,cDNA sequences were isolated and characterized for the AtHIS3 and AtLEU2 genes.AtHIS3 cDNA encodes a protein of 238 amino acids,while AtLEU2 cDNA encodes a protein of 363 amino acids.Based on the phylogenetic analysis of amino acid sequences of AtHis3p and AtLeu2p,A.tenuissima is closely related to the plant pathogenic fungus Phaeosphaeria nodorum.This study provides two genetic markers for studies of functions of genes regulating development,morphology,and virulence of A.tenuissima.

  5. Uroporphyrinogen-III synthase: Molecular cloning, nucleotide sequence, expression of a mouse full-length cDNA, and its localization on mouse chromosome 7

    Energy Technology Data Exchange (ETDEWEB)

    Xu, W.; Desnick, R.J. [Mount Sinai School of Medicine, New York, NY (United States); Kozak, C.A. [National Institute of Health, Bethesda, MD (United States)

    1995-04-10

    Uroporphyrinogen-III synthase, the fourth enzyme in the heme biosynthetic pathway, is responsible for the conversion of hydroxymethylbilane to the cyclic tetrapyrrole, uroporphyrinogen III. The deficient activity of URO-S is the enzymatic defect in congenital erythropoietic porphyria (CEP), an autosomal recessive disorder. For the generation of a mouse model of CEP, the human URO-S cDNA was used to screen 2 X 10{sup 6} recombinants from a mouse adult liver cDNA library. Ten positive clones were isolated, and dideoxy sequencing of the entire 1.6-kb insert of clone pmUROS-1 revealed 5{prime} and 3{prime} untranslated sequences of 144 and 623 bp, respectively, and an open reading frame of 798 bp encoding a 265-amino-acid polypeptide with a predicted molecular mass of 28,501 Da. The mouse and human coding sequences had 80.5 and 77.8% nucleotide and amino acid identity, respectively. The authenticity of the mouse cDNA was established by expression of the active monomeric enzyme in Escherichia coli. In addition, the analysis of two multilocus genetic crosses localized the mouse gene on chromosome 7, consistent with the mapping of the human gene to a position of conserved synteny on chromosome 10. The isolation, expression, and chromosomal mapping of this full-length cDNA should facilitate studies of the structure and organization of the mouse genomic sequence and the development of a mouse model of CEP for characterization of the disease pathogenesis and evaluation of gene therapy. 38 refs., 1 tab.

  6. Deep sequencing of voodoo lily (Amorphophallus konjac): an approach to identify relevant genes involved in the synthesis of the hemicellulose glucomannan

    OpenAIRE

    Gille, Sascha; Cheng, Kun; Skinner, Mary E.; Liepman, Aaron H.; Wilkerson, Curtis G.; Pauly, Markus

    2011-01-01

    A Roche 454 cDNA deep sequencing experiment was performed on a developing corm of Amorphophallus konjac—also known as voodoo lily. The dominant storage polymer in the corm of this plant is the polysaccharide glucomannan, a hemicellulose known to exist in the cell walls of higher plants and a major component of plant biomass derived from softwoods. A total of 246 mega base pairs of sequence data was obtained from which 4,513 distinct contigs were assembled. Within this voodoo lily expressed se...

  7. Cloning and sequencing of cDNA encoding human DNA topoisomerase II and localization of the gene to chromosome region 17q21-22

    International Nuclear Information System (INIS)

    Two overlapping cDNA clones encoding human DNA topoisomerase II were identified by two independent methods. In one, a human cDNA library in phage λ was screened by hybridization with a mixed oligonucleotide probe encoding a stretch of seven amino acids found in yeast and Drosophila DNA topoisomerase II; in the other, a different human cDNA library in a λgt11 expression vector was screened for the expression of antigenic determinants that are recognized by rabbit antibodies specific to human DNA topoisomerase II. The entire coding sequences of the human DNA topoisomerase II gene were determined from these and several additional clones, identified through the use of the cloned human TOP2 gene sequences as probes. Hybridization between the cloned sequences and mRNA and genomic DNA indicates that the human enzyme is encoded by a single-copy gene. The location of the gene was mapped to chromosome 17q21-22 by in situ hybridization of a cloned fragment to metaphase chromosomes and by hybridization analysis with a panel of mouse-human hybrid cell lines, each retaining a subset of human chromosomes

  8. cDNA cloning,sequence analysis,and recombinant expression of akitonin beta,a C-type lectin-like protein from Agkistrodon acutus

    Institute of Scientific and Technical Information of China (English)

    Xiang-dong ZHA; Jing LIU; Kang-sen XU

    2004-01-01

    AIM: To clone the cDNA of a new member of snake venom C-type lectin-like proteins, to study its structurefunction relationships and to achieve its recombinant production. METHODS: PCR primers were designed based on the homology and cDNA was amplified by RT-PCR using total RNA from snake venom gland as the template.The PCR products were cloned into the plasmid pGEM-T and sequenced. The deduced protein sequence was analyzed with some bioinformatic programs. A recombinant expression plasmid was constructed using pBADTOPO as vector and transformed into E. coli TOP10 competent cells. RESULTS: A novel cDNA sequence encoding akitonin β was found and accepted by GenBank (accession number AF387100). Akitonin β consists of a typical carbohydrate recognition domain (CRD) of C-type lectins, and it is homologous with other snake venom C-type lectin-like proteins. It was predicted to be a platelet antagonist. Upon induction with arabinose rAkitonin β expressing in E coli was achieved at a high level (superior to 150 mg/L). The recombinant fusion protein exhibited inhibitory activities on rat platelet aggregation in vitro. CONCLUSION: A new member of snake venom C-type lectin-like proteins was discovered and characterized, and an efficient recombinant expression system was established for its production.

  9. Cloning, sequencing and expression analysis of cDNA encoding a constitutive heat shock protein 70 (HSC70) in Fenneropenaeus chinensis

    Institute of Scientific and Technical Information of China (English)

    JIAO Chuanzhen; WANG Zaizhao; LI Fuhua; ZHANG Chengsong; XIANG Jianhai

    2004-01-01

    The cDNA encoding hsc70 of Chinese shrimp Fenneropenaeus chinensis was cloned from hepatopancreas by RT-PCR based on its EST sequence. The full length cDNA of 2090 bp contained an open reading frame of 1956 nucleotides and partial 5′- and 3′-untranslated region(5′- and 3′-UTR). PCR amplification and sequencing analysis showed the existence of introns in the region of 1-547 bp, but they did not exist in the region of 548-2090 bp of hsc70 cDNA. When the deduced 652 amino acid sequence of HSC70 was compared with the members of HSP70 family from other organisms, the results showed 85.9% similarity with HSC71 from Oncorhynchus mykiss and HSC70 from Homo sapiens. It also exhibited 85.8% similarity with HSP70 from Mus musculu and 85.4% with HSC70 from Manduca sexta. Expression analysis showed that hsc70 mRNA was espressed constitutively in hepatopancreas, muscle, eyestalks, haemocytes, heart, ovary, intestine and gills in Fenneropenaeus chinensis. No difference could be detected on hsc70 mRNA level in muscle between heat-shocked and control animals.

  10. Generation and analysis of large-scale expressed sequence tags (ESTs from a full-length enriched cDNA library of porcine backfat tissue

    Directory of Open Access Journals (Sweden)

    Lee Hae-Young

    2006-02-01

    Full Text Available Abstract Background Genome research in farm animals will expand our basic knowledge of the genetic control of complex traits, and the results will be applied in the livestock industry to improve meat quality and productivity, as well as to reduce the incidence of disease. A combination of quantitative trait locus mapping and microarray analysis is a useful approach to reduce the overall effort needed to identify genes associated with quantitative traits of interest. Results We constructed a full-length enriched cDNA library from porcine backfat tissue. The estimated average size of the cDNA inserts was 1.7 kb, and the cDNA fullness ratio was 70%. In total, we deposited 16,110 high-quality sequences in the dbEST division of GenBank (accession numbers: DT319652-DT335761. For all the expressed sequence tags (ESTs, approximately 10.9 Mb of porcine sequence were generated with an average length of 674 bp per EST (range: 200–952 bp. Clustering and assembly of these ESTs resulted in a total of 5,008 unique sequences with 1,776 contigs (35.46% and 3,232 singleton (65.54% ESTs. From a total of 5,008 unique sequences, 3,154 (62.98% were similar to other sequences, and 1,854 (37.02% were identified as having no hit or low identity (Sus scrofa. Gene ontology (GO annotation of unique sequences showed that approximately 31.7, 32.3, and 30.8% were assigned molecular function, biological process, and cellular component GO terms, respectively. A total of 1,854 putative novel transcripts resulted after comparison and filtering with the TIGR SsGI; these included a large percentage of singletons (80.64% and a small proportion of contigs (13.36%. Conclusion The sequence data generated in this study will provide valuable information for studying expression profiles using EST-based microarrays and assist in the condensation of current pig TCs into clusters representing longer stretches of cDNA sequences. The isolation of genes expressed in backfat tissue is the

  11. Predicted protein sequence of the murine I-E-beta S-polypeptide chain from cDNA and genomic clones.

    OpenAIRE

    Mengle-Gaw, L; McDevitt, H O

    1985-01-01

    Ia antigens are polymorphic cell-surface glycoprotein complexes, encoded within the I region of the mouse major histocompatibility complex, that control the ability of the organism to mount effective antigen-specific immune responses. We have isolated and determined the nucleotide sequences of cDNA and genomic clones for the I-E beta s gene and we present the predicted protein sequence for most of the E beta s polypeptide chain. The E beta s polypeptide shows 95% protein homology to the other...

  12. Frameshift mutations in infectious cDNA clones of Citrus tristeza virus: a strategy to minimize the toxicity of viral sequences to Escherichia coli

    International Nuclear Information System (INIS)

    The advent of reverse genetics revolutionized the study of positive-stranded RNA viruses that were amenable for cloning as cDNAs into high-copy-number plasmids of Escherichia coli. However, some viruses are inherently refractory to cloning in high-copy-number plasmids due to toxicity of viral sequences to E. coli. We report a strategy that is a compromise between infectivity of the RNA transcripts and toxicity to E. coli effected by introducing frameshift mutations into 'slippery sequences' near the viral 'toxicity sequences' in the viral cDNA. Citrus tristeza virus (CTV) has cDNA sequences that are toxic to E. coli. The original full-length infectious cDNA of CTV and a derivative replicon, CTV-ΔCla, cloned into pUC119, resulted in unusually limited E. coli growth. However, upon sequencing of these cDNAs, an additional uridinylate (U) was found in a stretch of U's between nts 3726 and 3731 that resulted in a change to a reading frame with a stop codon at nt 3734. Yet, in vitro produced RNA transcripts from these clones infected protoplasts, and the resulting progeny virus was repaired. Correction of the frameshift mutation in the CTV cDNA constructs resulted in increased infectivity of in vitro produced RNA transcripts, but also caused a substantial increase of toxicity to E. coli, now requiring 3 days to develop visible colonies. Frameshift mutations created in sequences not suspected to facilitate reading frame shifting and silent mutations introduced into oligo(U) regions resulted in complete loss of infectivity, suggesting that the oligo(U) region facilitated the repair of the frameshift mutation. Additional frameshift mutations introduced into other oligo(U) regions also resulted in transcripts with reduced infectivity similarly to the original clones with the +1 insertion. However, only the frameshift mutations introduced into oligo(U) regions that were near and before the toxicity region improved growth and stability in E. coli. These data demonstrate that

  13. Sequencing analysis of 20,000 full-length cDNA clones from cassava reveals lineage specific expansions in gene families related to stress response

    Directory of Open Access Journals (Sweden)

    Sakaki Yoshiyuki

    2007-12-01

    Full Text Available Abstract Background Cassava, an allotetraploid known for its remarkable tolerance to abiotic stresses is an important source of energy for humans and animals and a raw material for many industrial processes. A full-length cDNA library of cassava plants under normal, heat, drought, aluminum and post harvest physiological deterioration conditions was built; 19968 clones were sequence-characterized using expressed sequence tags (ESTs. Results The ESTs were assembled into 6355 contigs and 9026 singletons that were further grouped into 10577 scaffolds; we found 4621 new cassava sequences and 1521 sequences with no significant similarity to plant protein databases. Transcripts of 7796 distinct genes were captured and we were able to assign a functional classification to 78% of them while finding more than half of the enzymes annotated in metabolic pathways in Arabidopsis. The annotation of sequences that were not paired to transcripts of other species included many stress-related functional categories showing that our library is enriched with stress-induced genes. Finally, we detected 230 putative gene duplications that include key enzymes in reactive oxygen species signaling pathways and could play a role in cassava stress response features. Conclusion The cassava full-length cDNA library here presented contains transcripts of genes involved in stress response as well as genes important for different areas of cassava research. This library will be an important resource for gene discovery, characterization and cloning; in the near future it will aid the annotation of the cassava genome.

  14. Sequencing analysis of 20,000 full-length cDNA clones from cassava reveals lineage specific expansions in gene families related to stress response

    Science.gov (United States)

    Sakurai, Tetsuya; Plata, Germán; Rodríguez-Zapata, Fausto; Seki, Motoaki; Salcedo, Andrés; Toyoda, Atsushi; Ishiwata, Atsushi; Tohme, Joe; Sakaki, Yoshiyuki; Shinozaki, Kazuo; Ishitani, Manabu

    2007-01-01

    Background Cassava, an allotetraploid known for its remarkable tolerance to abiotic stresses is an important source of energy for humans and animals and a raw material for many industrial processes. A full-length cDNA library of cassava plants under normal, heat, drought, aluminum and post harvest physiological deterioration conditions was built; 19968 clones were sequence-characterized using expressed sequence tags (ESTs). Results The ESTs were assembled into 6355 contigs and 9026 singletons that were further grouped into 10577 scaffolds; we found 4621 new cassava sequences and 1521 sequences with no significant similarity to plant protein databases. Transcripts of 7796 distinct genes were captured and we were able to assign a functional classification to 78% of them while finding more than half of the enzymes annotated in metabolic pathways in Arabidopsis. The annotation of sequences that were not paired to transcripts of other species included many stress-related functional categories showing that our library is enriched with stress-induced genes. Finally, we detected 230 putative gene duplications that include key enzymes in reactive oxygen species signaling pathways and could play a role in cassava stress response features. Conclusion The cassava full-length cDNA library here presented contains transcripts of genes involved in stress response as well as genes important for different areas of cassava research. This library will be an important resource for gene discovery, characterization and cloning; in the near future it will aid the annotation of the cassava genome. PMID:18096061

  15. Cloning and sequencing of the cDNA encoding a core protein of the paired helical filament of Alzheimer's disease: Identification as the microtubule-associated protein tau

    International Nuclear Information System (INIS)

    Screening of cDNA libraries prepared from the frontal cortex of an Alzheimer's disease patient and from fetal human brain has led to isolation of the cDNA for a core protein of the paired helical filament of Alzheimer's disease. The partial amino acid sequence of this core protein was used to design synthetic oligonucleotide probes. The cDNA encodes a protein of 352 amino acids that contains a characteristic amino acid repeat in its carboxyl-terminal half. This protein is highly homologous to the sequence of the mouse microtubule-associated protein tau and thus constitutes the human equivalent of mouse tau. RNA blot analysis indicates the presence of two major transcripts, 6 and 2 kilobases long, with a wide distribution in normal human brain. Tau protein mRNAs were found in normal amounts in the frontal cortex from patients with Alzheimer's disease. The proof that at least part of tau protein forms a component of the paired helical filament core opens the way to understanding the mode of formation of paired helical filaments and thus, ultimately, the pathogenesis of Alzheimer's disease

  16. Global Identification of Significantly Expressed Genes in Developing Endosperm of Rice by Expression Sequence Tags and cDNA Array Approaches

    Institute of Scientific and Technical Information of China (English)

    Qichao Tu; Haitao Dong; Haigen Yao; Yongqi Fang; Cheng'en Dai; Hongmei Luo; Jian Yao; Dong Zhao; Debao Li

    2008-01-01

    Rice endosperm plays a very important role in seedling germination and determines the qualities of fice grain.Although studies on specific gene categories in endosperm have been carried out,global view of gene expression at a transcription level in rice endosperm is still limited.To gain a better understanding of the global and tissue-specific gene expression profiles in rice endosperm,a cDNA library from rice endosperm of immature seeds was sequenced.A cDNA array was constructed based on the tentative unique transcripts derived from expression sequence tag (EST) assembling results and then hybridized with cONAs from five different tissues or organs including endosperm,embryo,leaf,stem and root of rice.Significant redundancy was found for genes encoding prolamin,glutelin,allergen,and starch synthesis proteins,accounting for~34% of the total ESTs obtained.The cDNA array revealed 87 significantly expressed genes in endosperm compared with the other four organs or tissues.These genes included 13 prolamin family proteins,17 glutelin family proteins,12 binding proteins,nine catalytic proteins and four ribosomal proteins,indicating a complicated biological processing in rice endosperm.In addition,Northern verification of 1,4-alpha-glucan branching enzyme detected two isoforms in rice endosperm,the larger one of which only existed in endosperm.

  17. Isolation and sequence of a cDNA clone for human tyrosinase that maps at the mouse c-albino locus

    International Nuclear Information System (INIS)

    Screening of a λgt11 human melanocyte cDNA library with antibodies against hamster tyrosinase resulted in the isolation of 16 clones. The cDNA inserts from 13 of the 16 clones cross-hybridized with each other, indicating that they were form related mRNA species. One of the cDNA clones, Pmel34, detected one mRNA species with an approximate length of 2.4 kilobases that was expressed preferentially in normal and malignant melanocytes but not in other cell types. The amino acid sequence deduced from the nucleotide sequence showed that the putative human tyrosinase is composed of 548 amino acids with a molecular weight of 62,610. The deduced protein contains glycosylation sites and histidine-rich sites that could be used for copper binding. Southern blot analysis of DNA derived from newborn mice carrying lethal albino deletion mutations revealed that Pmel34 maps near or at the c-albino locus, the position of the structural gene for tyrosinase

  18. Cloning and sequence analysis of a full-length cDNA of SmPP1cb encoding turbot protein phosphatase 1 beta catalytic subunit

    Science.gov (United States)

    Qi, Fei; Guo, Huarong; Wang, Jian

    2008-02-01

    Reversible protein phosphorylation, catalyzed by protein kinases and phosphatases, is an important and versatile mechanism by which eukaryotic cells regulate almost all the signaling processes. Protein phosphatase 1 (PP1) is the first and well-characterized member of the protein serine/threonine phosphatase family. In the present study, a full-length cDNA encoding the beta isoform of the catalytic subunit of protein phosphatase 1(PP1cb), was for the first time isolated and sequenced from the skin tissue of flatfish turbot Scophthalmus maximus, designated SmPP1cb, by the rapid amplification of cDNA ends (RACE) technique. The cDNA sequence of SmPP1cb we obtained contains a 984 bp open reading frame (ORF), flanked by a complete 39 bp 5' untranslated region and 462 bp 3' untranslated region. The ORF encodes a putative 327 amino acid protein, and the N-terminal section of this protein is highly acidic, Met-Ala-Glu-Gly-Glu-Leu-Asp-Val-Asp, a common feature for PP1 catalytic subunit but absent in protein phosphatase 2B (PP2B). And its calculated molecular mass is 37 193 Da and pI 5.8. Sequence analysis indicated that, SmPP1cb is extremely conserved in both amino acid and nucleotide acid levels compared with the PP1cb of other vertebrates and invertebrates, and its Kozak motif contained in the 5'UTR around ATG start codon is GXXAXXGXX ATGG, which is different from mammalian in two positions A-6 and G-3, indicating the possibility of different initiation of translation in turbot, and also the 3'UTR of SmPP1cb is highly diverse in the sequence similarity and length compared with other animals, especially zebrafish. The cloning and sequencing of SmPP1cb gene lays a good foundation for the future work on the biological functions of PP1 in the flatfish turbot.

  19. Porcine transcriptome analysis based on 97 non-normalized cDNA libraries and assembly of 1,021,891 expressed sequence tags

    DEFF Research Database (Denmark)

    Gorodkin, Jan; Cirera, Susanna; Hedegaard, Jakob;

    2007-01-01

    BACKGROUND: Knowledge of the structure of gene expression is essential for mammalian transcriptomics research. We analyzed a collection of more than one million porcine expressed sequence tags (ESTs), of which two-thirds were generated in the Sino-Danish Pig Genome Project and one-third are from...... public databases. The Sino-Danish ESTs were generated from one normalized and 97 non-normalized cDNA libraries representing 35 different tissues and three developmental stages. RESULTS: Using the Distiller package, the ESTs were assembled to roughly 48,000 contigs and 73,000 singletons, of which...... with the greatest number of different expressed genes, whereas tissues with more specialized function, such as developing liver, have fewer expressed genes. There are at least 65 high confidence housekeeping gene candidates and 876 cDNA library-specific gene candidates. We identified differential...

  20. Deep sequencing reveals stepwise mutation acquisition in paroxysmal nocturnal hemoglobinuria

    Science.gov (United States)

    Shen, Wenyi; Clemente, Michael J.; Hosono, Naoko; Yoshida, Kenichi; Przychodzen, Bartlomiej; Yoshizato, Tetsuichi; Shiraishi, Yuichi; Miyano, Satoru; Ogawa, Seishi; Maciejewski, Jaroslaw P.; Makishima, Hideki

    2014-01-01

    Paroxysmal nocturnal hemoglobinuria (PNH) is a nonmalignant clonal disease of hematopoietic stem cells that is associated with hemolysis, marrow failure, and thrombophilia. PNH has been considered a monogenic disease that results from somatic mutations in the gene encoding PIGA, which is required for biosynthesis of glycosylphosphatidylinisotol-anchored (GPI-anchored) proteins. The loss of certain GPI-anchored proteins is hypothesized to provide the mutant clone with an extrinsic growth advantage, but some features of PNH argue that there are intrinsic drivers of clonal expansion. Here, we performed whole-exome sequencing of paired PNH+ and PNH– fractions on samples taken from 12 patients as well as targeted deep sequencing of an additional 36 PNH patients. We identified additional somatic mutations that resulted in a complex hierarchical clonal architecture, similar to that observed in myeloid neoplasms. In addition to mutations in PIGA, mutations were found in genes known to be involved in myeloid neoplasm pathogenesis, including TET2, SUZ12, U2AF1, and JAK2. Clonal analysis indicated that these additional mutations arose either as a subclone within the PIGA-mutant population, or prior to PIGA mutation. Together, our data indicate that in addition to PIGA mutations, accessory genetic events are frequent in PNH, suggesting a stepwise clonal evolution derived from a singular stem cell clone. PMID:25244093

  1. Creation of Functional Viruses from Non-Functional cDNA Clones Obtained from an RNA Virus Population by the Use of Ancestral Reconstruction

    DEFF Research Database (Denmark)

    Fahnøe, Ulrik; Pedersen, Anders Gorm; Dräger, Carolin;

    2015-01-01

    infectious RNA transcripts. Full length sequencing of cDNA clones and deep sequencing of the parental population identified substitutions important for the observed phenotypes. The investigated cDNA clones were furthermore used as the basis for inferring the sequence of functional viruses. Since each unique......-fitness, functional cDNAs and may also pose problems for sequence-based analysis of viral evolution. To address these challenges we have performed a study of the evolution of classical swine fever virus (CSFV) using deep sequencing and analysis of 84 full-length cDNA clones, each representing individual genomes from...... clone must necessarily be the descendant of a functional ancestor, we hypothesized that it should be possible to produce functional clones by reconstructing ancestral sequences. To test this we used phylogenetic methods to infer two ancestral sequences, which were then reconstructed as cDNA clones...

  2. Characterization of small interfering RNAs derived from the geminivirus/betasatellite complex using deep sequencing.

    Directory of Open Access Journals (Sweden)

    Xiuling Yang

    Full Text Available BACKGROUND: Small RNA (sRNA-guided RNA silencing is a critical antiviral defense mechanism employed by a variety of eukaryotic organisms. Although the induction of RNA silencing by bipartite and monopartite begomoviruses has been described in plants, the nature of begomovirus/betasatellite complexes remains undefined. METHODOLOGY/PRINCIPAL FINDINGS: Solanum lycopersicum plant leaves systemically infected with Tomato yellow leaf curl China virus (TYLCCNV alone or together with its associated betasatellite (TYLCCNB, and Nicotiana benthamiana plant leaves systemically infected with TYLCCNV alone, or together with TYLCCNB or with mutant TYLCCNB were harvested for RNA extraction; sRNA cDNA libraries were then constructed and submitted to Solexa-based deep sequencing. Both sense and anti-sense TYLCCNV and TYLCCNB-derived sRNAs (V-sRNAs and S-sRNAs accumulated preferentially as 22 nucleotide species in infected S. lycopersicum and N. benthamiana plants. High resolution mapping of V-sRNAs and S-sRNAs revealed heterogeneous distribution of V-sRNA and S-sRNA sequences across the TYLCCNV and TYLCCNB genomes. In TYLCCNV-infected S. lycopersicum or N. benthamiana and TYLCCNV and βC1-mutant TYLCCNB co-infected N. benthamiana plants, the primary TYLCCNV targets were AV2 and the 5' terminus of AV1. In TYLCCNV and betasatellite-infected plants, the number of V-sRNAs targeting this region decreased and the production of V-sRNAs increased corresponding to the overlapping regions of AC2 and AC3, as well as the 3' terminal of AC1. βC1 is the primary determinant mediating symptom induction and also the primary silencing target of the TYLCCNB genome even in its mutated form. CONCLUSIONS/SIGNIFICANCE: We report the first high-resolution sRNA map for a monopartite begomovirus and its associated betasatellite using Solexa-based deep sequencing. Our results suggest that viral transcript might act as RDR substrates resulting in dsRNA and secondary siRNA production. In

  3. cDNA: 53887 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus + Mm.196480 Mus musculus adult male testis cDNA, RIKEN full-length enriched library, ... uct:DNA Segment, Chr 15 Massachusetts Institute of Technology ... 260, full insert sequence gnl|UG|Mm#S10837764 AK07 ...

  4. cDNA: 53885 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus + Mm.196480 Mus musculus adult male testis cDNA, RIKEN full-length enriched library, ... uct:DNA Segment, Chr 15 Massachusetts Institute of Technology ... 260, full insert sequence gnl|UG|Mm#S10837547 AK07 ...

  5. cDNA: 41587 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.30133 Mus musculus 18-day embryo whole body cDNA, RIKEN full-length enriched li ... brary, clone:1110004A14 product:ethanol ... induced 6, full insert sequence gnl|UG|Mm#S9085518 ...

  6. cDNA: 49729 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.286963 Mus musculus 10 days lactation, adult female mammary gland cDNA, RIKEN f ... d library, clone:D730027I09 product:similar to LAK-4P ... [Homo sapiens], full insert sequence gnl|UG|Mm#S10 ...

  7. cDNA: 40377 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.154312 Mus musculus 0 day neonate kidney cDNA, RIKEN full-length enriched libra ... ry, clone:D630023P19 product:HYPERTENSION ... RELATED PROTEIN 1, full insert sequence gnl|UG|Mm# ...

  8. cDNA: 45098 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus + Mm.334199 Mus musculus adult male aorta and vein cDNA, RIKEN full-length enriched ... library, clone:A530074J19 product:SA rat hypertension -associated homolog, full insert sequence gnl|UG|Mm ...

  9. cDNA: 40378 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.154312 Mus musculus 7 days embryo whole body cDNA, RIKEN full-length enriched l ... ibrary, clone:C430046A10 product:HYPERTENSION ... RELATED PROTEIN 1, full insert sequence gnl|UG|Mm# ...

  10. Identification and characterization of a novel legume-like lectin cDNA sequence from the red marine algae Gracilaria fisheri

    Indian Academy of Sciences (India)

    Sukanya Suttisrisung; Saengchan Senapin; Boonsirm Withyachumnarnkul; Kanokpan Wongprasert

    2011-12-01

    A legume-type lectin (L-lectin) gene of the red algae Gracilaria fisheri (GFL) was cloned by rapid amplification of cDNA ends (RACE). The full-length cDNA of GFL was 1714 bp and contained a 1542 bp open reading frame encoding 513 amino acids with a predicted molecular mass of 56.5 kDa. Analysis of the putative amino acid sequence with NCBI-BLAST revealed a high homology (30–68%) with legume-type lectins (L-lectin) from Griffithsia japonica, Clavispora lusitaniae, Acyrthosiphon pisum, Tetraodon nigroviridis and Xenopus tropicalis. Phylogenetic relationship analysis showed the highest sequence identity to a glycoprotein of the red algae Griffithsia japonica (68%) (GenBank number AAM93989). Conserved Domain Database analysis detected an N-terminal carbohydrate recognition domain (CRD), the characteristic of L-lectins, which contained two sugar binding sites and a metal binding site. The secondary structure prediction of GFL showed a -sheet structure, connected with turn and coil. The most abundant structural element of GFL was the random coil, while the -helixes were distributed at the N- and C-termini, and 21 -sheets were distributed in the CRD. Computer analysis of three-dimensional structure showed a common feature of L-lectins of GFL, which included an overall globular shape that was composed of a -sandwich of two anti-parallel -sheets, monosaccharide binding sites, were on the top of the structure and in proximity with a metal binding site. Northern blot analysis using a DIG-labelled probe derived from a partial GFL sequence revealed a hybridization signal of ∼1.7 kb consistent with the length of the full-length GFL cDNA identified by RACE. No detectable band was observed from control total RNA extracted from filamentous green algae.

  11. Guanine nucleotide-binding proteins that enhance choleragen ADP-ribosyltransferase activity: nucleotide and deduced amino acid sequence of an ADP-ribosylation factor cDNA.

    OpenAIRE

    Price, S R; Nightingale, M.; Tsai, S C; Williamson, K. C.; Adamik, R; H. C. Chen; Moss, J; M. Vaughan

    1988-01-01

    Three (two soluble and one membrane) guanine nucleotide-binding proteins (G proteins) that enhance ADP-ribosylation of the Gs alpha stimulatory subunit of the adenylyl cyclase (EC 4.6.1.1) complex by choleragen have recently been purified from bovine brain. To further define the structure and function of these ADP-ribosylation factors (ARFs), we isolated a cDNA clone (lambda ARF2B) from a bovine retinal library by screening with a mixed heptadecanucleotide probe whose sequence was based on th...

  12. Deep Sequencing of RNA from Ancient Maize Kernels

    OpenAIRE

    Fordyce, Sarah Louise; Avila Arcos, Maria del Carmen; Rasmussen, Morten; CAPPELLINI, Enrico; Romero-Navarro, J. Alberto; Wales, Nathan; Alquezar Planas, David Eugenio; Penfield, Steven; Brown, Terence A.; Vielle-Calzada, Jean-Philippe; Montiel, Rafael; Jørgensen, Tina; Odegaard, Nancy; Jacobs, Michael; Arriaza, Bernardo

    2013-01-01

    The characterization of biomolecules from ancient samples can shed otherwise unobtainable insights into the past. Despite the fundamental role of transcriptomal change in evolution, the potential of ancient RNA remains unexploited - perhaps due to dogma associated with the fragility of RNA. We hypothesize that seeds offer a plausible refuge for long-term RNA survival, due to the fundamental role of RNA during seed germination. Using RNA-Seq on cDNA synthesized from nucleic acid extracts, we v...

  13. Deep-Sea, Deep-Sequencing: Metabarcoding Extracellular DNA from Sediments of Marine Canyons.

    Directory of Open Access Journals (Sweden)

    Magdalena Guardiola

    Full Text Available Marine sediments are home to one of the richest species pools on Earth, but logistics and a dearth of taxonomic work-force hinders the knowledge of their biodiversity. We characterized α- and β-diversity of deep-sea assemblages from submarine canyons in the western Mediterranean using an environmental DNA metabarcoding. We used a new primer set targeting a short eukaryotic 18S sequence (ca. 110 bp. We applied a protocol designed to obtain extractions enriched in extracellular DNA from replicated sediment corers. With this strategy we captured information from DNA (local or deposited from the water column that persists adsorbed to inorganic particles and buffered short-term spatial and temporal heterogeneity. We analysed replicated samples from 20 localities including 2 deep-sea canyons, 1 shallower canal, and two open slopes (depth range 100-2,250 m. We identified 1,629 MOTUs, among which the dominant groups were Metazoa (with representatives of 19 phyla, Alveolata, Stramenopiles, and Rhizaria. There was a marked small-scale heterogeneity as shown by differences in replicates within corers and within localities. The spatial variability between canyons was significant, as was the depth component in one of the canyons where it was tested. Likewise, the composition of the first layer (1 cm of sediment was significantly different from deeper layers. We found that qualitative (presence-absence and quantitative (relative number of reads data showed consistent trends of differentiation between samples and geographic areas. The subset of exclusively benthic MOTUs showed similar patterns of β-diversity and community structure as the whole dataset. Separate analyses of the main metazoan phyla (in number of MOTUs showed some differences in distribution attributable to different lifestyles. Our results highlight the differentiation that can be found even between geographically close assemblages, and sets the ground for future monitoring and conservation

  14. Deep-Sea, Deep-Sequencing: Metabarcoding Extracellular DNA from Sediments of Marine Canyons.

    Science.gov (United States)

    Guardiola, Magdalena; Uriz, María Jesús; Taberlet, Pierre; Coissac, Eric; Wangensteen, Owen Simon; Turon, Xavier

    2015-01-01

    Marine sediments are home to one of the richest species pools on Earth, but logistics and a dearth of taxonomic work-force hinders the knowledge of their biodiversity. We characterized α- and β-diversity of deep-sea assemblages from submarine canyons in the western Mediterranean using an environmental DNA metabarcoding. We used a new primer set targeting a short eukaryotic 18S sequence (ca. 110 bp). We applied a protocol designed to obtain extractions enriched in extracellular DNA from replicated sediment corers. With this strategy we captured information from DNA (local or deposited from the water column) that persists adsorbed to inorganic particles and buffered short-term spatial and temporal heterogeneity. We analysed replicated samples from 20 localities including 2 deep-sea canyons, 1 shallower canal, and two open slopes (depth range 100-2,250 m). We identified 1,629 MOTUs, among which the dominant groups were Metazoa (with representatives of 19 phyla), Alveolata, Stramenopiles, and Rhizaria. There was a marked small-scale heterogeneity as shown by differences in replicates within corers and within localities. The spatial variability between canyons was significant, as was the depth component in one of the canyons where it was tested. Likewise, the composition of the first layer (1 cm) of sediment was significantly different from deeper layers. We found that qualitative (presence-absence) and quantitative (relative number of reads) data showed consistent trends of differentiation between samples and geographic areas. The subset of exclusively benthic MOTUs showed similar patterns of β-diversity and community structure as the whole dataset. Separate analyses of the main metazoan phyla (in number of MOTUs) showed some differences in distribution attributable to different lifestyles. Our results highlight the differentiation that can be found even between geographically close assemblages, and sets the ground for future monitoring and conservation efforts on

  15. Construction of a full-length enriched cDNA library and preliminary analysis of expressed sequence tags from Bengal Tiger Panthera tigris tigris.

    Science.gov (United States)

    Liu, Changqing; Liu, Dan; Guo, Yu; Lu, Taofeng; Li, Xiangchen; Zhang, Minghai; Ma, Jianzhang; Ma, Yuehui; Guan, Weijun

    2013-01-01

    In this study, a full-length enriched cDNA library was successfully constructed from Bengal tiger, Panthera tigris tigris, the most well-known wild Animal. Total RNA was extracted from cultured Bengal tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.28 × 106 pfu/mL and 1.56 × 109 pfu/mL respectively. The percentage of recombinants from unamplified library was 90.2% and average length of exogenous inserts was 0.98 kb. A total of 212 individual ESTs with sizes ranging from 356 to 1108 bps were then analyzed. The BLASTX score revealed that 48.1% of the sequences were classified as a strong match, 45.3% as nominal and 6.6% as a weak match. Among the ESTs with known putative function, 26.4% ESTs were found to be related to all kinds of metabolisms, 19.3% ESTs to information storage and processing, 11.3% ESTs to posttranslational modification, protein turnover, chaperones, 11.3% ESTs to transport, 9.9% ESTs to signal transducer/cell communication, 9.0% ESTs to structure protein, 3.8% ESTs to cell cycle, and only 6.6% ESTs classified as novel genes. By EST sequencing, a full-length gene coding ferritin was identified and characterized. The recombinant plasmid pET32a-TAT-Ferritin was constructed, coded for the TAT-Ferritin fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-Ferritin recombinant protein was 2.32 ± 0.12 mg/mL. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genome and transcriptome research of Bengal tigers. PMID:23708105

  16. Construction of a Full-Length Enriched cDNA Library and Preliminary Analysis of Expressed Sequence Tags from Bengal Tiger Panthera tigris tigris

    Directory of Open Access Journals (Sweden)

    Changqing Liu

    2013-05-01

    Full Text Available In this study, a full-length enriched cDNA library was successfully constructed from Bengal tiger, Panthera tigris tigris, the most well-known wild Animal. Total RNA was extracted from cultured Bengal tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.28 × 106 pfu/mL and 1.56 × 109 pfu/mL respectively. The percentage of recombinants from unamplified library was 90.2% and average length of exogenous inserts was 0.98 kb. A total of 212 individual ESTs with sizes ranging from 356 to 1108 bps were then analyzed. The BLASTX score revealed that 48.1% of the sequences were classified as a strong match, 45.3% as nominal and 6.6% as a weak match. Among the ESTs with known putative function, 26.4% ESTs were found to be related to all kinds of metabolisms, 19.3% ESTs to information storage and processing, 11.3% ESTs to posttranslational modification, protein turnover, chaperones, 11.3% ESTs to transport, 9.9% ESTs to signal transducer/cell communication, 9.0% ESTs to structure protein, 3.8% ESTs to cell cycle, and only 6.6% ESTs classified as novel genes. By EST sequencing, a full-length gene coding ferritin was identified and characterized. The recombinant plasmid pET32a-TAT-Ferritin was constructed, coded for the TAT-Ferritin fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-Ferritin recombinant protein was 2.32 ± 0.12 mg/mL. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genome and transcriptome research of Bengal tigers.

  17. Structural of the class II enzyme of human liver alcohol dehydrogenase: combined cDNA and protein sequence determination of the π subunit

    International Nuclear Information System (INIS)

    The class II enzyme of human liver alcohol dehydrogenase was isolated, carboxymethylated, and cleaved with CNBr and proteolytic enzymes. Sequence analysis of peptides established structures corresponding to the π subunit. Two segments from the C-terminal region unique to π were selected for synthesis of oligodeoxyribonucleotide probes to screen a human liver cDNA library constructed in plasmid pT4. Sequence analysis of two identical hybridization-positive clones with cDNA inserts of about 2000 nucleotides gave the entire coding region of the π subunit, a 61-nucleotide 5' noncoding region and a 741-nucleotide 3' noncoding region containing four possible polyadenylation sites. Translation of the coding region yields a 391-residue polypeptide, which in all regions except the C-terminal segment corresponds to the protein structure as determined directly by peptide analysis. With the class I numbering system, the exception concerns a residue exchange at position 368, the actual C-terminus which is Phe-374 by peptide data but a 12 residue extension by cDNA data, and possibly two further residue exchanges at positions 303 and 312. The size difference might indicate the existence of posttranslational modifications of the mature protein or, in combination with the residue exchanges, the existence of polymorphism at the locus for class II subunits. The π subunit analyzed directly results in a 379-residue polypeptide and is the only class II size thus far known to occur in the mature protein. Comparison of the π structure with those of the class I subunits (α, β, and γ) reveals a homology with extensive differences. Large variations in segments affecting relationships at the active site and the area of subunit interactions account for the significant alterations of enzymatic specificities and other properties that differentiate class II from class I enzymes

  18. Sequencing and rescuing a highly virulent classical swine fever virus: Chinese strain cF114 from a full-length cDNA clone

    Institute of Scientific and Technical Information of China (English)

    2003-01-01

    The complete nucleotide sequence of classical swine fever virus (CSFV) strain cF114 (F114 strain propa- gated on PK-15 cells) was cloned by RT-PCR. The analyses of nucleotide and amino acids identity between cF114 and F114, Brescia, Alfort or C strain were 99.41%, 96.80%, 86.03%, 95.70% and 99.28%, 98.54%, 93.33%, 97.41% re- spectively. The cDNA fragments with correct sequence were ligated into a full-length cDNA and inserted into pMC18 plasmid (pMC12297). A full-length infectious viral RNA was synthesized by runoff transcription and transfected to PK15 cells. Viruses were recovered from transfected cells which wese titrated on PK-15 cells by endpoint dilution and indirect immunofluorescence with a CSFV-specific monoclonal antibody. The antigenicity and replication kinetics of the plasmid-derived virus (vM12297) were similar to the parental virus in vitro. The E01 or E2 gene was replaced with the genes from strain C and the pM/CE01 and pM/CE2 with chimeric full-length cDNA of cF114 were generated. The infectious viruses were obtained from pM/CE01 and pM/CE2. Both of the chimeric viruses can infect PK-15, SK- 6 and primary testicle cell of swine. The chimeric viruses can grow to a titer of 8×105 F-PFU/mL. These results are very important for understanding the genes related to the CSFV propagation and pathogenesis.

  19. Comparative analysis of gene expression at early seedling stage between a rice hybrid and its parents using a cDNA microarray of 9198 uni-sequences

    Institute of Scientific and Technical Information of China (English)

    HUANG; Yi; LI; Lihua; CHEN; Ying; LI; Xianghua; XU; Caiguo; WANG; Shiping; ZHANG; Qifa

    2006-01-01

    Using a cDNA microarray consisting of 9198 expressed sequence tags, we surveyed the gene expression profiles in shoots and roots of a rice hybrid, Liangyoupei 9 and its parents Peiai 64s and 93-11 at 72 h after germination. A total of 8587 sequences had detectable signals in both shoots and roots of the three genotypes. A total of 1571 sequences exhibited significant (P<0.01) expression differences in shoots or roots among the three genotypes, of which 121 showed expression polymorphisms in both shoots and roots, and 870 revealed significant expression differences between the hybrid and one of the parents. The expression polymorphism of the sequences was associated with the functional categories of the sequences. They occurred more frequently in categories of carbohydrate, energy and lipid metabolisms and stress response than expected, while less frequently in categories of amino acid metabolism, transcription and translation regulation, and signal transduction. A total of 214 sequences exhibited significant (P<0.05) mid-parent heterosis in expression, of which 117 had homology to genes with known functions, assigned in the categories of basic metabolism, genetic information processing, cell growth and death, signal transduction, transportation and stress response. The results may provide useful information for exploring the relationship between gene expression polymorphism and phenotypic variation, and for characterizing the molecular mechanism of seedling development and heterosis in rice.

  20. Identification of mutations leading to the Lesch-Nyhan syndrome by automated direct DNA sequencing of in vitro amplified cDNA

    International Nuclear Information System (INIS)

    The Lesch-Nyhan (LN) syndrome is a severe X chromosome-linked disease that results from a deficiency of the purine salvage enzyme hypoxanthine phosphoribosyltransferase (HPRT). The mutations leading to the disease are heterogeneous and frequently arise as de novo events. The authors have identified nucleotide alterations in 15 independently arising HPRT-deficiency cases by direct DNA sequencing of in vitro amplified HPRT cDNA. They also demonstrate that the direct DNA sequence analysis can be automated, further simplifying the detection of new mutations at this locus. The mutations include DNA base substitutions, small DNA deletions, a single DNA base insertion, and errors in RNA splicing. The application of these procedures allows DNA diagnosis and carrier identification by the direct detection of the mutant alleles within individual families affected by LN

  1. Identification of mutations leading to the Lesch-Nyhan syndrome by automated direct DNA sequencing of in vitro amplified cDNA

    Energy Technology Data Exchange (ETDEWEB)

    Gibbs, R.A. (Baylor College of Medicine, Houston, TX (USA)); Nguyen, Phinga (Howard Hughes Medical Institute, Houston, TX (USA)); McBride, L.J.; Koepf, S.M. (Applied Biosystems, Foster City, CA (USA)); Caskey, C.T. (Baylor College of Medicine, Houston, TX (USA) Howard Hughes Medical Institute, Houston, TX (USA))

    1989-03-01

    The Lesch-Nyhan (LN) syndrome is a severe X chromosome-linked disease that results from a deficiency of the purine salvage enzyme hypoxanthine phosphoribosyltransferase (HPRT). The mutations leading to the disease are heterogeneous and frequently arise as de novo events. The authors have identified nucleotide alterations in 15 independently arising HPRT-deficiency cases by direct DNA sequencing of in vitro amplified HPRT cDNA. They also demonstrate that the direct DNA sequence analysis can be automated, further simplifying the detection of new mutations at this locus. The mutations include DNA base substitutions, small DNA deletions, a single DNA base insertion, and errors in RNA splicing. The application of these procedures allows DNA diagnosis and carrier identification by the direct detection of the mutant alleles within individual families affected by LN.

  2. Transcription profiling of the model cyanobacterium Synechococcus sp. strain PCC 7002 by NextGen (SOLiD™ Sequencing of cDNA

    Directory of Open Access Journals (Sweden)

    Marcus eLudwig

    2011-03-01

    Full Text Available The genome of the unicellular, euryhaline cyanobacterium Synechococcus sp. PCC 7002 encodes about 3200 proteins. Transcripts were detected for nearly all annotated open reading frames by a global transcriptomic analysis by Next-Generation (SOLiDTM sequencing of cDNA. In the cDNA samples sequenced, ~90% of the mapped sequences were derived from the 16S and 23S ribosomal RNAs and ~10% of the sequences were derived from mRNAs. In cells grown photoautotrophically under standard conditions (38 °C, 1% (v/v CO2 in air, 250 µmol photons m-2 s-1, the highest transcript levels (up to 2% of the total mRNA for the most abundantly transcribed genes (e. g., cpcAB, psbA, psaA were generally derived from genes encoding structural components of the photosynthetic apparatus. High light exposure for one hour caused changes in transcript levels for genes encoding proteins of the photosynthetic apparatus, Type-1 NADH dehydrogenase complex and ATP synthase, whereas dark incubation for one hour resulted in a global decrease in transcript levels for photosynthesis-related genes and an increase in transcript levels for genes involved in carbohydrate degradation. Transcript levels for pyruvate kinase and the pyruvate dehydrogenase complex decreased sharply in cells incubated in the dark. Under dark anoxic (fermentative conditions, transcript changes indicated a global decrease in transcripts for respiratory proteins and suggested that cells employ an alternative phosphoenolpyruvate degradation pathway via phosphoenolpyruvate synthase (ppsA and the pyruvate:ferredoxin oxidoreductase (nifJ. Finally, the data suggested that an apparent operon involved in tetrapyrrole biosynthesis and fatty acid desaturation, acsF2-ho2-hemN2-desF, may be regulated by oxygen concentration.

  3. Assessment of adaptive evolution between wheat and rice as deduced from full-length common wheat cDNA sequence data and expression patterns

    Directory of Open Access Journals (Sweden)

    Hayashizaki Yoshihide

    2009-06-01

    Full Text Available Abstract Background Wheat is an allopolyploid plant that harbors a huge, complex genome. Therefore, accumulation of expressed sequence tags (ESTs for wheat is becoming particularly important for functional genomics and molecular breeding. We prepared a comprehensive collection of ESTs from the various tissues that develop during the wheat life cycle and from tissues subjected to stress. We also examined their expression profiles in silico. As full-length cDNAs are indispensable to certify the collected ESTs and annotate the genes in the wheat genome, we performed a systematic survey and sequencing of the full-length cDNA clones. This sequence information is a valuable genetic resource for functional genomics and will enable carrying out comparative genomics in cereals. Results As part of the functional genomics and development of genomic wheat resources, we have generated a collection of full-length cDNAs from common wheat. By grouping the ESTs of recombinant clones randomly selected from the full-length cDNA library, we were able to sequence 6,162 independent clones with high accuracy. About 10% of the clones were wheat-unique genes, without any counterparts within the DNA database. Wheat clones that showed high homology to those of rice were selected in order to investigate their expression patterns in various tissues throughout the wheat life cycle and in response to abiotic-stress treatments. To assess the variability of genes that have evolved differently in wheat and rice, we calculated the substitution rate (Ka/Ks of the counterparts in wheat and rice. Genes that were preferentially expressed in certain tissues or treatments had higher Ka/Ks values than those in other tissues and treatments, which suggests that the genes with the higher variability expressed in these tissues is under adaptive selection. Conclusion We have generated a high-quality full-length cDNA resource for common wheat, which is essential for continuation of the

  4. Construction of full-length cDNA library and development of EST-derived simple sequence repeat (EST-SSR) markers in Senecio scandens.

    Science.gov (United States)

    Qian, Gang; Ping, Junjiao; Lu, Jian; Zhang, Zhen; Wang, Lei; Xu, Delin

    2014-12-01

    Senecio scandens Buch.-Ham. ex D. Don (Compositae) is a crucial source of Chinese traditional medicine with antibacterial properties. We constructed a cDNA library and obtained expressed sequence tags (ESTs) to show the distribution of gene ontology annotations for mRNAs, using an individual plant with superior antibacterial characteristics. Analysis of comparative genomics indicates that the putative uncharacterized proteins (21.07%) might be derived from "molecular function unknown" clones or rare transcripts. Furthermore, the Compositae had high cross-species transferability of EST-derived simple sequence repeats (EST-SSR), based on valid amplifications of 206 primer pairs developed from the newly assembled expressed sequence tag sequences in Artemisia annua L. Among those EST-SSR markers, 52 primers showed polymorphic amplifications between individuals with contrasting diverse antibacterial traits. Our sequence data and molecular markers will be cost-effective tools for further studies such as genome annotation, molecular breeding, and novel transcript profiles within Compositae species. PMID:25007751

  5. Human glutamate pyruvate transaminase (GPT): Localization to 8q24.3, cDNA and genomic sequences, and polymorphic sites

    Energy Technology Data Exchange (ETDEWEB)

    Sohocki, M.M.; Sullivan, L.S.; Daiger, S.P. [Univ. of Texas Health Science Center, Houston, TX (United States)] [and others

    1997-03-01

    Two frequent protein variants of glutamate pyruvate transaminase (GPT) (E.C.2.6.1.2) have been used as genetic markers in humans for more than two decades, although chromosomal mapping of the GPT locus in the 1980s produced conflicting results. To resolve this conflict and develop useful DNA markers for this gene, we isolated and characterized cDNA and genomic clones of GPT. We have definitively mapped human GPT to the terminus of 8q using several methods. First, two cosmids shown to contain the GPT sequence were derived from a chromosome 8-specific library. Second, by fluorescence in situ hybridization, we mapped the cosmid containing the human GPT gene to chromosome band 8q24.3. Third, we mapped the rat gpt cDNA to the syntenic region of rat chromosome 7. Finally, PCR primers specific to human GPT amplify sequences contained within a {open_quotes}half-YAC{close_quotes} from the long arm of chromosome 8, that is, a YAC containing the 8q telomere. The human GPT genomic sequence spans 2.7 kb and consists of 11 exons, ranging in size from 79 to 243 bp. The exonic sequence encodes a protein of 495 amino acids that is nearly identical to the previously reported protein sequence of human GPT-1. The two polymorphic GPT isozymes are the result of a nucleotide substitution in codon 14. In addition, a cosmid containing the GPT sequence also contains a previously unmapped, polymorphic microsatellite sequence, D8S421. The cloned GPT gene and associated polymorphisms will be useful for linkage and physical mapping of disease loci that map to the terminus of 8q, including atypical vitelliform macular dystrophy (VMD1) and epidermolysis bullosa simplex, type Ogna (EBS1). In addition, this will be a useful system for characterizing the telomeric region of 8q. Finally, determination of the molecular basis of the GPT isozyme variants will permit PCR-based detection of this world-wide polymorphism. 22 refs., 3 figs.

  6. Characterization of the venom from the Australian scorpion Urodacus yaschenkoi: Molecular mass analysis of components, cDNA sequences and peptides with antimicrobial activity.

    Science.gov (United States)

    Luna-Ramírez, Karen; Quintero-Hernández, Veronica; Vargas-Jaimes, Leonel; Batista, Cesar V F; Winkel, Kenneth D; Possani, Lourival D

    2013-03-01

    The Urodacidae scorpions are the most widely distributed of the four families in Australia and represent half of the species in the continent, yet their venoms remain largely unstudied. This communication reports the first results of a proteome analysis of the venom of the scorpion Urodacus yaschenkoi performed by mass fingerprinting, after high performance liquid chromatography (HPLC) separation. A total of 74 fractions were obtained by HPLC separation allowing the identification of approximately 274 different molecular masses with molecular weights varying from 287 to 43,437 Da. The most abundant peptides were those from 1 K Da and 4-5 K Da representing antimicrobial peptides and putative potassium channel toxins, respectively. Three such peptides were chemically synthesized and tested against Gram-positive and Gram-negative bacteria showing minimum inhibitory concentration in the low micromolar range, but with moderate hemolytic activity. It also reports a transcriptome analysis of the venom glands of the same scorpion species, undertaken by constructing a cDNA library and conducting random sequencing screening of the transcripts. From the resultant cDNA library 172 expressed sequence tags (ESTs) were analyzed. These transcripts were further clustered into 120 unique sequences (23 contigs and 97 singlets). The identified putative proteins can be assorted in several groups, such as those implicated in common cellular processes, putative neurotoxins and antimicrobial peptides. The scorpion U. yaschenkoi is not known to be dangerous to humans and its venom contains peptides similar to those of Opisthacanthus cayaporum (antibacterial), Scorpio maurus palmatus (maurocalcin), Opistophthalmus carinatus (opistoporines) and Hadrurus gerstchi (scorpine-like molecules), amongst others. PMID:23182832

  7. Opsin cDNA sequences of a UV and green rhodopsin of the satyrine butterfly Bicyclus anynana

    NARCIS (Netherlands)

    Vanhoutte, Kürt; Eggen, BJL; Janssen, JJM; Stavenga, DG

    2002-01-01

    The cDNAs of an ultraviolet (UV) and long-wavelength (LW) (green) absorbing rhodopsin of the bush brown Bicyclus anynana were partially identified. The UV sequence, encoding 377 amino acids, is 76-79% identical to the UV sequences of the papilionids Papilio glaucus and Papilio xuthus and the moth Ma

  8. Gene discovery from Jatropha curcas by sequencing of ESTs from normalized and full-length enriched cDNA library from developing seeds

    Directory of Open Access Journals (Sweden)

    Sugantham Priyanka Annabel

    2010-10-01

    Full Text Available Abstract Background Jatropha curcas L. is promoted as an important non-edible biodiesel crop worldwide. Jatropha oil, which is a triacylglycerol, can be directly blended with petro-diesel or transesterified with methanol and used as biodiesel. Genetic improvement in jatropha is needed to increase the seed yield, oil content, drought and pest resistance, and to modify oil composition so that it becomes a technically and economically preferred source for biodiesel production. However, genetic improvement efforts in jatropha could not take advantage of genetic engineering methods due to lack of cloned genes from this species. To overcome this hurdle, the current gene discovery project was initiated with an objective of isolating as many functional genes as possible from J. curcas by large scale sequencing of expressed sequence tags (ESTs. Results A normalized and full-length enriched cDNA library was constructed from developing seeds of J. curcas. The cDNA library contained about 1 × 106 clones and average insert size of the clones was 2.1 kb. Totally 12,084 ESTs were sequenced to average high quality read length of 576 bp. Contig analysis revealed 2258 contigs and 4751 singletons. Contig size ranged from 2-23 and there were 7333 ESTs in the contigs. This resulted in 7009 unigenes which were annotated by BLASTX. It showed 3982 unigenes with significant similarity to known genes and 2836 unigenes with significant similarity to genes of unknown, hypothetical and putative proteins. The remaining 191 unigenes which did not show similarity with any genes in the public database may encode for unique genes. Functional classification revealed unigenes related to broad range of cellular, molecular and biological functions. Among the 7009 unigenes, 6233 unigenes were identified to be potential full-length genes. Conclusions The high quality normalized cDNA library was constructed from developing seeds of J. curcas for the first time and 7009 unigenes coding

  9. The cDNA sequence of the transferrin gene in tongue sole (Cynoglossus semilaevis and its transcriptional changes under acute hypoxic stress

    Directory of Open Access Journals (Sweden)

    Xi-Jie Guo

    2012-10-01

    Full Text Available In this study, the cDNA sequence of the transferrin (Tf gene in tongue sole (Cynoglossus semilaevis (CsTf was identified and characterised. The full length of CsTf cDNA was 2281 bp encoding 677 amino acids (aa. The mature CsTf protein was found to be made up of 656 aa and it consisted of two lobes (N- and C-lobes. Two- and three- dimensional (2-D and 3-D structure analysis showed that CsTf possesses a similar structure (multi-α helix/β-sheet to human Tf. Real-time polymerase chain reaction (PCR analysis showed that CsTf was mainly expressed in the liver, and moderately expressed in the heart, spleen and kidney. Moreover, CsTf was up-regulated in different tissues under acute hypoxia [dissolved oxygen (DO=0.8 mg/L]. Our results suggest that CsTf plays a role in fish’s adaptation to hypoxic stress.

  10. Complete sequence of HLA-B27 cDNA identified through the characterization of structural markers unique to the HLA-A, -B, and -C allelic series

    Energy Technology Data Exchange (ETDEWEB)

    Szoets, H.; Reithmueller, G.; Weiss, E.; Meo, T.

    1986-03-01

    Antigen HLA-B27 is a high-risk genetic factor with respect to a group of rheumatoid disorders, especially ankylosing spondylitis. A cDNA library was constructed from an autozygous B-cell line expressing HLA-B27, HLA-Cw1, and the previously cloned HLA-A2 antigen. Clones detected with an HLA probe were isolated and sorted into homology groups by differential hybridization and restriction maps. Nucleotide sequencing allowed the unambiguous assignment of cDNAs to HLA-A, -B, and -C loci. The HLA-B27 mRNA has the structure features and the codon variability typical of an HLA class I transcript but it specifies two uncommon amino acid replacements: a cysteine in position 67 and a serine in position 131. The latter substitution may have functional consequences, because it occurs in a conserved region and at a position invariably occupied by a species-specific arginine in humans and lysine in mice. The availability of the complete sequence of HLA-B27 and of the partial sequence of HLA-Cw1 allows the recognition of locus-specific sequence markers, particularly, but not exclusively, in the transmembrane and cytoplasmic domains.

  11. Construction and packaging of pseudotype retrovirus containing human N—ras cDNA antisense sequence and its biological effects on human hepatoma cells

    Institute of Scientific and Technical Information of China (English)

    JIALIBIN; WANGXIANG; 等

    1990-01-01

    N-ras is one of the transforming genes in human hepatic cancer cells.It has been found that N-ras was overexpressed at the mRNA and protein level in hepatoma cells.In order to explore the biological roles of N-ras in human hepatic carcinogenesis and the potential application in control of cancer cell growth,a preudotype retrovirus containing antisense sequence of human N-ras was constructed and packaged.A recombinant retrovirus vector containing antisense or sense sequences of N-ras cDNA was constructed by pZIP-NeoSV(X)1.The pseudotype virus was packaged ang rescued by transfection and infection in PA317 and ψ 2 helper cells.It has been demonstrated that the pseudotype retrovirus containing antisense N-ras sequence did inhibit the growth of human PLC/PRF/5 hepatoma cells accompanied with inhibition of p21 expression,while the retrovirus containing sense sequence had none.The pseudotype virus had no effect on human diploid fibroblasts.

  12. Deep sequencing of RNA from ancient maize kernels.

    Directory of Open Access Journals (Sweden)

    Sarah L Fordyce

    Full Text Available The characterization of biomolecules from ancient samples can shed otherwise unobtainable insights into the past. Despite the fundamental role of transcriptomal change in evolution, the potential of ancient RNA remains unexploited - perhaps due to dogma associated with the fragility of RNA. We hypothesize that seeds offer a plausible refuge for long-term RNA survival, due to the fundamental role of RNA during seed germination. Using RNA-Seq on cDNA synthesized from nucleic acid extracts, we validate this hypothesis through demonstration of partial transcriptomal recovery from two sources of ancient maize kernels. The results suggest that ancient seed transcriptomics may offer a powerful new tool with which to study plant domestication.

  13. The A-rich RNA sequences of HIV-1 pol are important for the synthesis of viral cDNA

    OpenAIRE

    Keating, Cameron P.; Hill, Melissa K.; Hawkes, David J.; Smyth, Redmond P.; Isel, Catherine; Le, Shu-Yun; Palmenberg, Ann C.; Marshall, John A.; Marquet, Roland; Nabel, Gary J.; Mak, Johnson

    2008-01-01

    The bias of A-rich codons in HIV-1 pol is thought to be a record of hypermutations in viral genomes that lack biological functions. Bioinformatic analysis predicted that A-rich sequences are generally associated with minimal local RNA structures. Using codon modifications to reduce the amount of A-rich sequences within HIV-1 genomes, we have reduced the flexibility of RNA sequences in pol to analyze the functional significance of these A-rich ‘structurally poor’ RNA elements in HIV-1 pol. Our...

  14. Localization of the human fibromodulin gene (FMOD) to chromosome 1q32 and completion of the cDNA sequence

    Energy Technology Data Exchange (ETDEWEB)

    Sztrolovics, R.; Grover, J.; Roughley, P.J. [McGill Univ., Montreal (Canada)] [and others

    1994-10-01

    This report describes the cloning of the 3{prime}-untranslated region of the human fibromodulin cDNA and its use to map the gene. For somatic cell hybrids, the generation of the PCR product was concordant with the presence of chromosome 1 and discordant with the presence of all other chromosomes, confirming that the fibromodulin gene is located within region q32 of chromosome 1. The physical mapping of genes is a critical step in the process of identifying which genes may be responsible for various inherited disorders. Specifically, the mapping of the fibromodulin gene now provides the information necessary to evaluate its potential role in genetic disorders of connective tissues. The analysis of previously reported diseases mapped to chromosome 1 reveals two genes located in the proximity of the fibromodulin locus. These are Usher syndrome type II, a recessive disorder characterized by hearing loss and retinitis pigmentosa, and Van der Woude syndrome, a dominant condition associated with abnormalities such as cleft lip and palate and hyperdontia. The genes for both of these disorders have been projected to be localized to 1q32 of a physical map that integrates available genetic linkage and physical data. However, it seems improbable that either of these disorders, exhibiting restricted tissue involvement, could be linked to the fibromodulin gene, given the wide tissue distribution of the encoded proteoglycan, although it remains possible that the relative importance of the quantity and function of the proteoglycan may avry between tissues. 11 refs., 1 fig.

  15. Integrated analysis of microRNA regulatory network in nasopharyngeal carcinoma with deep sequencing

    OpenAIRE

    Wang, Fan; Lu, Juan; Peng, Xiaohong; Jie WANG; LIU, XIONG; Chen, Xiaomei; Jiang, Yiqi; LI, XIANGPING; Zhang, Bao

    2016-01-01

    Background MicroRNAs (miRNAs) have been shown to play a critical role in the development and progression of nasopharyngeal carcinoma (NPC). Although accumulating studies have been performed on the molecular mechanisms of NPC, the miRNA regulatory networks in cancer progression remain largely unknown. Laser capture microdissection (LCM) and deep sequencing are powerful tools that can help us to detect the integrated view of miRNA-target network. Methods Illumina Hiseq2000 deep sequencing was u...

  16. Partial purification of the chloroplast ATP synthase from Chlamydomonas reinhardtii and the cloning and sequencing of a cDNA encoding the gamma subunit

    International Nuclear Information System (INIS)

    The chloroplast ATP synthase was partially purified from the green alga Chlamydomonas reinhardtii by extracting membranes with deoxycholate and KCl, followed by centrifugation and ammonium sulfate fractionation of the supernatant. The enzyme assay involved the reconstitution of such fractions with bacteriorhodopsin and soybean phospholipids to form vesicles capable of light-dependent [32P]-phosphate esterification. A cDNA for the gamma subunit from Chlamydomonas was isolated, expressed in vitro and sequenced. It contains the entire coding region for the gamma subunit precursor. A 35 amino acid long transit peptide resides at the NH2-terminus of a 323 amino acid long mature peptide that is 77% similar to the spinach gamma subunit. Six cysteines were found; three were conserved in Chlamydomonas and spinach

  17. Thermostable group II intron reverse transcriptase fusion proteins and their use in cDNA synthesis and next-generation RNA sequencing.

    Science.gov (United States)

    Mohr, Sabine; Ghanem, Eman; Smith, Whitney; Sheeter, Dennis; Qin, Yidan; King, Olga; Polioudakis, Damon; Iyer, Vishwanath R; Hunicke-Smith, Scott; Swamy, Sajani; Kuersten, Scott; Lambowitz, Alan M

    2013-07-01

    Mobile group II introns encode reverse transcriptases (RTs) that function in intron mobility ("retrohoming") by a process that requires reverse transcription of a highly structured, 2-2.5-kb intron RNA with high processivity and fidelity. Although the latter properties are potentially useful for applications in cDNA synthesis and next-generation RNA sequencing (RNA-seq), group II intron RTs have been difficult to purify free of the intron RNA, and their utility as research tools has not been investigated systematically. Here, we developed general methods for the high-level expression and purification of group II intron-encoded RTs as fusion proteins with a rigidly linked, noncleavable solubility tag, and we applied them to group II intron RTs from bacterial thermophiles. We thus obtained thermostable group II intron RT fusion proteins that have higher processivity, fidelity, and thermostability than retroviral RTs, synthesize cDNAs at temperatures up to 81°C, and have significant advantages for qRT-PCR, capillary electrophoresis for RNA-structure mapping, and next-generation RNA sequencing. Further, we find that group II intron RTs differ from the retroviral enzymes in template switching with minimal base-pairing to the 3' ends of new RNA templates, making it possible to efficiently and seamlessly link adaptors containing PCR-primer binding sites to cDNA ends without an RNA ligase step. This novel template-switching activity enables facile and less biased cloning of nonpolyadenylated RNAs, such as miRNAs or protein-bound RNA fragments. Our findings demonstrate novel biochemical activities and inherent advantages of group II intron RTs for research, biotechnological, and diagnostic methods, with potentially wide applications. PMID:23697550

  18. Complete cDNA sequence of human complement C1s and close physical linkage of the homologous genes C1s and C1r

    International Nuclear Information System (INIS)

    Overlapping molecular clones encoding the complement subcomponent C1s were isolated from a human liver cDNA library. The nucleotide sequence reconstructed from these clones spans about 85% of the length of the liver C1s messenger RNAs, which occur in three distinct size classes around 3 kilobases in length. Comparisons with the sequence of C1r, the other enzymatic subcomponent of C1, reveal 40% amino acid identity and conservation of all the cysteine residues. Beside the serine protease domain, the following sequence motifs, previously described in C1r, were also found in C1s: (a) two repeats of the type found in the Ba fragment of complement factor B and in several other complement but also noncomplement proteins, (b) a cysteine-rich segment homologous to the repeats of epidermal growth factor precursor, and (c) a duplicated segment found only in C1r and C1s. Differences in each of these structural motifs provide significant clues for the interpretation of the functional divergence of these interacting serine protease zymogens. Hybridizations of C1r and C1s probes to restriction endonuclease fragments of genomic DNA demonstrate close physical linkage of the corresponding genes. The implications of this finding are discussed with respect to the evolution of C1r and C1s after their origin by tandem gene duplication and to the previously observed combined hereditary deficiencies of Clr and Cls

  19. Comparative profiling of the transcriptional response to infection in two species of Drosophila by short-read cDNA sequencing

    Directory of Open Access Journals (Sweden)

    Sackton Timothy B

    2009-06-01

    Full Text Available Abstract Background Homology-based comparisons of the genes involved in innate immunity across many insect taxa with fully sequenced genomes has revealed a striking pattern of gene gain and loss, particularly among genes that encode proteins involved in clearing pathogens (effectors. However, limited functional annotation in non-model systems has hindered understanding of evolutionary novelties in the insect innate immune system. Results We use short read sequencing technology (Illumina/Solexa to compare the transcriptional response to infection between the well studied model system Drosophila melanogaster and the distantly related drosophilid D. virilis. We first demonstrate that Illumina/Solexa sequencing of cDNA from infected and uninfected D. melanogaster recapitulates previously published microarray studies of the transcriptional response to infection in this species, validating our approach. We then show that patterns of transcription of homologous genes differ considerably between D. melanogaster and D. virilis, and identify potential candidates for novel components of the D. virilis immune system based on transcriptional data. Finally, we use a proteomic approach to characterize the protein constituents of the D. virilis hemolymph and validate our transcriptional data. Conclusion These results suggest that the acquisition of novel components of the immune system, and particularly novel effector proteins, may be a common evolutionary phenomenon.

  20. HIV-1 Quasispecies Delineation by Tag Linkage Deep Sequencing

    Science.gov (United States)

    Wu, Nicholas C.; De La Cruz, Justin; Al-Mawsawi, Laith Q.; Olson, C. Anders; Qi, Hangfei; Luan, Harding H.; Nguyen, Nguyen; Du, Yushen; Le, Shuai; Wu, Ting-Ting; Li, Xinmin; Lewis, Martha J.; Yang, Otto O.; Sun, Ren

    2014-01-01

    Trade-offs between throughput, read length, and error rates in high-throughput sequencing limit certain applications such as monitoring viral quasispecies. Here, we describe a molecular-based tag linkage method that allows assemblage of short sequence reads into long DNA fragments. It enables haplotype phasing with high accuracy and sensitivity to interrogate individual viral sequences in a quasispecies. This approach is demonstrated to deduce ∼2000 unique 1.3 kb viral sequences from HIV-1 quasispecies in vivo and after passaging ex vivo with a detection limit of ∼0.005% to ∼0.001%. Reproducibility of the method is validated quantitatively and qualitatively by a technical replicate. This approach can improve monitoring of the genetic architecture and evolution dynamics in any quasispecies population. PMID:24842159

  1. Estimation of the methylation pattern distribution from deep sequencing data

    OpenAIRE

    Lin, Peijie; Forêt, Sylvain; Wilson, Susan R; Burden, Conrad J.

    2015-01-01

    Background Bisulphite sequencing enables the detection of cytosine methylation. The sequence of the methylation states of cytosines on any given read forms a methylation pattern that carries substantially more information than merely studying the average methylation level at individual positions. In order to understand better the complexity of DNA methylation landscapes in biological samples, it is important to study the diversity of these methylation patterns. However, the accurate quantific...

  2. Classifying and Visualizing Motion Capture Sequences using Deep Neural Networks

    OpenAIRE

    Cho, Kyunghyun; Chen, Xi

    2013-01-01

    The gesture recognition using motion capture data and depth sensors has recently drawn more attention in vision recognition. Currently most systems only classify dataset with a couple of dozens different actions. Moreover, feature extraction from the data is often computational complex. In this paper, we propose a novel system to recognize the actions from skeleton data with simple, but effective, features using deep neural networks. Features are extracted for each frame based on the relative...

  3. Nucleotide sequence of the cDNA encoding the precursor of the beta subunit of rat lutropin.

    OpenAIRE

    Chin, W W; Godine, J E; Klein, D. R.; Chang, A S; Tan, L K; Habener, J F

    1983-01-01

    We have determined the nucleotide sequences of cDNAs encoding the precursor of the beta subunit of rat lutropin, a polypeptide hormone that regulates gonadal function, including the development of gametes and the production of steroid sex hormones. The cDNAs were prepared from poly(A)+ RNA derived from the pituitary glands of rats 4 weeks after ovariectomy and were cloned in bacterial plasmids. Bacterial colonies containing transfected plasmids were screened by hybridization with a 32P-labele...

  4. Hybridization-based antibody cDNA recovery for the production of recombinant antibodies identified by repertoire sequencing

    OpenAIRE

    Valdés-Alemán, Javier; Téllez-Sosa, Juan; Ovilla-Muñoz, Marbella; Godoy-Lozano, Elizabeth; Velázquez-Ramírez, Daniel; Valdovinos-Torres, Humberto; Gómez-Barreto, Rosa E; Martinez-Barnetche, Jesús

    2013-01-01

    High-throughput sequencing of the antibody repertoire is enabling a thorough analysis of B cell diversity and clonal selection, which may improve the novel antibody discovery process. Theoretically, an adequate bioinformatic analysis could allow identification of candidate antigen-specific antibodies, requiring their recombinant production for experimental validation of their specificity. Gene synthesis is commonly used for the generation of recombinant antibodies identified in silico. Novel ...

  5. cDNA Cloning, Sequence Analysis of the Porcine LIM and Cysteine-rich Domain 1 Gene

    Institute of Scientific and Technical Information of China (English)

    Jun WANG; Chang-Yan DENG; Yuan-Zhu XIONG; Bo ZUO; Lei XING; Feng-E LI; Ming-Gang LEI; Rong ZHENG; Si-Wen JIANG

    2005-01-01

    LIM domain proteins are important regulators in cell growth, cell fate determination, cell differentiation and remodeling of the cell cytoskeleton by their interaction with various structural proteins, kinases and transcriptional regulators. Using molecular biology combined with in silico cloning, we have cloned the complete coding sequence of pig LIM and the cysteine-rich domain 1 gene (LMCD1) which encodes a 363 amino acid protein. The estimated molecular weight of the LMCD1 protein is 40,788 Da with a pI of 8.39. It was found to be highly expressed in both skeletal muscle and cardiac muscle. Alignment analysis revealed that the deduced protein sequence shares 86%, 91% and 93% homology with that of its human, mouse and rat counterparts, respectively. The LMCD1 protein was predicted by bioinformatics software to contain a novel cysteine-rich domain in the N-terminal region, two LIM domains in the C-terminal region, nine potential protein kinase C phosphorylation sites, seven casein kinase Ⅱ phosphorylation sites, a tyrosine kinase phosphorylation site, seven N-glycosylation and N-myristoylation sites and a single potential N-glycosylation site, which is similar to the protein's human counterpart. Phylogenetic tree was constructed by aligning the amino acid sequences of the LIM domain from different species. In addition, four base mutations were detected by comparing the sequences of Large White pigs with those of Chinese Meishan pigs. The G294A mutation site was confirmed by polymerase chain reaction-single-strand conformation polymorphism analysis. Its allele frequencies were studied in five pig breeds.

  6. Low molecular weight human pulmonary surfactant protein (SP5): isolation, characterization, and cDNA and amino acid sequences

    International Nuclear Information System (INIS)

    Pulmonary surfactant is a lipid-protein complex that promotes alveolar stability by lowering the surface tension at the air-fluid interface in the peripheral air spaces. A group of hydrophobic surfactant-associated proteins has been shown to be essential for rapid surface film formation by surfactant phospholipids. The authors have purified a hydrophobic surfactant protein of ≅ 5kDa that they term SP5 from bronchopulmonary lavage fluid from a patient with alveolar proteinosis and shown that it promotes rapid surface film formation by simple mixtures of phospholipids. They have derived the full amino acid sequence of human SP5 from the nucleotide sequence of cDNAs identified with oligonucleotide probes based on the NH2-terminal sequence of SP5. SP5 isolated from surfactant is a fragment of a much larger precursor protein (21 kDa). The precursor contains an extremely hydrophobic region of 34 amino acids that comprises most the mature SP5. This hydrophobicity explains the unusual solubility characteristics of SP5 and the fact that it is lipid-associated when isolated from lung

  7. Low molecular weight human pulmonary surfactant protein (SP5): isolation, characterization, and cDNA and amino acid sequences

    Energy Technology Data Exchange (ETDEWEB)

    Warr, R.G.; Hawgood, S.; Buckley, D.I.; Crisp, T.M.; Schilling, J.; Benson, B.J.; Ballard, P.L.; Clements, J.A.; White, R.T.

    1987-11-01

    Pulmonary surfactant is a lipid-protein complex that promotes alveolar stability by lowering the surface tension at the air-fluid interface in the peripheral air spaces. A group of hydrophobic surfactant-associated proteins has been shown to be essential for rapid surface film formation by surfactant phospholipids. The authors have purified a hydrophobic surfactant protein of approx. = 5kDa that they term SP5 from bronchopulmonary lavage fluid from a patient with alveolar proteinosis and shown that it promotes rapid surface film formation by simple mixtures of phospholipids. They have derived the full amino acid sequence of human SP5 from the nucleotide sequence of cDNAs identified with oligonucleotide probes based on the NH/sub 2/-terminal sequence of SP5. SP5 isolated from surfactant is a fragment of a much larger precursor protein (21 kDa). The precursor contains an extremely hydrophobic region of 34 amino acids that comprises most the mature SP5. This hydrophobicity explains the unusual solubility characteristics of SP5 and the fact that it is lipid-associated when isolated from lung.

  8. LookSeq: A browser-based viewer for deep sequencing data

    OpenAIRE

    Manske, Heinrich Magnus; Dominic P Kwiatkowski

    2009-01-01

    Sequencing a genome to great depth can be highly informative about heterogeneity within an individual or a population. Here we address the problem of how to visualize the multiple layers of information contained in deep sequencing data. We propose an interactive AJAX-based web viewer for browsing large data sets of aligned sequence reads. By enabling seamless browsing and fast zooming, the LookSeq program assists the user to assimilate information at different levels of resolution, from an ov...

  9. cDNA and deduced amino acid sequence of human pulmonary surfactant-associated proteolipid SPL(Phe).

    OpenAIRE

    Glasser, S W; Korfhagen, T R; Weaver, T.; Pilot-Matias, T; Fox, J L; Whitsett, J A

    1987-01-01

    Hydrophobic surfactant-associated protein of Mr 6000-14,000 was isolated from ether/ethanol or chloroform/methanol extracts of mammalian pulmonary surfactant. Automated Edman degradation in a gas-phase sequencer showed the major N-terminus of the human low molecular weight protein to be Phe-Pro-Ile-Pro-Leu-Pro-Tyr-Cys-Trp-Leu-Cys-Arg-Ala-Leu-. Because of the N-terminal phenylalanine, the surfactant protein was designated SPL(Phe). Antiserum generated against hydrophobic surfactant protein(s) ...

  10. Molecular cloning and sequence analysis of cDNA encoding molt-inhibiting hormone from Portunus trituberculatus

    OpenAIRE

    ZHU Dong-Fa; Shen, Jian-Ming; Yang, Ji-Feng; Su, Qing

    2008-01-01

    Molt-inhibiting hormone (MIH), which belongs to the crustacean hyperglycemic hormone (CHH) neuropeptide family, inhibits the synthesis of ecdysteroids by Y-organs. The CHH family is divided into two major groups, designated I and II. Group Ⅰ is composed of all CHH peptides, and Group Ⅱ is composed of most MIHs, gonad-inhibiting hormone (GIH) and mandibular organ-inhibiting hormone (MOIH). The Group Ⅰ peptides consist of a signal sequence, a CHH precursor related peptide (CPRP) and a mature ho...

  11. cDNA sequence and tissue distribution of the mRNA for bovine and murine p11, the S100-related light chain of the protein-tyrosine kinase substrate p36 (calpactin I)

    DEFF Research Database (Denmark)

    Saris, Chris J M; Kristensen, Torsten; D’Eustachio, Peter;

    1987-01-01

    We have isolated and sequenced cDNA clones of bovine nd murine pl 1 mRNAs. The nonpolyadenylated mRNAs are predicted to be 614 and 600 nucleotides, respectively. The p l l mRNAs both contain a 291 nucleotide open reading frame, preceded by a 5”untranslated region of 73 nucleotides in bovine p l l m...

  12. Complete cDNA sequence of the preproform of human pregnancy-associated plasma protein-A. Evidence for expression in the brain and induction by cAMP

    DEFF Research Database (Denmark)

    Haaning, Jesper; Oxvig, Claus; Overgaard, Michael Toft;

    1996-01-01

    A cDNA that encodes the prepropeptide of pregnancy-associated plasma protein-A (preproPAPP-A), a putative metalloproteinase, has been cloned and sequenced. PAPP-A is synthesized in the placenta as a 1627-residue precursor preproprotein with a putative 22-residue signal peptide and a highly basic...

  13. Cloning and sequence of cDNA encoding 1-aminocyclo- propane-1-carboxylate oxidase in Vanda flowers

    Directory of Open Access Journals (Sweden)

    Pattana Srifah Huehne

    2013-08-01

    Full Text Available The 1-aminocyclopropane-1-carboxylate oxidase (ACO gene in the final step of ethylene biosynthesis was isolated from ethylene-sensitive Vanda Miss Joaquim flowers. This consists of 1,242 base pairs (bp encoding for 326 amino acid residues. To investigate the specific divergence in orchid ACO sequences, the deduced Vanda ACO was aligned with five other orchid ACOs. The results reveal that the ACO sequences within Doritaenopsis, Phalaenopsis and Vanda show highly conserved and almost 95% identical homology, while the ACOs isolated from Cymbidium, Dendrobium and Cattleya are 8788% identical to Vanda ACO. In addition, the 2-oxoglutarate- Fe(II_oxygenase (Oxy domain of orchid ACOs consists of a higher degree of amino acid conservation than that of the non-haem dioxygenase (DIOX_N domain. The overall homology regions of Vanda ACO are commonly folded into 12 α-helices and 12 β-sheets similar to the three dimensional template-structure of Petunia ACO. This Vanda ACO cloned gene is highly expressed in flower tissue compared with root and leaf tissues. In particular, there is an abundance of ACO transcript accumulation in the column followed by the lip and the perianth of Vanda Miss Joaquim flowers at the fully-open stage.

  14. Whitefly (Bemisia tabaci genome project: analysis of sequenced clones from egg, instar, and adult (viruliferous and non-viruliferous cDNA libraries

    Directory of Open Access Journals (Sweden)

    Czosnek Henryk

    2006-04-01

    Full Text Available Abstract Background The past three decades have witnessed a dramatic increase in interest in the whitefly Bemisia tabaci, owing to its nature as a taxonomically cryptic species, the damage it causes to a large number of herbaceous plants because of its specialized feeding in the phloem, and to its ability to serve as a vector of plant viruses. Among the most important plant viruses to be transmitted by B. tabaci are those in the genus Begomovirus (family, Geminiviridae. Surprisingly, little is known about the genome of this whitefly. The haploid genome size for male B. tabaci has been estimated to be approximately one billion bp by flow cytometry analysis, about five times the size of the fruitfly Drosophila melanogaster. The genes involved in whitefly development, in host range plasticity, and in begomovirus vector specificity and competency, are unknown. Results To address this general shortage of genomic sequence information, we have constructed three cDNA libraries from non-viruliferous whiteflies (eggs, immature instars, and adults and two from adult insects that fed on tomato plants infected by two geminiviruses: Tomato yellow leaf curl virus (TYLCV and Tomato mottle virus (ToMoV. In total, the sequence of 18,976 clones was determined. After quality control, and removal of 5,542 clones of mitochondrial origin 9,110 sequences remained which included 3,843 singletons and 1,017 contigs. Comparisons with public databases indicated that the libraries contained genes involved in cellular and developmental processes. In addition, approximately 1,000 bases aligned with the genome of the B. tabaci endosymbiotic bacterium Candidatus Portiera aleyrodidarum, originating primarily from the egg and instar libraries. Apart from the mitochondrial sequences, the longest and most abundant sequence encodes vitellogenin, which originated from whitefly adult libraries, indicating that much of the gene expression in this insect is directed toward the production

  15. Using Small RNA Deep Sequencing Data to Detect Human Viruses

    OpenAIRE

    Fang Wang; Yu Sun; Jishou Ruan; Rui Chen; Xin Chen; Chengjie Chen; Jan F. Kreuze; ZhangJun Fei; Xiao Zhu; Shan Gao

    2016-01-01

    Small RNA sequencing (sRNA-seq) can be used to detect viruses in infected hosts without the necessity to have any prior knowledge or specialized sample preparation. The sRNA-seq method was initially used for viral detection and identification in plants and then in invertebrates and fungi. However, it is still controversial to use sRNA-seq in the detection of mammalian or human viruses. In this study, we used 931 sRNA-seq runs of data from the NCBI SRA database to detect and identify viruses i...

  16. Determining mutant spectra of three RNA viral samples using ultra-deep sequencing

    Energy Technology Data Exchange (ETDEWEB)

    Chen, H

    2012-06-06

    RNA viruses have extremely high mutation rates that enable the virus to adapt to new host environments and even jump from one species to another. As part of a viral transmission study, three viral samples collected from naturally infected animals were sequenced using Illumina paired-end technology at ultra-deep coverage. In order to determine the mutant spectra within the viral quasispecies, it is critical to understand the sequencing error rates and control for false positive calls of viral variants (point mutantations). I will estimate the sequencing error rate from two control sequences and characterize the mutant spectra in the natural samples with this error rate.

  17. Ultra-deep sequencing of VHSV isolates contributes to understanding the role of viral quasispecies

    DEFF Research Database (Denmark)

    Schönherz, Anna A.; Lorenzen, Niels; Guldbrandtsen, Bernt;

    2016-01-01

    better understand the nature of a virus quasispecies related to the evolutionary potential of VHSV, a deep-sequencing protocol specific to VHSV was established and applied to 4 VHSV isolates, 2 originating from rainbow trout and 2 from Atlantic herring. Each isolate was subjected to Illumina paired end...... shotgun sequencing after PCR amplification and the 11.1 kb genome was successfully sequenced with an average coverage of 0.5–1.9 × 106 sequenced copies. Differences in single nucleotide polymorphism (SNP) frequency were detected both within and between isolates, possibly related to their stage of...

  18. Phylogeny of immunoglobulin heavy chain isotypes: structure of the constant region of Ambystoma mexicanum upsilon chain deduced from cDNA sequence.

    Science.gov (United States)

    Fellah, J S; Kerfourn, F; Wiles, M V; Schwager, J; Charlemagne, J

    1993-01-01

    An RNA polymerase chain reaction strategy was used to amplify and clone a cDNA segment encoding for the complete constant part of the axolotl IgY heavy (C upsilon) chain. C upsilon is 433 amino acids long and organized into four domains (C upsilon 1-C upsilon 4); each has the typical internal disulfide bond and invariant tryptophane residues. Axolotl C upsilon is most closely related to Xenopus C upsilon (40% identical amino acid residues) and C upsilon 1 shares 46.4% amino acid residues among these species. The presence of additional cysteines in C upsilon 1 and C upsilon 2 domains is consistent with an additional intradomain S-S bond similar to that suggested for Xenopus C upsilon and C chi, and for the avian C upsilon and the human C epsilon. C upsilon 4 ends with the Gly-Lys dipeptide characteristic of secreted mammalian C gamma 3, human C epsilon 4, and avian and anuran C upsilon 4, and contains the consensus [G/GT(AA)] nucleotide splice signal sequence for joining C upsilon 4 to the transmembrane region. These results are consistent with the hypothesis of an ancestral structural relationship between amphibian, avian upsilon chains, and mammalian epsilon chains. However, these molecules have different biological properties: axolotl IgY is secretory Ig, anuran and avian IgY behave like mammalian IgG, and mammalian IgE is implicated in anaphylactic reactions. PMID:8344718

  19. Salmo salar and Esox lucius full-length cDNA sequences reveal changes in evolutionary pressures on a post-tetraploidization genome

    Directory of Open Access Journals (Sweden)

    Holt Robert A

    2010-04-01

    Full Text Available Abstract Background Salmonids are one of the most intensely studied fish, in part due to their economic and environmental importance, and in part due to a recent whole genome duplication in the common ancestor of salmonids. This duplication greatly impacts species diversification, functional specialization, and adaptation. Extensive new genomic resources have recently become available for Atlantic salmon (Salmo salar, but documentation of allelic versus duplicate reference genes remains a major uncertainty in the complete characterization of its genome and its evolution. Results From existing expressed sequence tag (EST resources and three new full-length cDNA libraries, 9,057 reference quality full-length gene insert clones were identified for Atlantic salmon. A further 1,365 reference full-length clones were annotated from 29,221 northern pike (Esox lucius ESTs. Pairwise dN/dS comparisons within each of 408 sets of duplicated salmon genes using northern pike as a diploid out-group show asymmetric relaxation of selection on salmon duplicates. Conclusions 9,057 full-length reference genes were characterized in S. salar and can be used to identify alleles and gene family members. Comparisons of duplicated genes show that while purifying selection is the predominant force acting on both duplicates, consistent with retention of functionality in both copies, some relaxation of pressure on gene duplicates can be identified. In addition, there is evidence that evolution has acted asymmetrically on paralogs, allowing one of the pair to diverge at a faster rate.

  20. Cloning and Sequence Analysis of Tocopherol Cyclase cDNA from Wheat (T. Aestivum L.)%小麦生育酚环化酶基因全长cDNA的克隆与分析

    Institute of Scientific and Technical Information of China (English)

    邹礼平; 高和平

    2009-01-01

    生育酚环化酶(TC)是维生素E生物合成途径中的%Tocopherol cyclase (TC) is a key enzyme for plant vitamin E biosynthesis. The cDNA of wheat TC was cloned by RACE-PCR with primers designed based on a EST (BQ161011) sequence. The full-length cDNA sequence contained a complete open reading frame (ORF) of 1404 bp, which encoding 467 amino acid residues. Homology analysis of the deduced amino acid showed 91%, 88%, 71%, 68% and 67% identity with TC from rice, maize, sesame, potato and Arabidopsis thaliana, respectively. The result of phylogenetic analysis indicated that various TC sequences were clustered into three groups.

  1. Noninvasive Detection of Fetal Subchromosome Abnormalities via Deep Sequencing of Maternal Plasma

    OpenAIRE

    Srinivasan, Anupama; Bianchi, Diana W.; Huang, Hui; Sehnert, Amy J.; Rava, Richard P.

    2013-01-01

    The purpose of this study was to determine the deep sequencing and analytic conditions needed to detect fetal subchromosome abnormalities across the genome from a maternal blood sample. Cell-free (cf) DNA was isolated from the plasma of 11 pregnant women carrying fetuses with subchromosomal duplications and deletions, translocations, mosaicism, and trisomy 20 diagnosed by metaphase karyotype. Massively parallel sequencing (MPS) was performed with 25-mer tags at approximately 109 tags per samp...

  2. Deep sequencing reveals 50 novel genes for recessive cognitive disorders.

    Science.gov (United States)

    Najmabadi, Hossein; Hu, Hao; Garshasbi, Masoud; Zemojtel, Tomasz; Abedini, Seyedeh Sedigheh; Chen, Wei; Hosseini, Masoumeh; Behjati, Farkhondeh; Haas, Stefan; Jamali, Payman; Zecha, Agnes; Mohseni, Marzieh; Püttmann, Lucia; Vahid, Leyla Nouri; Jensen, Corinna; Moheb, Lia Abbasi; Bienek, Melanie; Larti, Farzaneh; Mueller, Ines; Weissmann, Robert; Darvish, Hossein; Wrogemann, Klaus; Hadavi, Valeh; Lipkowitz, Bettina; Esmaeeli-Nieh, Sahar; Wieczorek, Dagmar; Kariminejad, Roxana; Firouzabadi, Saghar Ghasemi; Cohen, Monika; Fattahi, Zohreh; Rost, Imma; Mojahedi, Faezeh; Hertzberg, Christoph; Dehghan, Atefeh; Rajab, Anna; Banavandi, Mohammad Javad Soltani; Hoffer, Julia; Falah, Masoumeh; Musante, Luciana; Kalscheuer, Vera; Ullmann, Reinhard; Kuss, Andreas Walter; Tzschach, Andreas; Kahrizi, Kimia; Ropers, H Hilger

    2011-10-01

    Common diseases are often complex because they are genetically heterogeneous, with many different genetic defects giving rise to clinically indistinguishable phenotypes. This has been amply documented for early-onset cognitive impairment, or intellectual disability, one of the most complex disorders known and a very important health care problem worldwide. More than 90 different gene defects have been identified for X-chromosome-linked intellectual disability alone, but research into the more frequent autosomal forms of intellectual disability is still in its infancy. To expedite the molecular elucidation of autosomal-recessive intellectual disability, we have now performed homozygosity mapping, exon enrichment and next-generation sequencing in 136 consanguineous families with autosomal-recessive intellectual disability from Iran and elsewhere. This study, the largest published so far, has revealed additional mutations in 23 genes previously implicated in intellectual disability or related neurological disorders, as well as single, probably disease-causing variants in 50 novel candidate genes. Proteins encoded by several of these genes interact directly with products of known intellectual disability genes, and many are involved in fundamental cellular processes such as transcription and translation, cell-cycle control, energy metabolism and fatty-acid synthesis, which seem to be pivotal for normal brain development and function. PMID:21937992

  3. Deep sequencing of the murine olfactory receptor neuron transcriptome.

    Directory of Open Access Journals (Sweden)

    Ninthujah Kanageswaran

    Full Text Available The ability of animals to sense and differentiate among thousands of odorants relies on a large set of olfactory receptors (OR and a multitude of accessory proteins within the olfactory epithelium (OE. ORs and related signaling mechanisms have been the subject of intensive studies over the past years, but our knowledge regarding olfactory processing remains limited. The recent development of next generation sequencing (NGS techniques encouraged us to assess the transcriptome of the murine OE. We analyzed RNA from OEs of female and male adult mice and from fluorescence-activated cell sorting (FACS-sorted olfactory receptor neurons (ORNs obtained from transgenic OMP-GFP mice. The Illumina RNA-Seq protocol was utilized to generate up to 86 million reads per transcriptome. In OE samples, nearly all OR and trace amine-associated receptor (TAAR genes involved in the perception of volatile amines were detectably expressed. Other genes known to participate in olfactory signaling pathways were among the 200 genes with the highest expression levels in the OE. To identify OE-specific genes, we compared olfactory neuron expression profiles with RNA-Seq transcriptome data from different murine tissues. By analyzing different transcript classes, we detected the expression of non-olfactory GPCRs in ORNs and established an expression ranking for GPCRs detected in the OE. We also identified other previously undescribed membrane proteins as potential new players in olfaction. The quantitative and comprehensive transcriptome data provide a virtually complete catalogue of genes expressed in the OE and present a useful tool to uncover candidate genes involved in, for example, olfactory signaling, OR trafficking and recycling, and proliferation.

  4. Denoising DNA deep sequencing data-high-throughput sequencing errors and their correction.

    Science.gov (United States)

    Laehnemann, David; Borkhardt, Arndt; McHardy, Alice Carolyn

    2016-01-01

    Characterizing the errors generated by common high-throughput sequencing platforms and telling true genetic variation from technical artefacts are two interdependent steps, essential to many analyses such as single nucleotide variant calling, haplotype inference, sequence assembly and evolutionary studies. Both random and systematic errors can show a specific occurrence profile for each of the six prominent sequencing platforms surveyed here: 454 pyrosequencing, Complete Genomics DNA nanoball sequencing, Illumina sequencing by synthesis, Ion Torrent semiconductor sequencing, Pacific Biosciences single-molecule real-time sequencing and Oxford Nanopore sequencing. There is a large variety of programs available for error removal in sequencing read data, which differ in the error models and statistical techniques they use, the features of the data they analyse, the parameters they determine from them and the data structures and algorithms they use. We highlight the assumptions they make and for which data types these hold, providing guidance which tools to consider for benchmarking with regard to the data properties. While no benchmarking results are included here, such specific benchmarks would greatly inform tool choices and future software development. The development of stand-alone error correctors, as well as single nucleotide variant and haplotype callers, could also benefit from using more of the knowledge about error profiles and from (re)combining ideas from the existing approaches presented here. PMID:26026159

  5. Deep-sequencing of the peach latent mosaic viroid reveals new aspects of population heterogeneity.

    Directory of Open Access Journals (Sweden)

    Jean-Pierre Sehi Glouzon

    Full Text Available Viroids are small circular single-stranded infectious RNAs characterized by a relatively high mutation level. Knowledge of their sequence heterogeneity remains largely elusive and previous studies, using Sanger sequencing, were based on a limited number of sequences. In an attempt to address sequence heterogeneity from a population dynamics perspective, a GF305-indicator peach tree was infected with a single variant of the Avsunviroidae family member Peach latent mosaic viroid (PLMVd. Six months post-inoculation, full-length circular conformers of PLMVd were isolated and deep-sequenced. We devised an original approach to the bioinformatics refinement of our sequence libraries involving important phenotypic data, based on the systematic analysis of hammerhead self-cleavage activity. Two distinct libraries yielded a total of 3,939 different PLMVd variants. Sequence variants exhibiting up to ∼17% of mutations relative to the inoculated viroid were retrieved, clearly illustrating the high level of divergence dynamics within a unique population. While we initially assumed that most positions of the viroid sequence would mutate, we were surprised to discover that ∼50% of positions remained perfectly conserved, including several small stretches as well as a small motif reminiscent of a GNRA tetraloop which are the result of various selective pressures. Using a hierarchical clustering algorithm, the different variants harvested were subdivided into 7 clusters. We found that most sequences contained an average of 4.6 to 6.4 mutations compared to the variant used to initially inoculate the plant. Interestingly, it was possible to reconstitute and compare the sequence evolution of each of these clusters. In doing so, we identified several key mutations. This study provides a reliable pipeline for the treatment of viroid deep-sequencing. It also sheds new light on the extent of sequence variation that a viroid population can sustain, and which may give

  6. Deep-sequencing of the peach latent mosaic viroid reveals new aspects of population heterogeneity.

    Science.gov (United States)

    Glouzon, Jean-Pierre Sehi; Bolduc, François; Wang, Shengrui; Najmanovich, Rafael J; Perreault, Jean-Pierre

    2014-01-01

    Viroids are small circular single-stranded infectious RNAs characterized by a relatively high mutation level. Knowledge of their sequence heterogeneity remains largely elusive and previous studies, using Sanger sequencing, were based on a limited number of sequences. In an attempt to address sequence heterogeneity from a population dynamics perspective, a GF305-indicator peach tree was infected with a single variant of the Avsunviroidae family member Peach latent mosaic viroid (PLMVd). Six months post-inoculation, full-length circular conformers of PLMVd were isolated and deep-sequenced. We devised an original approach to the bioinformatics refinement of our sequence libraries involving important phenotypic data, based on the systematic analysis of hammerhead self-cleavage activity. Two distinct libraries yielded a total of 3,939 different PLMVd variants. Sequence variants exhibiting up to ∼17% of mutations relative to the inoculated viroid were retrieved, clearly illustrating the high level of divergence dynamics within a unique population. While we initially assumed that most positions of the viroid sequence would mutate, we were surprised to discover that ∼50% of positions remained perfectly conserved, including several small stretches as well as a small motif reminiscent of a GNRA tetraloop which are the result of various selective pressures. Using a hierarchical clustering algorithm, the different variants harvested were subdivided into 7 clusters. We found that most sequences contained an average of 4.6 to 6.4 mutations compared to the variant used to initially inoculate the plant. Interestingly, it was possible to reconstitute and compare the sequence evolution of each of these clusters. In doing so, we identified several key mutations. This study provides a reliable pipeline for the treatment of viroid deep-sequencing. It also sheds new light on the extent of sequence variation that a viroid population can sustain, and which may give rise to a

  7. Deep sequencing of the Camellia sinensis transcriptome revealed candidate genes for major metabolic pathways of tea-specific compounds

    Energy Technology Data Exchange (ETDEWEB)

    Shi, CY; Yang, H; Wei, CL; Yu, O; Zhang, ZZ; Sun, J; Wan, XC

    2011-01-01

    Tea is one of the most popular non-alcoholic beverages worldwide. However, the tea plant, Camellia sinensis, is difficult to culture in vitro, to transform, and has a large genome, rendering little genomic information available. Recent advances in large-scale RNA sequencing (RNA-seq) provide a fast, cost-effective, and reliable approach to generate large expression datasets for functional genomic analysis, which is especially suitable for non-model species with un-sequenced genomes. Using high-throughput Illumina RNA-seq, the transcriptome from poly (A){sup +} RNA of C. sinensis was analyzed at an unprecedented depth (2.59 gigabase pairs). Approximate 34.5 million reads were obtained, trimmed, and assembled into 127,094 unigenes, with an average length of 355 bp and an N50 of 506 bp, which consisted of 788 contig clusters and 126,306 singletons. This number of unigenes was 10-fold higher than existing C. sinensis sequences deposited in GenBank (as of August 2010). Sequence similarity analyses against six public databases (Uniprot, NR and COGs at NCBI, Pfam, InterPro and KEGG) found 55,088 unigenes that could be annotated with gene descriptions, conserved protein domains, or gene ontology terms. Some of the unigenes were assigned to putative metabolic pathways. Targeted searches using these annotations identified the majority of genes associated with several primary metabolic pathways and natural product pathways that are important to tea quality, such as flavonoid, theanine and caffeine biosynthesis pathways. Novel candidate genes of these secondary pathways were discovered. Comparisons with four previously prepared cDNA libraries revealed that this transcriptome dataset has both a high degree of consistency with previous EST data and an approximate 20 times increase in coverage. Thirteen unigenes related to theanine and flavonoid synthesis were validated. Their expression patterns in different organs of the tea plant were analyzed by RT-PCR and quantitative real

  8. Deep sequencing of the Camellia sinensis transcriptome revealed candidate genes for major metabolic pathways of tea-specific compounds

    Directory of Open Access Journals (Sweden)

    Chen Qi

    2011-02-01

    Full Text Available Abstract Background Tea is one of the most popular non-alcoholic beverages worldwide. However, the tea plant, Camellia sinensis, is difficult to culture in vitro, to transform, and has a large genome, rendering little genomic information available. Recent advances in large-scale RNA sequencing (RNA-seq provide a fast, cost-effective, and reliable approach to generate large expression datasets for functional genomic analysis, which is especially suitable for non-model species with un-sequenced genomes. Results Using high-throughput Illumina RNA-seq, the transcriptome from poly (A+ RNA of C. sinensis was analyzed at an unprecedented depth (2.59 gigabase pairs. Approximate 34.5 million reads were obtained, trimmed, and assembled into 127,094 unigenes, with an average length of 355 bp and an N50 of 506 bp, which consisted of 788 contig clusters and 126,306 singletons. This number of unigenes was 10-fold higher than existing C. sinensis sequences deposited in GenBank (as of August 2010. Sequence similarity analyses against six public databases (Uniprot, NR and COGs at NCBI, Pfam, InterPro and KEGG found 55,088 unigenes that could be annotated with gene descriptions, conserved protein domains, or gene ontology terms. Some of the unigenes were assigned to putative metabolic pathways. Targeted searches using these annotations identified the majority of genes associated with several primary metabolic pathways and natural product pathways that are important to tea quality, such as flavonoid, theanine and caffeine biosynthesis pathways. Novel candidate genes of these secondary pathways were discovered. Comparisons with four previously prepared cDNA libraries revealed that this transcriptome dataset has both a high degree of consistency with previous EST data and an approximate 20 times increase in coverage. Thirteen unigenes related to theanine and flavonoid synthesis were validated. Their expression patterns in different organs of the tea plant were

  9. Isolation and sequence of cDNA clones coding for a member of the family of high mobility group proteins (HMG-T) in trout and analysis of HMG-T-mRNA's in trout tissues.

    OpenAIRE

    Pentecost, B T; Wright, J. M.; Dixon, G H

    1985-01-01

    A specific oligonucleotide has been used to isolate a cDNA prepared from the mRNA for a trout High Mobility Group (HMG) protein closely related to trout HMG-T and bovine HMG 1 and 2 proteins. The sequence isolated more closely resembles bovine HMG-1 than the previously sequenced HMG-T protein in regions corresponding to the N terminal half of the protein. Northern blot analysis at low stringency indicated that 2 related sequences are expressed in total trout testis mRNA. Southern blots of tot...

  10. Metagenomes obtained by "deep sequencing" - what do they tell about the EBPR communities

    DEFF Research Database (Denmark)

    Albertsen, Mads; Saunders, Aaron Marc; Nielsen, Kåre Lehmann;

    Metagenomes obtained by "deep sequencing" - what do they tell about the EBPR communities? Mads Albertsen1, Aaron M. Saunders1, Kåre L. Nielsen1 and Per H. Nielsen1 1 Department of Biotechnology, Chemistry and Environmental Engineering, Aalborg University, Aalborg, Denmark Presenting Author: Mads...... Albertsen Keywords: Metagenomics; Accumulibacter; Micro-diversity; Enhanced Biological Phosphorus Removal Introduction Metagenomics, or environmental genomics, provides comprehensive information about the entire microbial community of a certain ecosystem, e.g. a wastewater treatment plant. So far......, metagenomic analyses have been hampered by high costs and high level of expertise needed to conduct the investigations, but it is changing now with development of new technologies allowing analyses of billions of DNA sequences (deep-sequencing) and user-friendly pipelines for analyses of the huge data sets...

  11. Classification of ncRNAs using position and size information in deep sequencing data

    Science.gov (United States)

    Erhard, Florian; Zimmer, Ralf

    2010-01-01

    Motivation: Small non-coding RNAs (ncRNAs) play important roles in various cellular functions in all clades of life. With next-generation sequencing techniques, it has become possible to study ncRNAs in a high-throughput manner and by using specialized algorithms ncRNA classes such as miRNAs can be detected in deep sequencing data. Typically, such methods are targeted to a certain class of ncRNA. Many methods rely on RNA secondary structure prediction, which is not always accurate and not all ncRNA classes are characterized by a common secondary structure. Unbiased classification methods for ncRNAs could be important to improve accuracy and to detect new ncRNA classes in sequencing data. Results: Here, we present a scoring system called ALPS (alignment of pattern matrices score) that only uses primary information from a deep sequencing experiment, i.e. the relative positions and lengths of reads, to classify ncRNAs. ALPS makes no further assumptions, e.g. about common structural properties in the ncRNA class and is nevertheless able to identify ncRNA classes with high accuracy. Since ALPS is not designed to recognize a certain class of ncRNA, it can be used to detect novel ncRNA classes, as long as these unknown ncRNAs have a characteristic pattern of deep sequencing read lengths and positions. We evaluate our scoring system on publicly available deep sequencing data and show that it is able to classify known ncRNAs with high sensitivity and specificity. Availability: Calculated pattern matrices of the datasets hESC and EB are available at the project web site http://www.bio.ifi.lmu.de/ALPS. An implementation of the described method is available upon request from the authors. Contact: florian.erhard@bio.ifi.lmu.de PMID:20823303

  12. Deep RNA sequencing improved the structural annotation of the Tuber melanosporum transcriptome.

    Science.gov (United States)

    Tisserant, E; Da Silva, C; Kohler, A; Morin, E; Wincker, P; Martin, F

    2011-02-01

    • The functional complexity of the Tuber melanosporum transcriptome has not yet been fully elucidated. Here, we applied high-throughput Illumina RNA-sequencing (RNA-Seq) to the transcriptome of T. melanosporum at different major developmental stages, that is free-living mycelium, fruiting body and ectomycorrhiza. • Sequencing of cDNA libraries generated a total of c. 24 million sequence reads representing > 882 Mb of sequence data. To construct a coverage signal profile across the genome, all reads were then aligned to the reference genome assembly of T. melanosporum Mel28. • We were able to identify a substantial number of novel transcripts, antisense transcripts, new exons, untranslated regions (UTRs), alternative upstream initiation codons and upstream open reading frames. • This RNA-Seq analysis allowed us to improve the genome annotation. It also provided us with a genome-wide view of the transcriptional and post-transcriptional mechanisms generating an increased number of transcript isoforms during major developmental transitions in T. melanosporum. PMID:21223284

  13. Identification of Multiple Stress Responsive Genes by Sequencing a Normalized cDNA Library from Sea-Land Cotton (Gossypium barbadense L..

    Directory of Open Access Journals (Sweden)

    Bin Zhou

    Full Text Available Plants often face multiple stresses including drought, extreme temperature, salinity, nutrition deficiency and biotic stresses during growth and development. All the stresses result in a series of physiological and metabolic reactions and then generate reversible inhibition of metabolism and growth and can cause seriously irreversible damage, even death. At each stage of cotton growth, environmental stress conditions pose devastating threats to plant growth and development, especially yield and quality. Due to the complex stress conditions and unclear molecular mechanisms of stress response, there is an urgent need to explore the mechanisms of cotton response against abiotic stresses.A normalized cDNA library was constructed using Gossypium barbadense Hai-7124 treated with different stress conditions (heat, cold, salt, drought, potassium and phosphorus deficit and Verticillium dahliae infection. Random sequencing of this library generated 6,047 high-quality expressed sequence tags (ESTs. The ESTs were clustered and assembled into 3,135 uniESTs, composed of 2,497 contigs and 638 singletons. The blastx results demonstrated 2,746 unigenes showing significant similarity to known genes, 74 uniESTs displaying significant similarity to genes of predicted proteins, and 315 uniESTs remain uncharacterized. Functional classification unveiled the abundance of uniESTs in binding, catalytic activity, and structural molecule activity. Annotations of the uniESTs by the plant transcription factor database (PlantTFDB and Plant Stress Protein Database (PSPDB disclosed that transcription factors and stress-related genes were enriched in the current library. The expression of some transcription factors and specific stress-related genes were verified by RT-PCR under various stress conditions.Annotation results showed that a huge number of genes respond to stress in our study, such as MYB-related, C2H2, FAR1, bHLH, bZIP, MADS, and mTERF. These results will improve our

  14. Identification of Multiple Stress Responsive Genes by Sequencing a Normalized cDNA Library from Sea-Land Cotton (Gossypium barbadense L.)

    Science.gov (United States)

    Zhou, Bin; Zhang, Lin; Ullah, Abid; Jin, Xin; Yang, Xiyan; Zhang, Xianlong

    2016-01-01

    Background Plants often face multiple stresses including drought, extreme temperature, salinity, nutrition deficiency and biotic stresses during growth and development. All the stresses result in a series of physiological and metabolic reactions and then generate reversible inhibition of metabolism and growth and can cause seriously irreversible damage, even death. At each stage of cotton growth, environmental stress conditions pose devastating threats to plant growth and development, especially yield and quality. Due to the complex stress conditions and unclear molecular mechanisms of stress response, there is an urgent need to explore the mechanisms of cotton response against abiotic stresses. Methodology and Principal Findings A normalized cDNA library was constructed using Gossypium barbadense Hai-7124 treated with different stress conditions (heat, cold, salt, drought, potassium and phosphorus deficit and Verticillium dahliae infection). Random sequencing of this library generated 6,047 high-quality expressed sequence tags (ESTs). The ESTs were clustered and assembled into 3,135 uniESTs, composed of 2,497 contigs and 638 singletons. The blastx results demonstrated 2,746 unigenes showing significant similarity to known genes, 74 uniESTs displaying significant similarity to genes of predicted proteins, and 315 uniESTs remain uncharacterized. Functional classification unveiled the abundance of uniESTs in binding, catalytic activity, and structural molecule activity. Annotations of the uniESTs by the plant transcription factor database (PlantTFDB) and Plant Stress Protein Database (PSPDB) disclosed that transcription factors and stress-related genes were enriched in the current library. The expression of some transcription factors and specific stress-related genes were verified by RT-PCR under various stress conditions. Conclusions/Significance Annotation results showed that a huge number of genes respond to stress in our study, such as MYB-related, C2H2, FAR1, b

  15. cDNA cloning of a mouse mammary epithelial cell surface protein reveals the existence of epidermal growth factor-like domains linked to factor VIII-like sequences

    International Nuclear Information System (INIS)

    A 2.1-kilobase cDNA coding for a surface protein of mammary epithelial cells has been isolated from a mouse mammary gland λgt11 cDNA library. Sequence analysis of this cDNA reveals an open reading frame of 1,389 base pairs that defines a protein with a molecular mass of 51.5 dKa. Structural analysis of the predicted sequence identifies two putative functional domains of the protein: (i) an N-terminal cysteine-rich region that is similar to epidermal growth factor-like domains of Drosophila Notch-1 protein and (ii) a large segment of the sequence that exhibited 54.5% identify with C-terminal domains of human coagulation factors VIII and V. These similarities in structure are used to predict the possible functions of the protein and its means of interaction with the cell surface. mRNA expression was detectable in mammary tissue from nonpregnant animals but was maximal in the lactating gland. In cultured cells, mRNA levels also correlated with the degree of cellular differentiation

  16. cDNA: 35986 [ASTRA[Archive

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.122430 Mus musculus adult male corpora quadrigemina cDNA, RIKEN full-length enr ... 0341B09 product:hypothetical Ubiquitin-conjugating enzymes ... containing protein, full insert sequence gnl|UG|Mm ...

  17. cDNA: 35981 [ASTRA[Archive

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.122430 Mus musculus adult retina cDNA, RIKEN full-length enriched library, clon ... 0014I21 product:hypothetical Ubiquitin-conjugating enzymes ... containing protein, full insert sequence gnl|UG|Mm ...

  18. cDNA: 35985 [ASTRA[Archive

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.122430 Mus musculus adult male testis cDNA, RIKEN full-length enriched library, ... 0440H19 product:hypothetical Ubiquitin-conjugating enzymes ... containing protein, full insert sequence gnl|UG|Mm ...

  19. Galaxy Environments in DEEP2: The Birth of the Red Sequence

    CERN Document Server

    Cooper, M; Cooper, Michael; Newman, Jeff

    2006-01-01

    The DEEP2 Galaxy Redshift Survey is the first project to study the distant Universe by obtaining a data set comparable in size and nature to recent generations of local surveys. Made possible by the largest ground-based optical telescopes and new instrumentation, DEEP2 was designed to measure both the properties of galaxies at z ~ 1 and their distribution in space, enabling a number of unique tests of galaxy formation and evolution. Here, we first provide an overview of the survey, including the planned second major data release scheduled for early 2007. We then present new results from DEEP2 pertaining to the relationship between galaxy properties and environment at intermediate redshift, revealing where and when typical ~L* galaxies began quenching and moved onto the red sequence in significant number.

  20. Hybrid DNA virus in Chinese patients with seronegative hepatitis discovered by deep sequencing

    OpenAIRE

    Xu, Baoyan; Zhi, Ning; Hu, Gangqing; Wan, Zhihong; Zheng, Xiaobin; Liu, Xiaohong; Wong, Susan; Kajigaya, Sachiko; Zhao, Keji; Mao, Qing; Young, Neal S

    2013-01-01

    Seronegative hepatitis—non-A, non-B, non-C, non-D, non-E hepatitis—is poorly characterized but strongly associated with serious complications. We collected 92 sera specimens from patients with non-A–E hepatitis in Chongqing, China between 1999 and 2007. Ten sera pools were screened by Solexa deep sequencing. We discovered a 3,780-bp contig present in all 10 pools that yielded BLASTx E scores of 7e-05–0.008 against parvoviruses. The complete sequence of the in silico-assembled 3,780-bp contig ...

  1. Classification of ncRNAs using position and size information in deep sequencing data

    OpenAIRE

    Erhard, Florian; Zimmer, Ralf

    2010-01-01

    Motivation: Small non-coding RNAs (ncRNAs) play important roles in various cellular functions in all clades of life. With next-generation sequencing techniques, it has become possible to study ncRNAs in a high-throughput manner and by using specialized algorithms ncRNA classes such as miRNAs can be detected in deep sequencing data. Typically, such methods are targeted to a certain class of ncRNA. Many methods rely on RNA secondary structure prediction, which is not always accurate and not all...

  2. miRBase: integrating microRNA annotation and deep-sequencing data.

    Science.gov (United States)

    Kozomara, Ana; Griffiths-Jones, Sam

    2011-01-01

    miRBase is the primary online repository for all microRNA sequences and annotation. The current release (miRBase 16) contains over 15,000 microRNA gene loci in over 140 species, and over 17,000 distinct mature microRNA sequences. Deep-sequencing technologies have delivered a sharp rise in the rate of novel microRNA discovery. We have mapped reads from short RNA deep-sequencing experiments to microRNAs in miRBase and developed web interfaces to view these mappings. The user can view all read data associated with a given microRNA annotation, filter reads by experiment and count, and search for microRNAs by tissue- and stage-specific expression. These data can be used as a proxy for relative expression levels of microRNA sequences, provide detailed evidence for microRNA annotations and alternative isoforms of mature microRNAs, and allow us to revisit previous annotations. miRBase is available online at: http://www.mirbase.org/. PMID:21037258

  3. Polymorphism Identification and Improved Genome Annotation of Brassica rapa Through Deep RNA Sequencing

    OpenAIRE

    Devisetty, Upendra Kumar; Covington, Michael F.; An V Tat; Lekkala, Saradadevi; Maloof, Julin N.

    2014-01-01

    The mapping and functional analysis of quantitative traits in Brassica rapa can be greatly improved with the availability of physically positioned, gene-based genetic markers and accurate genome annotation. In this study, deep transcriptome RNA sequencing (RNA-Seq) of Brassica rapa was undertaken with two objectives: SNP detection and improved transcriptome annotation. We performed SNP detection on two varieties that are parents of a mapping population to aid in development of a marker system...

  4. Cloning, sequence analysis and expression of a cDNA encoding a novel insulin-like growth factor binding protein (IGFBP-2).

    OpenAIRE

    Binkert, C; Landwehr, J; Mary, J L; J. Schwander; Heinrich, G

    1989-01-01

    Insulin-like growth factors bind with high affinity to specific binding proteins in extracellular fluids. To identify structural characteristics of IGF-binding proteins that might define their physiological roles, we determined the complete primary structure of a novel human IGF-binding protein (IGFBP-2) from a cloned cDNA. The cDNA encodes a 328 amino acid IGF-binding protein precursor which contains a 39-residue signal peptide. The mature 289 amino acid IGFBP-2 has a predicted Mr of 31,325....

  5. Continuous Distributed Representation of Biological Sequences for Deep Proteomics and Genomics.

    Directory of Open Access Journals (Sweden)

    Ehsaneddin Asgari

    Full Text Available We introduce a new representation and feature extraction method for biological sequences. Named bio-vectors (BioVec to refer to biological sequences in general with protein-vectors (ProtVec for proteins (amino-acid sequences and gene-vectors (GeneVec for gene sequences, this representation can be widely used in applications of deep learning in proteomics and genomics. In the present paper, we focus on protein-vectors that can be utilized in a wide array of bioinformatics investigations such as family classification, protein visualization, structure prediction, disordered protein identification, and protein-protein interaction prediction. In this method, we adopt artificial neural network approaches and represent a protein sequence with a single dense n-dimensional vector. To evaluate this method, we apply it in classification of 324,018 protein sequences obtained from Swiss-Prot belonging to 7,027 protein families, where an average family classification accuracy of 93%±0.06% is obtained, outperforming existing family classification methods. In addition, we use ProtVec representation to predict disordered proteins from structured proteins. Two databases of disordered sequences are used: the DisProt database as well as a database featuring the disordered regions of nucleoporins rich with phenylalanine-glycine repeats (FG-Nups. Using support vector machine classifiers, FG-Nup sequences are distinguished from structured protein sequences found in Protein Data Bank (PDB with a 99.8% accuracy, and unstructured DisProt sequences are differentiated from structured DisProt sequences with 100.0% accuracy. These results indicate that by only providing sequence data for various proteins into this model, accurate information about protein structure can be determined. Importantly, this model needs to be trained only once and can then be applied to extract a comprehensive set of information regarding proteins of interest. Moreover, this representation can be

  6. Efficient selection of biomineralizing DNA aptamers using deep sequencing and population clustering.

    Science.gov (United States)

    Bawazer, Lukmaan A; Newman, Aaron M; Gu, Qian; Ibish, Abdullah; Arcila, Mary; Cooper, James B; Meldrum, Fiona C; Morse, Daniel E

    2014-01-28

    DNA-based information systems drive the combinatorial optimization processes of natural evolution, including the evolution of biominerals. Advances in high-throughput DNA sequencing expand the power of DNA as a potential information platform for combinatorial engineering, but many applications remain to be developed due in part to the challenge of handling large amounts of sequence data. Here we employ high-throughput sequencing and a recently developed clustering method (AutoSOME) to identify single-stranded DNA sequence families that bind specifically to ZnO semiconductor mineral surfaces. These sequences were enriched from a diverse DNA library after a single round of screening, whereas previous screening approaches typically require 5-15 rounds of enrichment for effective sequence identification. The consensus sequence of the largest cluster was poly d(T)30. This consensus sequence exhibited clear aptamer behavior and was shown to promote the synthesis of crystalline ZnO from aqueous solution at near-neutral pH. This activity is significant, as the crystalline form of this wide-bandgap semiconductor is not typically amenable to solution synthesis in this pH range. High-resolution TEM revealed that this DNA synthesis route yields ZnO nanoparticles with an amorphous-crystalline core-shell structure, suggesting that the mechanism of mineralization involves nanoscale coacervation around the DNA template. We thus demonstrate that our new method, termed Single round Enrichment of Ligands by deep Sequencing (SEL-Seq), can facilitate biomimetic synthesis of technological nanomaterials by accelerating combinatorial selection of biomolecular-mineral interactions. Moreover, by enabling direct characterization of sequence family demographics, we anticipate that SEL-Seq will enhance aptamer discovery in applications employing additional rounds of screening. PMID:24341560

  7. Deep sequencing analysis of HBV genotype shift and correlation with antiviral efficiency during adefovir dipivoxil therapy.

    Directory of Open Access Journals (Sweden)

    Yuwei Wang

    Full Text Available Viral genotype shift in chronic hepatitis B (CHB patients during antiviral therapy has been reported, but the underlying mechanism remains elusive.38 CHB patients treated with ADV for one year were selected for studying genotype shift by both deep sequencing and Sanger sequencing method.Sanger sequencing method found that 7.9% patients showed mixed genotype before ADV therapy. In contrast, all 38 patients showed mixed genotype before ADV treatment by deep sequencing. 95.5% mixed genotype rate was also obtained from additional 200 treatment-naïve CHB patients. Of the 13 patients with genotype shift, the fraction of the minor genotype in 5 patients (38% increased gradually during the course of ADV treatment. Furthermore, responses to ADV and HBeAg seroconversion were associated with the high rate of genotype shift, suggesting drug and immune pressure may be key factors to induce genotype shift. Interestingly, patients with genotype C had a significantly higher rate of genotype shift than genotype B. In genotype shift group, ADV treatment induced a marked enhancement of genotype B ratio accompanied by a reduction of genotype C ratio, suggesting genotype C may be more sensitive to ADV than genotype B. Moreover, patients with dominant genotype C may have a better therapeutic effect. Finally, genotype shifts was correlated with clinical improvement in terms of ALT.Our findings provided a rational explanation for genotype shift among ADV-treated CHB patients. The genotype and genotype shift might be associated with antiviral efficiency.

  8. miRBase: annotating high confidence microRNAs using deep sequencing data.

    Science.gov (United States)

    Kozomara, Ana; Griffiths-Jones, Sam

    2014-01-01

    We describe an update of the miRBase database (http://www.mirbase.org/), the primary microRNA sequence repository. The latest miRBase release (v20, June 2013) contains 24 521 microRNA loci from 206 species, processed to produce 30 424 mature microRNA products. The rate of deposition of novel microRNAs and the number of researchers involved in their discovery continue to increase, driven largely by small RNA deep sequencing experiments. In the face of these increases, and a range of microRNA annotation methods and criteria, maintaining the quality of the microRNA sequence data set is a significant challenge. Here, we describe recent developments of the miRBase database to address this issue. In particular, we describe the collation and use of deep sequencing data sets to assign levels of confidence to miRBase entries. We now provide a high confidence subset of miRBase entries, based on the pattern of mapped reads. The high confidence microRNA data set is available alongside the complete microRNA collection at http://www.mirbase.org/. We also describe embedding microRNA-specific Wikipedia pages on the miRBase website to encourage the microRNA community to contribute and share textual and functional information. PMID:24275495

  9. Exploring the Mechanisms of Gastrointestinal Cancer Development Using Deep Sequencing Analysis

    International Nuclear Information System (INIS)

    Next-generation sequencing (NGS) technologies have revolutionized cancer genomics due to their high throughput sequencing capacity. Reports of the gene mutation profiles of various cancers by many researchers, including international cancer genome research consortia, have increased over recent years. In addition to detecting somatic mutations in tumor cells, NGS technologies enable us to approach the subject of carcinogenic mechanisms from new perspectives. Deep sequencing, a method of optimizing the high throughput capacity of NGS technologies, allows for the detection of genetic aberrations in small subsets of premalignant and/or tumor cells in noncancerous chronically inflamed tissues. Genome-wide NGS data also make it possible to clarify the mutational signatures of each cancer tissue by identifying the precise pattern of nucleotide alterations in the cancer genome, providing new information regarding the mechanisms of tumorigenesis. In this review, we highlight these new methods taking advantage of NGS technologies, and discuss our current understanding of carcinogenic mechanisms elucidated from such approaches

  10. Exploring the Mechanisms of Gastrointestinal Cancer Development Using Deep Sequencing Analysis

    Energy Technology Data Exchange (ETDEWEB)

    Matsumoto, Tomonori; Shimizu, Takahiro; Takai, Atsushi; Marusawa, Hiroyuki, E-mail: maru@kuhp.kyoto-u.ac.jp [Department of Gastroenterology and Hepatology, Graduate School of Medicine, Kyoto University, 54 Shogoin-Kawahara-cho, Sakyo-ku, Kyoto 606-8507 (Japan)

    2015-06-15

    Next-generation sequencing (NGS) technologies have revolutionized cancer genomics due to their high throughput sequencing capacity. Reports of the gene mutation profiles of various cancers by many researchers, including international cancer genome research consortia, have increased over recent years. In addition to detecting somatic mutations in tumor cells, NGS technologies enable us to approach the subject of carcinogenic mechanisms from new perspectives. Deep sequencing, a method of optimizing the high throughput capacity of NGS technologies, allows for the detection of genetic aberrations in small subsets of premalignant and/or tumor cells in noncancerous chronically inflamed tissues. Genome-wide NGS data also make it possible to clarify the mutational signatures of each cancer tissue by identifying the precise pattern of nucleotide alterations in the cancer genome, providing new information regarding the mechanisms of tumorigenesis. In this review, we highlight these new methods taking advantage of NGS technologies, and discuss our current understanding of carcinogenic mechanisms elucidated from such approaches.

  11. Galaxies in SDSS and DEEP2: a quiet life on the blue sequence?

    CERN Document Server

    Blanton, M R

    2005-01-01

    In the six billion years between redshifts z=1 and z=0.1, galaxies change due to the aging of their stellar populations, the formation of new stars, and mergers with other galaxies. Here I explore the relative importance of these various effects, finding that while mergers are likely to be important for the red galaxy sequence they are unlikely to affect more than 10% of the blue galaxy sequence. I compare the galaxy population at redshift z=0.1 from the Sloan Digital Sky Survey to that at z=1 from the Deep Extragalactic Evolutionary Probe 2. Galaxies are bluer at z=1: the blue sequence by about 0.3 mag and the red sequence by about 0.1 mag, in redshift z=0.1 (u-g) color. I evaluate the change in color and in the luminosity functions of the two sequences using some simplistic stellar population synthesis models. These models indicate that the luminous end of the red sequence fades less than passive evolution allows by about 0.2 mag. Due to a lack of luminous blue progenitors, ``dry'' mergers betweeen red gala...

  12. Use of deep sequencing data for routine analysis of HIV resistance in newly diagnosed patients

    Directory of Open Access Journals (Sweden)

    Jose-Angel Fernández-Caballero

    2014-11-01

    Full Text Available Introduction: Use of deep sequencing is becoming a critical tool in clinical virology, with an important impact in the HIV field for routine diagnostic purposes. Here, we present the comparison of deep and Sanger sequencing in newly diagnosed HIV patients, and the use of DeepChek v1.3 & VisibleChek for their interpretation and integration with virological and clinical data. Patients and Methods: Plasma samples from 88 newly diagnosed HIV-1-infected patients were included in the study. Median age (IQR was 37 (27–47, median CD4 count (IQR was 387 (220–554, and 85% were males. Median Viral Load (Log, IQR was 5.03 (4.51–5.53. Deep sequencing was obtained using a GS-Junior (Roche. Sequences were preprocessed with the 454 AVA software; aligned reads were uploaded into the DeepChek v1.3 system (ABL SA. Sanger sequences (Trugene, were uploaded in parallel. Stanford algorithm (version 7.0 resistance interpretation to first line drugs and all the mutations (score≥5 were analyzed. For deep sequencing, 1%, 5% and 10% thresholds were chosen for resistance interpretation. Results: Using VisibleChek for analysis, we were able to describe the detection of any mutation using Sanger in 37/88 patients, with a total number of 50 Stanford ≥5 mutations, K103N and E138A being the most prevalent (n=4. Using UDS-1%, we found 72/88 patients with at least one mutation (total of 206 Stanford ≥5 mutations. Using Sanger data, 9/88 patients (10.22% showed any resistance to NNRTIs, while none showed resistance to NRTIs or PIs. Using UDS-10% increased resistance to NRTIs [3/88 (3.40%], to NNRTIs 12/88 (13.63%, and to a lesser extent to PIs [1/88 (1.13%]. Using UDS-5% increased resistance to NRTIs [4/88 (4.54%] and to NNRTIs [12/88 (13.63%], but not to PIs. Using UDS-1% increased resistance to all classes: NRTIs [14/88 (15.90%], NNRTIs [26/88 (30.68%], and PIs [6/88 (6.81]. Conclusions: DeepChek and VisibleChek allow for an easy, reliable and rapid analysis of UDS

  13. Deep sequence characterisation of a divergent HPIV-4a from an adult with prolonged influenza-like illness

    Directory of Open Access Journals (Sweden)

    Katherine E. Arden

    2015-12-01

    Deep sequencing allowed identification and genomic characterisation of a possible pathogen from an ILI as well as being an important tool to aid future understanding of the linkages between viral genetic variation, transmission and disease prognosis.

  14. High-throughput sequencing and analysis of the gill tissue transcriptome from the deep-sea hydrothermal vent mussel Bathymodiolus azoricus

    Directory of Open Access Journals (Sweden)

    Gomes Paula

    2010-10-01

    Full Text Available Abstract Background Bathymodiolus azoricus is a deep-sea hydrothermal vent mussel found in association with large faunal communities living in chemosynthetic environments at the bottom of the sea floor near the Azores Islands. Investigation of the exceptional physiological reactions that vent mussels have adopted in their habitat, including responses to environmental microbes, remains a difficult challenge for deep-sea biologists. In an attempt to reveal genes potentially involved in the deep-sea mussel innate immunity we carried out a high-throughput sequence analysis of freshly collected B. azoricus transcriptome using gills tissues as the primary source of immune transcripts given its strategic role in filtering the surrounding waterborne potentially infectious microorganisms. Additionally, a substantial EST data set was produced and from which a comprehensive collection of genes coding for putative proteins was organized in a dedicated database, "DeepSeaVent" the first deep-sea vent animal transcriptome database based on the 454 pyrosequencing technology. Results A normalized cDNA library from gills tissue was sequenced in a full 454 GS-FLX run, producing 778,996 sequencing reads. Assembly of the high quality reads resulted in 75,407 contigs of which 3,071 were singletons. A total of 39,425 transcripts were conceptually translated into amino-sequences of which 22,023 matched known proteins in the NCBI non-redundant protein database, 15,839 revealed conserved protein domains through InterPro functional classification and 9,584 were assigned with Gene Ontology terms. Queries conducted within the database enabled the identification of genes putatively involved in immune and inflammatory reactions which had not been previously evidenced in the vent mussel. Their physical counterpart was confirmed by semi-quantitative quantitative Reverse-Transcription-Polymerase Chain Reactions (RT-PCR and their RNA transcription level by quantitative PCR (q

  15. Deep sequencing discovery of novel and conserved microRNAs in trifoliate orange (Citrus trifoliata

    Directory of Open Access Journals (Sweden)

    Yu Huaping

    2010-07-01

    Full Text Available Abstract Background MicroRNAs (miRNAs play a critical role in post-transcriptional gene regulation and have been shown to control many genes involved in various biological and metabolic processes. There have been extensive studies to discover miRNAs and analyze their functions in model plant species, such as Arabidopsis and rice. Deep sequencing technologies have facilitated identification of species-specific or lowly expressed as well as conserved or highly expressed miRNAs in plants. Results In this research, we used Solexa sequencing to discover new microRNAs in trifoliate orange (Citrus trifoliata which is an important rootstock of citrus. A total of 13,106,753 reads representing 4,876,395 distinct sequences were obtained from a short RNA library generated from small RNA extracted from C. trifoliata flower and fruit tissues. Based on sequence similarity and hairpin structure prediction, we found that 156,639 reads representing 63 sequences from 42 highly conserved miRNA families, have perfect matches to known miRNAs. We also identified 10 novel miRNA candidates whose precursors were all potentially generated from citrus ESTs. In addition, five miRNA* sequences were also sequenced. These sequences had not been earlier described in other plant species and accumulation of the 10 novel miRNAs were confirmed by qRT-PCR analysis. Potential target genes were predicted for most conserved and novel miRNAs. Moreover, four target genes including one encoding IRX12 copper ion binding/oxidoreductase and three genes encoding NB-LRR disease resistance protein have been experimentally verified by detection of the miRNA-mediated mRNA cleavage in C. trifoliata. Conclusion Deep sequencing of short RNAs from C. trifoliata flowers and fruits identified 10 new potential miRNAs and 42 highly conserved miRNA families, indicating that specific miRNAs exist in C. trifoliata. These results show that regulatory miRNAs exist in agronomically important trifoliate orange

  16. Molecular cloning, cDNA sequence, and chromosomal localization of the human phosphatidylinositol 3-kinase p110{alpha} (PIK3CA) gene

    Energy Technology Data Exchange (ETDEWEB)

    Volinia, S.; Hiles, I.; Waterfield, M.D. [Ludwig Institute for Cancer Research, London (United Kingdom)] [and others

    1994-12-01

    Phosphatidylinositol (PI) 3-kinase is a heterodimeric enzyme comprising a 110-kDa catalytic subunit and an 85-kDa regulatory subunit that binds to tyrosine phosphopeptide sites linked directly or indirectly to receptors serving diverse signal functions. Knowledge of the structure and function of PI 3-kinase was greatly advanced by the purification, cDNA cloning, and subsequent expression of the bovine enzyme. Here the cloning of the cDNA for the human p110{alpha}subunit of PI 3-kinase (PIK3CA), encoding a protein 99% identical to the bovine p110, and of its gene in YAC is described. The chromosomal localization of the gene for PIK3CA is shown to be at 3q21-qter as determined using somatic cell hybrids. In situ hybridization performed using Alu-PCR from the YAC DNA located the gene in 3q26.3. 30 refs., 3 figs., 1 tab.

  17. Complete Genome Sequence of a Reference Stock of Simian Immunodeficiency Virus RNA (SIVmac251/32H/L28) Determined by Deep Sequencing.

    Science.gov (United States)

    Jenkins, Adrian; Ham, Claire; Almond, Neil; Berry, Neil

    2016-01-01

    A reference preparation for simian immunodeficiency virus (SIV) RNA nucleic acid assays was characterized by complete genome deep sequencing. The entire coding sequence and flanking long terminal repeats, including minority species, were determined. This information will inform SIV research investigations and aid evaluation and development of amplification assays for SIV RNA quantification. PMID:27231355

  18. HomozygosityMapper2012--bridging the gap between homozygosity mapping and deep sequencing.

    Science.gov (United States)

    Seelow, Dominik; Schuelke, Markus

    2012-07-01

    Homozygosity mapping is a common method to map recessive traits in consanguineous families. To facilitate these analyses, we have developed HomozygosityMapper, a web-based approach to homozygosity mapping. HomozygosityMapper allows researchers to directly upload the genotype files produced by the major genotyping platforms as well as deep sequencing data. It detects stretches of homozygosity shared by the affected individuals and displays them graphically. Users can interactively inspect the underlying genotypes, manually refine these regions and eventually submit them to our candidate gene search engine GeneDistiller to identify the most promising candidate genes. Here, we present the new version of HomozygosityMapper. The most striking new feature is the support of Next Generation Sequencing *.vcf files as input. Upon users' requests, we have implemented the analysis of common experimental rodents as well as of important farm animals. Furthermore, we have extended the options for single families and loss of heterozygosity studies. Another new feature is the export of *.bed files for targeted enrichment of the potential disease regions for deep sequencing strategies. HomozygosityMapper also generates files for conventional linkage analyses which are already restricted to the possible disease regions, hence superseding CPU-intensive genome-wide analyses. HomozygosityMapper is freely available at http://www.homozygositymapper.org/. PMID:22669902

  19. Isolation and sequence of a cDNA encoding the precursor of a bombesinlike peptide from brain and early embryos of Xenopus laevis.

    OpenAIRE

    Wechselberger, C; Kreil, G; Richter, K.

    1992-01-01

    A cDNA encoding the precursor of a bombesinlike peptide was isolated from brain of Xenopus laevis. The predicted end product resembles neuromedin B, which was originally isolated from mammalian spinal cord. The mRNA for this precursor was also present in gastrointestinal tract and in ovaries. Moreover, it could be detected in early embryos (stage 2 and stage 10) of X. laevis. These findings suggest novel roles for peptides of the bombesin family in oocyte maturation and early amphibian develo...

  20. Genetic variation of human papillomavirus type 16 in individual clinical specimens revealed by deep sequencing.

    Directory of Open Access Journals (Sweden)

    Iwao Kukimoto

    Full Text Available Viral genetic diversity within infected cells or tissues, called viral quasispecies, has been mostly studied for RNA viruses, but has also been described among DNA viruses, including human papillomavirus type 16 (HPV16 present in cervical precancerous lesions. However, the extent of HPV genetic variation in cervical specimens, and its involvement in HPV-induced carcinogenesis, remains unclear. Here, we employ deep sequencing to comprehensively analyze genetic variation in the HPV16 genome isolated from individual clinical specimens. Through overlapping full-circle PCR, approximately 8-kb DNA fragments covering the whole HPV16 genome were amplified from HPV16-positive cervical exfoliated cells collected from patients with either low-grade squamous intraepithelial lesion (LSIL or invasive cervical cancer (ICC. Deep sequencing of the amplified HPV16 DNA enabled de novo assembly of the full-length HPV16 genome sequence for each of 7 specimens (5 LSIL and 2 ICC samples. Subsequent alignment of read sequences to the assembled HPV16 sequence revealed that 2 LSILs and 1 ICC contained nucleotide variations within E6, E1 and the non-coding region between E5 and L2 with mutation frequencies of 0.60% to 5.42%. In transient replication assays, a novel E1 mutant found in ICC, E1 Q381E, showed reduced ability to support HPV16 origin-dependent replication. In addition, partially deleted E2 genes were detected in 1 LSIL sample in a mixed state with the intact E2 gene. Thus, the methods used in this study provide a fundamental framework for investigating the influence of HPV somatic genetic variation on cervical carcinogenesis.

  1. Genotyping Influenza Virus by Next-Generation Deep Sequencing in Clinical Specimens.

    Science.gov (United States)

    Seong, Moon Woo; Cho, Sung Im; Park, Hyunwoong; Seo, Soo Hyun; Lee, Seung Jun; Kim, Eui Chong; Park, Sung Sup

    2016-05-01

    Rapid and accurate identification of an influenza outbreak is essential for patient care and treatment. We describe a next-generation sequencing (NGS)-based, unbiased deep sequencing method in clinical specimens to investigate an influenza outbreak. Nasopharyngeal swabs from patients were collected for molecular epidemiological analysis. Total RNA was sequenced by using the NGS technology as paired-end 250 bp reads. Total of 7 to 12 million reads were obtained. After mapping to the human reference genome, we analyzed the 3-4% of reads that originated from a non-human source. A BLAST search of the contigs reconstructed de novo revealed high sequence similarity with that of the pandemic H1N1 virus. In the phylogenetic analysis, the HA gene of our samples clustered closely with that of A/Senegal/VR785/2010(H1N1), A/Wisconsin/11/2013(H1N1), and A/Korea/01/2009(H1N1), and the NA gene of our samples clustered closely with A/Wisconsin/11/2013(H1N1). This study suggests that NGS-based unbiased sequencing can be effectively applied to investigate molecular characteristics of nosocomial influenza outbreak by using clinical specimens such as nasopharyngeal swabs. PMID:26915615

  2. A simple method for the parallel deep sequencing of full influenza A genomes

    DEFF Research Database (Denmark)

    Kampmann, Marie-Louise; Fordyce, Sarah Louise; Avila Arcos, Maria del Carmen;

    2011-01-01

    Given the major threat of influenza A to human and animal health, and its ability to evolve rapidly through mutation and reassortment, tools that enable its timely characterization are necessary to help monitor its evolution and spread. For this purpose, deep sequencing can be a very valuable tool...... virus from human and H1N1 virus from swine, on a single lane of a GAIIx flow cell to an average depth of 122-fold. This technique can be applied to cultivated and uncultivated virus....

  3. Optimization of affinity, specificity and function of designed influenza inhibitors using deep sequencing

    Energy Technology Data Exchange (ETDEWEB)

    Whitehead, Timothy A.; Chevalier, Aaron; Song, Yifan; Dreyfus, Cyrille; Fleishman, Sarel J.; De Mattos, Cecilia; Myers, Chris A.; Kamisetty, Hetunandan; Blair, Patrick; Wilson, Ian A.; Baker, David (UWASH); (Scripps); (NRL)

    2012-06-19

    We show that comprehensive sequence-function maps obtained by deep sequencing can be used to reprogram interaction specificity and to leapfrog over bottlenecks in affinity maturation by combining many individually small contributions not detectable in conventional approaches. We use this approach to optimize two computationally designed inhibitors against H1N1 influenza hemagglutinin and, in both cases, obtain variants with subnanomolar binding affinity. The most potent of these, a 51-residue protein, is broadly cross-reactive against all influenza group 1 hemagglutinins, including human H2, and neutralizes H1N1 viruses with a potency that rivals that of several human monoclonal antibodies, demonstrating that computational design followed by comprehensive energy landscape mapping can generate proteins with potential therapeutic utility.

  4. Inside the intraterrestrials: The deep biosphere seen through massively parallel sequencing

    Science.gov (United States)

    Biddle, J.

    2009-12-01

    Deeply buried marine sediments may house a large amount of the Earth’s microbial population. Initial studies based on 16S rRNA clone libraries suggest that these sediments contain unique phylotypes of microorganisms, particularly from the archaeal domain. Since this environment is so difficult to study, microbiologists are challenged to find ways to examine these populations remotely. A major approach taken to study this environment uses massively parallel sequencing to examine the inner genetic workings of these microorganisms after the sediment has been drilled. Both metagenomics and tagged amplicon sequencing have been employed on deep sediments, and initial results show that different geographic regions can be differentiated through genomics and also minor populations may cause major geochemical changes.

  5. Metatranscriptomic analysis of small RNAs present in soybean deep sequencing libraries

    Directory of Open Access Journals (Sweden)

    Lorrayne Gomes Molina

    2012-01-01

    Full Text Available A large number of small RNAs unrelated to the soybean genome were identified after deep sequencing of soybean small RNA libraries. A metatranscriptomic analysis was carried out to identify the origin of these sequences. Comparative analyses of small interference RNAs (siRNAs present in samples collected in open areas corresponding to soybean field plantations and samples from soybean cultivated in greenhouses under a controlled environment were made. Different pathogenic, symbiotic and free-living organisms were identified from samples of both growth systems. They included viruses, bacteria and different groups of fungi. This approach can be useful not only to identify potentially unknown pathogens and pests, but also to understand the relations that soybean plants establish with microorganisms that may affect, directly or indirectly, plant health and crop production.

  6. Rapid fine conformational epitope mapping using comprehensive mutagenesis and deep sequencing.

    Science.gov (United States)

    Kowalsky, Caitlin A; Faber, Matthew S; Nath, Aritro; Dann, Hailey E; Kelly, Vince W; Liu, Li; Shanker, Purva; Wagner, Ellen K; Maynard, Jennifer A; Chan, Christina; Whitehead, Timothy A

    2015-10-30

    Knowledge of the fine location of neutralizing and non-neutralizing epitopes on human pathogens affords a better understanding of the structural basis of antibody efficacy, which will expedite rational design of vaccines, prophylactics, and therapeutics. However, full utilization of the wealth of information from single cell techniques and antibody repertoire sequencing awaits the development of a high throughput, inexpensive method to map the conformational epitopes for antibody-antigen interactions. Here we show such an approach that combines comprehensive mutagenesis, cell surface display, and DNA deep sequencing. We develop analytical equations to identify epitope positions and show the method effectiveness by mapping the fine epitope for different antibodies targeting TNF, pertussis toxin, and the cancer target TROP2. In all three cases, the experimentally determined conformational epitope was consistent with previous experimental datasets, confirming the reliability of the experimental pipeline. Once the comprehensive library is generated, fine conformational epitope maps can be prepared at a rate of four per day. PMID:26296891

  7. Identification of miRNAs and their target genes in developing soybean seeds by deep sequencing

    Directory of Open Access Journals (Sweden)

    Chen Shou-Yi

    2011-01-01

    Full Text Available Abstract Background MicroRNAs (miRNAs regulate gene expression by mediating gene silencing at transcriptional and post-transcriptional levels in higher plants. miRNAs and related target genes have been widely studied in model plants such as Arabidopsis and rice; however, the number of identified miRNAs in soybean (Glycine max is limited, and global identification of the related miRNA targets has not been reported in previous research. Results In our study, a small RNA library and a degradome library were constructed from developing soybean seeds for deep sequencing. We identified 26 new miRNAs in soybean by bioinformatic analysis and further confirmed their expression by stem-loop RT-PCR. The miRNA star sequences of 38 known miRNAs and 8 new miRNAs were also discovered, providing additional evidence for the existence of miRNAs. Through degradome sequencing, 145 and 25 genes were identified as targets of annotated miRNAs and new miRNAs, respectively. GO analysis indicated that many of the identified miRNA targets may function in soybean seed development. Additionally, a soybean homolog of Arabidopsis SUPPRESSOR OF GENE SLIENCING 3 (AtSGS3 was detected as a target of the newly identified miRNA Soy_25, suggesting the presence of feedback control of miRNA biogenesis. Conclusions We have identified large numbers of miRNAs and their related target genes through deep sequencing of a small RNA library and a degradome library. Our study provides more information about the regulatory network of miRNAs in soybean and advances our understanding of miRNA functions during seed development.

  8. Cloning and Sequencing of Xylanase cDNA from Volvariella volvacea Using Conserved Sequences in Cellulose-Binding Domain%利用真菌纤维素结合域(CBD)保守性序列进行草菇木聚糖酶cDNA的克隆

    Institute of Scientific and Technical Information of China (English)

    丁少军; J.A.BUSWELL

    2004-01-01

    Cellulose-binding domains (CBDs) are present in the majority of fungal cellulases and hemicellulases. Based on the conserved region of CBDs, degenerate primers were designed and used to amplify the 5′-end cDNA fragment of xylanase from Volvariella volvacea by 5′-RACE. Gene specific primer was then designed based on extreme region of 5′-end cDNA fragment and used to amplify the full length cDNA of xylanase. The cDNA of xynl was 1287 bp in length, including 3′and 5′-non-coding region. The xynl cDNA contained an ORF of 1101 bp encoding 367 amino acids, in which there was a putative signal peptide with 19 amino acids. Alignment of the deduced amino acid sequence of xynl with other xylanases showed that the homology with family-10 xylanases from Agaricus bisporus xyll, AspergiUus sojae xynl, Aspergillus kawachii xynA, Fusarium oxysporumf, sp xyl3 was 64% ,55% ,52% ,55% , respectively.

  9. 真鲷天然抗性相关巨噬蛋白全长cDNA的克隆与序列分析%Cloning and sequence analysis of Nramp cDNA from Pagrus major

    Institute of Scientific and Technical Information of China (English)

    徐美瑜; 陈松林; 沙珍霞; 季相山

    2005-01-01

    Natural resistance associated macrophage protein (Nmmp) is an innate resistance protein to intracellular parasites, which is expressed plentifully in macrophage ceils. Nramp has been studied in mouse, human, cattle, rainbow trout and channel catfish.However, tittle was known about the structure of Pagrus major Nramp. In order to get the complete sequence of Pagrus major Nramp, a pair of primer is designed according to a 200bp known sequence of Pagrus major Nramp cDNA. By the use of SMART RACE, the full Nramp of Pagrus major cDNA about 5 000 bp was obtained, including about 200 bp 5' terminal region (UTR),complete encoding region and 3' terminal region. There were 3 ployA signals, which showed many possibilities of cutting at 3' terminal region. The character of Pagrus major Nramp nucleotide sequence and deduced amino acid sequence are analyzed. 12 putative transmembrane(TM) regions, a consensus transport motif (CTM), a predicted protein kinase C phosphroylation site and three predicted N-link glycosylation sites are indicated in its deduced amino acid sequence. The ‘consense transport motif' CTM is located etween TM8 and TM9. Furthermore, a protein kinase C phosphroylation site and three N-link glycosylation sites were predicted. The lignment of amino acid sequences between Pagrus major Nramp cDNA and several animals is analyzed and the deduced amino acid equence of Pagrus major Nrarnp had 77.8%, 83.0%, 82.3%, 80.0%, 81.1%, 60.4%, 70.3%, 58.5%, 69.5% identity ith rainbow trout α(AAD20721), rainbow trout β(AAD20722), channel catfish(AF400108), fathead minnow (AAF01778),common carp (CABal96), mouse 1 ( AAA39838 ), mouse 2 ( AAC42051 ), human 1 ( D50403 ), human 2 ( NP - 0(106(18 ),respectively. The alignment reveals high conservation in TM and CTM regions. Analysis result makes us get familiar with the structure nd character of fish Nramp, furthermore, offers some infonnat/on for the enhancement of immunity of fish and genetic amelioration on fish breeding.

  10. Population-genomic variation within RNA viruses of the Western honey bee, Apis mellifera, inferred from deep sequencing

    Science.gov (United States)

    Deep sequencing of viruses isolated from infected hosts is an efficient way to measure population-genetic variation and can reveal patterns of dispersal and natural selection. In this study, we mined existing Illumina sequence reads to investigate single-nucleotide polymorphisms (SNPs) within two RN...

  11. First Complete Genome Sequence of an Emerging Cucumber Green Mottle Mosaic Virus Isolate in North America

    OpenAIRE

    Li, Rugang; Zheng, Yi; Fei, Zhangjun; Ling, Kai-Shu

    2015-01-01

    The complete genome sequence (6,423 nucleotides [nt]) of an emerging cucumber green mottle mosaic virus (CGMMV) isolate on cucumber in North America was determined through deep sequencing of small (sRNA) and rapid amplification of cDNA ends. The virus shares 99% nucleotide sequence identity with the Asian genotype but only 90% with the European genotype.

  12. Human norovirus hyper-mutation revealed by ultra-deep sequencing.

    Science.gov (United States)

    Cuevas, José M; Combe, Marine; Torres-Puente, Manoli; Garijo, Raquel; Guix, Susana; Buesa, Javier; Rodríguez-Díaz, Jesús; Sanjuán, Rafael

    2016-07-01

    Human noroviruses (NoVs) are a major cause of gastroenteritis worldwide. It is thought that, similar to other RNA viruses, high mutation rates allow NoVs to evolve fast and to undergo rapid immune escape at the population level. However, the rate and spectrum of spontaneous mutations of human NoVs have not been quantified previously. Here, we analyzed the intra-patient diversity of the NoV capsid by carrying out RT-PCR and ultra-deep sequencing with 100,000-fold coverage of 16 stool samples from symptomatic patients. This revealed the presence of low-frequency sequences carrying large numbers of U-to-C or A-to-G base transitions, suggesting a role for hyper-mutation in NoV diversity. To more directly test for hyper-mutation, we performed transfection assays in which the production of mutations was restricted to a single cell infection cycle. This confirmed the presence of sequences with multiple U-to-C/A-to-G transitions, and suggested that hyper-mutation contributed a large fraction of the total NoV spontaneous mutation rate. The type of changes produced and their sequence context are compatible with ADAR-mediated editing of the viral RNA. PMID:27094861

  13. Analysis of the full-length genome sequence of papaya lethal yellowing virus (PLYV), determined by deep sequencing, confirms its classification in the genus Sobemovirus.

    Science.gov (United States)

    Pereira, Alvaro J; Alfenas-Zerbini, Poliane; Cascardo, Renan S; Andrade, Eduardo C; Murilo Zerbini, F

    2012-10-01

    Papaya lethal yellowing virus (PLYV) causes an economically important disease in papayas in northeastern Brazil. Based on biological and molecular properties, PLYV has been tentatively assigned to the genus Sobemovirus. We report the sequence of the full-length genome of a PLYV isolate from Brazil, determined by deep sequencing. The PLYV genome is 4,145 nt long and contains four ORFs, with an arrangement identical to that of sobemoviruses. The polyprotein and CP display significant sequence identity with the corresponding proteins of other sobemoviruses. Pairwise comparisons and phylogenetic analysis based on complete nucleotide sequences confirm the classification of PLYV in the genus Sobemovirus. PMID:22743825

  14. Breaking the 1000-gene barrier for Mimivirus using ultra-deep genome and transcriptome sequencing

    Directory of Open Access Journals (Sweden)

    Claverie Jean-Michel

    2011-03-01

    Full Text Available Abstract Background Mimivirus, a giant dsDNA virus infecting Acanthamoeba, is the prototype of the mimiviridae family, the latest addition to the family of the nucleocytoplasmic large DNA viruses (NCLDVs. Its 1.2 Mb-genome was initially predicted to encode 917 genes. A subsequent RNA-Seq analysis precisely mapped many transcript boundaries and identified 75 new genes. Findings We now report a much deeper analysis using the SOLiD™ technology combining RNA-Seq of the Mimivirus transcriptome during the infectious cycle (202.4 Million reads, and a complete genome re-sequencing (45.3 Million reads. This study corrected the genome sequence and identified several single nucleotide polymorphisms. Our results also provided clear evidence of previously overlooked transcription units, including an important RNA polymerase subunit distantly related to Euryarchea homologues. The total Mimivirus gene count is now 1018, 11% greater than the original annotation. Conclusions This study highlights the huge progress brought about by ultra-deep sequencing for the comprehensive annotation of virus genomes, opening the door to a complete one-nucleotide resolution level description of their transcriptional activity, and to the realistic modeling of the viral genome expression at the ultimate molecular level. This work also illustrates the need to go beyond bioinformatics-only approaches for the annotation of short protein and non-coding genes in viral genomes.

  15. Comprehensive survey of human brain microRNA by deep sequencing

    Directory of Open Access Journals (Sweden)

    Menzel Corinna

    2010-06-01

    Full Text Available Abstract Background MicroRNA (miRNA play an important role in gene expression regulation. At present, the number of annotated miRNA continues to grow rapidly, in part due to advances of high-throughput sequencing techniques. Here, we use deep sequencing to characterize a population of small RNA expressed in human and rhesus macaques brain cortex. Results Based on a total of more than 150 million sequence reads we identify 197 putative novel miRNA, in humans and rhesus macaques, that are highly conserved among mammals. These putative miRNA have significant excess of conserved target sites in genes' 3'UTRs, supporting their functional role in gene regulation. Additionally, in humans and rhesus macaques respectively, we identify 41 and 22 conserved putative miRNA originating from non-coding RNA (ncRNA transcripts. While some of these molecules might function as conventional miRNA, others might be harmful and result in target avoidance. Conclusions Here, we further extend the repertoire of conserved human and rhesus macaque miRNA. Even though our study is based on a single tissue, the coverage depth of our study allows identification of functional miRNA present in brain tissue at background expression levels. Therefore, our study might cover large proportion of the yet unannotated conserved miRNA present in the human genome.

  16. Analysis of transmitted HIV-1 drug resistance using 454 ultra-deep-sequencing and the DeepChek®-HIV system

    Directory of Open Access Journals (Sweden)

    Ana Garcia-Diaz

    2014-11-01

    Full Text Available Introduction: Next-generation sequencing (NGS is capable of detecting resistance-associated mutations (RAMs present at frequencies of 1% or below. Several studies have found that baseline low-frequency RAMs are associated with failure to first-line HAART. One major limitation to the expansion of this technology in routine diagnostics is the complexity and laboriousness integral to bioinformatics analysis. DeepChek (ABL, TherapyEdge is a CE-marked software that allows automated analysis and resistance interpretation of NGS data. Objective: To evaluate the use of 454 ultra-deep-sequencing (Roche® 454, Life Sciences; 454-UDS and DeepChek for routine baseline resistance testing in a clinical diagnostic laboratory. Methods: 107 newly diagnosed HIV-1-infected patients (subtypes: A, n=9; B, n=52; C, n=21; D, n=2; F, n=3; G, n=1; CRF01, n=7; CRF02, n=7; CRF06, n=1; CRF07, n=1; CRF10, n=1 and unassigned complex, n=2 with a median plasma viral load of 88,727 copies/mL (range: 1380–2,143,543 were tested by 454-UDS and Sanger sequencing for the detection of protease and reverse transcriptase RAMs. In addition, integrase RAMs were investigated in 57 of them. Sequence analysis and resistance interpretation were performed using DeepChek applying 1% and 20% thresholds for variant detections; filters applied were comparison between Sanger and 454-UDS, and Stanford and IAS list for resistance interpretation. Results: The time elapsed from generation of raw 454 data (between 2,000–5,000 sequences/sample to elaboration of a resistance report was approximately 10 minutes per sample, equivalent to the time required for the same process using Sanger sequencing. Four patients (3.7% showed baseline resistance by Sanger and 454-UDS at frequencies above 20%, which affected both NRTIs (n=2 and NNRTIs (n=2. In addition, 12 patients (11.2% showed transmitted drug resistance (TDR by 454-UDS at frequencies below 20% affecting NRTIs (n=9, NNRTIs (n=7 and PIs (n=2

  17. Advancing Eucalyptus genomics: identification and sequencing of lignin biosynthesis genes from deep-coverage BAC libraries

    Directory of Open Access Journals (Sweden)

    Kudrna David

    2011-03-01

    Full Text Available Abstract Background Eucalyptus species are among the most planted hardwoods in the world because of their rapid growth, adaptability and valuable wood properties. The development and integration of genomic resources into breeding practice will be increasingly important in the decades to come. Bacterial artificial chromosome (BAC libraries are key genomic tools that enable positional cloning of important traits, synteny evaluation, and the development of genome framework physical maps for genetic linkage and genome sequencing. Results We describe the construction and characterization of two deep-coverage BAC libraries EG_Ba and EG_Bb obtained from nuclear DNA fragments of E. grandis (clone BRASUZ1 digested with HindIII and BstYI, respectively. Genome coverages of 17 and 15 haploid genome equivalents were estimated for EG_Ba and EG_Bb, respectively. Both libraries contained large inserts, with average sizes ranging from 135 Kb (Eg_Bb to 157 Kb (Eg_Ba, very low extra-nuclear genome contamination providing a probability of finding a single copy gene ≥ 99.99%. Libraries were screened for the presence of several genes of interest via hybridizations to high-density BAC filters followed by PCR validation. Five selected BAC clones were sequenced and assembled using the Roche GS FLX technology providing the whole sequence of the E. grandis chloroplast genome, and complete genomic sequences of important lignin biosynthesis genes. Conclusions The two E. grandis BAC libraries described in this study represent an important milestone for the advancement of Eucalyptus genomics and forest tree research. These BAC resources have a highly redundant genome coverage (> 15×, contain large average inserts and have a very low percentage of clones with organellar DNA or empty vectors. These publicly available BAC libraries are thus suitable for a broad range of applications in genetic and genomic research in Eucalyptus and possibly in related species of Myrtaceae

  18. Analysis of Plasmodium falciparum diversity in natural infections by deep sequencing

    Science.gov (United States)

    Manske, Magnus; Miotto, Olivo; Campino, Susana; Auburn, Sarah; Almagro-Garcia, Jacob; Maslen, Gareth; O’Brien, Jack; Djimde, Abdoulaye; Doumbo, Ogobara; Zongo, Issaka; Ouedraogo, Jean-Bosco; Michon, Pascal; Mueller, Ivo; Siba, Peter; Nzila, Alexis; Borrmann, Steffen; Kiara, Steven M.; Marsh, Kevin; Jiang, Hongying; Su, Xin-Zhuan; Amaratunga, Chanaki; Fairhurst, Rick; Socheat, Duong; Nosten, Francois; Imwong, Mallika; White, Nicholas J.; Sanders, Mandy; Anastasi, Elisa; Alcock, Dan; Drury, Eleanor; Oyola, Samuel; Quail, Michael A.; Turner, Daniel J.; Rubio, Valentin Ruano; Jyothi, Dushyanth; Amenga-Etego, Lucas; Hubbart, Christina; Jeffreys, Anna; Rowlands, Kate; Sutherland, Colin; Roper, Cally; Mangano, Valentina; Modiano, David; Tan, John C.; Ferdig, Michael T.; Amambua-Ngwa, Alfred; Conway, David J.; Takala-Harrison, Shannon; Plowe, Christopher V.; Rayner, Julian C.; Rockett, Kirk A.; Clark, Taane G.; Newbold, Chris I.; Berriman, Matthew; MacInnis, Bronwyn; Kwiatkowski, Dominic P.

    2013-01-01

    Malaria elimination strategies require surveillance of the parasite population for genetic changes that demand a public health response, such as new forms of drug resistance. 1,2 Here we describe methods for large-scale analysis of genetic variation in Plasmodium falciparum by deep sequencing of parasite DNA obtained from the blood of patients with malaria, either directly or after short term culture. Analysis of 86,158 exonic SNPs that passed genotyping quality control in 227 samples from Africa, Asia and Oceania provides genome-wide estimates of allele frequency distribution, population structure and linkage disequilibrium. By comparing the genetic diversity of individual infections with that of the local parasite population, we derive a metric of within-host diversity that is related to the level of inbreeding in the population. An open-access web application has been established for exploration of regional differences in allele frequency and of highly differentiated loci in the P. falciparum genome. PMID:22722859

  19. Screening Target Specificity of siRNAs by Rapid Amplification of cDNA Ends (RACE) for Non-Sequenced Species

    OpenAIRE

    Sabirzhanov, Boris; Sabirzhanova, Inna B.; Keifer, Joyce

    2011-01-01

    RNA interference (RNAi) is the process of sequence-specific posttranslational gene silencing triggered by double-stranded RNAs (dsRNAs). RNAi is a widely used approach for studying gene function. However, studies have shown that using siRNA can lead to off-target effects when the siRNA contains sufficient sequence identity to non-target mRNA sequences. One of the important steps in designing dsRNA is verification that it has sequence identity to only the target mRNA. In this report, we propos...

  20. Deep sequencing of New World screw-worm transcripts to discover genes involved in insecticide resistance

    Directory of Open Access Journals (Sweden)

    Azeredo-Espin Ana Maria L

    2010-12-01

    Full Text Available Abstract Background The New World screw-worm (NWS, Cochliomyia hominivorax, is one of the most important myiasis-causing flies, causing severe losses to the livestock industry. In its current geographical distribution, this species has been controlled by the application of insecticides, mainly organophosphate (OP compounds, but a number of lineages have been identified that are resistant to such chemicals. Despite its economic importance, only limited genetic information is available for the NWS. Here, as a part of an effort to characterize the C. hominivorax genome and identify putative genes involved in insecticide resistance, we sampled its transcriptome by deep sequencing of polyadenylated transcripts using the 454 sequencing technology. Results Deep sequencing on the 454 platform of three normalized libraries (larval, adult male and adult female generated a total of 548,940 reads. Eighteen candidate genes coding for three metabolic detoxification enzyme families, cytochrome P450 monooxygenases, glutathione S-transferases and carboxyl/cholinesterases were selected and gene expression levels were measured using quantitative real-time polymerase chain reaction (qRT-PCR. Of the investigated candidates, only one gene was expressed differently between control and resistant larvae with, at least, a 10-fold down-regulation in the resistant larvae. The presence of mutations in the acetylcholinesterase (target site and carboxylesterase E3 genes was investigated and all of the resistant flies presented E3 mutations previously associated with insecticide resistance. Conclusions Here, we provided the largest database of NWS expressed sequence tags that is an important resource, not only for further studies on the molecular basis of the OP resistance in NWS fly, but also for functional and comparative studies among Calliphoridae flies. Among our candidates, only one gene was found differentially expressed in resistant individuals, and its role on

  1. Exploring the Gastrointestinal "Nemabiome": Deep Amplicon Sequencing to Quantify the Species Composition of Parasitic Nematode Communities.

    Science.gov (United States)

    Avramenko, Russell W; Redman, Elizabeth M; Lewis, Roy; Yazwinski, Thomas A; Wasmuth, James D; Gilleard, John S

    2015-01-01

    Parasitic helminth infections have a considerable impact on global human health as well as animal welfare and production. Although co-infection with multiple parasite species within a host is common, there is a dearth of tools with which to study the composition of these complex parasite communities. Helminth species vary in their pathogenicity, epidemiology and drug sensitivity and the interactions that occur between co-infecting species and their hosts are poorly understood. We describe the first application of deep amplicon sequencing to study parasitic nematode communities as well as introduce the concept of the gastro-intestinal "nemabiome". The approach is analogous to 16S rDNA deep sequencing used to explore microbial communities, but utilizes the nematode ITS-2 rDNA locus instead. Gastro-intestinal parasites of cattle were used to develop the concept, as this host has many well-defined gastro-intestinal nematode species that commonly occur as complex co-infections. Further, the availability of pure mono-parasite populations from experimentally infected cattle allowed us to prepare mock parasite communities to determine, and correct for, species representation biases in the sequence data. We demonstrate that, once these biases have been corrected, accurate relative quantitation of gastro-intestinal parasitic nematode communities in cattle fecal samples can be achieved. We have validated the accuracy of the method applied to field-samples by comparing the results of detailed morphological examination of L3 larvae populations with those of the sequencing assay. The results illustrate the insights that can be gained into the species composition of parasite communities, using grazing cattle in the mid-west USA as an example. However, both the technical approach and the concept of the 'nemabiome' have a wide range of potential applications in human and veterinary medicine. These include investigations of host-parasite and parasite-parasite interactions during co

  2. cDNA: 34099 [

    Lifescience Database Archive (English)

    Full Text Available H. sapiens + Hs.15282 Homo sapiens cDNA FLJ44214 fis, clone THYMU3003309, moderately similar to ... Homo sapiens sarcoma antigen (SAGE ) gnl|UG|Hs#S16886502 AK126202 23/5622_34099.png ...

  3. cDNA: 33377 [

    Lifescience Database Archive (English)

    Full Text Available H. sapiens + Hs.181243 Homo sapiens full open reading frame cDNA clone RZPDo834H102D for gene AT ... F4, activating transcription factor 4 (tax -responsive enhancer element B67); complete cds; wi ...

  4. cDNA: 17527 [

    Lifescience Database Archive (English)

    Full Text Available H. sapiens + Hs.134229 Homo sapiens cDNA FLJ44146 fis, clone THYMU2027734, weakly similar to Hom ... o sapiens SA hypertension -associated homolog (rat) (SAH) gnl|UG|Hs#S16886570 ...

  5. cDNA: 47992 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.228067 Mus musculus 15 days embryo male testis cDNA, RIKEN full-length enriched ... lone:8030476B22 product:hypothetical Mitochondrial energy ... transfer proteins (carrier protein) containing pro ...

  6. cDNA: 40711 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus + Mm.41523 Mus musculus adult male thymus cDNA, RIKEN full-length enriched library, ... lone:5830492N08 product:hypothetical Mitochondrial energy ... transfer proteins (carrier protein) containing pro ...

  7. cDNA: 47994 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.228067 Mus musculus 0 day neonate eyeball cDNA, RIKEN full-length enriched libr ... lone:E130118D21 product:hypothetical Mitochondrial energy ... transfer proteins (carrier protein) containing pro ...

  8. cDNA: 47991 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.228067 Mus musculus adult male diencephalon cDNA, RIKEN full-length enriched li ... lone:9330189G22 product:hypothetical Mitochondrial energy ... transfer proteins (carrier protein) containing pro ...

  9. cDNA: 40220 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.207654 Mus musculus adult male olfactory brain ... cDNA, RIKEN full-length enriched ... library, clone:6430704M03 product:similar to BRAIN ... PROTEIN (FRAGMENT) [Homo sapiens], full insert seq ...

  10. cDNA: 52275 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.275648 Mus musculus 11 days embryo head cDNA, RIKEN full-length enriched librar ... y, clone:6230400I01 product:SIMILAR TO ENIGMA ... (LIM DOMAIN PROTEIN) homolog [Homo sapiens], full ...

  11. cDNA: 52278 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.275648 Mus musculus adult male tongue cDNA, RIKEN full-length enriched library, ... clone:2310073F10 product:SIMILAR TO ENIGMA ... (LIM DOMAIN PROTEIN) homolog [Homo sapiens], full ...

  12. cDNA: 52277 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.275648 Mus musculus 18-day embryo whole body cDNA, RIKEN full-length enriched l ... ibrary, clone:1110003B01 product:SIMILAR TO ENIGMA ... (LIM DOMAIN PROTEIN) homolog [Homo sapiens], full ...

  13. cDNA: 52276 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.275648 Mus musculus adult male brain cDNA, RIKEN full-length enriched library, ... clone:0710007K04 product:SIMILAR TO ENIGMA ... (LIM DOMAIN PROTEIN) homolog [Homo sapiens], full ...

  14. Isolation, cDNA sequence analysis and tissue expression profile of a novel swine gene differentially expressed in the Longissimus dorsi muscle tissues from Large White × Meishan cross combination

    Institute of Scientific and Technical Information of China (English)

    LIU Yonggang; LEI Minggang; XIONG Yuanzhu; DENG Changyan

    2005-01-01

    In order to study the molecular mechanism of heterosis in pigs, the mRNA differential display technique was performed to investigate the differences in gene expression in the Longissimus dorsi muscle tissues from Large White × Meishan cross combination.One novel gene differentially expressed between the hybrids and the purebreds was isolated and subsequently identified using semi-quantitative reverse transcriptase polymerase chain reaction (RT-PCR) and its complete cDNA sequence was obtained using the rapid amplification of cDNA ends (RACE) method. The nucleotide sequence of the gene is not homologous to any of the known porcine genes. The sequence prediction revealed that the open reading frame of this gene encodes a protein of 188 amino acids that contains the putative conserved domain of the PRA1 family protein and this protein has high homology with the PRA1 family protein 3 of three species-rat (88 % ), human(88 % ), and mouse (87 % ), -so that it can be defined as swine PRA1 family protein 3. The phylogenetic tree analysis revealed that the swine PRA1 family protein 3 has a closer genetic relationship with the human PRA1 family protein 3 than with those of mouse and rat.The tissue expression analysis indicated that swine PRA1family protein 3 gene is highly-expressed in muscle and fat, moderately in spleen,weakly in heart, kidney, ovary, lung, and almost not expressed in small intestine and liver. The function of this gene and the relationship between this gene and heterosis are also discussed.

  15. Main: Sequences [KOME

    Lifescience Database Archive (English)

    Full Text Available Sequences Nucleotide Sequence Nucleotide sequence of full length cDNA (trimmed sequence) kome_ine_full_seque...nce_db.fasta.zip kome_ine_full_sequence_db.zip kome_ine_full_sequence_db ...

  16. Uncovering microRNA-mediated response to SO2 stress in Arabidopsis thaliana by deep sequencing.

    Science.gov (United States)

    Li, Lihong; Xue, Meizhao; Yi, Huilan

    2016-10-01

    Sulfur dioxide (SO2) is a major air pollutant and has significant impacts on plants. MicroRNAs (miRNAs) are a class of gene expression regulators that play important roles in response to environmental stresses. In this study, deep sequencing was used for genome-wide identification of miRNAs and their expression profiles in response to SO2 stress in Arabidopsis thaliana shoots. A total of 27 conserved miRNAs and 5 novel miRNAs were found to be differentially expressed under SO2 stress. qRT-PCR analysis showed mostly negative correlation between miRNA accumulation and target gene mRNA abundance, suggesting regulatory roles of these miRNAs during SO2 exposure. The target genes of SO2-responsive miRNAs encode transcription factors and proteins that regulate auxin signaling and stress response, and the miRNAs-mediated suppression of these genes could improve plant resistance to SO2 stress. Promoter sequence analysis of genes encoding SO2-responsive miRNAs showed that stress-responsive and phytohormone-related cis-regulatory elements occurred frequently, providing additional evidence of the involvement of miRNAs in adaption to SO2 stress. This study represents a comprehensive expression profiling of SO2-responsive miRNAs in Arabidopsis and broads our perspective on the ubiquitous regulatory roles of miRNAs under stress conditions. PMID:27232729

  17. Metagenomes obtained by "deep sequencing" - what do they tell about the EBPR communities?

    DEFF Research Database (Denmark)

    Albertsen, Mads; Saunders, Aaron Marc; Nielsen, Kåre Lehmann;

    2013-01-01

    Metagenomics enables studies of the genomic potential of complex microbial communities by sequencing bulk genomic DNA directly from the environment. Knowledge of the genetic potential of a community can be used to formulate and test ecological hypotheses about stability and performance. In this s......Metagenomics enables studies of the genomic potential of complex microbial communities by sequencing bulk genomic DNA directly from the environment. Knowledge of the genetic potential of a community can be used to formulate and test ecological hypotheses about stability and performance....... In this study deep metagenomics and fluorescence in situ hybridization (FISH) were used to study a full-scale wastewater treatment plant with enhanced biological phosphorus removal (EBPR) and compared to an existing EBPR metagenome. EBPR is a widely used process that relies on a complex community...... of microorganisms to function properly. Insight into community and species level stability and dynamics is valuable for knowledge driven optimization of the EBPR process. The metagenomes of the EBPR communities were distinct compared to metagenomes of communities from a wide range of other environments, which could...

  18. Ultra Deep Sequencing of a Baculovirus Population Reveals Widespread Genomic Variations

    Directory of Open Access Journals (Sweden)

    Aurélien Chateigner

    2015-07-01

    Full Text Available Viruses rely on widespread genetic variation and large population size for adaptation. Large DNA virus populations are thought to harbor little variation though natural populations may be polymorphic. To measure the genetic variation present in a dsDNA virus population, we deep sequenced a natural strain of the baculovirus Autographa californica multiple nucleopolyhedrovirus. With 124,221X average genome coverage of our 133,926 bp long consensus, we could detect low frequency mutations (0.025%. K-means clustering was used to classify the mutations in four categories according to their frequency in the population. We found 60 high frequency non-synonymous mutations under balancing selection distributed in all functional classes. These mutants could alter viral adaptation dynamics, either through competitive or synergistic processes. Lastly, we developed a technique for the delimitation of large deletions in next generation sequencing data. We found that large deletions occur along the entire viral genome, with hotspots located in homologous repeat regions (hrs. Present in 25.4% of the genomes, these deletion mutants presumably require functional complementation to complete their infection cycle. They might thus have a large impact on the fitness of the baculovirus population. Altogether, we found a wide breadth of genomic variation in the baculovirus population, suggesting it has high adaptive potential.

  19. Transcript analysis of a goat mesenteric lymph node by deep next-generation sequencing.

    Science.gov (United States)

    E, G X; Zhao, Y J; Na, R S; Huang, Y F

    2016-01-01

    Deep RNA sequencing (RNA-seq) provides a practical and inexpensive alternative for exploring genomic data in non-model organisms. The functional annotation of non-model mammalian genomes, such as that of goats, is still poor compared to that of humans and mice. In the current study, we performed a whole transcriptome analysis of an intestinal mucous membrane lymph node to comprehensively characterize the transcript catalogue of this tissue in a goat. Using an Illumina HiSeq 4000 sequencing platform, 9.692 GB of raw reads were acquired. A total of 57,526 lymph transcripts were obtained, and the majority of these were mapped to known transcriptional units (42.67%). A comparison of the mRNA expression of the mesenteric lymph nodes during the juvenile and post-adolescent stages revealed 8949 transcripts that were differentially expressed, including 6174 known genes. In addition, we functionally classified these transcripts using Gene Ontology (GO) and the Kyoto Encyclopedia of Genes and Genomes (KEGG) terms. A total of 6174 known genes were assigned to 64 GO terms, and 3782 genes were assigned to 303 KEGG pathways, including some related to immunity. Our results reveal the complex transcriptome profile of the lymph node and suggest that the immune system is immature in the mesenteric lymph nodes of juvenile goats. PMID:27173308

  20. Analysis of 4,664 high-quality sequence-finished poplar full-length cDNA clones and their utility for the discovery of genes responding to insect feeding

    Directory of Open Access Journals (Sweden)

    Douglas Carl J

    2008-01-01

    Full Text Available Abstract Background The genus Populus includes poplars, aspens and cottonwoods, which will be collectively referred to as poplars hereafter unless otherwise specified. Poplars are the dominant tree species in many forest ecosystems in the Northern Hemisphere and are of substantial economic value in plantation forestry. Poplar has been established as a model system for genomics studies of growth, development, and adaptation of woody perennial plants including secondary xylem formation, dormancy, adaptation to local environments, and biotic interactions. Results As part of the poplar genome sequencing project and the development of genomic resources for poplar, we have generated a full-length (FL-cDNA collection using the biotinylated CAP trapper method. We constructed four FLcDNA libraries using RNA from xylem, phloem and cambium, and green shoot tips and leaves from the P. trichocarpa Nisqually-1 genotype, as well as insect-attacked leaves of the P. trichocarpa × P. deltoides hybrid. Following careful selection of candidate cDNA clones, we used a combined strategy of paired end reads and primer walking to generate a set of 4,664 high-accuracy, sequence-verified FLcDNAs, which clustered into 3,990 putative unique genes. Mapping FLcDNAs to the poplar genome sequence combined with BLAST comparisons to previously predicted protein coding sequences in the poplar genome identified 39 FLcDNAs that likely localize to gaps in the current genome sequence assembly. Another 173 FLcDNAs mapped to the genome sequence but were not included among the previously predicted genes in the poplar genome. Comparative sequence analysis against Arabidopsis thaliana and other species in the non-redundant database of GenBank revealed that 11.5% of the poplar FLcDNAs display no significant sequence similarity to other plant proteins. By mapping the poplar FLcDNAs against transcriptome data previously obtained with a 15.5 K cDNA microarray, we identified 153 FLcDNA clones

  1. Deep RNA sequencing analysis of readthrough gene fusions in human prostate adenocarcinoma and reference samples

    Directory of Open Access Journals (Sweden)

    Nacu Serban

    2011-01-01

    Full Text Available Abstract Background Readthrough fusions across adjacent genes in the genome, or transcription-induced chimeras (TICs, have been estimated using expressed sequence tag (EST libraries to involve 4-6% of all genes. Deep transcriptional sequencing (RNA-Seq now makes it possible to study the occurrence and expression levels of TICs in individual samples across the genome. Methods We performed single-end RNA-Seq on three human prostate adenocarcinoma samples and their corresponding normal tissues, as well as brain and universal reference samples. We developed two bioinformatics methods to specifically identify TIC events: a targeted alignment method using artificial exon-exon junctions within 200,000 bp from adjacent genes, and genomic alignment allowing splicing within individual reads. We performed further experimental verification and characterization of selected TIC and fusion events using quantitative RT-PCR and comparative genomic hybridization microarrays. Results Targeted alignment against artificial exon-exon junctions yielded 339 distinct TIC events, including 32 gene pairs with multiple isoforms. The false discovery rate was estimated to be 1.5%. Spliced alignment to the genome was less sensitive, finding only 18% of those found by targeted alignment in 33-nt reads and 59% of those in 50-nt reads. However, spliced alignment revealed 30 cases of TICs with intervening exons, in addition to distant inversions, scrambled genes, and translocations. Our findings increase the catalog of observed TIC gene pairs by 66%. We verified 6 of 6 predicted TICs in all prostate samples, and 2 of 5 predicted novel distant gene fusions, both private events among 54 prostate tumor samples tested. Expression of TICs correlates with that of the upstream gene, which can explain the prostate-specific pattern of some TIC events and the restriction of the SLC45A3-ELK4 e4-e2 TIC to ERG-negative prostate samples, as confirmed in 20 matched prostate tumor and normal

  2. Generation and analysis of expressed sequence tags from a normalized cDNA library of young leaf from Ma bamboo (Dendrocalamus latiflorus Munro).

    Science.gov (United States)

    Gao, Z M; Li, C L; Peng, Z H

    2011-11-01

    Ma bamboo (Dendrocalamus latiflorus Munro) belongs to Dendrocalamus genus, Bambusease tribe, Bambusoideae subfamily, Poaceae family. It is a representative species of clumping bamboo, and a principal commercial species for various construction purposes using mature culms and for human consumption using young shoots. A normalized cDNA library was constructed from young leaves of Ma bamboo and 9,574 high-quality ESTs were generated, from which 5,317 unigenes including 1,502 contigs and 3,815 singletons were assembled. The unigenes were assigned into different gene ontology (GO) categories and summarized into 13 broad biologically functional groups according to similar functional characteristics or cellular roles by BLAST search against public databases. Eight hundred and ninety-one unigenes were assigned by KO identifiers and mapped to six KEGG biochemical pathways. The transcripts involved in biosynthesis of secondary metabolites such as cytochrome 450, flavonol synthase/flavanone 3-hydroxylase, and dihydroflavonol-4-reductase were well represented by 14 unigenes in the unigene set. The candidate genes involved in phytohormone metabolism, signal transduction and encoding cell wall-associated receptor kinases were also identified. Sixty-seven unigenes related to plant resistance (R) genes, including RPP genes, RGAs and RDL/RF genes, were discovered. These results will provide genome-wide knowledge about the molecular physiology of Ma bamboo young leaves and tools for advanced studies of molecular mechanism underlying leaf growth and development. PMID:21713530

  3. Complete genome sequence and construction of infectious full-length cDNA clones of tobacco ringspot Nepovirus, a viral pathogen causing bud blight in soybean.

    Science.gov (United States)

    Zhao, Fumei; Hwang, Un Sun; Lim, Seungmo; Yoo, Ran Hee; Igori, Davaajargal; Lee, Su-Heon; Lim, Hyoun-Sub; Moon, Jae Sun

    2015-08-01

    Tobacco ringspot virus (TRSV, genus Nepovirus), causes severe diseases in soybean and tobacco plants. TRSV-induced bud blight disease significantly reduced both the yield and quality of soybeans. The function of the encoded viral gene product involved in TRSV infection was unclear due to the limitation of reverse genetics studies on the viral genome. Here, we represent the successful construction of infectious full-length cDNA clones of TRSV genome (RNA1 and RNA2). The cDNAs of TRSV RNA1 and RNA2 were cloned into the binary vector pPZP211 immediately downstream of a double cauliflower mosaic virus 35S promoter and upstream of the nopaline synthase terminator. Seven days after agrobacterium-mediated co-inoculation of these two constructs, Nicotiana benthamiana plants developed a systemic infection with necrotic ringspot symptoms and weak stunting of the leaves, similar to that induced by natural TRSV. The systemic infection was confirmed by transmission electron microscopy and Western blot analysis. Simultaneously, soybean, tomato, and Arabidopsis ecotype Estland were mechanically inoculated with sap prepared from TRSV-agroinfiltrated N. benthamiana leaves, showing typical symptoms of bud blight, necrotic spots, and lethal systemic necrosis, respectively. The system developed herein will be an appealing way to determine TRSV viral gene functions and study host-TRSV interactions. PMID:26159876

  4. Porcine transcriptome analysis based on 97 non-normalized cDNA libraries and assembly of 1,021,891 expressed sequence tags

    DEFF Research Database (Denmark)

    Gorodkin, Jan; Cirera, Susanna; Hedegaard, Jacob; Gilchrist, Michael J.; Panitz, Frank; Jørgensen, Claus; Scheibye-Knudsen, Karsten; Arvin, Troels; Lumholdt, Steen; Sawera, Milena; Green, Trine; Nielsen, Bente J.; Havgaard, Jacob H.; Rosenkilde, Carina; Wang, Jun; Li, Heng; Li, Ruiqiang; Liu, Bin; Hu, Songnian; Dong, Wei; Li, Wei; Yu, Jun; Wang, Jian; Stærfeldt, Hans Henrik; Wernersson, Rasmus; Madsen, Lone B.; Thomsen, Bo; Hornhoj, Henrik; Zhan, Bujie; Wang, Xuegang; Wang, Xuefei; Bolund, Lars; Brunak, Søren; Yang, Huanming; Bendixen, Christian; Fredholm, Merete

    BACKGROUND: Knowledge of the structure of gene expression is essential for mammalian transcriptomics research. We analyzed a collection of more than one million porcine expressed sequence tags (ESTs), of which two-thirds were generated in the Sino-Danish Pig Genome Project and one-third are from ......: This EST collection, the largest to date in pig, represents an essential resource for annotation, comparative genomics, assembly of the pig genome sequence, and further porcine transcription studies....

  5. Porcine transcriptome analysis based on 97 non-normalized cDNA libraries and assembly of 1,021,891 expressed sequence tags

    DEFF Research Database (Denmark)

    Gorodkin, Jan; Cirera, Susanna; Hedegaard, Jacob;

    2007-01-01

    BACKGROUND: Knowledge of the structure of gene expression is essential for mammalian transcriptomics research. We analyzed a collection of more than one million porcine expressed sequence tags (ESTs), of which two-thirds were generated in the Sino-Danish Pig Genome Project and one-third are from ......: This EST collection, the largest to date in pig, represents an essential resource for annotation, comparative genomics, assembly of the pig genome sequence, and further porcine transcription studies....

  6. Whole-genome sequence of Sunxiuqinia dokdonensis DH1(T), isolated from deep sub-seafloor sediment in Dokdo Island.

    Science.gov (United States)

    Lim, Sooyeon; Chang, Dong-Ho; Kim, Byoung-Chan

    2016-09-01

    Sunxiuqinia dokdonensis DH1(T) was isolated from deep sub-seafloor sediment at a depth of 900 m below the seafloor off Seo-do (the west part of Dokdo Island) in the East Sea of the Republic of Korea and subjected to whole genome sequencing on HiSeq platform and annotated on RAST. The nucleotide sequence of this genome was deposited into DDBJ/EMBL/GenBank under the accession LGIA00000000. PMID:27437183

  7. Whole-genome sequence of Sunxiuqinia dokdonensis DH1T, isolated from deep sub-seafloor sediment in Dokdo Island

    Directory of Open Access Journals (Sweden)

    Sooyeon Lim

    2016-09-01

    Full Text Available Sunxiuqinia dokdonensis DH1T was isolated from deep sub-seafloor sediment at a depth of 900 m below the seafloor off Seo-do (the west part of Dokdo Island in the East Sea of the Republic of Korea and subjected to whole genome sequencing on HiSeq platform and annotated on RAST. The nucleotide sequence of this genome was deposited into DDBJ/EMBL/GenBank under the accession LGIA00000000.

  8. Transcriptome walking: a laboratory-oriented GUI-based approach to mRNA identification from deep-sequenced data

    Directory of Open Access Journals (Sweden)

    French Andrew S

    2012-12-01

    Full Text Available Abstract Background Deep sequencing technology provides efficient and economical production of large numbers of randomly positioned, relatively short, estimates of base identities in DNA molecules. Application of this technology to mRNA samples allows rapid examination of the molecular genetic environment in individual cells or tissues, the transcriptome. However, assembly of such short sequences into complete mRNA creates a challenge that limits the usefulness of the technology, particularly when no, or limited, genomic data is available. Several approaches to this problem have been developed, but there is still no general method to rapidly obtain an mRNA sequence from deep sequence data when a specific molecule, or family of molecules, are of interest. A frequent requirement is to identify specific mRNA molecules from tissues that are being investigated by methods such as electrophysiology, immunocytology and pharmacology. To be widely useful, any approach must be relatively simple to use in the laboratory by operators without extensive statistical or bioinformatics knowledge, and with readily available hardware. Findings An approach was developed that allows de novo assembly of individual mRNA sequences in two linked stages: sequence discovery and sequence completion. Both stages rely on computer assisted, Graphical User Interface (GUI-guided, user interaction with the data, but proceed relatively efficiently once discovery is complete. The method grows a discovered sequence by repeated passes through the complete raw data in a series of steps, and is hence termed ‘transcriptome walking’. All of the operations required for transcriptome analysis are combined in one program that presents a relatively simple user interface and runs on a standard desktop, or laptop computer, but takes advantage of multi-core processors, when available. Complete mRNA sequence identifications usually require less than 24 hours. This approach has already

  9. Generation and analysis of a large-scale expressed sequence tags from a full-length enriched cDNA library of Siberian tiger (Panthera tigris altaica).

    Science.gov (United States)

    Guo, Yu; Liu, Changqing; Lu, Taofeng; Liu, Dan; Bai, Chunyu; Li, Xiangchen; Ma, Yuehui; Guan, Weijun

    2014-05-15

    In this study, a full-length enriched cDNA library was successfully constructed from Siberian tiger, the world's most endangered species. The titers of primary and amplified libraries were 1.28×10(6)pfu/mL and 1.59×10(10)pfu/mL respectively. The proportion of recombinants from unamplified library was 91.3% and the average length of exogenous inserts was 1.06kb. A total of 279 individual ESTs with sizes ranging from 316 to 1258bps were then analyzed. Furthermore, 204 unigenes were successfully annotated and involved in 49 functions of the GO classification, cell (175, 85.5%), cellular process (165, 80.9%), and binding (152, 74.5%) are the dominant terms. 198 unigenes were assigned to 156 KEGG pathways, and the pathways with the most representation are metabolic pathways (18, 9.1%). The proportion pattern of each COG subcategory was similar among Panthera tigris altaica, P. tigris tigris and Homo sapiens, and general function prediction only cluster (44, 15.8%) represents the largest group, followed by translation, ribosomal structure and biogenesis (33, 11.8%), replication, recombination and repair (24, 8.6%), and only 7.2% ESTs classified as novel genes. Moreover, the recombinant plasmid pET32a-TAT-COL6A2 was constructed, coded for the Trx-TAT-COL6A2 fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-COL6A2 recombinant protein was 2.64±0.18mg/mL. This library will provide a useful platform for the functional genome and transcriptome research of for the P. tigris and other felid animals in the future. PMID:24630959

  10. Development of a candidate reference material for adventitious virus detection in vaccine and biologicals manufacturing by deep sequencing

    OpenAIRE

    Edward T Mee; Preston, Mark D.; Minor, Philip D.; ,; Huang, Xuening; Nguyen, Jenny; Wall, David; Hargrove, Stacey; Fu, Thomas; Xu, George; Li, Li; Cote, Colette; Delwart, Eric; Li, Linlin; Hewlett, Indira

    2016-01-01

    Background Unbiased deep sequencing offers the potential for improved adventitious virus screening in vaccines and biotherapeutics. Successful implementation of such assays will require appropriate control materials to confirm assay performance and sensitivity. Methods A common reference material containing 25 target viruses was produced and 16 laboratories were invited to process it using their preferred adventitious virus detection assay. Results Fifteen laboratories returned results, obtai...

  11. Small RNA deep sequencing revealed that mixed infection of known and unknown viruses were common in field collected vegetable samples

    Science.gov (United States)

    In an effort to characterize the causal agents for plant diseases in field collected samples using the small RNA deep sequencing technology, numerous known or novel viruses and viroids were identified. In many cases, a mixed infection with multiple pathogen species was common. Such situation compl...

  12. Genome-wide characterization of the biggest grass, bamboo, based on 10,608 putative full-length cDNA sequences

    Directory of Open Access Journals (Sweden)

    Zhu Chuanrang

    2010-06-01

    Full Text Available Abstract Background With the availability of rice and sorghum genome sequences and ongoing efforts to sequence genomes of other cereal and energy crops, the grass family (Poaceae has become a model system for comparative genomics and for better understanding gene and genome evolution that underlies phenotypic and ecological divergence of plants. While the genomic resources have accumulated rapidly for almost all major lineages of grasses, bamboo remains the only large subfamily of Poaceae with little genomic information available in databases, which seriously hampers our ability to take a full advantage of the wealth of grass genomic data for effective comparative studies. Results Here we report the cloning and sequencing of 10,608 putative full length cDNAs (FL-cDNAs primarily from Moso bamboo, Phyllostachys heterocycla cv. pubescens, a large woody bamboo with the highest ecological and economic values of all bamboos. This represents the third largest FL-cDNA collection to date of all plant species, and provides the first insight into the gene and genome structures of bamboos. We developed a Moso bamboo genomic resource database that so far contained the sequences of 10,608 putative FL-cDNAs and nearly 38,000 expressed sequence tags (ESTs generated in this study. Conclusion Analysis of FL-cDNA sequences show that bamboo diverged from its close relatives such as rice, wheat, and barley through an adaptive radiation. A comparative analysis of the lignin biosynthesis pathway between bamboo and rice suggested that genes encoding caffeoyl-CoA O-methyltransferase may serve as targets for genetic manipulation of lignin content to reduce pollutants generated from bamboo pulping.

  13. Molecular cloning of lupin leghemoglobin cDNA

    DEFF Research Database (Denmark)

    Konieczny, A; Jensen, E O; Marcker, K A;

    1987-01-01

    Poly(A)+ RNA isolated from root nodules of yellow lupin (Lupinus luteus, var. Ventus) has been used as a template for the construction of a cDNA library. The ds cDNA was synthesized and inserted into the Hind III site of plasmid pBR 322 using synthetic Hind III linkers. Clones containing sequences...... specific for nodules were selected by differential colony hybridization using 32P-labeled cDNA synthesized either from nodule poly(A)+ RNA or from poly(A)+ RNA of uninfected root as probes. Among the recombinant plasmids, the cDNA gene for leghemoglobin was identified. The protein structure derived from...... its nucleotide sequence was consistent with known amino acid sequence of lupin Lb II. The cloned lupin Lb cDNA hybridized to poly(A)+ RNA from nodules only, which is in accordance with the general concept, that leghemoglobin is expressed exclusively in nodules. Udgivelsesdato: 1987-null...

  14. cDNA: 46042 [ASTRA[Archive

    Lifescience Database Archive (English)

    Full Text Available M. musculus + Mm.290868 Mus musculus 12 days embryo spinal ganglion cDNA, RIKEN full-length enri ... , clone:D130020P04 product:target of myb1 homolog (chicken ), full insert sequence gnl|UG|Mm#S10839663 AK05124 ...

  15. cDNA: 37672 [ASTRA[Archive

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.315626 Mus musculus adult male thymus cDNA, RIKEN full-length enriched library, ... 10 product:MODULATOR OF ANTIGEN RECEPTOR SIGNALING MARS , full insert sequence gnl|UG|Mm#S10847064 AK030877 ...

  16. cDNA: 37677 [ASTRA[Archive

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.315626 Mus musculus adult retina cDNA, RIKEN full-length enriched library, clon ... 21 product:MODULATOR OF ANTIGEN RECEPTOR SIGNALING MARS , full insert sequence gnl|UG|Mm#S9072626 AK020837 ...

  17. cDNA: 37673 [ASTRA[Archive

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.315626 Mus musculus 2 days neonate thymus thymic cells cDNA, RIKEN full-length ... 24 product:MODULATOR OF ANTIGEN RECEPTOR SIGNALING MARS , full insert sequence gnl|UG|Mm#S10833741 AK088672 ...

  18. cDNA: 35980 [ASTRA[Archive

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.122430 Mus musculus 0 day neonate cerebellum cDNA, RIKEN full-length enriched l ... 0027A22 product:hypothetical Ubiquitin-conjugating enzymes ... containing protein, full insert sequence gnl|UG|Mm ...

  19. cDNA: 35899 [ASTRA[Archive

    Lifescience Database Archive (English)

    Full Text Available M. musculus + Mm.337238 Mus musculus 13 days embryo liver cDNA, RIKEN full-length enriched libra ... 0010F15 product:hypothetical Ubiquitin-conjugating enzymes ... containing protein, full insert sequence gnl|UG|Mm ...

  20. cDNA: 35982 [ASTRA[Archive

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.122430 Mus musculus 14 days embryo thymus cDNA, RIKEN full-length enriched libr ... 0401J04 product:hypothetical Ubiquitin-conjugating enzymes ... containing protein, full insert sequence gnl|UG|Mm ...

  1. DeepSNVMiner: a sequence analysis tool to detect emergent, rare mutations in subsets of cell populations

    Science.gov (United States)

    Andrews, T. Daniel; Jeelall, Yogesh; Talaulikar, Dipti; Goodnow, Christopher C.

    2016-01-01

    Background. Massively parallel sequencing technology is being used to sequence highly diverse populations of DNA such as that derived from heterogeneous cell mixtures containing both wild-type and disease-related states. At the core of such molecule tagging techniques is the tagging and identification of sequence reads derived from individual input DNA molecules, which must be first computationally disambiguated to generate read groups sharing common sequence tags, with each read group representing a single input DNA molecule. This disambiguation typically generates huge numbers of reads groups, each of which requires additional variant detection analysis steps to be run specific to each read group, thus representing a significant computational challenge. While sequencing technologies for producing these data are approaching maturity, the lack of available computational tools for analysing such heterogeneous sequence data represents an obstacle to the widespread adoption of this technology. Results. Using synthetic data we successfully detect unique variants at dilution levels of 1 in a 1,000,000 molecules, and find DeeepSNVMiner obtains significantly lower false positive and false negative rates compared to popular variant callers GATK, SAMTools, FreeBayes and LoFreq, particularly as the variant concentration levels decrease. In a dilution series with genomic DNA from two cells lines, we find DeepSNVMiner identifies a known somatic variant when present at concentrations of only 1 in 1,000 molecules in the input material, the lowest concentration amongst all variant callers tested. Conclusions. Here we present DeepSNVMiner; a tool to disambiguate tagged sequence groups and robustly identify sequence variants specific to subsets of starting DNA molecules that may indicate the presence of a disease. DeepSNVMiner is an automated workflow of custom sequence analysis utilities and open source tools able to differentiate somatic DNA variants from artefactual sequence

  2. DeepSNVMiner: a sequence analysis tool to detect emergent, rare mutations in subsets of cell populations.

    Science.gov (United States)

    Andrews, T Daniel; Jeelall, Yogesh; Talaulikar, Dipti; Goodnow, Christopher C; Field, Matthew A

    2016-01-01

    Background. Massively parallel sequencing technology is being used to sequence highly diverse populations of DNA such as that derived from heterogeneous cell mixtures containing both wild-type and disease-related states. At the core of such molecule tagging techniques is the tagging and identification of sequence reads derived from individual input DNA molecules, which must be first computationally disambiguated to generate read groups sharing common sequence tags, with each read group representing a single input DNA molecule. This disambiguation typically generates huge numbers of reads groups, each of which requires additional variant detection analysis steps to be run specific to each read group, thus representing a significant computational challenge. While sequencing technologies for producing these data are approaching maturity, the lack of available computational tools for analysing such heterogeneous sequence data represents an obstacle to the widespread adoption of this technology. Results. Using synthetic data we successfully detect unique variants at dilution levels of 1 in a 1,000,000 molecules, and find DeeepSNVMiner obtains significantly lower false positive and false negative rates compared to popular variant callers GATK, SAMTools, FreeBayes and LoFreq, particularly as the variant concentration levels decrease. In a dilution series with genomic DNA from two cells lines, we find DeepSNVMiner identifies a known somatic variant when present at concentrations of only 1 in 1,000 molecules in the input material, the lowest concentration amongst all variant callers tested. Conclusions. Here we present DeepSNVMiner; a tool to disambiguate tagged sequence groups and robustly identify sequence variants specific to subsets of starting DNA molecules that may indicate the presence of a disease. DeepSNVMiner is an automated workflow of custom sequence analysis utilities and open source tools able to differentiate somatic DNA variants from artefactual sequence

  3. Distinctive Drug-resistant Mutation Profiles and Interpretations of HIV-1 Proviral DNA Revealed by Deep Sequencing in Reverse Transcriptase

    Institute of Scientific and Technical Information of China (English)

    YIN Qian Qian; SHAO Yi Ming; MA Li Ying; LI Zhen Peng; ZHAO Hai; PAN Dong; WANG Yan; XU Wei Si; XING Hui; FENGYi; JIANG Shi Bo

    2016-01-01

    ObjectiveTo investigate distinctive features in drug-resistant mutations(DRMs) and interpretations for reverse transcriptase inhibitors (RTIs) between proviral DNA and paired viral RNA in HIV-1-infected patients. MethodsForty-three HIV-1-infected individuals receiving first-line antiretroviral therapy were recruited to participate in a multicenter AIDS Cohort Study in Anhui and Henan Provinces in China in 2004. Drug resistance genotyping was performed by bulk sequencing and deep sequencing on the plasma and whole blood of 77 samples, respectively. Drug-resistance interpretation was compared between viral RNA and paired proviral DNA. ResultsCompared with bulk sequencing, deep sequencing could detect more DRMs and samples with DRMs in both viral RNA and proviral DNA. The mutations M184I and M230I were more prevalent in proviral DNA than in viral RNA (Fisher’s exact test,P ConclusionCompared with viral RNA, the distinctive information of DRMsand drug resistance interpretations for proviral DNA could be obtained by deep sequencing, which could provide more detailed and precise information for drug resistance monitoring and the rational design of optimal antiretroviral therapy regimens.

  4. Genomic region operation kit for flexible processing of deep sequencing data.

    Science.gov (United States)

    Ovaska, Kristian; Lyly, Lauri; Sahu, Biswajyoti; Jänne, Olli A; Hautaniemi, Sampsa

    2013-01-01

    Computational analysis of data produced in deep sequencing (DS) experiments is challenging due to large data volumes and requirements for flexible analysis approaches. Here, we present a mathematical formalism based on set algebra for frequently performed operations in DS data analysis to facilitate translation of biomedical research questions to language amenable for computational analysis. With the help of this formalism, we implemented the Genomic Region Operation Kit (GROK), which supports various DS-related operations such as preprocessing, filtering, file conversion, and sample comparison. GROK provides high-level interfaces for R, Python, Lua, and command line, as well as an extension C++ API. It supports major genomic file formats and allows storing custom genomic regions in efficient data structures such as red-black trees and SQL databases. To demonstrate the utility of GROK, we have characterized the roles of two major transcription factors (TFs) in prostate cancer using data from 10 DS experiments. GROK is freely available with a user guide from >http://csbi.ltdk.helsinki.fi/grok/. PMID:23702556

  5. Multiple platform assessment of the EGF dependent transcriptome by microarray and deep tag sequencing analysis

    Directory of Open Access Journals (Sweden)

    Iraola Susana

    2011-06-01

    Full Text Available Abstract Background Epidermal Growth Factor (EGF is a key regulatory growth factor activating many processes relevant to normal development and disease, affecting cell proliferation and survival. Here we use a combined approach to study the EGF dependent transcriptome of HeLa cells by using multiple long oligonucleotide based microarray platforms (from Agilent, Operon, and Illumina in combination with digital gene expression profiling (DGE with the Illumina Genome Analyzer. Results By applying a procedure for cross-platform data meta-analysis based on RankProd and GlobalAncova tests, we establish a well validated gene set with transcript levels altered after EGF treatment. We use this robust gene list to build higher order networks of gene interaction by interconnecting associated networks, supporting and extending the important role of the EGF signaling pathway in cancer. In addition, we find an entirely new set of genes previously unrelated to the currently accepted EGF associated cellular functions. Conclusions We propose that the use of global genomic cross-validation derived from high content technologies (microarrays or deep sequencing can be used to generate more reliable datasets. This approach should help to improve the confidence of downstream in silico functional inference analyses based on high content data.

  6. Deep sequencing reveals unique small RNA repertoire that is regulated during head regeneration in Hydra magnipapillata.

    Science.gov (United States)

    Krishna, Srikar; Nair, Aparna; Cheedipudi, Sirisha; Poduval, Deepak; Dhawan, Jyotsna; Palakodeti, Dasaradhi; Ghanekar, Yashoda

    2013-01-01

    Small non-coding RNAs such as miRNAs, piRNAs and endo-siRNAs fine-tune gene expression through post-transcriptional regulation, modulating important processes in development, differentiation, homeostasis and regeneration. Using deep sequencing, we have profiled small non-coding RNAs in Hydra magnipapillata and investigated changes in small RNA expression pattern during head regeneration. Our results reveal a unique repertoire of small RNAs in hydra. We have identified 126 miRNA loci; 123 of these miRNAs are unique to hydra. Less than 50% are conserved across two different strains of Hydra vulgaris tested in this study, indicating a highly diverse nature of hydra miRNAs in contrast to bilaterian miRNAs. We also identified siRNAs derived from precursors with perfect stem-loop structure and that arise from inverted repeats. piRNAs were the most abundant small RNAs in hydra, mapping to transposable elements, the annotated transcriptome and unique non-coding regions on the genome. piRNAs that map to transposable elements and the annotated transcriptome display a ping-pong signature. Further, we have identified several miRNAs and piRNAs whose expression is regulated during hydra head regeneration. Our study defines different classes of small RNAs in this cnidarian model system, which may play a role in orchestrating gene expression essential for hydra regeneration. PMID:23166307

  7. cDNA sequences of two arylphorin subunits of an insect biliprotein: phylogenetic differences and gene duplications during evolution of hexamerins-implications for hexamer formation.

    Science.gov (United States)

    Lieb, Bernhard; Ebner, Bettina; Kayser, Hartmut

    2016-03-01

    Arylphorins represent a conserved class of hexameric ∼500 kDa insect hemolymph glycoproteins, rich in aromatic amino acids, which are produced in large quantities at the larval stage as reserves for metamorphosis and egg development. The recently isolated arylphorin from the moth Cerura vinula is unique in being complexed to a novel farnesylated bilin. Protein sequencing suggested the presence of two different ∼85 kDa subunits. Here, we report the complete coding sequences of two cDNAs encoding two arylphorins subunits with 67% identity and calculated physicochemical characteristics in agreement with the isolated holoprotein. Our phylogenetic analyses of the hexamerins revealed monophyletic origins not only for each of the arylphorins and methionine-rich proteins (H-type and M-type), the two major classes of hexamerins, but also for the minor groups of arylphorin-like and riboflavin-binding hexamerins. We named the latter proteins X-type (mixed type) hexamerins because they share sequence features with both major groups, and they show unique deletions and insertions at conserved sites located on the protein surface. We present a phylogenetic tree of lepidopteran hexamerins, which is in agreement with actual systematics. Overall, duplications of hexamerin genes occurred independently in several lepidopteran lineages. We also analyzed the hexamerin sequences for key parameters, which characterize each type of hexamerins. Based on the crystal structure of the homomeric arylphorin from Antheraea pernyi, we present a model for the heteromeric Cerura protein focusing on the role of N-glycan structures in stabilizing the hexamer structure. PMID:27062544

  8. A comparative study of 2',3'-cyclic-nucleotide 3'-phosphodiesterase in vertebrates: cDNA cloning and amino acid sequences for chicken and bullfrog enzymes.

    Science.gov (United States)

    Kasama-Yoshida, H; Tohyama, Y; Kurihara, T; Sakuma, M; Kojima, H; Tamai, Y

    1997-10-01

    In mammalian brain, two 2',3'-cyclic-nucleotide 3'-phosphodiesterase (EC 3.1.4.37) isoforms, CNP1 and CNP2, are translated, respectively, from the two mRNAs, which have been transcribed and processed by alternative use of the two transcription start points and by differential splicing. In the present study, the cDNAs encoding chicken CNP2 and bullfrog CNP1, respectively, were isolated, and the amino acid sequences of chicken CNP2 and bullfrog CNP1 were deduced. Western blot analysis showed that chicken brain contains a major CNP2-type protein together with a minor unidentified isoform, and bullfrog brain contains only a CNP1-type protein. All available amino acid sequences of vertebrate 2',3'-cyclic-nucleotide 3'-phosphodiesterases were aligned and compared. Three conserved motif sequences were noted: (a) an ATP-binding site near the amino terminus, (b) an isoprenylation site at the carboxyl terminus, and (c) a probable catalytic site resembling the active site of beta-ketoacyl synthase (EC 2.3.1.41). The second and the third motifs are conserved also in goldfish RICH (regeneration-induced 2',3'-cyclic-nucleotide 3'-phosphodiesterase homologue), which has been shown recently to have 2',3'-cyclic-nucleotide 3'-phosphodiesterase activity. The third motif (probably catalytic site) was assigned for the first time in the present report. PMID:9326261

  9. Genome-wide identification of Schistosoma japonicum microRNAs using a deep-sequencing approach.

    Directory of Open Access Journals (Sweden)

    Jian Huang

    Full Text Available BACKGROUND: Human schistosomiasis is one of the most prevalent and serious parasitic diseases worldwide. Schistosoma japonicum is one of important pathogens of this disease. MicroRNAs (miRNAs are a large group of non-coding RNAs that play important roles in regulating gene expression and protein translation in animals. Genome-wide identification of miRNAs in a given organism is a critical step to facilitating our understanding of genome organization, genome biology, evolution, and posttranscriptional regulation. METHODOLOGY/PRINCIPAL FINDINGS: We sequenced two small RNA libraries prepared from different stages of the life cycle of S. japonicum, immature schistosomula and mature pairing adults, through a deep DNA sequencing approach, which yielded approximately 12 million high-quality short sequence reads containing a total of approximately 2 million non-redundant tags. Based on a bioinformatics pipeline, we identified 176 new S. japonicum miRNAs, of which some exhibited a differential pattern of expression between the two stages. Although 21 S. japonicum miRNAs are orthologs of known miRNAs within the metazoans, some nucleotides at many positions of Schistosoma miRNAs, such as miR-8, let-7, miR-10, miR-31, miR-92, miR-124, and miR-125, are indeed significantly distinct from other bilaterian orthologs. In addition, both miR-71 and some miR-2 family members in tandem are found to be clustered in a reversal direction model on two genomic loci, and two pairs of novel S. japonicum miRNAs were derived from sense and antisense DNA strands at the same genomic loci. CONCLUSIONS/SIGNIFICANCE: The collection of S. japonicum miRNAs could be used as a new platform to study the genomic structure, gene regulation and networks, evolutionary processes, development, and host-parasite interactions. Some S. japonicum miRNAs and their clusters could represent the ancestral forms of the conserved orthologues and a model for the genesis of novel miRNAs.

  10. cDNA: 12295 [

    Lifescience Database Archive (English)

    Full Text Available H. sapiens + Hs.267288 Homo sapiens full open reading ... frame cDNA clone RZPDo834G0212D for gene C ... 6orf55, chromosome 6 open reading ... frame 55; complete cds, incl. stopcodon gnl|UG|Hs# ...

  11. cDNA: 16610 [

    Lifescience Database Archive (English)

    Full Text Available H. sapiens + Hs.62595 Homo sapiens full open reading ... frame cDNA clone RZPDo834H088D for gene C9o ... rf9, chromosome 9 open reading ... frame 9; complete cds, incl. stopcodon gnl|UG|Hs#S ...

  12. cDNA: 3940 [

    Lifescience Database Archive (English)

    Full Text Available H. sapiens - Hs.7188 Homo sapiens cDNA PSEC0078 fis, clone NT2RP2004036, moderately similar to M ... -Sema F=a factor in neural network ... development. gnl|UG|Hs#S4806431 AK075388 2/887_394 ...

  13. cDNA: 39377 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus + Mm.7091 Mus musculus adult pancreas islet cells cDNA, RIKEN fu ll-length enriched l ... R BETA SUBUNIT) (SSR-BETA) homolog [Homo sapiens], fu ... ... gnl|UG|Mm#S10839939 AK050505 3/6436_39377.png ...

  14. cDNA: 46817 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus + Mm.30092 Mus musculus adult male cerebellum cDNA, RIKEN fu ll-length enriched libra ... 1 PRECURSOR (EC 3.4.21.-) homolog [Homo sapiens], fu ... ... gnl|UG|Mm#S10838326 AK075719 8/7837_46817.png ...

  15. cDNA: 56670 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.10747 Mus musculus adult male testis cDNA, RIKEN fu ll-length enriched library, ... PONSE FACTOR) (MRF-1) homolog [Rattus norvegicus], fu ... ... gnl|UG|Mm#S9083407 AK006184 17/9809_56670.png ...

  16. cDNA: 56898 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.10747 Mus musculus adult male testis cDNA, RIKEN fu ll-length enriched library, ... PONSE FACTOR) (MRF-1) homolog [Rattus norvegicus], fu ... ... gnl|UG|Mm#S9083407 AK006184 17/9810_56898.png ...

  17. cDNA: 56671 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.10747 Mus musculus adult male testis cDNA, RIKEN fu ll-length enriched library, ... PONSE FACTOR) (MRF-1) homolog [Rattus norvegicus], fu ... ... gnl|UG|Mm#S9083172 AK006562 17/9809_56671.png ...

  18. cDNA: 56677 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.10747 Mus musculus adult male testis cDNA, RIKEN fu ll-length enriched library, ... PONSE FACTOR) (MRF-1) homolog [Rattus norvegicus], fu ... ... gnl|UG|Mm#S9083407 AK006184 17/9809_56677.png ...

  19. cDNA: 56899 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.10747 Mus musculus adult male testis cDNA, RIKEN fu ll-length enriched library, ... PONSE FACTOR) (MRF-1) homolog [Rattus norvegicus], fu ... ... gnl|UG|Mm#S9083172 AK006562 17/9810_56899.png ...

  20. cDNA: 46327 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus + Mm.292517 Mus musculus 12 days embryo spinal ganglion cDNA, RIKEN fu ll-length enri ... PHA CHAIN) (PHERS) (CML33) homolog [Homo sapiens], fu ... ... gnl|UG|Mm#S10835133 AK084031 8/7824_46327.png ...

  1. cDNA: 56891 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.10747 Mus musculus adult male testis cDNA, RIKEN fu ll-length enriched library, ... PONSE FACTOR) (MRF-1) homolog [Rattus norvegicus], fu ... ... gnl|UG|Mm#S9083407 AK006184 17/9810_56891.png ...

  2. cDNA: 56678 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.10747 Mus musculus adult male testis cDNA, RIKEN fu ll-length enriched library, ... PONSE FACTOR) (MRF-1) homolog [Rattus norvegicus], fu ... ... gnl|UG|Mm#S9083172 AK006562 17/9809_56678.png ...

  3. cDNA: 41699 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus + Mm.246636 Mus musculus adult male testis cDNA, RIKEN fu ll-length enriched library, ... e/G-protein beta WD-40 repeats containing protein, fu ... ... gnl|UG|Mm#S9075317 AK016965 5/6970_41699.png ...

  4. cDNA: 56892 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus - Mm.10747 Mus musculus adult male testis cDNA, RIKEN fu ll-length enriched library, ... PONSE FACTOR) (MRF-1) homolog [Rattus norvegicus], fu ... ... gnl|UG|Mm#S9083172 AK006562 17/9810_56892.png ...

  5. cDNA: 36690 [

    Lifescience Database Archive (English)

    Full Text Available M. musculus + Mm.345070 Mus musculus 16 days embryo head cDNA, RIKEN fu ll-length enriched librar ... SPLICEOSOME ASSOCIATED PROTEIN 49) [Homo sapiens], fu ... ... gnl|UG|Mm#S10835972 AK081584 2/6005_36690.png ...

  6. Genome-wide detection and analysis of hippocampus core promoters using DeepCAGE

    DEFF Research Database (Denmark)

    Valen, Eivind; Pascarella, Giovanni; Chalk, Alistair;

    2009-01-01

    given tissue. Here, we present a new method for high-throughput sequencing of 5' cDNA tags-DeepCAGE: merging the Cap Analysis of Gene Expression method with ultra-high-throughput sequence technology. We apply DeepCAGE to characterize 1.4 million sequenced TSS from mouse hippocampus and reveal a wealth...... of novel core promoters that are preferentially used in hippocampus: This is the most comprehensive promoter data set for any tissue to date. Using these data, we present evidence indicating a key role for the Arnt2 transcription factor in hippocampus gene regulation. DeepCAGE can also detect...

  7. Long-range organization of reiterated sequences, including the SSX1 cDNA, at the OATL1 cluster in Xp11.23

    Energy Technology Data Exchange (ETDEWEB)

    Chand, A.; Craig, Ian W. [Genetics Lab., Oxford (United Kingdom); Clark, J. [Haddow Labs., Surrey (United Kingdom)] [and others

    1995-12-10

    Analysis of four YACs at the OATL1 locus was undertaken to determine the organization of locally repeated sequences within the OATL1 cluster. A restriction map of ICRFy900C0874, a 600-kb YAC, was constructed using a range of rare-cutting enzymes. Several markers were isolated from the OATL1 cluster, and the YAC map was used for their localization. Markers are shown to be reiterated within the OATL1 cluster, and fine-scale mapping has identified the accurate map position and organization of these repeated sequences within a 275-kb interval. The SSX1 gene involved in synovial sarcoma tumorigenesis and localized at the OATL1 cluster is also shown to be present in a minimum of five copies, all of which map to the aforementioned region. Several markers generated in this study have homologous counterparts at OATL2; they have been utilized to delimit the duplicated region at the OATL clusters. Although instability of one of the OATL1 YACs, ICRFy900F0501, has precluded a precise sizing of this interval, it has been possible to place an upper limit of 520 kb on this region of duplication. 33 refs., 4 figs., 3 tabs.

  8. Microbial Dark Matter: Unusual intervening sequences in 16S rRNA genes of candidate phyla from the deep subsurface

    Energy Technology Data Exchange (ETDEWEB)

    Jarett, Jessica; Stepanauskas, Ramunas; Kieft, Thomas; Onstott, Tullis; Woyke, Tanja

    2014-03-17

    The Microbial Dark Matter project has sequenced genomes from over 200 single cells from candidate phyla, greatly expanding our knowledge of the ecology, inferred metabolism, and evolution of these widely distributed, yet poorly understood lineages. The second phase of this project aims to sequence an additional 800 single cells from known as well as potentially novel candidate phyla derived from a variety of environments. In order to identify whole genome amplified single cells, screening based on phylogenetic placement of 16S rRNA gene sequences is being conducted. Briefly, derived 16S rRNA gene sequences are aligned to a custom version of the Greengenes reference database and added to a reference tree in ARB using parsimony. In multiple samples from deep subsurface habitats but not from other habitats, a large number of sequences proved difficult to align and therefore to place in the tree. Based on comparisons to reference sequences and structural alignments using SSU-ALIGN, many of these ?difficult? sequences appear to originate from candidate phyla, and contain intervening sequences (IVSs) within the 16S rRNA genes. These IVSs are short (39 - 79 nt) and do not appear to be self-splicing or to contain open reading frames. IVSs were found in the loop regions of stem-loop structures in several different taxonomic groups. Phylogenetic placement of sequences is strongly affected by IVSs; two out of three groups investigated were classified as different phyla after their removal. Based on data from samples screened in this project, IVSs appear to be more common in microbes occurring in deep subsurface habitats, although the reasons for this remain elusive.

  9. Deep Sequencing Reveals the Complete Genome and Evidence for Transcriptional Activity of the First Virus-Like Sequences Identified in Aristotelia chilensis (Maqui Berry

    Directory of Open Access Journals (Sweden)

    Javier Villacreses

    2015-04-01

    Full Text Available Here, we report the genome sequence and evidence for transcriptional activity of a virus-like element in the native Chilean berry tree Aristotelia chilensis. We propose to name the endogenous sequence as Aristotelia chilensis Virus 1 (AcV1. High-throughput sequencing of the genome of this tree uncovered an endogenous viral element, with a size of 7122 bp, corresponding to the complete genome of AcV1. Its sequence contains three open reading frames (ORFs: ORFs 1 and 2 shares 66%–73% amino acid similarity with members of the Caulimoviridae virus family, especially the Petunia vein clearing virus (PVCV, Petuvirus genus. ORF1 encodes a movement protein (MP; ORF2 a Reverse Transcriptase (RT and a Ribonuclease H (RNase H domain; and ORF3 showed no amino acid sequence similarity with any other known virus proteins. Analogous to other known endogenous pararetrovirus sequences (EPRVs, AcV1 is integrated in the genome of Maqui Berry and showed low viral transcriptional activity, which was detected by deep sequencing technology (DNA and RNA-seq. Phylogenetic analysis of AcV1 and other pararetroviruses revealed a closer resemblance with Petuvirus. Overall, our data suggests that AcV1 could be a new member of Caulimoviridae family, genus Petuvirus, and the first evidence of this kind of virus in a fruit plant.

  10. Deep sequencing reveals the complete genome and evidence for transcriptional activity of the first virus-like sequences identified in Aristotelia chilensis (Maqui Berry).

    Science.gov (United States)

    Villacreses, Javier; Rojas-Herrera, Marcelo; Sánchez, Carolina; Hewstone, Nicole; Undurraga, Soledad F; Alzate, Juan F; Manque, Patricio; Maracaja-Coutinho, Vinicius; Polanco, Victor

    2015-04-01

    Here, we report the genome sequence and evidence for transcriptional activity of a virus-like element in the native Chilean berry tree Aristotelia chilensis. We propose to name the endogenous sequence as Aristotelia chilensis Virus 1 (AcV1). High-throughput sequencing of the genome of this tree uncovered an endogenous viral element, with a size of 7122 bp, corresponding to the complete genome of AcV1. Its sequence contains three open reading frames (ORFs): ORFs 1 and 2 shares 66%-73% amino acid similarity with members of the Caulimoviridae virus family, especially the Petunia vein clearing virus (PVCV), Petuvirus genus. ORF1 encodes a movement protein (MP); ORF2 a Reverse Transcriptase (RT) and a Ribonuclease H (RNase H) domain; and ORF3 showed no amino acid sequence similarity with any other known virus proteins. Analogous to other known endogenous pararetrovirus sequences (EPRVs), AcV1 is integrated in the genome of Maqui Berry and showed low viral transcriptional activity, which was detected by deep sequencing technology (DNA and RNA-seq). Phylogenetic analysis of AcV1 and other pararetroviruses revealed a closer resemblance with Petuvirus. Overall, our data suggests that AcV1 could be a new member of Caulimoviridae family, genus Petuvirus, and the first evidence of this kind of virus in a fruit plant. PMID:25855242

  11. cDNA Clone of Prophenoloxidase for Litopenaeus Stylirostris and Sequence Structure Analysis%细角滨对虾酚氧化酶原cDNA 克隆及序列结构分析

    Institute of Scientific and Technical Information of China (English)

    许尤厚; 胡超群

    2015-01-01

    采用 RT-PCR 原理和长片段扩增技术克隆细角滨对虾酚氧化酶原基因。结果表明,细角滨对虾血淋巴细胞内存在2个 proPO 基因。 proPO gene 1的 cDNA 序列包含有372氨基酸,前190个氨基酸为一个M 家族血蓝蛋白,是一个铜结合位点区域,191-372为一个 C 家族的血蓝蛋白,是一个免疫球蛋白样的区域。proPO gene 2的2个功能位点之间的序列有重叠,proPO gene 2 cDNA 序列的6-935bp 包含了第一个功能位点,928-1464bp 则包含了第二个功能位点。系统进化树比对分析发现2个基因之间的序列差异非常大。细角滨对虾和凡纳滨对虾的 proPO gene 2同处于一个密切相关的群,proPO gene 1则和其他几种对虾的 proPO gene 处于一个群。 proPO gene 2与 proPO gene 1在对虾免疫活动中是否存在不同的功能还有待于进一步的研究。%Prophenoloxidase (proPO) is one of the important factors on humoral immunity of shrimp, so far there are no re-ports for Litopenaeus stylirostris. Depend on techniques of RT-PCR and long fragment amplification cloning, prophenoloxidase gene of L. stylirostris was cloned. The results show that, there are two proPO genes in the lymphocytes of L. stylirostris. ProPO gene 1 cDNA sequence contains 372 amino acids, the first 190 amino acids are a family of M hemocyanin, a copper binding site region, 191-372 is one of the C family of hemocyanin, is an immunoglobulin like region. There are sequence overlap between the 2 functional sites of proPO gene 2, which means that 6-935bp contains the first functional sites, while 928-1464bp contains sec-ond functional sites. The phylogenetic tree alignment analysis showed that sequence structures of two genes is very different. Pro-PO gene 2 of L. stylirostris and L. vannamei was in a closely related group; but proPO gene 1 of L. stylirostris and L. vannamei was in another group with other several shrimp. The function of ProPO gene 2 and proPO gene 1 in shrimp immune

  12. 杜仲HDR基因全长cDNA克隆与序列分析%Cloning and Sequence Analysis of 1-Hydroxy-2-Methyl-2-E-Butenyl-4-Diphosphate Reductase Gene cDNA from Eucommia ulmoides

    Institute of Scientific and Technical Information of China (English)

    刘攀峰; 杜红岩; 乌云塔娜; 杜兰英; 孙志强

    2013-01-01

    以杜仲叶片cDNA为模板,采用反转录RCR及RACE技术分离出HDR基因的cDNA克隆,命名为EuHDR.EuHDR基因cDNA全长1 653 bp,5'端非编码区长82 bp,3'端非编码区长188 bp,编码460个氨基酸,与喜树HDR基因序列相似性最高,达82%;推导EuHDR氨基酸序列中包含转运肽序列(A1-A33)及植物HDR蛋白多个保守的功能位点(A117,A208,A262,A345);EuHDR蛋白二级结构α-螺旋占35.65%,β-折叠占19.78%,螺环结构占44.57%;EuHDR蛋白三级结构为单体形式,呈不规则的三叶草形状;系统进化分析表明EuHDR蛋白与葡萄HDR蛋白的亲缘关系最为接近.%1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate reductase (HDR) synthesizes IPP and DMAPP in the last step of the plant 2-C-methyl-D-erythritol 4-phosphate (MEP) pathway.Homologous HDR gene cDNA was isolated from the leaves of Eucommia ulmoides by the method of reverse transcription polymerase chain reaction (RTPCR) and rapid amplification of cDNA ends (RACE) technique,and named as EuHDR.With the highest gene sequence similarity to Camptotheca acuminata (82%),the full-length cDNA of EuHDR was 1 653 bp including 5'non-coding region of 82 bp and 3' non-coding region of 188 bp and encoded 460 amino acids.The transit peptide sequence (A1-A33) and multiple conserved functional sites(A117,A208,A262,A345)of plant HDR protein were found in the deduced coding sequence of EuHDR.The secondary structure of EuHDR protein was predicted with proportion of α-helix to 35.65%,β-sheet to 19.78% and loop/coil to 44.57%.The calculated protein tertiary structure of EuHDR was formed as monomer,which in space displayed asymmetrical shamrock-like shape.Phylogenetic analysis revealed that the evolutionary relationship of EuHDR protein was the closest to Vitis vinifera HDR protein.

  13. The cDNA Cloning and Analysis of Sequence Information and Quantitative Express of Chrysanthemum Rhythms Clock Output Gene CmGI (GIGANTEA)%菊花节律钟输出基因CmGI(GIGANTEA)的 cDNA 全长克隆、序列信息及定量表达分析

    Institute of Scientific and Technical Information of China (English)

    孙霞; 王秀峰; 郑成淑; 邢世岩; 束怀瑞

    2012-01-01

    [目的]克隆菊花节律钟输出基因GIGANTEA的cDNA全长序列,进行序列信息学分析,研究其mRNA的相对定量表达.[方法]利用多聚酶链式反应(PCR)结合5′RACE、3′RACE技术,克隆节律钟输出基因GIGANTEA的cDNA全长序列,应用生物信息学软件对获得的基因核苷酸序列及编码的蛋白质序列进行分析;通过在线建模软件对蛋白质的三维结构进行建模预测;利用实时荧光定量PCR技术,用2-△△Ct法进行GIGANTEA的mRNA相对定量表达分析.[结果]从菊花品种‘Jinba’中克隆得到节律钟输出基因GIGANTEA的cDNA全长序列,核苷酸序列长度3461bp,开放阅读框3453bp,编码1150个氨基酸.氨基酸序列分析显示,该基因编码的蛋白与植物节律钟输出基因GIGANTEA同源,命名为CmGI基因,序列提交到GenBank,登录号为JQ043439.序列比对显示与葡萄、蓖麻等的GI的相似度依次为76%、75%.构建类似蛋白系统进化树显示,菊花CmGI与拟南芥(Arabidopsis thaliana GIGANTEA,ABP96482.1)分子进化距离最近,其次是白菜(Brassica rapa GIGANTEA,AEB33730.1);预测CmGI蛋白有6个跨膜螺旋多次跨膜;为转录因子,定位在细胞核中,为非分泌性蛋白质;不具备信号肤;对CmGI三级结构建模预测表明,蛋白核心结构符合转录因子与DNA结合常见的功能域HTH、HLH;采用荧光相对定量分析,菊花CmGI的表达呈昼夜节律表达模式;不同花芽分化阶段叶片中CmGI基因mRNA水平差异大,两个高峰值分别出现在花芽分化启动期和小花原基分化中期;营养生长的组培苗、长日照条件下的叶、芽、花蕾期均是痕量表达;盛花期表达量依次为叶片>舌状花>筒状花.[结论]从菊花中克隆得到节律钟输出基因CmGI,对该基因的进一步深入研究有助于探索光周期途径菊花成花的分子调控机制,可作为切花菊花期调控分子育种的目标基因.%[ Objective ] The cDNA sequence of

  14. Deep sequencing reveals a novel closterovirus associated with wild rose leaf rosette disease.

    Science.gov (United States)

    He, Yan; Yang, Zuokun; Hong, Ni; Wang, Guoping; Ning, Guogui; Xu, Wenxing

    2015-06-01

    A bizarre virus-like symptom of a leaf rosette formed by dense small leaves on branches of wild roses (Rosa multiflora Thunb.), designated as 'wild rose leaf rosette disease' (WRLRD), was observed in China. To investigate the presumed causal virus, a wild rose sample affected by WRLRD was subjected to deep sequencing of small interfering RNAs (siRNAs) for a complete survey of the infecting viruses and viroids. The assembly of siRNAs led to the reconstruction of the complete genomes of three known viruses, namely Apple stem grooving virus (ASGV), Blackberry chlorotic ringspot virus (BCRV) and Prunus necrotic ringspot virus (PNRSV), and of a novel virus provisionally named 'rose leaf rosette-associated virus' (RLRaV). Phylogenetic analysis clearly placed RLRaV alongside members of the genus Closterovirus, family Closteroviridae. Genome organization of RLRaV RNA (17,653 nucleotides) showed 13 open reading frames (ORFs), except ORF1 and the quintuple gene block, most of which showed no significant similarities with known viral proteins, but, instead, had detectable identities to fungal or bacterial proteins. Additional novel molecular features indicated that RLRaV seems to be the most complex virus among the known genus members. To our knowledge, this is the first report of WRLRD and its associated closterovirus, as well as two ilarviruses and one capilovirus, infecting wild roses. Our findings present novel information about the closterovirus and the aetiology of this rose disease which should facilitate its control. More importantly, the novel features of RLRaV help to clarify the molecular and evolutionary features of the closterovirus. PMID:25187347

  15. Development of a candidate reference material for adventitious virus detection in vaccine and biologicals manufacturing by deep sequencing

    Science.gov (United States)

    Mee, Edward T.; Preston, Mark D.; Minor, Philip D.; Schepelmann, Silke; Huang, Xuening; Nguyen, Jenny; Wall, David; Hargrove, Stacey; Fu, Thomas; Xu, George; Li, Li; Cote, Colette; Delwart, Eric; Li, Linlin; Hewlett, Indira; Simonyan, Vahan; Ragupathy, Viswanath; Alin, Voskanian-Kordi; Mermod, Nicolas; Hill, Christiane; Ottenwälder, Birgit; Richter, Daniel C.; Tehrani, Arman; Jacqueline, Weber-Lehmann; Cassart, Jean-Pol; Letellier, Carine; Vandeputte, Olivier; Ruelle, Jean-Louis; Deyati, Avisek; La Neve, Fabio; Modena, Chiara; Mee, Edward; Schepelmann, Silke; Preston, Mark; Minor, Philip; Eloit, Marc; Muth, Erika; Lamamy, Arnaud; Jagorel, Florence; Cheval, Justine; Anscombe, Catherine; Misra, Raju; Wooldridge, David; Gharbia, Saheer; Rose, Graham; Ng, Siemon H.S.; Charlebois, Robert L.; Gisonni-Lex, Lucy; Mallet, Laurent; Dorange, Fabien; Chiu, Charles; Naccache, Samia; Kellam, Paul; van der Hoek, Lia; Cotten, Matt; Mitchell, Christine; Baier, Brian S.; Sun, Wenping; Malicki, Heather D.

    2016-01-01

    Background Unbiased deep sequencing offers the potential for improved adventitious virus screening in vaccines and biotherapeutics. Successful implementation of such assays will require appropriate control materials to confirm assay performance and sensitivity. Methods A common reference material containing 25 target viruses was produced and 16 laboratories were invited to process it using their preferred adventitious virus detection assay. Results Fifteen laboratories returned results, obtained using a wide range of wet-lab and informatics methods. Six of 25 target viruses were detected by all laboratories, with the remaining viruses detected by 4–14 laboratories. Six non-target viruses were detected by three or more laboratories. Conclusion The study demonstrated that a wide range of methods are currently used for adventitious virus detection screening in biological products by deep sequencing and that they can yield significantly different results. This underscores the need for common reference materials to ensure satisfactory assay performance and enable comparisons between laboratories. PMID:26709640

  16. Absence of major histocompatibility complex class II mediated immunity in pipefish, Syngnathus typhle: evidence from deep transcriptome sequencing

    OpenAIRE

    Haase, David; Roth, Olivia; Kalbe, Martin; Schmiedeskamp, Gisela; Scharsack, Jörn P; Rosenstiel, Philip; Reusch, Thorsten B. H.

    2013-01-01

    The major histocompatibility complex (MHC)-mediated adaptive immune system is the hallmark of gnathostome immune defence. Recent work suggests that cod-like fishes (Gadidae) lack important components of the MHC class II mediated immunity. Here, we report a putative independent loss of functionality of this pathway in another species, the pipefish Syngnathus typhle, that belongs to a distantly related fish family (Syngnathidae). In a deep transcriptome sequencing approach comprising several in...

  17. Identification of MicroRNAs and Transcript Targets in Camelina sativa by Deep Sequencing and Computational Methods

    OpenAIRE

    Poudel, Saroj; Aryal, Niranjan; Lu, Chaofu

    2015-01-01

    Camelina sativa is an annual oilseed crop that is under intensive development for renewable resources of biofuels and industrial oils. MicroRNAs, or miRNAs, are endogenously encoded small RNAs that play key roles in diverse plant biological processes. Here, we conducted deep sequencing on small RNA libraries prepared from camelina leaves, flower buds and two stages of developing seeds corresponding to initial and peak storage products accumulation. Computational analyses identified 207 known ...

  18. High-throughput verification of transcriptional starting sites by Deep-RACE

    DEFF Research Database (Denmark)

    Olivarius, Signe; Plessy, Charles; Carninci, Piero

    2009-01-01

    We present a high-throughput method for investigating the transcriptional starting sites of genes of interest, which we named Deep-RACE (Deep–rapid amplification of cDNA ends). Taking advantage of the latest sequencing technology, it allows the parallel analysis of multiple genes and is free of...

  19. Deep sequencing of immune repertoires during bovine development and in response to respiratory pathogen challenge

    Science.gov (United States)

    Vertebrate immune systems generate diverse repertoires of antibodies capable of mediating response to a variety of antigens. Single-molecule circular consensus sequencing permits the sequencing of expressed antibody repertoires at previously unattainable depths of coverage and accuracy. We examined...

  20. Identification of Genes Potentially Responsible for extra-Oral Digestion and Overcoming Plant Defense from Salivary Glands of the Tarnished Plant Bug (Hemiptera: Miridae) Using cDNA Sequencing.

    Science.gov (United States)

    Zhu, Yu-Cheng; Yao, Jianxiu; Luttrell, Randall

    2016-01-01

    Saliva is known to play a crucial role in tarnished plant bug (TPB, Lygus lineolaris [Palisot de Beauvois]) feeding. By facilitating the piercing, the enzyme-rich saliva may be used for extra-oral digestion and for overcoming plant defense before the plant fluids are ingested by TPBs. To identify salivary gland genes, mRNA was extracted from salivary glands and cDNA library clones were sequenced. A de novo-assembling of 7,000 Sanger sequences revealed 666 high-quality unique cDNAs with an average size of 624 bp, in which the identities of 347 cDNAs were determined using Blast2GO. Kyoto Encyclopedia of Genes and Genomes analysis indicated that these genes participate in eighteen metabolic pathways. Identifications of large number of enzyme genes in TPB salivary glands evidenced functions for extra-oral digestion and feeding damage mechanism, including 45 polygalacturonase, two α- amylase, one glucosidase, one glycan enzyme, one aminopeptidase, four lipase, and many serine protease cDNAs. The presence of multiple transcripts, multigene members, and high abundance of cell wall degradation enzymes (polygalacturonases) indicated that the enzyme-rich saliva may cause damage to plants by breaking down plant cell walls to make nutrients available for feeding. We also identified genes potentially involved in insect adaptation and detoxifying xenobiotics that may allow insects to overcome plant defense responses, including four glutathione S-transferases, three esterases, one cytochrome P450, and several serine proteases. The gene profiles of TPB salivary glands revealed in this study provides a foundation for further understanding and potential development of novel enzymatic inhibitors, or other RNAi approaches that may interrupt or minimize TPB feeding damage. PMID:27324587

  1. Coffee and tomato share common gene repertoires as revealed by deep sequencing of seed and cherry transcripts.

    Science.gov (United States)

    Lin, Chenwei; Mueller, Lukas A; Mc Carthy, James; Crouzillat, Dominique; Pétiard, Vincent; Tanksley, Steven D

    2005-12-01

    An EST database has been generated for coffee based on sequences from approximately 47,000 cDNA clones derived from five different stages/tissues, with a special focus on developing seeds. When computationally assembled, these sequences correspond to 13,175 unigenes, which were analyzed with respect to functional annotation, expression profile and evolution. Compared with Arabidopsis, the coffee unigenes encode a higher proportion of proteins related to protein modification/turnover and metabolism-an observation that may explain the high diversity of metabolites found in coffee and related species. Several gene families were found to be either expanded or unique to coffee when compared with Arabidopsis. A high proportion of these families encode proteins assigned to functions related to disease resistance. Such families may have expanded and evolved rapidly under the intense pathogen pressure experienced by a tropical, perennial species like coffee. Finally, the coffee gene repertoire was compared with that of Arabidopsis and Solanaceous species (e.g. tomato). Unlike Arabidopsis, tomato has a nearly perfect gene-for-gene match with coffee. These results are consistent with the facts that coffee and tomato have a similar genome size, chromosome karyotype (tomato, n=12; coffee n=11) and chromosome architecture. Moreover, both belong to the Asterid I clade of dicot plant families. Thus, the biology of coffee (family Rubiacaeae) and tomato (family Solanaceae) may be united into one common network of shared discoveries, resources and information. PMID:16273343

  2. Deep sequencing extends the diversity of human papillomaviruses in human skin.

    OpenAIRE

    Bzhalava, Davit; Mühr, Laila Sara Arroyo; Lagheden, Camilla; Ekström, Johanna; Forslund, Ola; Dillner, Joakim; Hultin, Emilie

    2014-01-01

    Most viruses in human skin are known to be human papillomaviruses (HPVs). Previous sequencing of skin samples has identified 273 different cutaneous HPV types, including 47 previously unknown types. In the present study, we wished to extend prior studies using deeper sequencing. This deeper sequencing without prior PCR of a pool of 142 whole genome amplified skin lesions identified 23 known HPV types, 3 novel putative HPV types and 4 non-HPV viruses. The complete sequence was obtained for one...

  3. Reproducibility of Illumina platform deep sequencing errors allows accurate determination of DNA barcodes in cells.

    OpenAIRE

    Beltman, J.B.; J. Urbanus; Velds, A.; de, Rooij, R.; Rohr, J.C.; S.H. Naik; T.N. Schumacher.

    2016-01-01

    BACKGROUND Next generation sequencing (NGS) of amplified DNA is a powerful tool to describe genetic heterogeneity within cell populations that can both be used to investigate the clonal structure of cell populations and to perform genetic lineage tracing. For applications in which both abundant and rare sequences are biologically relevant, the relatively high error rate of NGS techniques complicates data analysis, as it is difficult to distinguish rare true sequences from spurious sequences t...

  4. Tibetan antelope cystathionine γ -lyase:complete cDNA sequences%高原藏羚羊胱硫醚-γ-裂解酶基因克隆与全序列测定

    Institute of Scientific and Technical Information of China (English)

    李肃; 格日力

    2013-01-01

    Objective To identify the Cystathionine-γ-Lyase(CSE) genes coding sequences molecular cloning , exam the tissues expression spectrums and discuss the hypoxic adaptations mechanisms in Tibetan antelope .Methods The total RNA was extracted ,and the cDNA was captured by reverse transcription RT-PCR ,then identified ,se-quenced and cloned .Results There was 96 .47% homology between the Tibetan antelope gene fragment containing the purpose gene and the cattle gene in gen banks ,thus the result mean the gene which cloned before was CSE gene protein .The length of the CSE gene protein was been detected by designing primers according to the human ,mous , wild boar ,cattle CSEcDNA sequences ,and the CSE gene primers of tibetan antelope which tesied by Pnaman .Conclu-sion CSE gene protein might play an important role in the body of the Tibetan anteplope ,which provide experiment basis to the gene study about adaptation in high altitude hypoxia environment .in the future .%目的:探讨克隆高原藏羚羊胱硫醚-γ-裂解酶(CSE)基因编码区并检测其在成年高原藏羚羊组织中的表达,同时探讨高原藏羚羊低氧适应的分子生物学机制。方法从高原藏羚羊组织中提取总 RNA ,通过逆转录聚合酶链反应(RT-PCR)获得高原藏羚羊 cDNA ,并进行鉴定和测序。结果将含有目的片段克隆后经测序和 Blast分析,结果显示其部分编码序列与 GenBank 中牛 CSE 蛋白基因序列同源性96.47%,表明本实验所克隆的序列为CSE 蛋白基因。根据已知人、褐家鼠、小鼠、野猪、食蟹猴、家牛 CSEcDNA 序列和 Pnaman 软件设计高原藏羚羊 cse基因的引物。结论 CSE mRNA 可能在高原藏羚羊机体较为广泛的区域中发挥着作用,同时为高原低氧适应相关基因的研究提供了实验依据。

  5. Comparison of illumina and 454 deep sequencing in participants failing raltegravir-based antiretroviral therapy.

    Directory of Open Access Journals (Sweden)

    Jonathan Z Li

    Full Text Available The impact of raltegravir-resistant HIV-1 minority variants (MVs on raltegravir treatment failure is unknown. Illumina sequencing offers greater throughput than 454, but sequence analysis tools for viral sequencing are needed. We evaluated Illumina and 454 for the detection of HIV-1 raltegravir-resistant MVs.A5262 was a single-arm study of raltegravir and darunavir/ritonavir in treatment-naïve patients. Pre-treatment plasma was obtained from 5 participants with raltegravir resistance at the time of virologic failure. A control library was created by pooling integrase clones at predefined proportions. Multiplexed sequencing was performed with Illumina and 454 platforms at comparable costs. Illumina sequence analysis was performed with the novel snp-assess tool and 454 sequencing was analyzed with V-Phaser.Illumina sequencing resulted in significantly higher sequence coverage and a 0.095% limit of detection. Illumina accurately detected all MVs in the control library at ≥0.5% and 7/10 MVs expected at 0.1%. 454 sequencing failed to detect any MVs at 0.1% with 5 false positive calls. For MVs detected in the patient samples by both 454 and Illumina, the correlation in the detected variant frequencies was high (R2 = 0.92, P<0.001. Illumina sequencing detected 2.4-fold greater nucleotide MVs and 2.9-fold greater amino acid MVs compared to 454. The only raltegravir-resistant MV detected was an E138K mutation in one participant by Illumina sequencing, but not by 454.In participants of A5262 with raltegravir resistance at virologic failure, baseline raltegravir-resistant MVs were rarely detected. At comparable costs to 454 sequencing, Illumina demonstrated greater depth of coverage, increased sensitivity for detecting HIV MVs, and fewer false positive variant calls.

  6. Comparison of Illumina and 454 Deep Sequencing in Participants Failing Raltegravir-Based Antiretroviral Therapy

    OpenAIRE

    Li, Jonathan Z.; Chapman, Brad; Charlebois, Patrick; Hofmann, Oliver; Weiner, Brian; Porter, Alyssa J.; Samuel, Reshmi; Vardhanabhuti, Saran; ZHENG, LU; Eron, Joseph; Taiwo, Babafemi; Zody, Michael C; Henn, Matthew R.; Daniel R Kuritzkes; Hide, Winston

    2014-01-01

    Background: The impact of raltegravir-resistant HIV-1 minority variants (MVs) on raltegravir treatment failure is unknown. Illumina sequencing offers greater throughput than 454, but sequence analysis tools for viral sequencing are needed. We evaluated Illumina and 454 for the detection of HIV-1 raltegravir-resistant MVs. Methods: A5262 was a single-arm study of raltegravir and darunavir/ritonavir in treatment-naïve patients. Pre-treatment plasma was obtained from 5 participants with raltegra...

  7. Comparison of Illumina and 454 Deep Sequencing in Participants Failing Raltegravir-Based Antiretroviral Therapy

    OpenAIRE

    Li, Jonathan Z.; Brad Chapman; Patrick Charlebois; Oliver Hofmann; Brian Weiner; Porter, Alyssa J.; Reshmi Samuel; Saran Vardhanabhuti; Lu Zheng; Joseph Eron; Babafemi Taiwo; Zody, Michael C; Henn, Matthew R.; Daniel R Kuritzkes; Winston Hide

    2014-01-01

    BACKGROUND: The impact of raltegravir-resistant HIV-1 minority variants (MVs) on raltegravir treatment failure is unknown. Illumina sequencing offers greater throughput than 454, but sequence analysis tools for viral sequencing are needed. We evaluated Illumina and 454 for the detection of HIV-1 raltegravir-resistant MVs. METHODS: A5262 was a single-arm study of raltegravir and darunavir/ritonavir in treatment-naïve patients. Pre-treatment plasma was obtained from 5 participants with raltegra...

  8. HPV Population Profiling in Healthy Men by Next-Generation Deep Sequencing Coupled with HPV-QUEST.

    Science.gov (United States)

    Yin, Li; Yao, Jin; Chang, Kaifen; Gardner, Brent P; Yu, Fahong; Giuliano, Anna R; Goodenow, Maureen M

    2016-02-01

    Multiple-type human papillomaviruses (HPV) infection presents a greater risk for persistence in asymptomatic individuals and may accelerate cancer development. To extend the scope of HPV types defined by probe-based assays, multiplexing deep sequencing of HPV L1, coupled with an HPV-QUEST genotyping server and a bioinformatic pipeline, was established and applied to survey the diversity of HPV genotypes among a subset of healthy men from the HPV in Men (HIM) Multinational Study. Twenty-one HPV genotypes (12 high-risk and 9 low-risk) were detected in the genital area from 18 asymptomatic individuals. A single HPV type, either HPV16, HPV6b or HPV83, was detected in 7 individuals, while coinfection by 2 to 5 high-risk and/or low-risk genotypes was identified in the other 11 participants. In two individuals studied for over one year, HPV16 persisted, while fluctuations of coinfecting genotypes occurred. HPV L1 regions were generally identical between query and reference sequences, although nonsynonymous and synonymous nucleotide polymorphisms of HPV16, 18, 31, 35h, 59, 70, 73, cand85, 6b, 62, 81, 83, cand89 or JEB2 L1 genotypes, mostly unidentified by linear array, were evident. Deep sequencing coupled with HPV-QUEST provides efficient and unambiguous classification of HPV genotypes in multiple-type HPV infection in host ecosystems. PMID:26821041

  9. High-Throughput Plasmid cDNA Library Screening

    Energy Technology Data Exchange (ETDEWEB)

    Wan, Kenneth H.; Yu, Charles; George, Reed A.; Carlson, JosephW.; Hoskins, Roger A.; Svirskas, Robert; Stapleton, Mark; Celniker, SusanE.

    2006-05-24

    Libraries of cDNA clones are valuable resources foranalysing the expression, structure, and regulation of genes, as well asfor studying protein functions and interactions. Full-length cDNA clonesprovide information about intron and exon structures, splice junctionsand 5'- and 3'-untranslated regions (UTRs). Open reading frames (ORFs)derived from cDNA clones can be used to generate constructs allowingexpression of native proteins and N- or C-terminally tagged proteins.Thus, obtaining full-length cDNA clones and sequences for most or allgenes in an organism is critical for understanding genome functions.Expressed sequence tag (EST) sequencing samples cDNA libraries at random,which is most useful at the beginning of large-scale screening projects.However, as projects progress towards completion, the probability ofidentifying unique cDNAs via EST sequencing diminishes, resulting in poorrecovery of rare transcripts. We describe an adapted, high-throughputprotocol intended for recovery of specific, full-length clones fromplasmid cDNA libraries in five days.

  10. Deep sequencing reveals important roles of microRNAs in response to drought and salinity stress in cotton

    OpenAIRE

    Xie, Fuliang; Wang, Qinglian; Sun, Runrun; Zhang, Baohong

    2014-01-01

    Drought and salinity are two major environmental factors adversely affecting plant growth and productivity. However, the regulatory mechanism is unknown. In this study, the potential roles of small regulatory microRNAs (miRNAs) in cotton response to those stresses were investigated. Using next-generation deep sequencing, a total of 337 miRNAs with precursors were identified, comprising 289 known miRNAs and 48 novel miRNAs. Of these miRNAs, 155 miRNAs were expressed differentially. Target pred...

  11. Simultaneous identification of DNA and RNA viruses present in pig faeces using process-controlled deep sequencing.

    Directory of Open Access Journals (Sweden)

    Jana Sachsenröder

    Full Text Available BACKGROUND: Animal faeces comprise a community of many different microorganisms including bacteria and viruses. Only scarce information is available about the diversity of viruses present in the faeces of pigs. Here we describe a protocol, which was optimized for the purification of the total fraction of viral particles from pig faeces. The genomes of the purified DNA and RNA viruses were simultaneously amplified by PCR and subjected to deep sequencing followed by bioinformatic analyses. The efficiency of the method was monitored using a process control consisting of three bacteriophages (T4, M13 and MS2 with different morphology and genome types. Defined amounts of the bacteriophages were added to the sample and their abundance was assessed by quantitative PCR during the preparation procedure. RESULTS: The procedure was applied to a pooled faecal sample of five pigs. From this sample, 69,613 sequence reads were generated. All of the added bacteriophages were identified by sequence analysis of the reads. In total, 7.7% of the reads showed significant sequence identities with published viral sequences. They mainly originated from bacteriophages (73.9% and mammalian viruses (23.9%; 0.8% of the sequences showed identities to plant viruses. The most abundant detected porcine viruses were kobuvirus, rotavirus C, astrovirus, enterovirus B, sapovirus and picobirnavirus. In addition, sequences with identities to the chimpanzee stool-associated circular ssDNA virus were identified. Whole genome analysis indicates that this virus, tentatively designated as pig stool-associated circular ssDNA virus (PigSCV, represents a novel pig virus. CONCLUSION: The established protocol enables the simultaneous detection of DNA and RNA viruses in pig faeces including the identification of so far unknown viruses. It may be applied in studies investigating aetiology, epidemiology and ecology of diseases. The implemented process control serves as quality control, ensures

  12. Using deep RNA sequencing for the structural annotation of the Laccaria bicolor mycorrhizal transcriptome.

    Directory of Open Access Journals (Sweden)

    Peter E Larsen

    Full Text Available BACKGROUND: Accurate structural annotation is important for prediction of function and required for in vitro approaches to characterize or validate the gene expression products. Despite significant efforts in the field, determination of the gene structure from genomic data alone is a challenging and inaccurate process. The ease of acquisition of transcriptomic sequence provides a direct route to identify expressed sequences and determine the correct gene structure. METHODOLOGY: We developed methods to utilize RNA-seq data to correct errors in the structural annotation and extend the boundaries of current gene models using assembly approaches. The methods were validated with a transcriptomic data set derived from the fungus Laccaria bicolor, which develops a mycorrhizal symbiotic association with the roots of many tree species. Our analysis focused on the subset of 1501 gene models that are differentially expressed in the free living vs. mycorrhizal transcriptome and are expected to be important elements related to carbon metabolism, membrane permeability and transport, and intracellular signaling. Of the set of 1501 gene models, 1439 (96% successfully generated modified gene models in which all error flags were successfully resolved and the sequences aligned to the genomic sequence. The remaining 4% (62 gene models either had deviations from transcriptomic data that could not be spanned or generated sequence that did not align to genomic sequence. The outcome of this process is a set of high confidence gene models that can be reliably used for experimental characterization of protein function. CONCLUSIONS: 69% of expressed mycorrhizal JGI "best" gene models deviated from the transcript sequence derived by this method. The transcriptomic sequence enabled correction of a majority of the structural inconsistencies and resulted in a set of validated models for 96% of the mycorrhizal genes. The method described here can be applied to improve gene

  13. Using deep RNA sequencing for the structural annotation of the laccaria bicolor mycorrhizal transcriptome.

    Energy Technology Data Exchange (ETDEWEB)

    Larsen, P. E.; Trivedi, G.; Sreedasyam, A.; Lu, V.; Podila, G. K.; Collart, F. R.; Biosciences Division; Univ. of Alabama

    2010-07-06

    Accurate structural annotation is important for prediction of function and required for in vitro approaches to characterize or validate the gene expression products. Despite significant efforts in the field, determination of the gene structure from genomic data alone is a challenging and inaccurate process. The ease of acquisition of transcriptomic sequence provides a direct route to identify expressed sequences and determine the correct gene structure. We developed methods to utilize RNA-seq data to correct errors in the structural annotation and extend the boundaries of current gene models using assembly approaches. The methods were validated with a transcriptomic data set derived from the fungus Laccaria bicolor, which develops a mycorrhizal symbiotic association with the roots of many tree species. Our analysis focused on the subset of 1501 gene models that are differentially expressed in the free living vs. mycorrhizal transcriptome and are expected to be important elements related to carbon metabolism, membrane permeability and transport, and intracellular signaling. Of the set of 1501 gene models, 1439 (96%) successfully generated modified gene models in which all error flags were successfully resolved and the sequences aligned to the genomic sequence. The remaining 4% (62 gene models) either had deviations from transcriptomic data that could not be spanned or generated sequence that did not align to genomic sequence. The outcome of this process is a set of high confidence gene models that can be reliably used for experimental characterization of protein function. 69% of expressed mycorrhizal JGI 'best' gene models deviated from the transcript sequence derived by this method. The transcriptomic sequence enabled correction of a majority of the structural inconsistencies and resulted in a set of validated models for 96% of the mycorrhizal genes. The method described here can be applied to improve gene structural annotation in other species, provided

  14. Cloning and expression of cDNA for salmon growth hormone in Escherichia coli

    OpenAIRE

    Sekine, Susumu; Mizukami, Tamio; Nishi, Tatsunari; Kuwana, Yoshihisa; Saito, Akiko; Sato, Moriyuki; Itoh, Seiga; Kawauchi, Hiroshi

    1985-01-01

    cDNA clones encoding chum salmon (Oncorhynchus keta) growth hormone (sGH) have been isolated from a cDNA library prepared from chum salmon pituitary gland poly(A)+ RNA. Synthetic oligodeoxynucleotide mixtures based on amino acid residues 23-28 of sGH were used as hybridization probes to select recombinant plasmids carrying the sGH coding sequence. The complete nucleotide sequence of sGH cDNA has been determined. The cDNA sequence codes for a polypeptide of 210 amino acids, including a putativ...

  15. High diversity of picornaviruses in rats from different continents revealed by deep sequencing

    DEFF Research Database (Denmark)

    Arn Hansen, Thomas; Mollerup, Sarah; Nguyen, Nam-Phuong;

    2016-01-01

    Outbreaks of zoonotic diseases in humans and livestock are not uncommon, and an important component in containment of such emerging viral diseases is rapid and reliable diagnostics. Such methods are often PCR-based and hence require the availability of sequence data from the pathogen. Rattus......) collected from two continents by analyzing 2.2 billion next-generation sequencing reads derived from both DNA and RNA. Among other virus families, we found sequences from members of the Picornaviridae to be abundant in the microbiome of all the samples. Here we describe the diversity of the picornavirus...

  16. 梅花鹿卵泡刺激素α-亚基cDNA的分子克隆与序列分析%Nucleotide sequence of cloned cDNA for α-subunit of sika follicle stimulating hormone

    Institute of Scientific and Technical Information of China (English)

    关洪斌; 李庆章; 张莉

    2002-01-01

    从新屠宰的母梅花鹿脑垂体中提取总RNA,反转录获得cDNA,以此cDNA为模板用PCR法扩增目的片段,获得长为380 bp的梅花鹿卵泡刺激素α--亚基cDNA片段,将它克隆至pMD-18-T-Verctor.随机挑选3个阳性重组子进行测序,并将测序结果与绵羊、牛、猪等多种哺乳动物该基因的核苷酸序列及相应氨基酸序列进行比较.结果表明,梅花鹿卵泡刺激素α--亚基基因编码的氨基酸序列与绵羊、水牛的该基因同源性最高,达97%,只有4个氨基酸不同;与牛的该基因同源性达96%.与人的该基因氨基酸序列同源性较低,为75%.其编码的核苷酸序列与绵羊、水牛、牛的同源性最高,达96%,只有14~16个碱基不同;与人的该基因核苷酸同源性最低,为84%.总的来说,哺乳动物的卵泡刺激素α-亚基具有很高的同源性.%Total RNA was prepared from pituitary gland of new butchered sika.cDNA was synthesized by RT-PCRreaction and this cDNA was used as model in PCR amplification for α-subunit of sika follicle stimulating hormone.The PCR product was 380bp in 1.2% agarose gel electro-phoresis which just was the target fragment of predictedFSHα-subunit. It was cloned it to pMD-18-T vector. 3 positive recombinant was selected at random to analyze itssequence by DNA analysis apparatus. Its amino acid sequence was compared with some other mammalian. The resultshows that it has the highest homology with sheep and buffalo,which it reaches 97%. There are only 4 amino acidsdifference among sika ,sheep and buffalo. It has lower homology in amino acid with human, its homology is 75%. Ithas the highest homology among sika ,sheep, buffalo and bovine in nucleotide sequence, which it reaches 96%.There are 14-16 nucleotides difference among them. It has lower homology in nucleotide sequence with human, it isonly 84%. It was found that the nucleotide sequence of the o-subunit in these mammalian species are highly con-servative. According to our

  17. Increasing Clinical Severity during a Dengue Virus Type 3 Cuban Epidemic: Deep Sequencing of Evolving Viral Populations

    Science.gov (United States)

    Blanc, Hervé; Bordería, Antonio V.; Díaz, Gisell; Henningsson, Rasmus; Gonzalez, Daniel; Santana, Emidalys; Alvarez, Mayling; Castro, Osvaldo; Fontes, Magnus; Vignuzzi, Marco; Guzman, Maria G.

    2016-01-01

    ABSTRACT During the dengue virus type 3 (DENV-3) epidemic that occurred in Havana in 2001 to 2002, severe disease was associated with the infection sequence DENV-1 followed by DENV-3 (DENV-1/DENV-3), while the sequence DENV-2/DENV-3 was associated with mild/asymptomatic infections. To determine the role of the virus in the increasing severity demonstrated during the epidemic, serum samples collected at different time points were studied. A total of 22 full-length sequences were obtained using a deep-sequencing approach. Bayesian phylogenetic analysis of consensus sequences revealed that two DENV-3 lineages were circulating in Havana at that time, both grouped within genotype III. The predominant lineage is closely related to Peruvian and Ecuadorian strains, while the minor lineage is related to Venezuelan strains. According to consensus sequences, relatively few nonsynonymous mutations were observed; only one was fixed during the epidemic at position 4380 in the NS2B gene. Intrahost genetic analysis indicated that a significant minor population was selected and became predominant toward the end of the epidemic. In conclusion, greater variability was detected during the epidemic's progression in terms of significant minority variants, particularly in the nonstructural genes. An increasing trend of genetic diversity toward the end of the epidemic was observed only for synonymous variant allele rates, with higher variability in secondary cases. Remarkably, significant intrahost genetic variation was demonstrated within the same patient during the course of secondary infection with DENV-1/DENV-3, including changes in the structural proteins premembrane (PrM) and envelope (E). Therefore, the dynamic of evolving viral populations in the context of heterotypic antibodies could be related to the increasing clinical severity observed during the epidemic. IMPORTANCE Based on the evidence that DENV fitness is context dependent, our research has focused on the study of viral

  18. Dissection of the octoploid strawberry genome by deep sequencing of the genomes of Fragaria species.

    Science.gov (United States)

    Hirakawa, Hideki; Shirasawa, Kenta; Kosugi, Shunichi; Tashiro, Kosuke; Nakayama, Shinobu; Yamada, Manabu; Kohara, Mistuyo; Watanabe, Akiko; Kishida, Yoshie; Fujishiro, Tsunakazu; Tsuruoka, Hisano; Minami, Chiharu; Sasamoto, Shigemi; Kato, Midori; Nanri, Keiko; Komaki, Akiko; Yanagi, Tomohiro; Guoxin, Qin; Maeda, Fumi; Ishikawa, Masami; Kuhara, Satoru; Sato, Shusei; Tabata, Satoshi; Isobe, Sachiko N

    2014-01-01

    Cultivated strawberry (Fragaria x ananassa) is octoploid and shows allogamous behaviour. The present study aims at dissecting this octoploid genome through comparison with its wild relatives, F. iinumae, F. nipponica, F. nubicola, and F. orientalis by de novo whole-genome sequencing on an Illumina and Roche 454 platforms. The total length of the assembled Illumina genome sequences obtained was 698 Mb for F. x ananassa, and ∼200 Mb each for the four wild species. Subsequently, a virtual reference genome termed FANhybrid_r1.2 was constructed by integrating the sequences of the four homoeologous subgenomes of F. x ananassa, from which heterozygous regions in the Roche 454 and Illumina genome sequences were eliminated. The total length of FANhybrid_r1.2 thus created was 173.2 Mb with the N50 length of 5137 bp. The Illumina-assembled genome sequences of F. x ananassa and the four wild species were then mapped onto the reference genome, along with the previously published F. vesca genome sequence to establish the subgenomic structure of F. x ananassa. The strategy adopted in this study has turned out to be successful in dissecting the genome of octoploid F. x ananassa and appears promising when applied to the analysis of other polyploid plant species. PMID:24282021

  19. RECOGNITION OF CDNA MICROARRAY IMAGE USING FEEDFORWARD ARTIFICIAL NEURAL NETWORK

    Directory of Open Access Journals (Sweden)

    R. M. Farouk

    2014-07-01

    Full Text Available The complementary DNA (cDNA sequence considered th e magic biometric technique for personal identification. Microarray image processing used fo r the concurrent genes identification. In this pape r, we present a new method for cDNA recognition based on the artificial neural network (ANN. We have segmented the location of the spots in a cDNA micro array. Thus, a precise localization and segmenting of a spot are essential to obtain a more exact intensity measurement, leading to a more accurate gene expression measurement. The segmented cDNA microarr ay image resized and used as an input for the proposed artificial neural network. For matching an d recognition, we have trained the artificial neura l network. Recognition results are given for the gall eries of cDNA sequences . The numerical results sho w that, the proposed matching technique is an effecti ve in the cDNA sequences process. The experimental results of our matching approach using different da tabases shows that, the proposed technique is an effective matching performance.

  20. RECOGNITION OF CDNA MICROARRAY IMAGE USING FEEDFORWARD ARTIFICIAL NEURAL NETWORK

    Directory of Open Access Journals (Sweden)

    R. M. Farouk

    2014-09-01

    Full Text Available The complementary DNA (cDNA sequence considered the magic biometric technique for personal identification. Microarray image processing used for the concurrent genes identification. In this paper, we present a new method for cDNA recognition based on the artificial neural network (ANN. We have segmented the location of the spots in a cDNA microarray. Thus, a precise localization and segmenting of a spot are essential to obtain a more exact intensity measurement, leading to a more accurate gene expression measurement. The segmented cDNA microarray image resized and used as an input for the proposed artificial neural network. For matching and recognition, we have trained the artificial neural network. Recognition results are given for the galleries of cDNA sequences . The numerical results show that, the proposed matching technique is an effective in the cDNA sequences process. The experimental results of our matching approach using different databases shows that, the proposed technique is an effective matching performance.

  1. Characterization of microRNAs and their targets in wild barley (Hordeum vulgare subsp. spontaneum) using deep sequencing.

    Science.gov (United States)

    Deng, Pingchuan; Bian, Jianxin; Yue, Hong; Feng, Kewei; Wang, Mengxing; Du, Xianghong; Weining, Song; Nie, Xiaojun

    2016-05-01

    MicroRNAs (miRNA) are a class of small, endogenous RNAs that play a negative regulatory role in various developmental and metabolic processes of plants. Wild barley (Hordeum vulgare subsp. spontaneum), as the progenitor of cultivated barley (Hordeum vulgare subsp. vulgare), has served as a valuable germplasm resource for barley genetic improvement. To survey miRNAs in wild barley, we sequenced the small RNA library prepared from wild barley using the Illumina deep sequencing technology. A total of 70 known miRNAs and 18 putative novel miRNAs were identified. Sequence analysis revealed that all of the miRNAs identified in wild barley contained the highly conserved hairpin sequences found in barley cultivars. MiRNA target predictions showed that 12 out of 52 miRNA families were predicted to target transcription factors, including 8 highly conserved miRNA families in plants and 4 wheat-barley conserved miRNA families. In addition to transcription factors, other predicted target genes were involved in diverse physiological and metabolic processes and stress defense. Our study for the first time reported the large-scale investigation of small RNAs in wild barley, which will provide essential information for understanding the regulatory role of miRNAs in wild barley and also shed light on future practical utilization of miRNAs for barley improvement. PMID:27100818

  2. Characterization and Development of EST-SSRs by Deep Transcriptome Sequencing in Chinese Cabbage (Brassica rapa L. ssp. pekinensis

    Directory of Open Access Journals (Sweden)

    Qian Ding

    2015-01-01

    Full Text Available Simple sequence repeats (SSRs are among the most important markers for population analysis and have been widely used in plant genetic mapping and molecular breeding. Expressed sequence tag-SSR (EST-SSR markers, located in the coding regions, are potentially more efficient for QTL mapping, gene targeting, and marker-assisted breeding. In this study, we investigated 51,694 nonredundant unigenes, assembled from clean reads from deep transcriptome sequencing with a Solexa/Illumina platform, for identification and development of EST-SSRs in Chinese cabbage. In total, 10,420 EST-SSRs with over 12 bp were identified and characterized, among which 2744 EST-SSRs are new and 2317 are known ones showing polymorphism with previously reported SSRs. A total of 7877 PCR primer pairs for 1561 EST-SSR loci were designed, and primer pairs for twenty-four EST-SSRs were selected for primer evaluation. In nineteen EST-SSR loci (79.2%, amplicons were successfully generated with high quality. Seventeen (89.5% showed polymorphism in twenty-four cultivars of Chinese cabbage. The polymorphic alleles of each polymorphic locus were sequenced, and the results showed that most polymorphisms were due to variations of SSR repeat motifs. The EST-SSRs identified and characterized in this study have important implications for developing new tools for genetics and molecular breeding in Chinese cabbage.

  3. Geochemical features and effects on deep-seated fluids during the May-June 2012 southern Po Valley seismic sequence

    Directory of Open Access Journals (Sweden)

    Francesco Italiano

    2012-10-01

    Full Text Available A periodic sampling of the groundwaters and dissolved and free gases in selected deep wells located in the area affected by the May-June 2012 southern Po Valley seismic sequence has provided insight into seismogenic-induced changes of the local aquifer systems. The results obtained show progressive changes in the fluid geochemistry, allowing it to be established that deep-seated fluids were mobilized during the seismic sequence and reached surface layers along faults and fractures, which generated significant geochemical anomalies. The May-June 2012 seismic swarm (mainshock on May 29, 2012, M 5.8; 7 shocks M >5, about 200 events 3 > M > 5 induced several modifications in the circulating fluids. This study reports the preliminary results obtained for the geochemical features of the waters and gases collected over the epicentral area from boreholes drilled at different depths, thus intercepting water and gases with different origins and circulation. The aim of the investigations was to improve our knowledge of the fluids circulating over the seismic area (e.g. origin, provenance, interactions, mixing of different components, temporal changes. This was achieved by collecting samples from both shallow and deep-drilled boreholes, and then, after the selection of the relevant sites, we looked for temporal changes with mid-to-long-term monitoring activity following a constant sampling rate. This allowed us to gain better insight into the relationships between the fluid circulation and the faulting activity. The sampling sites are listed in Table 1, along with the analytical results of the gas phase. […

  4. SHARAKU: an algorithm for aligning and clustering read mapping profiles of deep sequencing in non-coding RNA processing

    Science.gov (United States)

    Tsuchiya, Mariko; Amano, Kojiro; Abe, Masaya; Seki, Misato; Hase, Sumitaka; Sato, Kengo; Sakakibara, Yasubumi

    2016-01-01

    Motivation: Deep sequencing of the transcripts of regulatory non-coding RNA generates footprints of post-transcriptional processes. After obtaining sequence reads, the short reads are mapped to a reference genome, and specific mapping patterns can be detected called read mapping profiles, which are distinct from random non-functional degradation patterns. These patterns reflect the maturation processes that lead to the production of shorter RNA sequences. Recent next-generation sequencing studies have revealed not only the typical maturation process of miRNAs but also the various processing mechanisms of small RNAs derived from tRNAs and snoRNAs. Results: We developed an algorithm termed SHARAKU to align two read mapping profiles of next-generation sequencing outputs for non-coding RNAs. In contrast with previous work, SHARAKU incorporates the primary and secondary sequence structures into an alignment of read mapping profiles to allow for the detection of common processing patterns. Using a benchmark simulated dataset, SHARAKU exhibited superior performance to previous methods for correctly clustering the read mapping profiles with respect to 5′-end processing and 3′-end processing from degradation patterns and in detecting similar processing patterns in deriving the shorter RNAs. Further, using experimental data of small RNA sequencing for the common marmoset brain, SHARAKU succeeded in identifying the significant clusters of read mapping profiles for similar processing patterns of small derived RNA families expressed in the brain. Availability and Implementation: The source code of our program SHARAKU is available at http://www.dna.bio.keio.ac.jp/sharaku/, and the simulated dataset used in this work is available at the same link. Accession code: The sequence data from the whole RNA transcripts in the hippocampus of the left brain used in this work is available from the DNA DataBank of Japan (DDBJ) Sequence Read Archive (DRA) under the accession number DRA

  5. Mitochondrial genome sequences reveal deep divergences among Anopheles punctulatus sibling species in Papua New Guinea

    Directory of Open Access Journals (Sweden)

    Logue Kyle

    2013-02-01

    Full Text Available Abstract Background Members of the Anopheles punctulatus group (AP group are the primary vectors of human malaria in Papua New Guinea. The AP group includes 13 sibling species, most of them morphologically indistinguishable. Understanding why only certain species are able to transmit malaria requires a better comprehension of their evolutionary history. In particular, understanding relationships and divergence times among Anopheles species may enable assessing how malaria-related traits (e.g. blood feeding behaviours, vector competence have evolved. Methods DNA sequences of 14 mitochondrial (mt genomes from five AP sibling species and two species of the Anopheles dirus complex of Southeast Asia were sequenced. DNA sequences from all concatenated protein coding genes (10,770 bp were then analysed using a Bayesian approach to reconstruct phylogenetic relationships and date the divergence of the AP sibling species. Results Phylogenetic reconstruction using the concatenated DNA sequence of all mitochondrial protein coding genes indicates that the ancestors of the AP group arrived in Papua New Guinea 25 to 54 million years ago and rapidly diverged to form the current sibling species. Conclusion Through evaluation of newly described mt genome sequences, this study has revealed a divergence among members of the AP group in Papua New Guinea that would significantly predate the arrival of humans in this region, 50 thousand years ago. The divergence observed among the mtDNA sequences studied here may have resulted from reproductive isolation during historical changes in sea-level through glacial minima and maxima. This leads to a hypothesis that the AP sibling species have evolved independently for potentially thousands of generations. This suggests that the evolution of many phenotypes, such as insecticide resistance will arise independently in each of the AP sibling species studied here.

  6. Galaxies in SDSS and DEEP2: a quiet life on the blue sequence?

    OpenAIRE

    Blanton, Michael R.

    2005-01-01

    In the six billion years between redshifts z=1 and z=0.1, galaxies change due to the aging of their stellar populations, the formation of new stars, and mergers with other galaxies. Here I explore the relative importance of these various effects, finding that while mergers are likely to be important for the red galaxy sequence they are unlikely to affect more than 10% of the blue galaxy sequence. I compare the galaxy population at redshift z=0.1 from the Sloan Digital Sky Survey to that at z=...

  7. Novel insight into the non-coding repertoire through deep sequencing analysis

    OpenAIRE

    Isakov, Ofer; Ronen, Roy; Kovarsky, Judit; Gabay, Aviram; Gan, Ido; Modai, Shira; Shomron, Noam

    2012-01-01

    Non-coding RNAs (ncRNA) account for a large portion of the transcribed genomic output. This diverse family of untranslated RNA molecules play a crucial role in cellular function. The use of ‘deep sequencing’ technology (also known as ‘next generation sequencing’) to infer transcript expression levels in general, and ncRNA specifically, is becoming increasingly common in molecular and clinical laboratories. We developed a software termed ‘RandA’ (which stands for ncRNA Read-and-Analyze) that p...

  8. Exome and deep sequencing of clinically aggressive neuroblastoma reveal somatic mutations that affect key pathways involved in cancer progression

    Science.gov (United States)

    Lasorsa, Vito Alessandro; Formicola, Daniela; Pignataro, Piero; Cimmino, Flora; Calabrese, Francesco Maria; Mora, Jaume; Esposito, Maria Rosaria; Pantile, Marcella; Zanon, Carlo; De Mariano, Marilena; Longo, Luca; Hogarty, Michael D.; de Torres, Carmen; Tonini, Gian Paolo; Iolascon, Achille; Capasso, Mario

    2016-01-01

    The spectrum of somatic mutation of the most aggressive forms of neuroblastoma is not completely determined. We sought to identify potential cancer drivers in clinically aggressive neuroblastoma. Whole exome sequencing was conducted on 17 germline and tumor DNA samples from high-risk patients with adverse events within 36 months from diagnosis (HR-Event3) to identify somatic mutations and deep targeted sequencing of 134 genes selected from the initial screening in additional 48 germline and tumor pairs (62.5% HR-Event3 and high-risk patients), 17 HR-Event3 tumors and 17 human-derived neuroblastoma cell lines. We revealed 22 significantly mutated genes, many of which implicated in cancer progression. Fifteen genes (68.2%) were highly expressed in neuroblastoma supporting their involvement in the disease. CHD9, a cancer driver gene, was the most significantly altered (4.0% of cases) after ALK. Other genes (PTK2, NAV3, NAV1, FZD1 and ATRX), expressed in neuroblastoma and involved in cell invasion and migration were mutated at frequency ranged from 4% to 2%. Focal adhesion and regulation of actin cytoskeleton pathways, were frequently disrupted (14.1% of cases) thus suggesting potential novel therapeutic strategies to prevent disease progression. Notably BARD1, CHEK2 and AXIN2 were enriched in rare, potentially pathogenic, germline variants. In summary, whole exome and deep targeted sequencing identified novel cancer genes of clinically aggressive neuroblastoma. Our analyses show pathway-level implications of infrequently mutated genes in leading neuroblastoma progression. PMID:27009842

  9. Deep-sequencing revealed Citrus bark cracking viroid (CBCVd) as a highly aggressive pathogen on hop

    Czech Academy of Sciences Publication Activity Database

    Jakše, J.; Radišek, S.; Pokorn, T.; Matoušek, Jaroslav; Javornik, B.

    2015-01-01

    Roč. 64, č. 4 (2015), s. 831-842. ISSN 0032-0862 R&D Projects: GA MŠk(CZ) LH14255 Institutional support: RVO:60077344 Keywords : Bioinformatic * Citrus bark cracking viroid * Hop * Next-generation sequencing Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 2.121, year: 2014

  10. Reproducibility of Illumina platform deep sequencing errors allows accurate determination of DNA barcodes in cells.

    NARCIS (Netherlands)

    Beltman, J.B.; Urbanus, J.; Velds, A.; Rooij, van N.; Rohr, J.C.; Naik, S.H.; Schumacher, T.N.

    2016-01-01

    BACKGROUND Next generation sequencing (NGS) of amplified DNA is a powerful tool to describe genetic heterogeneity within cell populations that can both be used to investigate the clonal structure of cell populations and to perform genetic lineage tracing. For applications in which both abundant and

  11. Characterisation of the gut microflora in rainbow trout fry (Oncorhynchus mykiss) using deep-sequencing

    DEFF Research Database (Denmark)

    Ingerslev, Hans-Christian; Dalsgaard, Inger; Boye, Mette;

    2012-01-01

    an environmental sample or a tissue / fecal sample from an animal. These metagenomic studies have provided new and deeper insight into the microbial ecology and the influence of the microbiota in warm-blooded animals. In this study next-generation sequencing of the 16S rDNA gene on the Illumina HiSeq platform...

  12. Characterisation of the gut microflora in rainbow trout fry (Oncorhynchus mykiss) using deep-sequencing

    DEFF Research Database (Denmark)

    Ingerslev, Hans-Christian; Dalsgaard, Inger; Boye, Mette;

    2012-01-01

    environmental sample or a tissue / fecal sample from an animal. These metagenomic studies have provided new and deeper insight into the microbial ecology and the influence of the microbiota in warm-blooded animals. In this study next-generation sequencing of the 16S rDNA gene on the Illumina HiSeq platform was...

  13. Identification and profiling of novel microRNAs in the Brassica rapa genome based on small RNA deep sequencing

    Directory of Open Access Journals (Sweden)

    Kim Bumjin

    2012-11-01

    Full Text Available Abstract Background MicroRNAs (miRNAs are one of the functional non-coding small RNAs involved in the epigenetic control of the plant genome. Although plants contain both evolutionary conserved miRNAs and species-specific miRNAs within their genomes, computational methods often only identify evolutionary conserved miRNAs. The recent sequencing of the Brassica rapa genome enables us to identify miRNAs and their putative target genes. In this study, we sought to provide a more comprehensive prediction of B. rapa miRNAs based on high throughput small RNA deep sequencing. Results We sequenced small RNAs from five types of tissue: seedlings, roots, petioles, leaves, and flowers. By analyzing 2.75 million unique reads that mapped to the B. rapa genome, we identified 216 novel and 196 conserved miRNAs that were predicted to target approximately 20% of the genome’s protein coding genes. Quantitative analysis of miRNAs from the five types of tissue revealed that novel miRNAs were expressed in diverse tissues but their expression levels were lower than those of the conserved miRNAs. Comparative analysis of the miRNAs between the B. rapa and Arabidopsis thaliana genomes demonstrated that redundant copies of conserved miRNAs in the B. rapa genome may have been deleted after whole genome triplication. Novel miRNA members seemed to have spontaneously arisen from the B. rapa and A. thaliana genomes, suggesting the species-specific expansion of miRNAs. We have made this data publicly available in a miRNA database of B. rapa called BraMRs. The database allows the user to retrieve miRNA sequences, their expression profiles, and a description of their target genes from the five tissue types investigated here. Conclusions This is the first report to identify novel miRNAs from Brassica crops using genome-wide high throughput techniques. The combination of computational methods and small RNA deep sequencing provides robust predictions of miRNAs in the genome. The

  14. Deep sequencing of Ptilidium (Ptilidiaceae) suggests evolutionary stasis in liverwort plastid genome structure

    OpenAIRE

    Forrest, L. L.; Wickett, N. J.; Cox, C. J.; Goffinet, B

    2011-01-01

    Background and aims – Organellar genome sampling is patchy for non-vascular groups, with the earliest land plants poorly represented; currently only two liverworts, two mosses and one hornwort have sequenced, annotated plastid genomes. This is in part due to methodological difficulties that have hampered attempts to generate plastid genome data from liverworts. In this paper we present a method that overcomes some of the inherent difficulties by circumventing the need for plastid enrichment, ...

  15. Deep sequencing of the oral microbiome reveals signatures of periodontal disease.

    Directory of Open Access Journals (Sweden)

    Bo Liu

    Full Text Available The oral microbiome, the complex ecosystem of microbes inhabiting the human mouth, harbors several thousands of bacterial types. The proliferation of pathogenic bacteria within the mouth gives rise to periodontitis, an inflammatory disease known to also constitute a risk factor for cardiovascular disease. While much is known about individual species associated with pathogenesis, the system-level mechanisms underlying the transition from health to disease are still poorly understood. Through the sequencing of the 16S rRNA gene and of whole community DNA we provide a glimpse at the global genetic, metabolic, and ecological changes associated with periodontitis in 15 subgingival plaque samples, four from each of two periodontitis patients, and the remaining samples from three healthy individuals. We also demonstrate the power of whole-metagenome sequencing approaches in characterizing the genomes of key players in the oral microbiome, including an unculturable TM7 organism. We reveal the disease microbiome to be enriched in virulence factors, and adapted to a parasitic lifestyle that takes advantage of the disrupted host homeostasis. Furthermore, diseased samples share a common structure that was not found in completely healthy samples, suggesting that the disease state may occupy a narrow region within the space of possible configurations of the oral microbiome. Our pilot study demonstrates the power of high-throughput sequencing as a tool for understanding the role of the oral microbiome in periodontal disease. Despite a modest level of sequencing (~2 lanes Illumina 76 bp PE and high human DNA contamination (up to ~90% we were able to partially reconstruct several oral microbes and to preliminarily characterize some systems-level differences between the healthy and diseased oral microbiomes.

  16. Deep sequencing of the oral microbiome reveals signatures of periodontal disease.

    Science.gov (United States)

    Liu, Bo; Faller, Lina L; Klitgord, Niels; Mazumdar, Varun; Ghodsi, Mohammad; Sommer, Daniel D; Gibbons, Theodore R; Treangen, Todd J; Chang, Yi-Chien; Li, Shan; Stine, O Colin; Hasturk, Hatice; Kasif, Simon; Segrè, Daniel; Pop, Mihai; Amar, Salomon

    2012-01-01

    The oral microbiome, the complex ecosystem of microbes inhabiting the human mouth, harbors several thousands of bacterial types. The proliferation of pathogenic bacteria within the mouth gives rise to periodontitis, an inflammatory disease known to also constitute a risk factor for cardiovascular disease. While much is known about individual species associated with pathogenesis, the system-level mechanisms underlying the transition from health to disease are still poorly understood. Through the sequencing of the 16S rRNA gene and of whole community DNA we provide a glimpse at the global genetic, metabolic, and ecological changes associated with periodontitis in 15 subgingival plaque samples, four from each of two periodontitis patients, and the remaining samples from three healthy individuals. We also demonstrate the power of whole-metagenome sequencing approaches in characterizing the genomes of key players in the oral microbiome, including an unculturable TM7 organism. We reveal the disease microbiome to be enriched in virulence factors, and adapted to a parasitic lifestyle that takes advantage of the disrupted host homeostasis. Furthermore, diseased samples share a common structure that was not found in completely healthy samples, suggesting that the disease state may occupy a narrow region within the space of possible configurations of the oral microbiome. Our pilot study demonstrates the power of high-throughput sequencing as a tool for understanding the role of the oral microbiome in periodontal disease. Despite a modest level of sequencing (~2 lanes Illumina 76 bp PE) and high human DNA contamination (up to ~90%) we were able to partially reconstruct several oral microbes and to preliminarily characterize some systems-level differences between the healthy and diseased oral microbiomes. PMID:22675498

  17. Deep Sequencing of HIV-Infected Cells: Insights into Nascent Transcription and Host-Directed Therapy

    OpenAIRE

    Peng, Xinxia; Sova, Pavel; Green, Richard R.; Thomas, Matthew J.; Korth, Marcus J.; Proll, Sean; Xu, Jiabao; Cheng, Yanbing; Yi, Kang; Chen, Li; Peng, Zhiyu; Wang, Jun; Palermo, Robert E.; Katze, Michael G.

    2014-01-01

    Polyadenylated mature mRNAs are the focus of standard transcriptome analyses. However, the profiling of nascent transcripts, which often include nonpolyadenylated RNAs, can unveil novel insights into transcriptional regulation. Here, we separately sequenced total RNAs (Total RNAseq) and mRNAs (mRNAseq) from the same HIV-1-infected human CD4+ T cells. We found that many nonpolyadenylated RNAs were differentially expressed upon HIV-1 infection, and we identified 8 times more differentially expr...

  18. Molecular cloning of growth hormone encoding cDNA of Indian major carps by a modified rapid amplification of cDNA ends strategy

    Indian Academy of Sciences (India)

    T Venugopal; S Mathavan; T J Pandian

    2002-06-01

    A modified rapid amplification of cDNA ends (RACE) strategy has been developed for cloning highly conserved cDNA sequences. Using this modified method, the growth hormone (GH) encoding cDNA sequences of Labeo rohita, Cirrhina mrigala and Catla catla have been cloned, characterized and overexpressed in Escherichia coli. These sequences show 96–98% homology to each other and are about 85% homologous to that of common carp. Besides, an attempt has been made for the first time to describe a 3-D model of the fish GH protein.

  19. Identification of microRNAs and transcript targets in Camelina sativa by deep sequencing and computational methods.

    Directory of Open Access Journals (Sweden)

    Saroj Poudel

    Full Text Available Camelina sativa is an annual oilseed crop that is under intensive development for renewable resources of biofuels and industrial oils. MicroRNAs, or miRNAs, are endogenously encoded small RNAs that play key roles in diverse plant biological processes. Here, we conducted deep sequencing on small RNA libraries prepared from camelina leaves, flower buds and two stages of developing seeds corresponding to initial and peak storage products accumulation. Computational analyses identified 207 known miRNAs belonging to 63 families, as well as 5 novel miRNAs. These miRNAs, especially members of the miRNA families, varied greatly in different tissues and developmental stages. The predicted miRNA target genes are involved in a broad range of physiological functions including lipid metabolism. This report is the first step toward elucidating roles of miRNAs in C. sativa and will provide additional tools to improve this oilseed crop for biofuels and biomaterials.

  20. Deep sequencing of mRNA in CD24− and CD24+ mammary carcinoma Mvt1 cell line

    Directory of Open Access Journals (Sweden)

    Ran Rostoker

    2015-09-01

    Full Text Available CD24 is an anchored cell surface marker that is highly expressed in cancer cells (Lee et al., 2009 and its expression is associated with poorer outcome of cancer patients (Kristiansen et al., 2003. Phenotype comparison between two subpopulations derived from the Mvt1 cell line, CD24− cells (with no CD24 cell surface expression and the CD24+ cells, identified high tumorigenic capacity for the CD24+ cells. In order to reveal the transcripts that support the CD24+ aggressive and invasive phenotype we compared the gene profiles of these two subpopulations. mRNA profiles of CD24− and CD24+ cells were generated by deep sequencing, in triplicate, using an Illumina HiSeq 2500. Here we provide a detailed description of the mRNA-seq analysis from our recent study (Rostoker et al., 2015. The mRNA-seq data have been deposited in the NCBI GEO database (accession number GSE68746.

  1. Identification of microRNAs Involved in the Host Response to Enterovirus 71 Infection by a Deep Sequencing Approach

    Directory of Open Access Journals (Sweden)

    Lunbiao Cui

    2010-01-01

    Full Text Available Role of microRNA (miRNA has been highlighted in pathogen-host interactions recently. To identify cellular miRNAs involved in the host response to enterovirus 71 (EV71 infection, we performed a comprehensive miRNA profiling in EV71-infected Hep2 cells through deep sequencing. 64 miRNAs were found whose expression levels changed for more than 2-fold in response to EV71 infection. Gene ontology analysis revealed that many of these mRNAs play roles in neurological process, immune response, and cell death pathways, which are known to be associated with the extreme virulence of EV71. To our knowledge, this is the first study on host miRNAs expression alteration response to EV71 infection. Our findings supported the hypothesis that certain miRNAs might be essential in the host-pathogen interactions.

  2. Deep sequencing and in silico analyses identify MYB-regulated gene networks and signaling pathways in pancreatic cancer.

    Science.gov (United States)

    Azim, Shafquat; Zubair, Haseeb; Srivastava, Sanjeev K; Bhardwaj, Arun; Zubair, Asif; Ahmad, Aamir; Singh, Seema; Khushman, Moh'd; Singh, Ajay P

    2016-01-01

    We have recently demonstrated that the transcription factor MYB can modulate several cancer-associated phenotypes in pancreatic cancer. In order to understand the molecular basis of these MYB-associated changes, we conducted deep-sequencing of transcriptome of MYB-overexpressing and -silenced pancreatic cancer cells, followed by in silico pathway analysis. We identified significant modulation of 774 genes upon MYB-silencing (p RELA was validated by both qPCR and immunoblotting and they were both shown to be under direct transcriptional control of MYB. These observations were further confirmed in a converse approach wherein MYB was overexpressed ectopically in a MYB-null pancreatic cancer cell line. Our findings thus suggest that MYB potentially regulates growth and genomic stability of pancreatic cancer cells via targeting complex gene networks and signaling pathways. Further in-depth functional studies are warranted to fully understand MYB signaling in pancreatic cancer. PMID:27354262

  3. Identification of microRNAs and transcript targets in Camelina sativa by deep sequencing and computational methods.

    Science.gov (United States)

    Poudel, Saroj; Aryal, Niranjan; Lu, Chaofu

    2015-01-01

    Camelina sativa is an annual oilseed crop that is under intensive development for renewable resources of biofuels and industrial oils. MicroRNAs, or miRNAs, are endogenously encoded small RNAs that play key roles in diverse plant biological processes. Here, we conducted deep sequencing on small RNA libraries prepared from camelina leaves, flower buds and two stages of developing seeds corresponding to initial and peak storage products accumulation. Computational analyses identified 207 known miRNAs belonging to 63 families, as well as 5 novel miRNAs. These miRNAs, especially members of the miRNA families, varied greatly in different tissues and developmental stages. The predicted miRNA target genes are involved in a broad range of physiological functions including lipid metabolism. This report is the first step toward elucidating roles of miRNAs in C. sativa and will provide additional tools to improve this oilseed crop for biofuels and biomaterials. PMID:25826400

  4. Exploring the Gastrointestinal “Nemabiome”: Deep Amplicon Sequencing to Quantify the Species Composition of Parasitic Nematode Communities

    Science.gov (United States)

    Avramenko, Russell W.; Redman, Elizabeth M.; Lewis, Roy; Yazwinski, Thomas A.; Wasmuth, James D.; Gilleard, John S.

    2015-01-01

    Parasitic helminth infections have a considerable impact on global human health as well as animal welfare and production. Although co-infection with multiple parasite species within a host is common, there is a dearth of tools with which to study the composition of these complex parasite communities. Helminth species vary in their pathogenicity, epidemiology and drug sensitivity and the interactions that occur between co-infecting species and their hosts are poorly understood. We describe the first application of deep amplicon sequencing to study parasitic nematode communities as well as introduce the concept of the gastro-intestinal “nemabiome”. The approach is analogous to 16S rDNA deep sequencing used to explore microbial communities, but utilizes the nematode ITS-2 rDNA locus instead. Gastro-intestinal parasites of cattle were used to develop the concept, as this host has many well-defined gastro-intestinal nematode species that commonly occur as complex co-infections. Further, the availability of pure mono-parasite populations from experimentally infected cattle allowed us to prepare mock parasite communities to determine, and correct for, species representation biases in the sequence data. We demonstrate that, once these biases have been corrected, accurate relative quantitation of gastro-intestinal parasitic nematode communities in cattle fecal samples can be achieved. We have validated the accuracy of the method applied to field-samples by comparing the results of detailed morphological examination of L3 larvae populations with those of the sequencing assay. The results illustrate the insights that can be gained into the species composition of parasite communities, using grazing cattle in the mid-west USA as an example. However, both the technical approach and the concept of the ‘nemabiome’ have a wide range of potential applications in human and veterinary medicine. These include investigations of host-parasite and parasite-parasite interactions

  5. Congruent Deep Relationships in the Grape Family (Vitaceae Based on Sequences of Chloroplast Genomes and Mitochondrial Genes via Genome Skimming.

    Directory of Open Access Journals (Sweden)

    Ning Zhang

    Full Text Available Vitaceae is well-known for having one of the most economically important fruits, i.e., the grape (Vitis vinifera. The deep phylogeny of the grape family was not resolved until a recent phylogenomic analysis of 417 nuclear genes from transcriptome data. However, it has been reported extensively that topologies based on nuclear and organellar genes may be incongruent due to differences in their evolutionary histories. Therefore, it is important to reconstruct a backbone phylogeny of the grape family using plastomes and mitochondrial genes. In this study, next-generation sequencing data sets of 27 species were obtained using genome skimming with total DNAs from silica-gel preserved tissue samples on an Illumina HiSeq 2500 instrument. Plastomes were assembled using the combination of de novo and reference genome (of V. vinifera methods. Sixteen mitochondrial genes were also obtained via genome skimming using the reference genome of V. vinifera. Extensive phylogenetic analyses were performed using maximum likelihood and Bayesian methods. The topology based on either plastome data or mitochondrial genes is congruent with the one using hundreds of nuclear genes, indicating that the grape family did not exhibit significant reticulation at the deep level. The results showcase the power of genome skimming in capturing extensive phylogenetic data: especially from chloroplast and mitochondrial DNAs.

  6. Deep sequencing of plant and animal DNA contained within traditional Chinese medicines reveals legality issues and health safety concerns.

    Directory of Open Access Journals (Sweden)

    Megan L Coghlan

    Full Text Available Traditional Chinese medicine (TCM has been practiced for thousands of years, but only within the last few decades has its use become more widespread outside of Asia. Concerns continue to be raised about the efficacy, legality, and safety of many popular complementary alternative medicines, including TCMs. Ingredients of some TCMs are known to include derivatives of endangered, trade-restricted species of plants and animals, and therefore contravene the Convention on International Trade in Endangered Species (CITES legislation. Chromatographic studies have detected the presence of heavy metals and plant toxins within some TCMs, and there are numerous cases of adverse reactions. It is in the interests of both biodiversity conservation and public safety that techniques are developed to screen medicinals like TCMs. Targeting both the p-loop region of the plastid trnL gene and the mitochondrial 16S ribosomal RNA gene, over 49,000 amplicon sequence reads were generated from 15 TCM samples presented in the form of powders, tablets, capsules, bile flakes, and herbal teas. Here we show that second-generation, high-throughput sequencing (HTS of DNA represents an effective means to genetically audit organic ingredients within complex TCMs. Comparison of DNA sequence data to reference databases revealed the presence of 68 different plant families and included genera, such as Ephedra and Asarum, that are potentially toxic. Similarly, animal families were identified that include genera that are classified as vulnerable, endangered, or critically endangered, including Asiatic black bear (Ursus thibetanus and Saiga antelope (Saiga tatarica. Bovidae, Cervidae, and Bufonidae DNA were also detected in many of the TCM samples and were rarely declared on the product packaging. This study demonstrates that deep sequencing via HTS is an efficient and cost-effective way to audit highly processed TCM products and will assist in monitoring their legality and safety

  7. Deep sequencing-based analysis of the Cymbidium ensifolium floral transcriptome.

    Directory of Open Access Journals (Sweden)

    Xiaobai Li

    Full Text Available Cymbidium ensifolium is a Chinese Cymbidium with an elegant shape, beautiful appearance, and a fragrant aroma. C. ensifolium has a long history of cultivation in China and it has excellent commercial value as a potted plant and cut flower. The development of C. ensifolium genomic resources has been delayed because of its large genome size. Taking advantage of technical and cost improvement of RNA-Seq, we extracted total mRNA from flower buds and mature flowers and obtained a total of 9.52 Gb of filtered nucleotides comprising 98,819,349 filtered reads. The filtered reads were assembled into 101,423 isotigs, representing 51,696 genes. Of the 101,423 isotigs, 41,873 were putative homologs of annotated sequences in the public databases, of which 158 were associated with floral development and 119 were associated with flowering. The isotigs were categorized according to their putative functions. In total, 10,212 of the isotigs were assigned into 25 eukaryotic orthologous groups (KOGs, 41,690 into 58 gene ontology (GO terms, and 9,830 into 126 Arabidopsis Kyoto Encyclopedia of Genes and Genomes (KEGG pathways, and 9,539 isotigs into 123 rice pathways. Comparison of the isotigs with those of the two related orchid species P. equestris and C. sinense showed that 17,906 isotigs are unique to C. ensifolium. In addition, a total of 7,936 SSRs and 16,676 putative SNPs were identified. To our knowledge, this transcriptome database is the first major genomic resource for C. ensifolium and the most comprehensive transcriptomic resource for genus Cymbidium. These sequences provide valuable information for understanding the molecular mechanisms of floral development and flowering. Sequences predicted to be unique to C. ensifolium would provide more insights into C. ensifolium gene diversity. The numerous SNPs and SSRs identified in the present study will contribute to marker development for C. ensifolium.

  8. Hemocytome: deep sequencing analysis of mosquito blood cells in Indian malarial vector Anopheles stephensi.

    Science.gov (United States)

    Thomas, Tina; De, Tanwee Das; Sharma, Punita; Lata, Suman; Saraswat, Priyanka; Pandey, Kailash C; Dixit, Rajnikant

    2016-07-10

    Hemocytes are tiny circulating blood cells of insects known to play multiple roles in physiological as well as cellular immune responses. However, the molecular nature of hemocytes in blood feeding insects, especially mosquitoes which transmit several deadly diseases such as malaria, dengue etc. is still limited. Therefore, to know the basic molecular composition of naïve mosquito hemocyte encoded proteins, we sequenced RNA-Seq library and analyzed a total of 13,105,858 Illumina sequencing reads in the mosquito Anopheles stephensi, an urban malarial vector in India. Denovo assembly approach yielded a buildup of 3025 contigs, for molecular and functional annotation. A total of 1829 contigs (48%) could be mapped to the mosquito transcript database, while out of remaining 1196 unmatched contigs, at least 1108 contigs i.e. 40% of total contigs, yielded a significant match to the available draft genome. ImmunoDB analysis predicted a total of 88 putative hemocyte transcripts belonging to 11 immune family proteins. A comprehensive molecular analysis of several unique transcripts including novel LRR, Holotricin, OBP, NiFU, that are involved in immunity, chemo sensing, cell-cell communication, nitrogen fixation/metabolism etc. provides initial evidence that mosquito hemocytes carry unique ability to meet and manage cell specific diverse functions of the mosquito blood. An unexpected observation of abundant transcripts encoding hypothetical proteins with unknown functions indicated that a much of the hemocyte biology remains to be understood. PMID:26915489

  9. Evolutionary Relations of Hexanchiformes Deep-Sea Sharks Elucidated by Whole Mitochondrial Genome Sequences

    Directory of Open Access Journals (Sweden)

    Keiko Tanaka

    2013-01-01

    Full Text Available Hexanchiformes is regarded as a monophyletic taxon, but the morphological and genetic relationships between the five extant species within the order are still uncertain. In this study, we determined the whole mitochondrial DNA (mtDNA sequences of seven sharks including representatives of the five Hexanchiformes, one squaliform, and one carcharhiniform and inferred the phylogenetic relationships among those species and 12 other Chondrichthyes (cartilaginous fishes species for which the complete mitogenome is available. The monophyly of Hexanchiformes and its close relation with all other Squaliformes sharks were strongly supported by likelihood and Bayesian phylogenetic analysis of 13,749 aligned nucleotides of 13 protein coding genes and two rRNA genes that were derived from the whole mDNA sequences of the 19 species. The phylogeny suggested that Hexanchiformes is in the superorder Squalomorphi, Chlamydoselachus anguineus (frilled shark is the sister species to all other Hexanchiformes, and the relations within Hexanchiformes are well resolved as Chlamydoselachus, (Notorynchus, (Heptranchias, (Hexanchus griseus, H. nakamurai. Based on our phylogeny, we discussed evolutionary scenarios of the jaw suspension mechanism and gill slit numbers that are significant features in the sharks.

  10. Deep-sequence profiling of miRNAs and their target prediction in Monotropa hypopitys.

    Science.gov (United States)

    Shchennikova, Anna V; Beletsky, Alexey V; Shulga, Olga A; Mazur, Alexander M; Prokhortchouk, Egor B; Kochieva, Elena Z; Ravin, Nikolay V; Skryabin, Konstantin G

    2016-07-01

    Myco-heterotroph Monotropa hypopitys is a widely spread perennial herb used to study symbiotic interactions and physiological mechanisms underlying the development of non-photosynthetic plant. Here, we performed, for the first time, transcriptome-wide characterization of M. hypopitys miRNA profile using high throughput Illumina sequencing. As a result of small RNA library sequencing and bioinformatic analysis, we identified 55 members belonging to 40 families of known miRNAs and 17 putative novel miRNAs unique for M. hypopitys. Computational screening revealed 206 potential mRNA targets for known miRNAs and 31 potential mRNA targets for novel miRNAs. The predicted target genes were described in Gene Ontology terms and were found to be involved in a broad range of metabolic and regulatory pathways. The identification of novel M. hypopitys-specific miRNAs, some with few target genes and low abundances, suggests their recent evolutionary origin and participation in highly specialized regulatory mechanisms fundamental for non-photosynthetic biology of M. hypopitys. This global analysis of miRNAs and their potential targets in M. hypopitys provides a framework for further investigation of miRNA role in the evolution and establishment of non-photosynthetic myco-heterotrophs. PMID:27097902

  11. Deep sequencing of subseafloor eukaryotic rRNA reveals active Fungi across marine subsurface provinces.

    Directory of Open Access Journals (Sweden)

    William Orsi

    Full Text Available The deep marine subsurface is a vast habitat for microbial life where cells may live on geologic timescales. Because DNA in sediments may be preserved on long timescales, ribosomal RNA (rRNA is suggested to be a proxy for the active fraction of a microbial community in the subsurface. During an investigation of eukaryotic 18S rRNA by amplicon pyrosequencing, unique profiles of Fungi were found across a range of marine subsurface provinces including ridge flanks, continental margins, and abyssal plains. Subseafloor fungal populations exhibit statistically significant correlations with total organic carbon (TOC, nitrate, sulfide, and dissolved inorganic carbon (DIC. These correlations are supported by terminal restriction length polymorphism (TRFLP analyses of fungal rRNA. Geochemical correlations with fungal pyrosequencing and TRFLP data from this geographically broad sample set suggests environmental selection of active Fungi in the marine subsurface. Within the same dataset, ancient rRNA signatures were recovered from plants and diatoms in marine sediments ranging from 0.03 to 2.7 million years old, suggesting that rRNA from some eukaryotic taxa may be much more stable than previously considered in the marine subsurface.

  12. Cloning and bioinformatics analysis of cDNA encoding cattle Smad4 gene

    Institute of Scientific and Technical Information of China (English)

    Xiaohui ZHANG; Shangzhong XU; Xue GAO; Hongyan REN; Jinbao CHEN

    2008-01-01

    The cDNA of cattle Smad4 gene was cloned by RT-PCR, 3' RACE and 5' RACE and got a 3503-bp full-long cDNA sequence. The cloned cattle Smad4 cDNA sequence had been send to GenBank and got an accession number: DQ494856. Cattle Smad4 gene consists of 12 exons and codes 553 amino acids. Cattle Smad4 cDNA shares 99%, 96%, 95%, 91% and 91% similarity in nucleic acid sequences, and 99%, 98%, 98%, 99% and 98% sim-ilarity in amino acid sequences with sheep, pig, human, rat and mouse, respectively. Smad4 cDNA was found in the testes, pancreas, liver, small intestine, ovary, lymph, car-diac muscle, skeleton muscle and thymus gland, which indicated that Smad4 was broadly expressed in cattle.

  13. Genome-wide analysis of SRSF10-regulated alternative splicing by deep sequencing of chicken transcriptome

    Directory of Open Access Journals (Sweden)

    Xuexia Zhou

    2014-12-01

    Full Text Available Splicing factor SRSF10 is known to function as a sequence-specific splicing activator that is capable of regulating alternative splicing both in vitro and in vivo. We recently used an RNA-seq approach coupled with bioinformatics analysis to identify the extensive splicing network regulated by SRSF10 in chicken cells. We found that SRSF10 promoted both exon inclusion and exclusion. Functionally, many of the SRSF10-verified alternative exons are linked to pathways of response to external stimulus. Here we describe in detail the experimental design, bioinformatics analysis and GO/pathway enrichment analysis of SRSF10-regulated genes to correspond with our data in the Gene Expression Omnibus with accession number GSE53354. Our data thus provide a resource for studying regulation of alternative splicing in vivo that underlines biological functions of splicing regulatory proteins in cells.

  14. Identification of Retinopathy of Prematurity Related miRNAs in Hyperoxia-Induced Neonatal Rats by Deep Sequencing

    Directory of Open Access Journals (Sweden)

    Ruibin Zhao

    2014-12-01

    Full Text Available Retinopathy of prematurity (ROP remains a major problem for many preterm infants. MicroRNAs (miRNAs are a class of small noncoding RNAs that regulate gene expression at the posttranscriptional level and have been studied in many diseases. To understand the roles of miRNAs in ROP model rats, we constructed two small RNA libraries from the plasma of hyperoxia-induced rats and normal controls. Sequencing data revealed that 44 down-regulated microRNAs and 22 up-regulated microRNAs from the hyperoxia-induced rats were identified by deep sequencing technology. Some of the differentially expressed miRNAs were confirmed by quantitative reverse transcription-PCR (qRT-PCR. A total of 594 target genes of the differentially expressed microRNAs were identified using a bioinformatics approach. Functional annotation analysis indicated that a number of pathways might be involved in angiogenesis, cell proliferation and cell differentiation, which might be involved in the genesis and development of ROP. The elevated expression level of the vascular endothelial growth factor (VEGF protein in the hyperoxia-induced neonatal rats was also confirmed by enzyme linked immunosorbent assay (ELISA. This study provides some insights into the molecular mechanisms that underlie ROP development, thereby aiding the diagnosis and treatment of this disease.

  15. Preoperative evaluation for volume reduction surgery of pulmonary emphysema using MRI. Usefulness of HASTE (Half-Fourier Single-Shot Turbo SE) sequence during deep respiration

    International Nuclear Information System (INIS)

    Volume reduction surgery has recently been an important surgical procedure for patients with severe pulmonary emphysema. We compared the sagittal and coronal images taken by the HASTE sequence with those obtained by turbo FLASH during deep breathing and with CT images obtained under deep inspiration. Clear images were obtained from both sequences, without cardiac or respiratory motion artifacts. The emphysematous areas were demonstrated as low signal intensity areas, as in CT images. The ratio of signal intensity in the expiratory phase to that in the inspiratory phase was lower than that of volunteers in the HASTE sequence. The HASTE sequence provides useful information about respiratory movement as well as about changes in the pulmonary parenchyma when used for preoperative examination. (author)

  16. Molecular cloning and mammalian expression of human beta 2-glycoprotein I cDNA

    DEFF Research Database (Denmark)

    Kristensen, Torsten; Schousboe, Inger; Boel, Espen;

    1991-01-01

    Human β2-glycoprotein (β2gpI) cDNA was isolated from a liver cDNA library and sequenced. The cDNA encoded a 19-residue hydrophobic signal peptide followed by the mature β2gpI of 326 amino acid residues. In liver and in the hepatoma cell line HepG2 there are two mRNA species of about 1.4 and 4.3 k...

  17. Small RNA and transcriptome deep sequencing proffers insight into floral gene regulation in Rosa cultivars

    Directory of Open Access Journals (Sweden)

    Kim Jungeun

    2012-11-01

    Full Text Available Abstract Background Roses (Rosa sp., which belong to the family Rosaceae, are the most economically important ornamental plants—making up 30% of the floriculture market. However, given high demand for roses, rose breeding programs are limited in molecular resources which can greatly enhance and speed breeding efforts. A better understanding of important genes that contribute to important floral development and desired phenotypes will lead to improved rose cultivars. For this study, we analyzed rose miRNAs and the rose flower transcriptome in order to generate a database to expound upon current knowledge regarding regulation of important floral characteristics. A rose genetic database will enable comprehensive analysis of gene expression and regulation via miRNA among different Rosa cultivars. Results We produced more than 0.5 million reads from expressed sequences, totalling more than 110 million bp. From these, we generated 35,657, 31,434, 34,725, and 39,722 flower unigenes from Rosa hybrid: ‘Vital’, ‘Maroussia’, and ‘Sympathy’ and Rosa rugosa Thunb. , respectively. The unigenes were assigned functional annotations, domains, metabolic pathways, Gene Ontology (GO terms, Plant Ontology (PO terms, and MIPS Functional Catalogue (FunCat terms. Rose flower transcripts were compared with genes from whole genome sequences of Rosaceae members (apple, strawberry, and peach and grape. We also produced approximately 40 million small RNA reads from flower tissue for Rosa, representing 267 unique miRNA tags. Among identified miRNAs, 25 of them were novel and 242 of them were conserved miRNAs. Statistical analyses of miRNA profiles revealed both shared and species-specific miRNAs, which presumably effect flower development and phenotypes. Conclusions In this study, we constructed a Rose miRNA and transcriptome database, and we analyzed the miRNAs and transcriptome generated from the flower tissues of four Rosa cultivars. The database provides a

  18. Deep sequencing of small RNA libraries reveals dynamic regulation of conserved and novel microRNAs and microRNA-stars during silkworm development

    OpenAIRE

    Soulages Jose L; Arrese Estela L; Jiang Haobo; Sumathipala Niranji; Zheng Yun; Jagadeeswaran Guru; Zhang Weixiong; Sunkar Ramanjulu

    2010-01-01

    Abstract Background In eukaryotes, microRNAs (miRNAs) have emerged as critical regulators of gene expression. The Silkworm (Bombyx mori L.) is one of the most suitable lepidopteran insects for studying the molecular aspects of metamorphosis because of its large size, availability of mutants and genome sequence. Besides, this insect also has been amply studied from a physiological and biochemical perspective. Deep sequencing of small RNAs isolated from different stages of silkworm is a powerfu...

  19. Deep sequencing reveals microbiota dysbiosis of tongue coat in patients with liver carcinoma

    Science.gov (United States)

    Lu, Haifeng; Ren, Zhigang; Li, Ang; Zhang, Hua; Jiang, Jianwen; Xu, Shaoyan; Luo, Qixia; Zhou, Kai; Sun, Xiaoli; Zheng, Shusen; Li, Lanjuan

    2016-01-01

    Liver carcinoma (LC) is a common malignancy worldwide, associated with high morbidity and mortality. Characterizing microbiome profiles of tongue coat may provide useful insights and potential diagnostic marker for LC patients. Herein, we are the first time to investigate tongue coat microbiome of LC patients with cirrhosis based on 16S ribosomal RNA (rRNA) gene sequencing. After strict inclusion and exclusion criteria, 35 early LC patients with cirrhosis and 25 matched healthy subjects were enrolled. Microbiome diversity of tongue coat in LC patients was significantly increased shown by Shannon, Simpson and Chao 1 indexes. Microbiome on tongue coat was significantly distinguished LC patients from healthy subjects by principal component analysis. Tongue coat microbial profiles represented 38 operational taxonomic units assigned to 23 different genera, distinguishing LC patients. Linear discriminant analysis (LDA) effect size (LEfSe) reveals significant microbial dysbiosis of tongue coats in LC patients. Strikingly, Oribacterium and Fusobacterium could distinguish LC patients from healthy subjects. LEfSe outputs show microbial gene functions related to categories of nickel/iron_transport, amino_acid_transport, energy produced system and metabolism between LC patients and healthy subjects. These findings firstly identify microbiota dysbiosis of tongue coat in LC patients, may providing novel and non-invasive potential diagnostic biomarker of LC. PMID:27605161

  20. THE CLONING OF HRNT-1 USING A COMBINATION OF cDNA LIBRARY SCREENING WITH BIOTIN-LABELED PROBE AND RAPID AMPLIFICATION OF cDNA ENDS

    Institute of Scientific and Technical Information of China (English)

    2001-01-01

    Objective: To clone the human counterpart of rat ZA73, EST cloned from rat tracheal epithelial (RTE) neoplastic transformed cell model induced by (a-particles radiation by using mRNA differential display. Methods: According to the sequence of rat ZA73, a probe was biotin-labeled to screen human cDNA library, and then the gene sequence was extended by RACE (rapid amplification of cDNA ends). Result: Human gene HRNT-1 (GenBank Accession Number: AF223393) is 4.256 kb in length, with an ORF located in the region between 254 and 3013 bp. 5' UTS (untranslated sequences) is 253 bp, 3' UTS is 1243 bp. Conclusion: The combination of cDNA library screening with biotin-labeled probes and RACE is an effective method to clone full-length cDNA, especially for sequences longer than 2 kb.

  1. Main: Sequences [KOME

    Lifescience Database Archive (English)

    Full Text Available Sequences Amino Acid Sequence Amino Acid sequence of full length cDNA (Longest ORF) kome_ine_full_sequence..._amino_db.fasta.zip kome_ine_full_sequence_amino_db.zip kome_ine_full_sequence_amino_db ...

  2. Draft Genome Sequence of Pseudoalteromonas sp. Strain XI10 Isolated from the Brine-Seawater Interface of Erba Deep in the Red Sea

    KAUST Repository

    Zhang, Guishan

    2016-03-10

    Pseudoalteromonas sp. strain XI10 was isolated from the brine-seawater interface of Erba Deep in the Red Sea, Saudi Arabia. Here, we present the draft genome sequence of strain XI10, a gammaproteobacterium that synthesizes polysaccharides for biofilm formation when grown in liquid culture.

  3. Molecular cloning and characterization of duck hepatitis virus cDNA

    International Nuclear Information System (INIS)

    The aim was to determine the complete nucleotide sequence of duck hepatitis virus (DHV) and compare this sequence with that of the other picornaviruses. The DHV cDNA has been cloned, which will enable us to demonstrate the genome structure of the DHV RNA and the translation mechanisms of the viral protein. Also, we can use the cDNA probes to detect DHV in clinical samples. DHV cDNA has been cloned with a size of the insert fragments of 0.5-0.7 kb. The cDNA clones were authenticated by northern hybridization. The availability of cloned DHV cDNA should make it possible to deduce RNA and amino acid sequences, both for use as a probe of clinical specimens and as to the presence of DHV genome

  4. Deep Sea Coral voucher sequence dataset - Identification of deep-sea corals collected during the 2009 - 2014 West Coast Groundfish Bottom Trawl Survey

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — Data for this project resides in the West Coast Groundfish Bottom Trawl Survey Database. Deep-sea corals are often components of trawling bycatch, though their...

  5. Refining transcriptional programs in kidney development by integration of deep RNA-sequencing and array-based spatial profiling

    Directory of Open Access Journals (Sweden)

    Rumballe Bree A

    2011-09-01

    Full Text Available Abstract Background The developing mouse kidney is currently the best-characterized model of organogenesis at a transcriptional level. Detailed spatial maps have been generated for gene expression profiling combined with systematic in situ screening. These studies, however, fall short of capturing the transcriptional complexity arising from each locus due to the limited scope of microarray-based technology, which is largely based on "gene-centric" models. Results To address this, the polyadenylated RNA and microRNA transcriptomes of the 15.5 dpc mouse kidney were profiled using strand-specific RNA-sequencing (RNA-Seq to a depth sufficient to complement spatial maps from pre-existing microarray datasets. The transcriptional complexity of RNAs arising from mouse RefSeq loci was catalogued; including 3568 alternatively spliced transcripts and 532 uncharacterized alternate 3' UTRs. Antisense expressions for 60% of RefSeq genes was also detected including uncharacterized non-coding transcripts overlapping kidney progenitor markers, Six2 and Sall1, and were validated by section in situ hybridization. Analysis of genes known to be involved in kidney development, particularly during mesenchymal-to-epithelial transition, showed an enrichment of non-coding antisense transcripts extended along protein-coding RNAs. Conclusion The resulting resource further refines the transcriptomic cartography of kidney organogenesis by integrating deep RNA sequencing data with locus-based information from previously published expression atlases. The added resolution of RNA-Seq has provided the basis for a transition from classical gene-centric models of kidney development towards more accurate and detailed "transcript-centric" representations, which highlights the extent of transcriptional complexity of genes that direct complex development events.

  6. Deep sequencing reveals important roles of microRNAs in response to drought and salinity stress in cotton.

    Science.gov (United States)

    Xie, Fuliang; Wang, Qinglian; Sun, Runrun; Zhang, Baohong

    2015-02-01

    Drought and salinity are two major environmental factors adversely affecting plant growth and productivity. However, the regulatory mechanism is unknown. In this study, the potential roles of small regulatory microRNAs (miRNAs) in cotton response to those stresses were investigated. Using next-generation deep sequencing, a total of 337 miRNAs with precursors were identified, comprising 289 known miRNAs and 48 novel miRNAs. Of these miRNAs, 155 miRNAs were expressed differentially. Target prediction, Gene Ontology (GO)-based functional classification, and Kyoto Encyclopedia of Genes and Genomes (KEGG)-based functional enrichment show that these miRNAs might play roles in response to salinity and drought stresses through targeting a series of stress-related genes. Degradome sequencing analysis showed that at least 55 predicted target genes were further validated to be regulated by 60 miRNAs. CitationRank-based literature mining was employed to determinhe the importance of genes related to drought and salinity stress. The NAC, MYB, and MAPK families were ranked top under the context of drought and salinity, indicating their important roles for the plant to combat drought and salinity stress. According to target prediction, a series of cotton miRNAs are associated with these top-ranked genes, including miR164, miR172, miR396, miR1520, miR6158, ghr-n24, ghr-n56, and ghr-n59. Interestingly, 163 cotton miRNAs were also identified to target 210 genes that are important in fibre development. These results will contribute to cotton stress-resistant breeding as well as understanding fibre development. PMID:25371507

  7. Deep Sequencing of the Scutellaria baicalensis Georgi Transcriptome Reveals Flavonoid Biosynthetic Profiling and Organ-Specific Gene Expression.

    Directory of Open Access Journals (Sweden)

    Jinxin Liu

    Full Text Available Scutellaria baicalensis Georgi has long been used in traditional medicine to treat various such widely varying diseases and has been listed in the Chinese Pharmacopeia, the Japanese Pharmacopeia, the Korean Pharmacopoeia and the European Pharmacopoeia. Flavonoids, especially wogonin, wogonoside, baicalin, and baicalein, are its main functional ingredients with various pharmacological activities. Although pharmaological studies for these flavonoid components have been well conducted, the molecular mechanism of their biosynthesis remains unclear in S. baicalensis. In this study, Illumina/Solexa deep sequencing generated more than 91 million paired-end reads and 49,507 unigenes from S. baicalensis roots, stems, leaves and flowers. More than 70% unigenes were annotated in at least one of the five public databases and 13,627 unigenes were assigned to 3,810 KEGG genes involved in 579 different pathways. 54 unigenes that encode 12 key enzymes involved in the pathway of flavonoid biosynthesis were discovered. One baicalinase and three baicalein 7-O-glucuronosyltransferases genes potentially involved in the transformation between baicalin/wogonoside and baicalein/wogonin were identified. Four candidate 6-hydroxylase genes for the formation of baicalin/baicalein and one candidate 8-O-methyltransferase gene for the biosynthesis of wogonoside/wogonin were also recognized. Our results further support the conclusion that, in S. baicalensis, 3,5,7-trihydroxyflavone was the precursor of the four above compounds. Then, the differential expression models and simple sequence repeats associated with these genes were carefully analyzed. All of these results not only enrich the gene resource but also benefit research into the molecular genetics and functional genomics in S. baicalensis.

  8. Characterization and heterospecific expression of cDNA clones of genes in the maize GSH S-transferase multigene family.

    OpenAIRE

    Grove, G; Zarlengo, R P; Timmerman, K P; Li, N Q; Tam, M F; Tu, C P

    1988-01-01

    We have isolated from a constructed lambda gt11 expression library two classes of cDNA clones encoding the entire sequence of the maize GSH S-transferases GST I and GST III. Expression of a full-length GST I cDNA in E. coli resulted in the synthesis of enzymatically active maize GST I that is immunologically indistinguishable from the native GST I. Another GST I cDNA with a truncated N-terminal sequence is also active in heterospecific expression. Our GST III cDNA sequence differs from the ve...

  9. Deep sequencing of pyrethroid-resistant bed bugs reveals multiple mechanisms of resistance within a single population.

    Directory of Open Access Journals (Sweden)

    Zach N Adelman

    Full Text Available A frightening resurgence of bed bug infestations has occurred over the last 10 years in the U.S. and current chemical methods have been inadequate for controlling this pest due to widespread insecticide resistance. Little is known about the mechanisms of resistance present in U.S. bed bug populations, making it extremely difficult to develop intelligent strategies for their control. We have identified bed bugs collected in Richmond, VA which exhibit both kdr-type (L925I and metabolic resistance to pyrethroid insecticides. Using LD(50 bioassays, we determined that resistance ratios for Richmond strain bed bugs were ∼5200-fold to the insecticide deltamethrin. To identify metabolic genes potentially involved in the detoxification of pyrethroids, we performed deep-sequencing of the adult bed bug transcriptome, obtaining more than 2.5 million reads on the 454 titanium platform. Following assembly, analysis of newly identified gene transcripts in both Harlan (susceptible and Richmond (resistant bed bugs revealed several candidate cytochrome P450 and carboxylesterase genes which were significantly over-expressed in the resistant strain, consistent with the idea of increased metabolic resistance. These data will accelerate efforts to understand the biochemical basis for insecticide resistance in bed bugs, and provide molecular markers to assist in the surveillance of metabolic resistance.

  10. Identiifcation of microRNAs in two species of tomato,Solanum lycopersicum and Solanum habrochaites, by deep sequencing

    Institute of Scientific and Technical Information of China (English)

    FAN Shan-shan; LI Qian-nan; GUO Guang-jun; GAO Jian-chang; WANG Xiao-xuan; GUO Yan-mei; John C Snyder; DU Yong-chen

    2015-01-01

    MicroRNAs (miRNAs) are ~21 nucleotide (nt), endogenous RNAs that regulate gene expression in plants. Increasing evidence suggests that miRNAs play an important role in species-speciifc development in plants. However, the detailed miRNA proifle divergence has not been performed among tomato species. In this study, the smal RNA (sRNA) proifles of Solanum lycopersicumcultivar 9706 andSolanum habrochaites species PI 134417 were obtained by deep sequencing. Sixty-three known miRNA families were identiifed from these two species, of which 39 were common. Further miRNA proifle comparison showed that 24 known non-conserved miRNA families were species-speciifc between these two tomato species. In addition, six conserved miRNA families displayed an apparent divergent expression pattern between the two tomato species. Our results suggested that species-speciifc, non-conserved miRNAs and divergent expression of conserved miRNAs might contribute to developmental changes and phenotypic variation between the two tomato species. Twenty new miRNAs were also identiifed inS. lycopersicum. This research signiifcantly increases the number of known miRNA families in tomato and provides the ifrst set of smal RNAs inS. habrochaites. It also suggests that miRNAs have an important role in species-speciifc plant developmental regulation.

  11. Molecular cloning of cDNA for human prothymosin alpha.

    OpenAIRE

    Goodall, G J; Dominguez, F.; Horecker, B L

    1986-01-01

    A cDNA library was constructed from human spleen mRNA and screened for clones containing cDNAs coding for prothymosin alpha. A clone containing a 503-base-pair insert including the entire coding sequence for the translated portion of the mRNA was isolated. The deduced amino acid sequence confirms and completes the partial sequence of human prothymosin alpha determined by protein sequencing methods. The presence of an initiator codon immediately preceding the codon for the NH2-terminal serine ...

  12. Cloning and expression analysis of MBLL cDNA

    Institute of Scientific and Technical Information of China (English)

    2000-01-01

    The mbl (muscleblind) gene of Drosophila encodes a nuclear protein which contains two Cys3His motifs. The mutation of mbl gene will disturb the differentiation of all the Drosophila's photoreceptors. Primers have been designed according to human EST086139, which is highly homologous to mbl gene. Human fetal brain cDNA library has been screened and a novel cDNA clone has been obtained. The 2595 bp cDNA, designated MBLL (muscleblind-like), contains an open reading frame which encodes 255 amino acids and has 4 Cys3His motifs (GenBank Acc. AF061261). The amino acids sequence shares high homology to Drosophila's mbl. The Northern blot and RNA dot blot hybridization of 43 human adult tissues and 7 fetal tissues show that MBLL is a widely expressed gene, but the expression amounts differ in these tissues.

  13. A deep sequencing approach to comparatively analyze the transcriptome of lifecycle stages of the filarial worm, Brugia malayi.

    Directory of Open Access Journals (Sweden)

    Young-Jun Choi

    2011-12-01

    Full Text Available BACKGROUND: Developing intervention strategies for the control of parasitic nematodes continues to be a significant challenge. Genomic and post-genomic approaches play an increasingly important role for providing fundamental molecular information about these parasites, thus enhancing basic as well as translational research. Here we report a comprehensive genome-wide survey of the developmental transcriptome of the human filarial parasite Brugia malayi. METHODOLOGY/PRINCIPAL FINDINGS: Using deep sequencing, we profiled the transcriptome of eggs and embryos, immature (≤3 days of age and mature microfilariae (MF, third- and fourth-stage larvae (L3 and L4, and adult male and female worms. Comparative analysis across these stages provided a detailed overview of the molecular repertoires that define and differentiate distinct lifecycle stages of the parasite. Genome-wide assessment of the overall transcriptional variability indicated that the cuticle collagen family and those implicated in molting exhibit noticeably dynamic stage-dependent patterns. Of particular interest was the identification of genes displaying sex-biased or germline-enriched profiles due to their potential involvement in reproductive processes. The study also revealed discrete transcriptional changes during larval development, namely those accompanying the maturation of MF and the L3 to L4 transition that are vital in establishing successful infection in mosquito vectors and vertebrate hosts, respectively. CONCLUSIONS/SIGNIFICANCE: Characterization of the transcriptional program of the parasite's lifecycle is an important step toward understanding the developmental processes required for the infectious cycle. We find that the transcriptional program has a number of stage-specific pathways activated during worm development. In addition to advancing our understanding of transcriptome dynamics, these data will aid in the study of genome structure and organization by facilitating

  14. RNA deep sequencing reveals novel candidate genes and polymorphisms in boar testis and liver tissues with divergent androstenone levels.

    Directory of Open Access Journals (Sweden)

    Asep Gunawan

    Full Text Available Boar taint is an unpleasant smell and taste of pork meat derived from some entire male pigs. The main causes of boar taint are the two compounds androstenone (5α-androst-16-en-3-one and skatole (3-methylindole. It is crucial to understand the genetic mechanism of boar taint to select pigs for lower androstenone levels and thus reduce boar taint. The aim of the present study was to investigate transcriptome differences in boar testis and liver tissues with divergent androstenone levels using RNA deep sequencing (RNA-Seq. The total number of reads produced for each testis and liver sample ranged from 13,221,550 to 33,206,723 and 12,755,487 to 46,050,468, respectively. In testis samples 46 genes were differentially regulated whereas 25 genes showed differential expression in the liver. The fold change values ranged from -4.68 to 2.90 in testis samples and -2.86 to 3.89 in liver samples. Differentially regulated genes in high androstenone testis and liver samples were enriched in metabolic processes such as lipid metabolism, small molecule biochemistry and molecular transport. This study provides evidence for transcriptome profile and gene polymorphisms of boars with divergent androstenone level using RNA-Seq technology. Digital gene expression analysis identified candidate genes in flavin monooxygenease family, cytochrome P450 family and hydroxysteroid dehydrogenase family. Moreover, polymorphism and association analysis revealed mutation in IRG6, MX1, IFIT2, CYP7A1, FMO5 and KRT18 genes could be potential candidate markers for androstenone levels in boars. Further studies are required for proving the role of candidate genes to be used in genomic selection against boar taint in pig breeding programs.

  15. Ultra-deep T cell receptor sequencing reveals the complexity and intratumour heterogeneity of T cell clones in renal cell carcinomas.

    Science.gov (United States)

    Gerlinger, Marco; Quezada, Sergio A; Peggs, Karl S; Furness, Andrew J S; Fisher, Rosalie; Marafioti, Teresa; Shende, Vishvesh H; McGranahan, Nicholas; Rowan, Andrew J; Hazell, Steven; Hamm, David; Robins, Harlan S; Pickering, Lisa; Gore, Martin; Nicol, David L; Larkin, James; Swanton, Charles

    2013-12-01

    The recognition of cancer cells by T cells can impact upon prognosis and be exploited for immunotherapeutic approaches. This recognition depends on the specific interaction between antigens displayed on the surface of cancer cells and the T cell receptor (TCR), which is generated by somatic rearrangements of TCR α- and β-chains (TCRb). Our aim was to assess whether ultra-deep sequencing of the rearranged TCRb in DNA extracted from unfractionated clear cell renal cell carcinoma (ccRCC) samples can provide insights into the clonality and heterogeneity of intratumoural T cells in ccRCCs, a tumour type that can display extensive genetic intratumour heterogeneity (ITH). For this purpose, DNA was extracted from two to four tumour regions from each of four primary ccRCCs and was analysed by ultra-deep TCR sequencing. In parallel, tumour infiltration by CD4, CD8 and Foxp3 regulatory T cells was evaluated by immunohistochemistry and correlated with TCR-sequencing data. A polyclonal T cell repertoire with 367-16 289 (median 2394) unique TCRb sequences was identified per tumour region. The frequencies of the 100 most abundant T cell clones/tumour were poorly correlated between most regions (Pearson correlation coefficient, -0.218 to 0.465). 3-93% of these T cell clones were not detectable across all regions. Thus, the clonal composition of T cell populations can be heterogeneous across different regions of the same ccRCC. T cell ITH was higher in tumours pretreated with an mTOR inhibitor, which could suggest that therapy can influence adaptive tumour immunity. These data show that ultra-deep TCR-sequencing technology can be applied directly to DNA extracted from unfractionated tumour samples, allowing novel insights into the clonality of T cell populations in cancers. These were polyclonal and displayed ITH in ccRCC. TCRb sequencing may shed light on mechanisms of cancer immunity and the efficacy of immunotherapy approaches. PMID:24122851

  16. Construction and analysis of full-lengh and normalized cDNA libraries from citrus

    OpenAIRE

    Marqués, M.Carmen; Pérez-Amador, Miguel A.

    2012-01-01

    We have developed an integrated method to generate a normalized cDNA collection enriched in full-length and rare transcripts from citrus, using different species and multiple tissues and developmental stages. Interpretation of ever-increasing raw sequence information generated by modern genome sequencing technologies faces multiple challenges, such as gene function analysis and genome annotation. In this regard, the availability of full-length cDNA clones facilitates functional an...

  17. Ultra-deep T cell receptor sequencing reveals the complexity and intratumour heterogeneity of T cell clones in renal cell carcinomas

    OpenAIRE

    Gerlinger, Marco; Quezada, Sergio A; Peggs, Karl S.; Furness, Andrew JS; Fisher, Rosalie; Marafioti, Teresa; Shende, Vishvesh H.; McGranahan, Nicholas; Rowan, Andrew J.; Hazell, Steven; Hamm, David; Robins, Harlan S; Pickering, Lisa; Gore, Martin; Nicol, David L.

    2013-01-01

    The recognition of cancer cells by T cells can impact upon prognosis and be exploited for immunotherapeutic approaches. This recognition depends on the specific interaction between antigens displayed on the surface of cancer cells and the T cell receptor (TCR), which is generated by somatic rearrangements of TCR α- and β-chains (TCRb). Our aim was to assess whether ultra-deep sequencing of the rearranged TCRb in DNA extracted from unfractionated clear cell renal cell carcinoma (ccRCC) samples...

  18. Complete Genome Sequence of the d-Amino Acid Catabolism Bacterium Phaeobacter sp. Strain JL2886, Isolated from Deep Seawater of the South China Sea.

    Science.gov (United States)

    Fu, Yingnan; Wang, Rui; Zhang, Zilian; Jiao, Nianzhi

    2016-01-01

    Phaeobacter sp. strain JL2886, isolated from deep seawater of the South China Sea, can catabolize d-amino acids. Here, we report the complete genome sequence of Phaeobacter sp. JL2886. It comprises ~4.06 Mbp, with a G+C content of 61.52%. A total of 3,913 protein-coding genes and 10 genes related to d-amino acid catabolism were obtained. PMID:27587825

  19. Complete Genome Sequence of the Hyperthermophilic Archaeon Pyrococcus sp. Strain ST04, Isolated from a Deep-Sea Hydrothermal Sulfide Chimney on the Juan de Fuca Ridge

    OpenAIRE

    Jung, Jong-Hyun; Lee, Ju-Hoon; Holden, James F.; Seo, Dong-Ho; Shin, Hakdong; Kim, Hae-Yeong; Kim, Wooki; Ryu, Sangryeol; Park, Cheon-Seok

    2012-01-01

    Pyrococcus sp. strain ST04 is a hyperthermophilic, anaerobic, and heterotrophic archaeon isolated from a deep-sea hydrothermal sulfide chimney on the Endeavour Segment of the Juan de Fuca Ridge in the northeastern Pacific Ocean. To further understand the distinct characteristics of this archaeon at the genome level (polysaccharide utilization at high temperature and ATP generation by a Na+ gradient), the genome of strain ST04 was completely sequenced and analyzed. Here, we present the complet...

  20. Analysis of TSC cortical tubers by deep sequencing of TSC1, TSC2, and KRAS demonstrates that small second hit mutations in these genes are rare events

    OpenAIRE

    Qin, Wei; Chan, Jennifer A.; Vinters, Harry V.; Mathern, Gary W.; Franz, David Neal; Taillon, Bruce E.; Bouffard, Pascal; Kwiatkowski, David J.

    2010-01-01

    Tuberous sclerosis complex (TSC) is an often severe neurocutaneous syndrome. Cortical tubers are the predominant neuropathological finding in TSC, and their number and location has been shown to correlate roughly with the severity of neurologic features in TSC. Past studies have shown that genomic deletion events in TSC1 or TSC2 are very rare in tubers, and suggested the potential involvement of the MAPK pathway in their pathogenesis. We used deep sequencing to assess all coding exons of TSC1...

  1. deepBase v2.0: identification, expression, evolution and function of small RNAs, LncRNAs and circular RNAs from deep-sequencing data.

    Science.gov (United States)

    Zheng, Ling-Ling; Li, Jun-Hao; Wu, Jie; Sun, Wen-Ju; Liu, Shun; Wang, Ze-Lin; Zhou, Hui; Yang, Jian-Hua; Qu, Liang-Hu

    2016-01-01

    Small non-coding RNAs (e.g. miRNAs) and long non-coding RNAs (e.g. lincRNAs and circRNAs) are emerging as key regulators of various cellular processes. However, only a very small fraction of these enigmatic RNAs have been well functionally characterized. In this study, we describe deepBase v2.0 (http://biocenter.sysu.edu.cn/deepBase/), an updated platform, to decode evolution, expression patterns and functions of diverse ncRNAs across 19 species. deepBase v2.0 has been updated to provide the most comprehensive collection of ncRNA-derived small RNAs generated from 588 sRNA-Seq datasets. Moreover, we developed a pipeline named lncSeeker to identify 176 680 high-confidence lncRNAs from 14 species. Temporal and spatial expression patterns of various ncRNAs were profiled. We identified approximately 24 280 primate-specific, 5193 rodent-specific lncRNAs, and 55 highly conserved lncRNA orthologs between human and zebrafish. We annotated 14 867 human circRNAs, 1260 of which are orthologous to mouse circRNAs. By combining expression profiles and functional genomic annotations, we developed lncFunction web-server to predict the function of lncRNAs based on protein-lncRNA co-expression networks. This study is expected to provide considerable resources to facilitate future experimental studies and to uncover ncRNA functions. PMID:26590255

  2. Isolation of full-length putative rat lysophospholipase cDNA using improved methods for mRNA isolation and cDNA cloning

    International Nuclear Information System (INIS)

    The authors have cloned a full-length putative rat pancreatic lysophospholipase cDNA by an improved mRNA isolation method and cDNA cloning strategy using [32P]-labelled nucleotides. These new methods allow the construction of a cDNA library from the adult rat pancreas in which the majority of recombinant clones contained complete sequences for the corresponding mRNAs. A previously recognized but unidentified long and relatively rare cDNA clone containing the entire sequence from the cap site at the 5' end to the poly(A) tail at the 3' end of the mRNA was isolated by single-step screening of the library. The size, amino acid composition, and the activity of the protein expressed in heterologous cells strongly suggest this mRNA codes for lysophospholipase

  3. Construction and characterization of a normalized whole-life-cycle cDNA library of rice

    Institute of Scientific and Technical Information of China (English)

    2003-01-01

    A cDNA library with genomic complete coverage is a powerful tool for functional genomic studies. For studying the functions of rice genes on a large scale, a normalized whole-life-cycle cDNA library is constructed based on the strategy of saturation hybridization with genomic DNA using rice cultivar Minghui 63, an elite restorer line for a number of rice hybrids that are widely cultivated in China. This library consists of cDNA from 15 directionally cloned cDNA libraries constructed with different tissues from 9 developmental stages. For normalization, the denatured plasmids purified from the 15 directionally cloned libraries are mixed and hybridized with saturated genomic DNA labeled with magnetic beads in two complementary systems. Well-matched plasmids are captured from the hybridized genomic DNA and electroporated into competent DH10B E. coli for construction of the normalized whole-life-cycle cDNA library. This library consists of 62000 clones with an average insert length about 1.4 kb. Inverse Northern blotting shows that this cDNA library included many rarely expressed genes and tissue-specific genes. Sequencing of 10750 cDNA clones of this library reveals 6399 unique ESTs (expressed sequence tags), indicating that the non-redundancy of the library is about 59.5%. This library has been used to make cDNA microarrays for functional genomic studies.

  4. Cloning and sequence analysis of para sodium channel cDNA fragment from silkworm, Bombyx mori%家蚕Para钠通道cDNA片段克隆与序列分析

    Institute of Scientific and Technical Information of China (English)

    何琳; 刘丽花; 汪洋

    2011-01-01

    Previous studies have revealed that a point mutation of a target gene is related to insecticide resistance to pyrethroids. The para sodium channel in the insect central nervous system is the target of pyrethroid insecticides. We used the RT-PCR method to clone the para sodium ion channel in the silkworm, Bombyx mori L. (GenBank No. EF521818).The full length of this cDNA fragment is 4 882 base pairs and its partial ORF is 3 986 bp translated into 1 328 amino acids. BLAST analysis demonstrated that the cloned cDNA fragment is virtually identical to the para sodium channel a subunit gene amplified from other insects. Amino acid homology of the cloned fragment with para sodium channel a subunit genes from Heliothis virescens Fabricius, Aedes aegypti L. , Blattella germanica L. , Drosophila melanogaster Meigen and Musca domestica L. was 95%, 82%, 80%, 79% and 77% respectively.%昆虫神经系统para型钠离子通道是拟除虫菊酯类杀虫剂的主要靶标,已有的研究表明钠离子通道基因发生点突变与昆虫对菊酯类杀虫剂的抗性密切相关.本文通过RT-PCR方法克隆获得了编码家蚕Bombyx mori L.钠离子通道的cDNA片段(GenBank No.EF521818),该片段全长4 882 bp,部分ORF包含3 986 bp核苷酸,翻译成1 328个氨基酸.蛋白序列分析表明,PCR扩增获得的家蚕钠离子通道eDNA片段所编码的氨基酸与其他昆虫的para型钠离子通道α亚基的氨基酸具有很高的同源相似性,与棉铃虫Heliothis virescens Fabricius、埃及伊蚊Aedes aegypti L.、德国小蠊Blattella germanica L.、果蝇Drosophila melanogaster Meigen和家蝇Musca domestica L.的相似性分别为95%、82%、80%、79%、77%.

  5. 酵母双杂交筛选胎肾上腺cDNA文库中HNP-1结合蛋白%SCREENING THE GENE SEQUENCES OF THE INTERACTION PROTEINS OF HNP-1 FROM HUMAN FETAL ADRENAL CDNA LIBRARY BY YEAST TWO HYBRID SYSTEM

    Institute of Scientific and Technical Information of China (English)

    杜润滋; 邓璐霞; 黄宁; 罗朝志

    2011-01-01

    [Objective] To screen proteins binding with α-defensin (HNP-1) mature peptide from placenta cDNA libraty by yeast two hybrid technique. [Methods] The cDNA fragment encoding HNP-1 mature peptide was amplified by polymera-sechain reaction (PCR) and constructed into pGBK-T7 vector as the bait plasmid in yeast two hybrid system 3. Subsequently , the RNA from fetal adrenal gland was obtained and then transformed into cDNA library using SMART technology. The fetal adrenal cDNA library was screened with pGBKT7-HNP-1 as bait plasmid by yeast-two hybrid system Matchmaker Lexa. Finally, the positive clone was obtained by PCR and then identified by sequence. Then the interaction between them was determined by GST pull down in vitro and coimmunoprecipitates experiments in vivo. [Results] Bait and cDNA library have been constructed successfully and transformed into yeast. Then the interaction protein was found-melanocortin 2 receptor (ACTHR), CCAAT-enhancer-binding proteins (C/EBP), Tramembrane trafficking protein (TMP21), low density lipoprotein receptor-related protein 6 (LRP6). Therefore, melanocortin 2 receptor (ACTH-R) was determined into the major subjects. And bands which can demonstrate the relationship between HNP-1 and ACTH-R was obtained in GST pull down and coimmunoprecipitates experiments. [Conclusion] ACTH-R can bind to HNP-1 we obtained from fetal adrenal cDNA library and it may play important roles in the function of HNP-1 mature peptide.%[目的]筛选胎肾上腺cDNA文库中与α防御素HNP-1成熟肽具有相互作用的蛋白分子.[方法]通过聚合酶链反应(PCR)成功获得HNP-1成熟肽基因插入酵母表达载体pGBK-T7中构建诱饵质粒,同时提取胎肾上腺RNA,SMART技术制备人胎肾上腺cDNA文库,并采用Matchmaker LexA酵母双杂交系统从胎肾上腺cDNA文库中筛选与HNP-1成熟肽相互作用的蛋白.最后通过PCR筛选获得阳性克隆并测序,而后经回转实验,GST pull down以及免疫共沉淀再次验证

  6. Cloning and sequence analysis of HMG-CoA reductase full-length cDNA from tea (Camellia sinensis)%茶树 HMG-CoA 还原酶基因全长 cDNA 克隆及序列分析

    Institute of Scientific and Technical Information of China (English)

    韩兴杰; 徐玲玲; 廖亮; 李同建; 邓辉胜; 樊启水; 徐小青

    2015-01-01

    香气是茶叶的重要品质之一,萜类物质不仅香气好,而且沸点普遍较高,是构成茶叶香气的重要物质基础,决定着茶叶的香气品质,也可作为茶叶香型划分的依据.在植物中,倍半萜、多萜醇等通过胞质中的甲瓦龙酸(MVA)途径合成.HMG-CoA 还原酶(HMGR)催化 HMG-CoA(3-羟基-3-甲基戊二酸单酰辅酶 A)生成甲瓦龙酸,是依赖 MVA 萜类合成途径的关键限速反应.为了有助于理解茶树萜类合成的分子遗传机制,通过 RACE-PCR 方法从茶树中克隆了一个编码 HMG-CoA 还原酶的 cDNA 全长序列(命名为 CsHMGR 1),该序列由1979 bp 组成,包含一个1722 bp 的完整开放阅读框,编码573个氨基酸.其推定的编码蛋白与橡胶树、旱莲木、人参、荔枝、西洋参、丹参、罗汉果及龙眼的同源蛋白具有80%~82%的序列一致性.利用 CsH-MGR1和其它物种 HMGR 同源蛋白的催化区域构建系统发育树,表明其属于真核生物 I 类 HMGR 家族.结构分析表明,CsHMGR1含有两个跨膜区,推测其与其它真核生物同源蛋白类似地定位于内质网上;含有两个HMG-CoA 结合位点、两个 NADPH 结合位点、四个保守的催化活性残基及一个磷酸化位点,说明磷酸化/去磷酸化很可能也是其活性调节的重要方式.表达分析表明,CsHMGR 1在“大叶龙”叶芽、母株叶芽及花芽都有较强的表达.其表达调控及生理活性对茶叶品质可能有重要影响,并在其功能解析的基础上,有可能作为茶叶品质鉴定及育种的一个依据.%Aroma is an important character of tea,and terpenoids,with preferable fragrance and relatively high boiling point,are substantial substances contributing to tea aroma,which not only determine the aroma quality of tea,but also provide a classification basis of scent types.In plants,sesquiterpenes and dolichol and so forth are synthesize via the cytoplasmic mevalonate-dependent pathway.The enzyme HMG-CoA reductase (HMGR

  7. 茶树 HMG-CoA 还原酶基因全长 cDNA 克隆及序列分析%Cloning and sequence analysis of HMG-CoA reductase full-length cDNA from tea (Camellia sinensis)

    Institute of Scientific and Technical Information of China (English)

    韩兴杰; 徐玲玲; 廖亮; 李同建; 邓辉胜; 樊启水; 徐小青

    2015-01-01

    香气是茶叶的重要品质之一,萜类物质不仅香气好,而且沸点普遍较高,是构成茶叶香气的重要物质基础,决定着茶叶的香气品质,也可作为茶叶香型划分的依据.在植物中,倍半萜、多萜醇等通过胞质中的甲瓦龙酸(MVA)途径合成.HMG-CoA 还原酶(HMGR)催化 HMG-CoA(3-羟基-3-甲基戊二酸单酰辅酶 A)生成甲瓦龙酸,是依赖 MVA 萜类合成途径的关键限速反应.为了有助于理解茶树萜类合成的分子遗传机制,通过 RACE-PCR 方法从茶树中克隆了一个编码 HMG-CoA 还原酶的 cDNA 全长序列(命名为 CsHMGR 1),该序列由1979 bp 组成,包含一个1722 bp 的完整开放阅读框,编码573个氨基酸.其推定的编码蛋白与橡胶树、旱莲木、人参、荔枝、西洋参、丹参、罗汉果及龙眼的同源蛋白具有80%~82%的序列一致性.利用 CsH-MGR1和其它物种 HMGR 同源蛋白的催化区域构建系统发育树,表明其属于真核生物 I 类 HMGR 家族.结构分析表明,CsHMGR1含有两个跨膜区,推测其与其它真核生物同源蛋白类似地定位于内质网上;含有两个HMG-CoA 结合位点、两个 NADPH 结合位点、四个保守的催化活性残基及一个磷酸化位点,说明磷酸化/去磷酸化很可能也是其活性调节的重要方式.表达分析表明,CsHMGR 1在“大叶龙”叶芽、母株叶芽及花芽都有较强的表达.其表达调控及生理活性对茶叶品质可能有重要影响,并在其功能解析的基础上,有可能作为茶叶品质鉴定及育种的一个依据.%Aroma is an important character of tea,and terpenoids,with preferable fragrance and relatively high boiling point,are substantial substances contributing to tea aroma,which not only determine the aroma quality of tea,but also provide a classification basis of scent types.In plants,sesquiterpenes and dolichol and so forth are synthesize via the cytoplasmic mevalonate-dependent pathway.The enzyme HMG-CoA reductase (HMGR

  8. Deep Sequencing Reveals the Complete Genome and Evidence for Transcriptional Activity of the First Virus-Like Sequences Identified in Aristotelia chilensis (Maqui Berry)

    OpenAIRE

    Javier Villacreses; Marcelo Rojas-Herrera; Carolina Sánchez; Nicole Hewstone; Undurraga, Soledad F.; Juan F Alzate; Patricio Manque; Vinicius Maracaja-Coutinho; Victor Polanco

    2015-01-01

    Here, we report the genome sequence and evidence for transcriptional activity of a virus-like element in the native Chilean berry tree Aristotelia chilensis. We propose to name the endogenous sequence as Aristotelia chilensis Virus 1 (AcV1). High-throughput sequencing of the genome of this tree uncovered an endogenous viral element, with a size of 7122 bp, corresponding to the complete genome of AcV1. Its sequence contains three open reading frames (ORFs): ORFs 1 and 2 shares 66%–73% amino ac...

  9. Random rapid amplification of cDNA ends (RRACE) allows for cloning of multiple novel human cDNA fragments containing (CAG)n repeats.

    Science.gov (United States)

    Carney, J P; McKnight, C; VanEpps, S; Kelley, M R

    1995-04-01

    We describe a new technique for isolating cDNA fragments in which (i) either a partial sequence of the cDNA is known or (ii) a repeat sequence is utilized. We have used this technique, termed random rapid amplification of cDNA ends (random RACE), to isolate a number of trinucleotide repeat (CAG)n-containing genes. Using the random RACE (RRACE) technique, we have isolated over a hundred (CAG)n-containing genes. The results of our initial analysis of ten clones indicate that three are identical to previously cloned (CAG)n-containing genes. Three of our clones matched with expressed sequence tags, one of which contained a CA repeat. The remaining four clones did not match with any sequence in GenBank. These results indicate that this approach provides a rapid and efficient method for isolating trinucleotide repeat-containing cDNA fragments. Finally, this technique may be used for purposes other than cloning repeat-containing cDNA fragments. If only a partial sequence of a gene is known, our system, described here, provides a rapid and efficient method for isolating a fragment of the gene of interest. PMID:7536696

  10. Genetic instability of Japanese encephalitis virus cDNA clones propagated in Escherichia coli.

    Science.gov (United States)

    Zheng, Xuchen; Tong, Wu; Liu, Fei; Liang, Chao; Gao, Fei; Li, Guoxin; Tong, Guangzhi; Zheng, Hao

    2016-04-01

    The genetic instability of Flavivirus cDNA clones in transformed bacteria is a common phenomenon. Herein, a cDNA fragment of the nucleotide (nt) 1-2913 of the genome of a flavivirus, Japanese encephalitis virus (JEV), was used to investigate factors that caused the instability of cDNA clones. Several cDNA fragments with different 5'- or 3'-termini of the 2913-nt cDNA were obtained by PCR amplification or restriction enzyme digestion and cloned into a pCR-Blunt II-TOPO vector. All the cDNA fragments were stably propagated at 25 °C. However, the 5'-untranslated region and half of the 3'-E gene could cause the instability of the 2913-nt cDNA at 37 °C. The 5'-terminus sequences of the 2913-nt fragment were subjected to testing of the prokaryotic promoter activity by luciferase assay and Western blot. The sequences of 54-120 nt of the JEV genome exhibited high prokaryotic promoter activity at 37 °C, and the activity declined markedly at 25 °C. These findings revealed that the high prokaryotic promoter activity of the 54-120 nt sequences of the JEV genome together with expression of JEV structural genes determined the instability of a JEV cDNA clone. Growth at room temperature may reduce the prokaryotic promoter activity of 5'-sequences of the JEV genome and could represent an effective way to improve the stability of flavivirus cDNA clones in host bacteria. PMID:26888374

  11. Transcriptional Slippage and RNA Editing Increase the Diversity of Transcripts in Chloroplasts: Insight from Deep Sequencing of Vigna radiata Genome and Transcriptome.

    Directory of Open Access Journals (Sweden)

    Ching-Ping Lin

    Full Text Available We performed deep sequencing of the nuclear and organellar genomes of three mungbean genotypes: Vigna radiata ssp. sublobata TC1966, V. radiata var. radiata NM92 and the recombinant inbred line RIL59 derived from a cross between TC1966 and NM92. Moreover, we performed deep sequencing of the RIL59 transcriptome to investigate transcript variability. The mungbean chloroplast genome has a quadripartite structure including a pair of inverted repeats separated by two single copy regions. A total of 213 simple sequence repeats were identified in the chloroplast genomes of NM92 and RIL59; 78 single nucleotide variants and nine indels were discovered in comparing the chloroplast genomes of TC1966 and NM92. Analysis of the mungbean chloroplast transcriptome revealed mRNAs that were affected by transcriptional slippage and RNA editing. Transcriptional slippage frequency was positively correlated with the length of simple sequence repeats of the mungbean chloroplast genome (R2=0.9911. In total, 41 C-to-U editing sites were found in 23 chloroplast genes and in one intergenic spacer. No editing site that swapped U to C was found. A combination of bioinformatics and experimental methods revealed that the plastid-encoded RNA polymerase-transcribed genes psbF and ndhA are affected by transcriptional slippage in mungbean and in main lineages of land plants, including three dicots (Glycine max, Brassica rapa, and Nicotiana tabacum, two monocots (Oryza sativa and Zea mays, two gymnosperms (Pinus taeda and Ginkgo biloba and one moss (Physcomitrella patens. Transcript analysis of the rps2 gene showed that transcriptional slippage could affect transcripts at single sequence repeat regions with poly-A runs. It showed that transcriptional slippage together with incomplete RNA editing may cause sequence diversity of transcripts in chloroplasts of land plants.

  12. Deep-sequencing method for quantifying background abundances of symbiodinium types: exploring the rare symbiodinium biosphere in reef-building corals.

    Directory of Open Access Journals (Sweden)

    Kate M Quigley

    Full Text Available The capacity of reef-building corals to associate with environmentally-appropriate types of endosymbionts from the dinoflagellate genus Symbiodinium contributes significantly to their success at local scales. Additionally, some corals are able to acclimatize to environmental perturbations by shuffling the relative proportions of different Symbiodinium types hosted. Understanding the dynamics of these symbioses requires a sensitive and quantitative method of Symbiodinium genotyping. Electrophoresis methods, still widely utilized for this purpose, are predominantly qualitative and cannot guarantee detection of a background type below 10% of the total Symbiodinium population. Here, the relative abundances of four Symbiodinium types (A13, C1, C3, and D1 in mixed samples of known composition were quantified using deep sequencing of the internal transcribed spacer of the ribosomal RNA gene (ITS-2 by means of Next Generation Sequencing (NGS using Roche 454. In samples dominated by each of the four Symbiodinium types tested, background levels of the other three types were detected when present at 5%, 1%, and 0.1% levels, and their relative abundances were quantified with high (A13, C1, D1 to variable (C3 accuracy. The potential of this deep sequencing method for resolving fine-scale genetic diversity within a symbiont type was further demonstrated in a natural symbiosis using ITS-1, and uncovered reef-specific differences in the composition of Symbiodinium microadriaticum in two species of acroporid corals (Acropora digitifera and A. hyacinthus from Palau. The ability of deep sequencing of the ITS locus (1 and 2 to detect and quantify low-abundant Symbiodinium types, as well as finer-scale diversity below the type level, will enable more robust quantification of local genetic diversity in Symbiodinium populations. This method will help to elucidate the role that background types have in maximizing coral fitness across diverse environments and in

  13. Quantitative identification of mutant alleles derived from lung cancer in plasma cell-free DNA via anomaly detection using deep sequencing data.

    Directory of Open Access Journals (Sweden)

    Yoji Kukita

    Full Text Available The detection of rare mutants using next generation sequencing has considerable potential for diagnostic applications. Detecting circulating tumor DNA is the foremost application of this approach. The major obstacle to its use is the high read error rate of next-generation sequencers. Rather than increasing the accuracy of final sequences, we detected rare mutations using a semiconductor sequencer and a set of anomaly detection criteria based on a statistical model of the read error rate at each error position. Statistical models were deduced from sequence data from normal samples. We detected epidermal growth factor receptor (EGFR mutations in the plasma DNA of lung cancer patients. Single-pass deep sequencing (>100,000 reads was able to detect one activating mutant allele in 10,000 normal alleles. We confirmed the method using 22 prospective and 155 retrospective samples, mostly consisting of DNA purified from plasma. A temporal analysis suggested potential applications for disease management and for therapeutic decision making to select epidermal growth factor receptor tyrosine kinase inhibitors (EGFR-TKI.

  14. Post-capture immune gene expression studies in the deep-sea hydrothermal vent mussel Bathymodiolus azoricus acclimatized to atmospheric pressure

    Digital Repository Service at National Institute of Oceanography (India)

    Barros, I.; Divya, B.; Martins, I.; Vandeperre, F.; Santos, R.S.; Bettencourt, R.

    ) following the manufacturer's specifi- cations and re-suspended in nuclease-free, DEPC-treated water. Total RNA quality preparations and concentrations were assessed by the A260/280 and A260/230 spectrophotometric ratios using the NanoVue spectrophotometer... in all cDNA synthesis. The cDNA concentration was measured using the NanoVue spectrophotometer (A260/280 and A260/230 spectrophotometric ratios). 2.5. Real-time PCR (qPCR) analyses Based on sequences retrieved from the DeepSeaVent database (http...

  15. Amplification of 3' cDNA end of coxsackievirus B by RACE and its sequence analysis%柯萨奇B组病毒的RACE扩增与序列分析

    Institute of Scientific and Technical Information of China (English)

    李小光; 张凤民; 马培林; 陈小贝; 杨爱英

    2005-01-01

    目的探讨3′RACE(rapid amplification of cDNA ends)技术检测柯萨奇B组病毒(CVB)的可行性,并建立相应的实验检测方法.方法首先自行设计柯萨奇B组病毒6个型别病毒基因组的一条通用引物spl;然后从柯萨奇病毒(CVB1~CVB6)感染的Hela细胞,提取总RNA,用引物sp1结合Oligo dT进行3′RACE扩增,将产物克隆到pGEM-T载体上,挑取阳性菌落进行序列测定,与标准株进行同源性比对.结果获得了柯萨奇B组病毒6个型别的3′末端扩增产物及其核苷酸序列,同源性分析的结果显示扩增毒株与标准株之间的核苷酸同源性在95%~99%之间,氨基酸同源性在98%~100%之间.结论设计的引物"sp1"是柯萨奇B组病毒6个型别病毒基因组的一条通用引物;该引物结合Oligo dT进行的3′RACE扩增可用于柯萨奇B组病毒的检测.

  16. Structure and Genome Organization of Cherry Virus A (Capillovirus, Betaflexiviridae) from China Using Small RNA Sequencing.

    Science.gov (United States)

    Wang, Jiawei; Zhai, Ying; Liu, Weizhen; Dhingra, Amit; Pappu, Hanu R; Liu, Qingzhong

    2016-01-01

    Cherry virus A (CVA) (Capillovirus, Betaflexiviridae) is widely present in cherry-growing areas. We obtained the complete genome of a CVA isolate (CVA-TA) using small RNA deep sequencing, followed by overlapping reverse transcription-PCR (RT-PCR) and rapid amplification of cDNA ends (RACE). The newly identified 5'-untranslated region (5'-UTR) from CVA-TA may form additional hairpin and loop structures to stabilize the CVA genome. PMID:27174277

  17. Identification of novel exons and transcribed regions by chimpanzee transcriptome sequencing

    OpenAIRE

    Wetterbom, Anna; Ameur, Adam; Feuk, Lars; Gyllensten, Ulf; Cavelier, Lucia

    2010-01-01

    Background We profile the chimpanzee transcriptome by using deep sequencing of cDNA from brain and liver, aiming to quantify expression of known genes and to identify novel transcribed regions. Results Using stringent criteria for transcription, we identify 12,843 expressed genes, with a majority being found in both tissues. We further identify 9,826 novel transcribed regions that are not overlapping with annotated exons, mRNAs or ESTs. Over 80% of the novel transcribed regions map within or ...

  18. Structure and Genome Organization of Cherry Virus A (Capillovirus, Betaflexiviridae) from China Using Small RNA Sequencing

    Science.gov (United States)

    Wang, Jiawei; Zhai, Ying; Liu, Weizhen; Dhingra, Amit

    2016-01-01

    Cherry virus A (CVA) (Capillovirus, Betaflexiviridae) is widely present in cherry-growing areas. We obtained the complete genome of a CVA isolate (CVA-TA) using small RNA deep sequencing, followed by overlapping reverse transcription-PCR (RT-PCR) and rapid amplification of cDNA ends (RACE). The newly identified 5′-untranslated region (5′-UTR) from CVA-TA may form additional hairpin and loop structures to stabilize the CVA genome. PMID:27174277

  19. Ultra-deep sequencing analysis of the hepatitis A virus 5'-untranslated region among cases of the same outbreak from a single source.

    Science.gov (United States)

    Wu, Shuang; Nakamoto, Shingo; Kanda, Tatsuo; Jiang, Xia; Nakamura, Masato; Miyamura, Tatsuo; Shirasawa, Hiroshi; Sugiura, Nobuyuki; Takahashi-Nakaguchi, Azusa; Gonoi, Tohru; Yokosuka, Osamu

    2014-01-01

    Hepatitis A virus (HAV) is a causative agent of acute viral hepatitis for which an effective vaccine has been developed. Here we describe ultra-deep pyrosequences (UDPSs) of HAV 5'-untranslated region (5'UTR) among cases of the same outbreak, which arose from a single source, associated with a revolving sushi bar. We determined the reference sequence from HAV-derived clone from an attendant by the Sanger method. Sixteen UDPSs from this outbreak and one from another sporadic case were compared with this reference. Nucleotide errors yielded a UDPS error rate of < 1%. This study confirmed that nucleotide substitutions of this region are transition mutations in outbreak cases, that insertion was observed only in non-severe cases, and that these nucleotide substitutions were different from those of the sporadic case. Analysis of UDPSs detected low-prevalence HAV variations in 5'UTR, but no specific mutations associated with severity in these outbreak cases. To our surprise, HAV strains in this outbreak conserved HAV IRES sequence even if we performed analysis of UDPSs. UDPS analysis of HAV 5'UTR gave us no association between the disease severity of hepatitis A and HAV 5'UTR substitutions. It might be more interesting to perform ultra-deep sequencing of full length HAV genome in order to reveal possible unknown genomic determinants associated with disease severity. Further studies will be needed. PMID:24396287

  20. Cloning and functional expression of a human pancreatic islet glucose-transporter cDNA

    International Nuclear Information System (INIS)

    Previous studies have suggested that pancreatic islet glucose transport is mediated by a high-Km, low-affinity facilitated transporter similar to that expressed in liver. To determine the relationship between islet and liver glucose transporters, liver-type glucose-transporter cDNA clones were isolated from a human liver cDNA library. The liver-type glucose-transporter cDNA clone hybridized to mRNA transcripts of the same size in human liver and pancreatic islet RNA. A cDNA library was prepared from purified human pancreatic islet tissue and screened with human liver-type glucose-transporter cDNA. The authors isolated two overlapping cDNA clones encompassing 2600 base pairs, which encode a pancreatic islet protein identical in sequence to that of the putative liver-type glucose-transporter protein. Xenopus oocytes injected with synthetic mRNA transcribed from a full-length cDNA construct exhibited increased uptake of 2-deoxyglucose, confirming the functional identity of the clone. These cDNA clones can now be used to study regulation of expression of the gene and to assess the role of inherited defects in this gene as a candidate for inherited susceptibility to non-insulin-dependent diabetes mellitus

  1. Deep sequencing of ESTs from nacreous and prismatic layer producing tissues and a screen for novel shell formation-related genes in the pearl oyster.

    Directory of Open Access Journals (Sweden)

    Shigeharu Kinoshita

    Full Text Available BACKGROUND: Despite its economic importance, we have a limited understanding of the molecular mechanisms underlying shell formation in pearl oysters, wherein the calcium carbonate crystals, nacre and prism, are formed in a highly controlled manner. We constructed comprehensive expressed gene profiles in the shell-forming tissues of the pearl oyster Pinctada fucata and identified novel shell formation-related genes candidates. PRINCIPAL FINDINGS: We employed the GS FLX 454 system and constructed transcriptome data sets from pallial mantle and pearl sac, which form the nacreous layer, and from the mantle edge, which forms the prismatic layer in P. fucata. We sequenced 260477 reads and obtained 29682 unique sequences. We also screened novel nacreous and prismatic gene candidates by a combined analysis of sequence and expression data sets, and identified various genes encoding lectin, protease, protease inhibitors, lysine-rich matrix protein, and secreting calcium-binding proteins. We also examined the expression of known nacreous and prismatic genes in our EST library and identified novel isoforms with tissue-specific expressions. CONCLUSIONS: We constructed EST data sets from the nacre- and prism-producing tissues in P. fucata and found 29682 unique sequences containing novel gene candidates for nacreous and prismatic layer formation. This is the first report of deep sequencing of ESTs in the shell-forming tissues of P. fucata and our data provide a powerful tool for a comprehensive understanding of the molecular mechanisms of molluscan biomineralization.

  2. RICD: A rice indica cDNA database resource for rice functional genomics

    Directory of Open Access Journals (Sweden)

    Zhang Qifa

    2008-11-01

    Full Text Available Abstract Background The Oryza sativa L. indica subspecies is the most widely cultivated rice. During the last few years, we have collected over 20,000 putative full-length cDNAs and over 40,000 ESTs isolated from various cDNA libraries of two indica varieties Guangluai 4 and Minghui 63. A database of the rice indica cDNAs was therefore built to provide a comprehensive web data source for searching and retrieving the indica cDNA clones. Results Rice Indica cDNA Database (RICD is an online MySQL-PHP driven database with a user-friendly web interface. It allows investigators to query the cDNA clones by keyword, genome position, nucleotide or protein sequence, and putative function. It also provides a series of information, including sequences, protein domain annotations, similarity search results, SNPs and InDels information, and hyperlinks to gene annotation in both The Rice Annotation Project Database (RAP-DB and The TIGR Rice Genome Annotation Resource, expression atlas in RiceGE and variation report in Gramene of each cDNA. Conclusion The online rice indica cDNA database provides cDNA resource with comprehensive information to researchers for functional analysis of indica subspecies and for comparative genomics. The RICD database is available through our website http://www.ncgr.ac.cn/ricd.

  3. Genome sequence of Halorhabdus tiamatea, the first archaeon isolated from a deep-sea anoxic brine lake.

    KAUST Repository

    Antunes, Andre

    2011-09-01

    We present the draft genome of Halorhabdus tiamatea, the first member of the Archaea ever isolated from a deep-sea anoxic brine. Genome comparison with Halorhabdus utahensis revealed some striking differences, including a marked increase in genes associated with transmembrane transport and putative genes for a trehalose synthase and a lactate dehydrogenase.

  4. Genome sequence of Haloplasma contractile, an unusual contractile bacterium from a deep-sea anoxic brine lake.

    KAUST Repository

    Antunes, Andre

    2011-09-01

    We present the draft genome of Haloplasma contractile, isolated from a deep-sea brine and representing a new order between Firmicutes and Mollicutes. Its complex morphology with contractile protrusions might be strongly influenced by the presence of seven MreB/Mbl homologs, which appears to be the highest copy number ever reported.

  5. Expression cloning of a cDNA encoding the bovine histamine H1 receptor.

    OpenAIRE

    Yamashita, M; Fukui, H; Sugama, K; Horio, Y; Ito, S.; Mizuguchi, H.; Wada, H

    1991-01-01

    A functional cDNA clone for the histamine H1 receptor was isolated from a cDNA library of bovine adrenal medulla by a combination of molecular cloning in an expression vector and electrophysiological assay in Xenopus oocytes. The H1 receptor cDNA encodes a protein of 491 amino acids (Mr 55,954) with seven putative transmembrane domains, illustrating the similarity to other receptors that couple with guanine nucleotide-binding regulatory proteins (G protein-coupled receptors). The sequence hom...

  6. Molecular cloning of human interleukin 2 cDNA and its expression in E. coli.

    OpenAIRE

    Devos, René; Plaetinck, Geert; Cheroutre, Hilde; Simons, Guus; Degrave, Wim,; Tavernier, Jan; Remaut, Erik; Fiers, Walter

    1983-01-01

    A recombinant plasmid containing human interleukin 2 (IL2) cDNA was identified in a cDNA library constructed from mRNA derived from PHA-TPA induced splenocytes. Using this cDNA as a hybridization probe, a DNA fragment containing the IL2 gene was isolated from a collection of hybrid phages derived from human genomic DNA. A unique reading frame was identified from the nucleotide sequence derived from these plasmids coding for a polypeptide of 153 amino acids and containing a putative signal seq...

  7. Characterization of antigen-expressing Plasmodium falciparum cDNA clones that are reactive with parasite inhibitory antibodies.

    Science.gov (United States)

    Horii, T; Bzik, D J; Inselburg, J

    1988-07-01

    A Plasmodium falciparum (FCR3 strain) lambda gt11 cDNA expression library was constructed from trophozoite and schizont poly(A) RNA and was screened immunologically with a pooled human immune serum from Nigeria to form a gene bank of 288 positive clones. The gene bank was subsequently screened with parasite inhibitory mouse monoclonal antibodies (mMAb) and with individual human Liberian sera. Two mMAb, 43E5 and 5H10, strongly reacted with 8 and 3 cDNA clones, respectively. Several of those clones also weakly cross-reacted with the other mMAb. Two of those weakly cross-reactive clones, cDNA#366 and cDNA#22, were shown to be located in different chromosomal regions of the parasite by Southern hybridization and so appeared to represent two different parasite genes. The genomic organization of both cDNA#366 and cDNA#22 sequences were identical in the FCR3 and the Honduras-1 strain. The nucleotide sequence of cDNA#366 and the amino acid sequence it coded for were homologous to a partial DNA and amino acid sequence previously reported for a P. falciparum (Camp strain) exoantigen designated p126. The mRNA for cDNA#366 appeared to represent an abundant message in blood stage trophozoites and schizonts. PMID:2456465

  8. 杜仲1-羟基-2-甲基-2-(E)-丁烯基-4-二磷酸合酶基因cDNA全长克隆与序列分析%Cloning and Sequence Analysis of 1-Hydroxy-2-methyl-2-(E)-butenyl-4-diphosphate Synthase Gene cDNA from Eucommia ulmoides

    Institute of Scientific and Technical Information of China (English)

    刘攀峰; 杜红岩; 杜兰英; 乌云塔娜; 黄海燕

    2012-01-01

    植物萜类生物合成MEP途径中1-羟基-2-甲基-2-(E)-丁烯基4-二磷酸合酶(HDS)催化ME-2,4cPP生成HMBPP.以杜仲叶片cDNA为模板,采用反转录RCR及RACE技术分离出HDS基因的cDNA全长克隆.测序及序列分析结果表明该基因全长2786 bp,基因内部含有完整的开放阅读框,共编码743个氨基酸,推导的蛋白质分子量为82.25 kD,理论等电点为5.89,编码序列含有2个保守的结构域PSN和PSI以及3个绝对保守的半胱氨酸位点.系统进化树分析表明EuHDS蛋白与葡萄HDS蛋白的进化距离最为接近(0.049),其次为番茄(0.052)和橡胶(0.052).%l-Hydroxy-2-methyl-2-(E)-butenyl-4-diphosphate synthase (HDS) catalyses 2C-methyl-D-erythritol-2,4-cyclodiphosphate into l-hydroxy-2-methyl-2-(E)-butenyl-4-diphosphate in the penultimate step of 2C-meth-yl-D-erythritoI 4-phosphate (MEP) pathway, which is an alternative plant terpenoids biosynthetic route that has been recently discovered. Homologous HDS cDNA (EuHDS) was isolated from leaves of Eucommia ulmoides by the methods of reverse transcription polymerase chain reaction (RT-PCR) and rapid amplification of cDNA ends ( RACE ) technique. The full-length cDNA of EuHDS was 2 786 bp and encoded 743 amino acids with a predicted molecular mass of 82. 25 kD, and a theoretical isoelectric point of 5. 89. Two conserved motifs and three absolutely conserved cysteines (residues 644, 647 and 678) were discovered in the deduced coding sequence. Phylogenetic analysis revealed that EuDXR was more similar to Vitis vinifera(evolutionary distance, 0.049) than that of other species, followed by Solanum lycopersicum(0.052) and Hevea brasiliensis(0. 052).

  9. Two human cDNA molecules coding for the Duchenne muscular dystrophy (DMD) locus are highly homologous

    Energy Technology Data Exchange (ETDEWEB)

    Rosenthal, A.; Speer, A.; Billwitz, H. (Zentralinstitut fuer Molekularbiologie, Berlin-Buch (Germany Democratic Republic)); Cross, G.S.; Forrest, S.M.; Davies, K.E. (Univ. of Oxford (England))

    1989-07-11

    Recently the complete sequence of the human fetal cDNA coding for the Duchenne muscular dystrophy (DMD) locus was reported and a 3,685 amino acid long, rod-shaped cytoskeletal protein (dystrophin) was predicted as the protein product. Independently, the authors have isolated and sequenced different DMD cDNA molecules from human adult and fetal muscle. The complete 12.5 kb long sequence of all their cDNA clones has now been determined and they report here the nucleotide (nt) and amino acid (aa) differences between the sequences of both groups. The cDNA sequence comprises the whole coding region but lacks the first 110 nt from the 5{prime}-untranslated region and the last 1,417 nt of the 3{prime}-untranslated region. They have found 11 nt differences (approximately 99.9% homology) from which 7 occurred at the aa level.

  10. First characterization of infectious cDNA clones of Olive mild mosaic virus

    OpenAIRE

    Cardoso, Joana M.S.; Félix, M. Rosário; Clara, M.Ivone; Oliveira, Solange

    2012-01-01

    Full-length cDNA clones of an Olive mild mosaic virus (OMMV) isolate were constructed in order to find infectious cDNA clones. The sequencing of three individual full-length clones revealed some differences between them. In vitro transcription of these clones was performed and the effect of spontaneous mutations in the biological behaviour of the in vitro transcripts was evaluated by symptomatology, RNA accumulation and virus replication in inoculated plants. In vitro synthesized RNA from one...

  11. Genome Sequence of the Psychrophilic Bacterium Tenacibaculum ovolyticum Strain da5A-8 Isolated from Deep Seawater.

    Science.gov (United States)

    Teramoto, Maki; Zhai, Zhenyu; Komatsu, Ayumi; Shibayama, Keigo; Suzuki, Masato

    2016-01-01

    Some bacterial species of the genus Tenacibaculum, including Tenacibaculum ovolyticum, have been known as fish pathogens in the sea. So far, the only published genome sequence for this genus is for Tenacibaculum dicentrarchi, which could also be a fish pathogen. Strain da5A-8, showing 100% identity to the 16S rRNA gene sequence of T. ovolyticum DSM 18103(T), was isolated from seawater at a depth of 344 m in Kochi, Japan, and grew optimally at 10 to 20°C. The genome sequence of strain da5A-8 revealed the possible virulence genes commonly observed in the genus Tenacibaculum. PMID:27365358

  12. Genome Sequence of the Psychrophilic Bacterium Tenacibaculum ovolyticum Strain da5A-8 Isolated from Deep Seawater

    Science.gov (United States)

    Zhai, Zhenyu; Komatsu, Ayumi; Shibayama, Keigo

    2016-01-01

    Some bacterial species of the genus Tenacibaculum, including Tenacibaculum ovolyticum, have been known as fish pathogens in the sea. So far, the only published genome sequence for this genus is for Tenacibaculum dicentrarchi, which could also be a fish pathogen. Strain da5A-8, showing 100% identity to the 16S rRNA gene sequence of T. ovolyticum DSM 18103T, was isolated from seawater at a depth of 344 m in Kochi, Japan, and grew optimally at 10 to 20°C. The genome sequence of strain da5A-8 revealed the possible virulence genes commonly observed in the genus Tenacibaculum. PMID:27365358

  13. Identification of microRNAs from Amur grape (vitis amurensis Rupr. by deep sequencing and analysis of microRNA variations with bioinformatics

    Directory of Open Access Journals (Sweden)

    Wang Chen

    2012-03-01

    Full Text Available Abstract Background MicroRNA (miRNA is a class of functional non-coding small RNA with 19-25 nucleotides in length while Amur grape (Vitis amurensis Rupr. is an important wild fruit crop with the strongest cold resistance among the Vitis species, is used as an excellent breeding parent for grapevine, and has elicited growing interest in wine production. To date, there is a relatively large number of grapevine miRNAs (vv-miRNAs from cultivated grapevine varieties such as Vitis vinifera L. and hybrids of V. vinifera and V. labrusca, but there is no report on miRNAs from Vitis amurensis Rupr, a wild grapevine species. Results A small RNA library from Amur grape was constructed and Solexa technology used to perform deep sequencing of the library followed by subsequent bioinformatics analysis to identify new miRNAs. In total, 126 conserved miRNAs belonging to 27 miRNA families were identified, and 34 known but non-conserved miRNAs were also found. Significantly, 72 new potential Amur grape-specific miRNAs were discovered. The sequences of these new potential va-miRNAs were further validated through miR-RACE, and accumulation of 18 new va-miRNAs in seven tissues of grapevines confirmed by real time RT-PCR (qRT-PCR analysis. The expression levels of va-miRNAs in flowers and berries were found to be basically consistent in identity to those from deep sequenced sRNAs libraries of combined corresponding tissues. We also describe the conservation and variation of va-miRNAs using miR-SNPs and miR-LDs during plant evolution based on comparison of orthologous sequences, and further reveal that the number and sites of miR-SNP in diverse miRNA families exhibit distinct divergence. Finally, 346 target genes for the new miRNAs were predicted and they include a number of Amur grape stress tolerance genes and many genes regulating anthocyanin synthesis and sugar metabolism. Conclusions Deep sequencing of short RNAs from Amur grape flowers and berries identified 72

  14. The construction of a recombinant cDNA library representative of the poly(A)+ mRNA population from normal human lymphocytes.

    OpenAIRE

    Woods, D.; Crampton, J.; Clarke, B.; Williamson, R

    1980-01-01

    A recombinant library has been constructed using the plasmid pAT153 and double stranded cDNA prepared from normal human lymphocyte poly(A)+ RNA. Transformation conditions were optimized to yield approximately 200,000 recombinants per microgram of double stranded cDNA. Statistical analysis as well as sequence complexity analysis of the inserted sequences indicates that the cDNA library is representative of > 99% of the poly(A)+ RNA present in the normal human lymphocyte.

  15. Deep sequencing analysis of small noncoding RNA and mRNA targets of the global post-transcriptional regulator, Hfq

    DEFF Research Database (Denmark)

    Sittka, A; Lucchini, S; Papenfort, K;

    2008-01-01

    Recent advances in high-throughput pyrosequencing (HTPS) technology now allow a thorough analysis of RNA bound to cellular proteins, and, therefore, of post-transcriptional regulons. We used HTPS to discover the Salmonella RNAs that are targeted by the common bacterial Sm-like protein, Hfq. Initial...... transcriptomic analysis revealed that Hfq controls the expression of almost a fifth of all Salmonella genes, including several horizontally acquired pathogenicity islands (SPI-1, -2, -4, -5), two sigma factor regulons, and the flagellar gene cascade. Subsequent HTPS analysis of 350,000 cDNAs, derived from RNA co......-immunoprecipitation (coIP) with epitope-tagged Hfq or control coIP, identified 727 mRNAs that are Hfq-bound in vivo. The cDNA analysis discovered new, small noncoding RNAs (sRNAs) and more than doubled the number of sRNAs known to be expressed in Salmonella to 64; about half of these are associated with Hfq. Our analysis...

  16. Identification of SNPs in barley(Hordeum vulgare L.)by deep sequencing of six reduced representation libraries

    Institute of Scientific and Technical Information of China (English)

    Ganggang; Guo; Dawa; Dondup; Lisha; Zhang; Sha; Hu; Xingmiao; Yuan; Jing; Zhang

    2014-01-01

    High-density genetic markers are required for genotyping and linkage mapping in identifying genes from crops with complex genomes, such as barley. As the most common variation, single nucleotide polymorphisms(SNPs) are suitable for accurate genotyping by using the next-generation sequencing(NGS) technology. Reduced representation libraries(RRLs) of five barley accessions and one mutant were sequenced using NGS technology for SNP discovery. Twenty million short reads were generated and the proportion of repetitive sequences was reduced by more than 56%. A total of 6061 SNPs were identified, and 451 were mapped to the draft sequence of the barley genome with pairing reads. Eleven SNPs were validated using length polymorphic allele-specific PCR markers.

  17. Congruent Deep Relationships in the Grape Family (Vitaceae) Based on Sequences of Chloroplast Genomes and Mitochondrial Genes via Genome Skimming

    OpenAIRE

    Zhang, Ning; Wen, Jun; Zimmer, Elizabeth A.

    2015-01-01

    Vitaceae is well-known for having one of the most economically important fruits, i.e., the grape (Vitis vinifera). The deep phylogeny of the grape family was not resolved until a recent phylogenomic analysis of 417 nuclear genes from transcriptome data. However, it has been reported extensively that topologies based on nuclear and organellar genes may be incongruent due to differences in their evolutionary histories. Therefore, it is important to reconstruct a backbone phylogeny of the grape ...

  18. Identification of microRNA-like RNAs from Curvularia lunata associated with maize leaf spot by bioinformation analysis and deep sequencing.

    Science.gov (United States)

    Liu, Tong; Hu, John; Zuo, Yuhu; Jin, Yazhong; Hou, Jumei

    2016-04-01

    Deep sequencing of small RNAs is a useful tool to identify novel small RNAs that may be involved in fungal growth and pathogenesis. In this study, we used HiSeq deep sequencing to identify 747,487 unique small RNAs from Curvularia lunata. Among these small RNAs were 1012 microRNA-like RNAs (milRNAs), which are similar to other known microRNAs, and 48 potential novel milRNAs without homologs in other organisms have been identified using the miRBase© database. We used quantitative PCR to analyze the expression of four of these milRNAs from C. lunata at different developmental stages. The analysis revealed several changes associated with germinating conidia and mycelial growth, suggesting that these milRNAs may play a role in pathogen infection and mycelial growth. A total of 8334 target mRNAs for the 1012 milRNAs that were identified, and 256 target mRNAs for the 48 novel milRNAs were predicted by computational analysis. These target mRNAs of milRNAs were also performed by gene ontology and Kyoto Encyclopedia of Genes and Genomes pathway analysis. To our knowledge, this study is the first report of C. lunata's milRNA profiles. This information will provide a better understanding of pathogen development and infection mechanism. PMID:26481645

  19. Whole transcriptome RNA sequencing data from blood leukocytes derived from Parkinson's disease patients prior to and following deep brain stimulation treatment

    Directory of Open Access Journals (Sweden)

    Lilach Soreq

    2015-03-01

    Full Text Available Recent evidence demonstrates the power of RNA sequencing (RNA-Seq for identifying valuable and urgently needed blood biomarkers and advancing both early and accurate detection of neurological diseases, and in particular Parkinson's disease (PD. RNA sequencing technology enables non-biased, high throughput, probe-independent inspection of expression data and high coverage and both quantification of global transcript levels as well as the detection of expressed exons and junctions given a sufficient sequencing depth (coverage. However, the analysis of sequencing data frequently presents a bottleneck. Tools for quantification of alternative splicing from sequenced libraries hardly exist at the present time, and methods that support multiple sequencing platforms are especially lacking. Here, we describe in details a whole RNA-Seq transcriptome dataset produced from PD patient's blood leukocytes. The samples were taken prior to, and following deep brain stimulation (DBS treatment while being on stimulation and following 1 h of complete electrical stimulation cessation and from healthy control volunteers. We describe in detail the methodology applied for analyzing the RNA-Seq data including differential expression of long noncoding RNAs (lncRNAs. We also provide details of the corresponding analysis of in-depth splice isoform data from junction and exon reads, with the use of the software AltAnalyze. Both the RNA-Seq raw (http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE42608 and analyzed data (https://www.synapse.org/#!Synapse:syn2805267 may be found valuable towards detection of novel blood biomarkers for PD.

  20. Complete mitochondrial genome DNA sequence for two ophiuroids and a holothuroid: the utility of protein gene sequence and gene maps in the analyses of deep deuterostome phylogeny.

    Science.gov (United States)

    Scouras, Andrea; Beckenbach, Karen; Arndt, Allan; Smith, Michael J

    2004-04-01

    The complete mitochondrial genome sequences have been determined for the holothuroid Cucumaria miniata and two ophiuroid species Ophiopholis aculeata and Ophiura lütkeni. In addition, the nucleotide sequence of the mitochondrial protein-coding genes for the asteroid Pisaster ochraceus has been completed. Maximum-likelihood and LogDet distance analyses of concatenated protein-coding sequences produced a series of trees that did not conclusively support generally accepted models of echinoderm phylogeny. The ophiuroid data consistently demonstrated accelerated nucleotide divergence rates and lack of stationarity. This confounds the phylogenetic analyses. Molecular investigations using individual protein-coding gene alignments demonstrated that the cytochrome b gene exhibits the least deviation in rate and stationarity and generated some trees consistent with proposed echinoderm phylogenies. Phylogenies based on echinoderm mitochondrial gene rearrangements also proved problematic because of extensive variation in gene order between and within classes. A comparison of the two distinctive ophiuroid mitochondrial gene orders supports the hypothesis that O. lütkeni has a more derived mitochondrial gene order versus O. aculeata. The variation in the echinoderm mitochondrial gene maps reinforces the limitations of the application of mitochondrial gene rearrangements as a global phylogenetic tool. PMID:15019608

  1. cDNA2Genome: A tool for mapping and annotating cDNAs

    Directory of Open Access Journals (Sweden)

    Suhai Sandor

    2003-09-01

    Full Text Available Abstract Background In the last years several high-throughput cDNA sequencing projects have been funded worldwide with the aim of identifying and characterizing the structure of complete novel human transcripts. However some of these cDNAs are error prone due to frameshifts and stop codon errors caused by low sequence quality, or to cloning of truncated inserts, among other reasons. Therefore, accurate CDS prediction from these sequences first require the identification of potentially problematic cDNAs in order to speed up the posterior annotation process. Results cDNA2Genome is an application for the automatic high-throughput mapping and characterization of cDNAs. It utilizes current annotation data and the most up to date databases, especially in the case of ESTs and mRNAs in conjunction with a vast number of approaches to gene prediction in order to perform a comprehensive assessment of the cDNA exon-intron structure. The final result of cDNA2Genome is an XML file containing all relevant information obtained in the process. This XML output can easily be used for further analysis such us program pipelines, or the integration of results into databases. The web interface to cDNA2Genome also presents this data in HTML, where the annotation is additionally shown in a graphical form. cDNA2Genome has been implemented under the W3H task framework which allows the combination of bioinformatics tools in tailor-made analysis task flows as well as the sequential or parallel computation of many sequences for large-scale analysis. Conclusions cDNA2Genome represents a new versatile and easily extensible approach to the automated mapping and annotation of human cDNAs. The underlying approach allows sequential or parallel computation of sequences for high-throughput analysis of cDNAs.

  2. A Method for Amplicon Deep Sequencing of Drug Resistance Genes in Plasmodium falciparum Clinical Isolates from India.

    Science.gov (United States)

    Rao, Pavitra N; Uplekar, Swapna; Kayal, Sriti; Mallick, Prashant K; Bandyopadhyay, Nabamita; Kale, Sonal; Singh, Om P; Mohanty, Akshaya; Mohanty, Sanjib; Wassmer, Samuel C; Carlton, Jane M

    2016-06-01

    A major challenge to global malaria control and elimination is early detection and containment of emerging drug resistance. Next-generation sequencing (NGS) methods provide the resolution, scalability, and sensitivity required for high-throughput surveillance of molecular markers of drug resistance. We have developed an amplicon sequencing method on the Ion Torrent PGM platform for targeted resequencing of a panel of six Plasmodium falciparum genes implicated in resistance to first-line antimalarial therapy, including artemisinin combination therapy, chloroquine, and sulfadoxine-pyrimethamine. The protocol was optimized using 12 geographically diverse P. falciparum reference strains and successfully applied to multiplexed sequencing of 16 clinical isolates from India. The sequencing results from the reference strains showed 100% concordance with previously reported drug resistance-associated mutations. Single-nucleotide polymorphisms (SNPs) in clinical isolates revealed a number of known resistance-associated mutations and other nonsynonymous mutations that have not been implicated in drug resistance. SNP positions containing multiple allelic variants were used to identify three clinical samples containing mixed genotypes indicative of multiclonal infections. The amplicon sequencing protocol has been designed for the benchtop Ion Torrent PGM platform and can be operated with minimal bioinformatics infrastructure, making it ideal for use in countries that are endemic for the disease to facilitate routine large-scale surveillance of the emergence of drug resistance and to ensure continued success of the malaria treatment policy. PMID:27008882

  3. A Method for Amplicon Deep Sequencing of Drug Resistance Genes in Plasmodium falciparum Clinical Isolates from India

    Science.gov (United States)

    Rao, Pavitra N.; Uplekar, Swapna; Kayal, Sriti; Mallick, Prashant K.; Bandyopadhyay, Nabamita; Kale, Sonal; Singh, Om P.; Mohanty, Akshaya; Mohanty, Sanjib; Wassmer, Samuel C.

    2016-01-01

    A major challenge to global malaria control and elimination is early detection and containment of emerging drug resistance. Next-generation sequencing (NGS) methods provide the resolution, scalability, and sensitivity required for high-throughput surveillance of molecular markers of drug resistance. We have developed an amplicon sequencing method on the Ion Torrent PGM platform for targeted resequencing of a panel of six Plasmodium falciparum genes implicated in resistance to first-line antimalarial therapy, including artemisinin combination therapy, chloroquine, and sulfadoxine-pyrimethamine. The protocol was optimized using 12 geographically diverse P. falciparum reference strains and successfully applied to multiplexed sequencing of 16 clinical isolates from India. The sequencing results from the reference strains showed 100% concordance with previously reported drug resistance-associated mutations. Single-nucleotide polymorphisms (SNPs) in clinical isolates revealed a number of known resistance-associated mutations and other nonsynonymous mutations that have not been implicated in drug resistance. SNP positions containing multiple allelic variants were used to identify three clinical samples containing mixed genotypes indicative of multiclonal infections. The amplicon sequencing protocol has been designed for the benchtop Ion Torrent PGM platform and can be operated with minimal bioinformatics infrastructure, making it ideal for use in countries that are endemic for the disease to facilitate routine large-scale surveillance of the emergence of drug resistance and to ensure continued success of the malaria treatment policy. PMID:27008882

  4. Deep Illumina-based shotgun sequencing reveals dietary effects on the structure and function of the fecal microbiome of growing kittens.

    Directory of Open Access Journals (Sweden)

    Oliver Deusch

    Full Text Available Previously, we demonstrated that dietary protein:carbohydrate ratio dramatically affects the fecal microbial taxonomic structure of kittens using targeted 16S gene sequencing. The present study, using the same fecal samples, applied deep Illumina shotgun sequencing to identify the diet-associated functional potential and analyze taxonomic changes of the feline fecal microbiome.Fecal samples from kittens fed one of two diets differing in protein and carbohydrate content (high-protein, low-carbohydrate, HPLC; and moderate-protein, moderate-carbohydrate, MPMC were collected at 8, 12 and 16 weeks of age (n = 6 per group. A total of 345.3 gigabases of sequence were generated from 36 samples, with 99.75% of annotated sequences identified as bacterial. At the genus level, 26% and 39% of reads were annotated for HPLC- and MPMC-fed kittens, with HPLC-fed cats showing greater species richness and microbial diversity. Two phyla, ten families and fifteen genera were responsible for more than 80% of the sequences at each taxonomic level for both diet groups, consistent with the previous taxonomic study. Significantly different abundances between diet groups were observed for 324 genera (56% of all genera identified demonstrating widespread diet-induced changes in microbial taxonomic structure. Diversity was not affected over time. Functional analysis identified 2,013 putative enzyme function groups were different (p<0.000007 between the two dietary groups and were associated to 194 pathways, which formed five discrete clusters based on average relative abundance. Of those, ten contained more (p<0.022 enzyme functions with significant diet effects than expected by chance. Six pathways were related to amino acid biosynthesis and metabolism linking changes in dietary protein with functional differences of the gut microbiome.These data indicate that feline feces-derived microbiomes have large structural and functional differences relating to the dietary

  5. Identification and Analysis of Differentially-Expressed microRNAs in Japanese Encephalitis Virus-Infected PK-15 Cells with Deep Sequencing

    Directory of Open Access Journals (Sweden)

    Yuhan Cai

    2015-01-01

    Full Text Available Japanese encephalitis virus (JEV, a mosquito-borne Flavivirus, causes acute viral encephalitis with high morbidity and mortality in humans and animals. MicroRNAs (miRNAs are small noncoding RNAs that are important modulators of the intricate host-pathogen interaction networks. However, our knowledge of the changes that occur in miRNAs in host cells after JEV infection is still limited. To understand the molecular pathogenesis of JEV at the level of posttranscriptional regulation, we used Illumina deep sequencing to sequence two small RNA libraries prepared from PK-15 cells before and after JEV infection. We identified 522 and 427 miRNAs in the infected and uninfected cells, respectively. Overall, 132 miRNAs were expressed significantly differently after challenge with JEV: 78 were upregulated and 54 downregulated. The sequencing results for selected miRNAs were confirmed with RT-qPCR. GO analysis of the host target genes revealed that these dysregulated miRNAs are involved in complex cellular pathways, including the metabolic pathway, inflammatory response and immune response. To our knowledge, this is the first report of the comparative expression of miRNAs in PK-15 cells after JEV infection. Our findings will underpin further studies of miRNAs’ roles in JEV replication and identify potential candidates for antiviral therapies against JEV.

  6. The utility of diversity profiling using Illumina 18S rRNA gene amplicon deep sequencing to detect and discriminate Toxoplasma gondii among the cyst-forming coccidia.

    Science.gov (United States)

    Cooper, Madalyn K; Phalen, David N; Donahoe, Shannon L; Rose, Karrie; Šlapeta, Jan

    2016-01-30

    Next-generation sequencing (NGS) has the capacity to screen a single DNA sample and detect pathogen DNA from thousands of host DNA sequence reads, making it a versatile and informative tool for investigation of pathogens in diseased animals. The technique is effective and labor saving in the initial identification of pathogens, and will complement conventional diagnostic tests to associate the candidate pathogen with a disease process. In this report, we investigated the utility of the diversity profiling NGS approach using Illumina small subunit ribosomal RNA (18S rRNA) gene amplicon deep sequencing to detect Toxoplasma gondii in previously confirmed cases of toxoplasmosis. We then tested the diagnostic approach with species-specific PCR genotyping, histopathology and immunohistochemistry of toxoplasmosis in a Risso's dolphin (Grampus griseus) to systematically characterise the disease and associate causality. We show that the Euk7A/Euk570R primer set targeting the V1-V3 hypervariable region of the 18S rRNA gene can be used as a species-specific assay for cyst-forming coccidia and discriminate T. gondii. Overall, the approach is cost-effective and improves diagnostic decision support by narrowing the differential diagnosis list with more certainty than was previously possible. Furthermore, it supplements the limitations of cryptic protozoan morphology and surpasses the need for species-specific PCR primer combinations. PMID:26801593

  7. Construction and sequence sampling of deep-coverage, large-insert BAC libraries for three model lepidopteran species

    Directory of Open Access Journals (Sweden)

    Zhao Shaying

    2009-06-01

    Full Text Available Abstract Background Manduca sexta, Heliothis virescens, and Heliconius erato represent three widely-used insect model species for genomic and fundamental studies in Lepidoptera. Large-insert BAC libraries of these insects are critical resources for many molecular studies, including physical mapping and genome sequencing, but not available to date. Results We report the construction and characterization of six large-insert BAC libraries for the three species and sampling sequence analysis of the genomes. The six BAC libraries were constructed with two restriction enzymes, two libraries for each species, and each has an average clone insert size ranging from 152–175 kb. We estimated that the genome coverage of each library ranged from 6–9 ×, with the two combined libraries of each species being equivalent to 13.0–16.3 × haploid genomes. The genome coverage, quality and utility of the libraries were further confirmed by library screening using 6~8 putative single-copy probes. To provide a first glimpse into these genomes, we sequenced and analyzed the BAC ends of ~200 clones randomly selected from the libraries of each species. The data revealed that the genomes are AT-rich, contain relatively small fractions of repeat elements with a majority belonging to the category of low complexity repeats, and are more abundant in retro-elements than DNA transposons. Among the species, the H. erato genome is somewhat more abundant in repeat elements and simple repeats than those of M. sexta and H. virescens. The BLAST analysis of the BAC end sequences suggested that the evolution of the three genomes is widely varied, with the genome of H. virescens being the most conserved as a typical lepidopteran, whereas both genomes of H. erato and M. sexta appear to have evolved significantly, resulting in a higher level of species- or evolutionary lineage-specific sequences. Conclusion The high-quality and large-insert BAC libraries of the insects, together

  8. cDNA cloning and gene expression of ascorbate oxidase in tobacco.

    Science.gov (United States)

    Kato, N; Esaka, M

    1996-02-01

    A cDNA clone for ascorbate oxidase (AAO) has been isolated from a cDNA library of tobacco (Nicotiana tabacum) cells. The identity of the amino acid sequence deduced from tobacco AAO cDNA to that from pumpkin AAO cDNA was 68%, which was much lower than the identity (80%) between pumpkin and cucumber AAO. AAO activity in tobacco cells was much lower than that in pumpkin cells, whereas the immunoreactive protein in tobacco cells was more abundant than that in pumpkin cells. We suppose that AAO protein in tobacco cells may be less active than that in pumpkin cells. Genomic Southern blotting suggested that AAO in tobacco was encoded by a single-copy gene. Nothern blotting revealed that mRNA of AAO was highly expressed in young and growing tissues of tobacco plant. PMID:8624413

  9. Quantitative Transcript Analysis in Plants: Improved First-strand cDNA Synthesis

    Institute of Scientific and Technical Information of China (English)

    Nai-Zhong XIAO; Lei BA; Preben Bach HOLM; Xing-Zhi WANG; Steve BOWRA

    2005-01-01

    The quantity and quality of first-strand cDNA directly influence the accuracy of transcriptional analysis and quantification. Using a plant-derived α-tubulin as a model system, the effect of oligo sequence and DTT on the quality and quantity of first-strand cDNA synthesis was assessed via a combination of semi-quantitative PCR and real-time PCR. The results indicated that anchored oligo dT significantly improved the quantity and quality of α-tubulin cDNA compared to the conventional oligo dT. Similarly, omitting DTT from the first-strand cDNA synthesis also enhanced the levels of transcript. This is the first time that a comparative analysis has been undertaken for a plant system and it shows conclusively that small changes to current protocols can have very significant impact on transcript analysis.

  10. Rapid and Deep Proteomes by Faster Sequencing on a Benchtop Quadrupole Ultra-High-Field Orbitrap Mass Spectrometer

    DEFF Research Database (Denmark)

    Kelstrup, Christian D; Jersie-Christensen, Rosa R; Batth, Tanveer Singh;

    2014-01-01

    Shotgun proteomics is a powerful technology for global analysis of proteins and their post-translational modifications. Here, we investigate faster sequencing speed of the latest Q Exactive HF mass spectrometer, which features an ultra-high-field Orbitrap mass analyzer. Proteome coverage is...... evaluated by four different acquisition methods and benchmarked across three generations of Q Exactive instruments (ProteomeXchange dataset PXD001305). We find the ultra-high-field Orbitrap mass analyzer capable of attaining a sequencing speed above 20 Hz and it routinely exceeds 10 peptide spectrum matches......-shot phosphoproteomics, where we identify 7600 unique HeLa phospho-peptides in one gradient hour and find the quality of fragmentation spectra to be more important than quantity for accurate site assignment....

  11. Independent studies using deep sequencing resolve the same set of core bacterial species dominating gut communities of honey bees.

    Science.gov (United States)

    Sabree, Zakee L; Hansen, Allison K; Moran, Nancy A

    2012-01-01

    Starting in 2003, numerous studies using culture-independent methodologies to characterize the gut microbiota of honey bees have retrieved a consistent and distinctive set of eight bacterial species, based on near identity of the 16S rRNA gene sequences. A recent study [Mattila HR, Rios D, Walker-Sperling VE, Roeselers G, Newton ILG (2012) Characterization of the active microbiotas associated with honey bees reveals healthier and broader communities when colonies are genetically diverse. PLoS ONE 7(3): e32962], using pyrosequencing of the V1-V2 hypervariable region of the 16S rRNA gene, reported finding entirely novel bacterial species in honey bee guts, and used taxonomic assignments from these reads to predict metabolic activities based on known metabolisms of cultivable species. To better understand this discrepancy, we analyzed the Mattila et al. pyrotag dataset. In contrast to the conclusions of Mattila et al., we found that the large majority of pyrotag sequences belonged to clusters for which representative sequences were identical to sequences from previously identified core species of the bee microbiota. On average, they represent 95% of the bacteria in each worker bee in the Mattila et al. dataset, a slightly lower value than that found in other studies. Some colonies contain small proportions of other bacteria, mostly species of Enterobacteriaceae. Reanalysis of the Mattila et al. dataset also did not support a relationship between abundances of Bifidobacterium and of putative pathogens or a significant difference in gut communities between colonies from queens that were singly or multiply mated. Additionally, consistent with previous studies, the dataset supports the occurrence of considerable strain variation within core species, even within single colonies. The roles of these bacteria within bees, or the implications of the strain variation, are not yet clear. PMID:22829932

  12. Independent studies using deep sequencing resolve the same set of core bacterial species dominating gut communities of honey bees.

    Directory of Open Access Journals (Sweden)

    Zakee L Sabree

    Full Text Available Starting in 2003, numerous studies using culture-independent methodologies to characterize the gut microbiota of honey bees have retrieved a consistent and distinctive set of eight bacterial species, based on near identity of the 16S rRNA gene sequences. A recent study [Mattila HR, Rios D, Walker-Sperling VE, Roeselers G, Newton ILG (2012 Characterization of the active microbiotas associated with honey bees reveals healthier and broader communities when colonies are genetically diverse. PLoS ONE 7(3: e32962], using pyrosequencing of the V1-V2 hypervariable region of the 16S rRNA gene, reported finding entirely novel bacterial species in honey bee guts, and used taxonomic assignments from these reads to predict metabolic activities based on known metabolisms of cultivable species. To better understand this discrepancy, we analyzed the Mattila et al. pyrotag dataset. In contrast to the conclusions of Mattila et al., we found that the large majority of pyrotag sequences belonged to clusters for which representative sequences were identical to sequences from previously identified core species of the bee microbiota. On average, they represent 95% of the bacteria in each worker bee in the Mattila et al. dataset, a slightly lower value than that found in other studies. Some colonies contain small proportions of other bacteria, mostly species of Enterobacteriaceae. Reanalysis of the Mattila et al. dataset also did not support a relationship between abundances of Bifidobacterium and of putative pathogens or a significant difference in gut communities between colonies from queens that were singly or multiply mated. Additionally, consistent with previous studies, the dataset supports the occurrence of considerable strain variation within core species, even within single colonies. The roles of these bacteria within bees, or the implications of the strain variation, are not yet clear.

  13. Importance of minor variants and their detection (ultra-deep sequencing) in the management of HIV infection

    OpenAIRE

    Paredes R.

    2012-01-01

    Sanger genotypic drug resistance assays currently used for routine HIV management often miss low-frequency drug-resistant HIV variants, potentially posing patients at risk of antiretroviral treatment failure. Next-generation sequencing (NGS) technologies allow detection of such minority variants down to approximately 0.5%–1% frequency. They also facilitate interrogating “other” HIV genomic regions like Gag or the RT connection domain, which modulate HIV resistance and are...

  14. Identification of Novel MicroRNAs in Primates by Using the Synteny Information and Small RNA Deep Sequencing Data

    OpenAIRE

    Yuan, Zhidong; Liu, Hongde; Nie, Yumin; Ding, Suping; Yan, Mingli; Tan, Shuhua; Jin, Yuanchang; Sun, Xiao

    2013-01-01

    Current technologies that are used for genome-wide microRNA (miRNA) prediction are mainly based on BLAST tool. They often produce a large number of false positives. Here, we describe an effective approach for identifying orthologous pre-miRNAs in several primates based on syntenic information. Some of them have been validated by small RNA high throughput sequencing data. This approach uses the synteny information and experimentally validated miRNAs of human, and incorporates currently availab...

  15. Contrasted seismogenic and rheological behaviours from shallow and deep earthquake sequences in the North Tanzanian Divergence, East Africa

    Science.gov (United States)

    Albaric, J.; Perrot, J.; Déverchère, J.; Deschamps, A.; Le Gall, B.; Ferdinand, R. W.; Petit, C.; Tiberi, C.; Sue, C.; Songo, M.

    2010-12-01

    We report preliminary results of a seismological experiment, SEISMO-TANZ' 07, which consisted in the deployment of a local network (35 stations) in the East African Rift System (EARS), North Tanzania, during 6 months in 2007. We compare two earthquake sequences (Gelai and Manyara) occurring, respectively, in the southern end of the Kenya rift and in the North Tanzanian Divergence (NTD). Only distant of ˜150 km, their triggering mechanisms are different. None of the sequences depicts typical swarm or mainshock-aftershock patterns. They highlight the change in the magmatic/tectonic nature of the rift where the eastern branch of the EARS enters the Tanzanian craton. The similar shape and long-axis of the elongate sequences emphasize the preferred locus of active strain release along NE-SW discontinuities which probably root at depth into steep Proterozoic shear zones. At Gelai, the deformation is dominated by aseismic process involving slow slip on normal fault and dyke intrusion within the upper crust (Calais et al., 2008). The spatial and temporal earthquake distribution indicates a possible correlation between the Gelai crisis and the eruption of the nearby Oldoinyo Lengai volcano. At Manyara, the sequence is more uncommon, revealing a long-lasting seismic activity deeply rooted (˜20-35 km depth) possibly related to stress loading transmitted laterally. The yield strength envelope modelled from the depth frequency distribution of earthquakes in the NTD is consistent with the presence of a mafic lower crust and further supports the strength increase of the rifted crust from south Kenya to the NTD.

  16. Genomic Analysis by Deep Sequencing of the Probiotic Lactobacillus brevis KB290 Harboring Nine Plasmids Reveals Genomic Stability

    OpenAIRE

    Fukao, Masanori; Oshima, Kenshiro; Morita, Hidetoshi; Toh, Hidehiro; Suda, Wataru; Kim, Seok-Won; Suzuki, Shigenori; Yakabe, Takafumi; Hattori, Masahira; Yajima, Nobuhiro

    2013-01-01

    We determined the complete genome sequence of Lactobacillus brevis KB290, a probiotic lactic acid bacterium isolated from a traditional Japanese fermented vegetable. The genome contained a 2,395,134-bp chromosome that housed 2,391 protein-coding genes and nine plasmids that together accounted for 191 protein-coding genes. KB290 contained no virulence factor genes, and several genes related to presumptive cell wall-associated polysaccharide biosynthesis and the stress response were present in ...

  17. Deep sequencing of the Mexican avocado transcriptome, an ancient angiosperm with a high content of fatty acids

    OpenAIRE

    Ibarra-Laclette, Enrique; Méndez-Bravo, Alfonso; Pérez-Torres, Claudia Anahí; Albert, Victor A; Mockaitis, Keithanne; Kilaru, Aruna; López-Gómez, Rodolfo; Cervantes-Luevano, Jacob Israel; Herrera-Estrella, Luis

    2015-01-01

    Background Avocado (Persea americana) is an economically important tropical fruit considered to be a good source of fatty acids. Despite its importance, the molecular and cellular characterization of biochemical and developmental processes in avocado is limited due to the lack of transcriptome and genomic information. Results The transcriptomes of seeds, roots, stems, leaves, aerial buds and flowers were determined using different sequencing platforms. Additionally, the transcriptomes of thre...

  18. Bioinformatic prediction, deep sequencing of microRNAs and expression analysis during phenotypic plasticity in the pea aphid, Acyrthosiphon pisum

    Directory of Open Access Journals (Sweden)

    Leterme Nathalie

    2010-05-01

    Full Text Available Abstract Background Post-transcriptional regulation in eukaryotes can be operated through microRNA (miRNAs mediated gene silencing. MiRNAs are small (18-25 nucleotides non-coding RNAs that play crucial role in regulation of gene expression in eukaryotes. In insects, miRNAs have been shown to be involved in multiple mechanisms such as embryonic development, tissue differentiation, metamorphosis or circadian rhythm. Insect miRNAs have been identified in different species belonging to five orders: Coleoptera, Diptera, Hymenoptera, Lepidoptera and Orthoptera. Results We developed high throughput Solexa sequencing and bioinformatic analyses of the genome of the pea aphid Acyrthosiphon pisum in order to identify the first miRNAs from a hemipteran insect. By combining these methods we identified 149 miRNAs including 55 conserved and 94 new miRNAs. Moreover, we investigated the regulation of these miRNAs in different alternative morphs of the pea aphid by analysing the expression of miRNAs across the switch of reproduction mode. Pea aphid microRNA sequences have been posted to miRBase: http://microrna.sanger.ac.uk/sequences/ Conclusions Our study has identified candidates as putative regulators involved in reproductive polyphenism in aphids and opens new avenues for further functional analyses.

  19. Deep Sequencing Reveals Novel Genetic Variants in Children with Acute Liver Failure and Tissue Evidence of Impaired Energy Metabolism

    Science.gov (United States)

    Valencia, C. Alexander; Wang, Xinjian; Wang, Jin; Peters, Anna; Simmons, Julia R.; Moran, Molly C.; Mathur, Abhinav; Husami, Ammar; Qian, Yaping; Sheridan, Rachel; Bove, Kevin E.; Witte, David; Huang, Taosheng; Miethke, Alexander G.

    2016-01-01

    Background & Aims The etiology of acute liver failure (ALF) remains elusive in almost half of affected children. We hypothesized that inherited mitochondrial and fatty acid oxidation disorders were occult etiological factors in patients with idiopathic ALF and impaired energy metabolism. Methods Twelve patients with elevated blood molar lactate/pyruvate ratio and indeterminate etiology were selected from a retrospective cohort of 74 subjects with ALF because their fixed and frozen liver samples were available for histological, ultrastructural, molecular and biochemical analysis. Results A customized next-generation sequencing panel for 26 genes associated with mitochondrial and fatty acid oxidation defects revealed mutations and sequence variants in five subjects. Variants involved the genes ACAD9, POLG, POLG2, DGUOK, and RRM2B; the latter not previously reported in subjects with ALF. The explanted livers of the patients with heterozygous, truncating insertion mutations in RRM2B showed patchy micro- and macrovesicular steatosis, decreased mitochondrial DNA (mtDNA) content acidosis was found to carry two heterozygous variants in ACAD9, which was associated with isolated complex I deficiency and diffuse hypergranular hepatocytes. The two subjects with heterozygous variants of unknown clinical significance in POLG and DGUOK developed ALF following drug exposure. Their hepatocytes displayed abnormal mitochondria by electron microscopy. Conclusion Targeted next generation sequencing and correlation with histological, ultrastructural and functional studies on liver tissue in children with elevated lactate/pyruvate ratio expand the spectrum of genes associated with pediatric ALF. PMID:27483465

  20. Genome re-sequencing of semi-wild soybean reveals a complex Soja population structure and deep introgression.

    Directory of Open Access Journals (Sweden)

    Jie Qiu

    Full Text Available Semi-wild soybean is a unique type of soybean that retains both wild and domesticated characteristics, which provides an important intermediate type for understanding the evolution of the subgenus Soja population in the Glycine genus. In this study, a semi-wild soybean line (Maliaodou and a wild line (Lanxi 1 collected from the lower Yangtze regions were deeply sequenced while nine other semi-wild lines were sequenced to a 3-fold genome coverage. Sequence analysis revealed that (1 no independent phylogenetic branch covering all 10 semi-wild lines was observed in the Soja phylogenetic tree; (2 besides two distinct subpopulations of wild and cultivated soybean in the Soja population structure, all semi-wild lines were mixed with some wild lines into a subpopulation rather than an independent one or an intermediate transition type of soybean domestication; (3 high heterozygous rates (0.19-0.49 were observed in several semi-wild lines; and (4 over 100 putative selective regions were identified by selective sweep analysis, including those related to the development of seed size. Our results suggested a hybridization origin for the semi-wild soybean, which makes a complex Soja population structure.