WorldWideScience

Sample records for contigs reveals interspersed

  1. Identification and insertion polymorphisms of short interspersed nuclear elements (SINEs) in Brassica genomes

    International Nuclear Information System (INIS)

    Nouroz, F.; Naveed, M.

    2018-01-01

    The non-LTR retrotransposons (retroposons) are abundant in plant genomes including members of Brassicaceae. Of the retroposons, long interspersed nuclear elements (LINEs) are more copious followed by short interspersed nuclear elements (SINEs) in sequenced eukaryotic genomes. The SINEs are short elements and ranged from 100-500 bps flanked by variable sized target site duplications, 5' tRNA region with polymerase III promoter, internal tRNA unrelated region, 3' LINEs derived region and a poly adenosine tail. Different computational approaches were used for the identification and characterization of SINEs, while PCR was used to detect the SINEs insertion polymorphisms in various Brassica genotypes. Ten previously unidentified families of SINEs were identified and characterized from Brassica genomes. The structural features of these SINEs were studied in detail, which showed typical SINE features displaying small sizes, target site duplications, head regions, internal regions (body) of variable sizes and a poly (A) tail at the 3' terminus. The elements from various families ranged from 206-558 bp, where BoSINE2 family displayed smallest SINE element (206 bp), while larger members belonged to BoSINE9 family (524-558 bp). The distribution and abundance of SINEs in various Brassica species and genotypes (40) at a particular site/locus were investigated by SINEs based PCR markers. Various SINE insertion polymorphisms were detected from different genotypes, where higher PCR bands amplified the SINE insertions, while lower bands amplified the pre-insertion sites (flanking regions). The analysis of Brassica SINEs copy numbers from 10 identified families revealed that around 860 and 1712 copies of SINEs were calculated from B. rapa and B. oleracea Whole-genome shotgun contigs (WGS) respectively. Analysis of insertion sites of Brassica SINEs revealed that the members from all 10 SINE families had shown an insertion preference in AT rich regions. The present

  2. Dicty_cDB: Contig-U12086-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U12086-1 gap included 1101 3 5710254 5711336 PLUS 1 2 U12086 0 0 0 0 0 0 0 1 0 0 0 0 0 0 Show Contig...-U12086-1 Contig ID Contig-U12086-1 Contig update 2002.12.18 Contig sequence >Contig-U12086-1 (Contig-U12086-1Q) /CSM_Contig/Contig-U12086...ATCGGATTA Gap gap included Contig length 1101 Chromosome number (1..6, M) 3 Chromosome length 6358359 Start ...te 2004. 6.10 Homology vs CSM-cDNA Query= Contig-U12086-1 (Contig-U12086-1Q) /CSM_Contig/Contig...Sequences producing significant alignments: (bits) Value Contig-U12086-1 (Contig-U12086-1Q) /CSM_Contig/Conti... 404 e-113 Contig

  3. Dicty_cDB: Contig-U09640-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U09640-1 gap included 1368 2 219988 218635 MINUS 4 5 U09640 0 0 2 0 0 0 0 0 0 0 0 0 1 1 Show Contig...-U09640-1 Contig ID Contig-U09640-1 Contig update 2002. 9.13 Contig sequence >Contig-U09640-1 (Contig...-U09640-1Q) /CSM_Contig/Contig-U09640-1Q.Seq.d ACTGTTGGCCTACTGGNAAAAAATAGTGTAATAATAACCAACAAT...AACAACAACAACAAAAACAAAAACAAATTTTAATT AAATAAAATAATAATATAAAATATAATA Gap gap included Contig...ate 2004. 6.10 Homology vs CSM-cDNA Query= Contig-U09640-1 (Contig-U09640-1Q) /CSM_Contig/Contig

  4. Dicty_cDB: Contig-U09694-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U09694-1 gap included 1129 1 4027135 4026071 MINUS 3 4 U09694 2 0 1 0 0 0 0 0 0 0 0 0 0 0 Show Contig...-U09694-1 Contig ID Contig-U09694-1 Contig update 2002. 9.13 Contig sequence >Contig-U09694-1 (Contig-U09694-1Q) /CSM_Contig/Contig-U0969...TTAAATTAAAACAACAACAATTTCATAATATAAATAAT Gap gap included Contig length 1129 Chromosome number (1..6, M) 1 Chr...iklkqqqfklkqqqfhninn own update 2004. 6.10 Homology vs CSM-cDNA Query= Contig-U09694-1 (Contig-U09694-1Q) /CSM_Contig/Contig...E Sequences producing significant alignments: (bits) Value Contig-U09694-1 (Contig-U09694-1Q) /CSM_Contig

  5. AcEST: CL1889Contig1 [AcEST

    Lifescience Database Archive (English)

    Full Text Available CL1889Contig1 491 2 Adiantum capillus-veneris contig: CL1889contig1 sequence. Link ...apillus-veneris contig: CL1889contig1 sequence. Link to clone list Link to clone list Clone ID BP919609 BP91

  6. Dicty_cDB: Contig-U15058-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U15058-1 no gap 1987 4 4423139 4424727 PLUS 2 4 U15058 0 0 0 0 0 0 0 0 1 0 1 0 0 0 Show Contig...-U15058-1 Contig ID Contig-U15058-1 Contig update 2004. 6.11 Contig sequence >Contig-U15058-1 (Contig...-U15058-1Q) /CSM_Contig/Contig-U15058-1Q.Seq.d AAAAAAGGTTACTCACAAAGTTAAAGAAATCAATGAAAGATTTACCACCC...ACTCAAGGGGGTAGGAGAATAAAATCAACCGATTATCCAGGCNTTAAG CGACCTTTTTCCCAAAAAAAAAAGATGTTCAGAAAAT Gap no gap Contig len...srx*atffpkkkdvq k own update 2004. 6.23 Homology vs CSM-cDNA Query= Contig-U15058-1 (Contig-U15058-1Q) /CSM_Contig/Contig

  7. Dicty_cDB: Contig-U14745-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U14745-1 no gap 1780 6 3063854 3065579 PLUS 2 4 U14745 1 0 1 0 0 0 0 0 0 0 0 0 0 0 Show Contig...-U14745-1 Contig ID Contig-U14745-1 Contig update 2002.12.18 Contig sequence >Contig-U14745-1 (Contig...-U14745-1Q) /CSM_Contig/Contig-U14745-1Q.Seq.d GCGTCCGGACAATTTCAATAAAACAAATTTAAAAATAAATAATTTTTAAT...AATAAAATA ATTTAAATAAAAAAATATTTATTTTATTTTAAGATTAACAAAATAAAATA ATTTAAATAAAAAAATATTTATTTTAAAGA Gap no gap Contig...k*kniyfk own update 2004. 6.10 Homology vs CSM-cDNA Query= Contig-U14745-1 (Contig-U14745-1Q) /CSM_Contig/Contig

  8. Dicty_cDB: Contig-U03367-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U03367-1 no gap 323 - - - - 2 1 U03367 0 0 0 0 0 0 0 0 0 0 0 0 1 1 Show Contig-U03367-1 Contig... ID Contig-U03367-1 Contig update 2001. 8.29 Contig sequence >Contig-U03367-1 (Contig-U03367-1Q) /CSM_Contig/Contig...TTGCGGGTTGGCAGGACTGTNGGNAGGCATGGNCATCGGTATNNTTGGAG ATGCTNGTGTGAGGGCGAATGCT Gap no gap Contig length 323 Chro...HLXXGLXCGLAGLXXGMXIGXXGDAXVRANA own update 2004. 6. 9 Homology vs CSM-cDNA Query= Contig-U03367-1 (Contig...-U03367-1Q) /CSM_Contig/Contig-U03367-1Q.Seq.d (323 letters) Database: CSM 6905 sequ

  9. Dicty_cDB: Contig-U16086-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U16086-1 gap included 1018 - - - - 3 4 U16086 0 0 0 0 0 1 1 0 0 0 1 0 0 0 Show Contig-U16086-1 Contig... ID Contig-U16086-1 Contig update 2004. 6.11 Contig sequence >Contig-U16086-1 (Contig-U16086-1Q) /CSM_Contig.../Contig-U16086-1Q.Seq.d AATTTGATGAAGTAGTAGTAGAGGTAAAACATGTATCAAAACATTATAAG ATTGCAGG...ACTTGGATATAAATGAAG GTAGCTCATCAAATTTTTCAAATAATGATAATTTTAAATCGGTAGATCAA ATTACCAATGACCTTAGCCGTATTTTAT Gap gap included Contig...KSVDQI TNDLSRIL own update 2004. 6.23 Homology vs CSM-cDNA Query= Contig-U16086-1 (Contig-U16086-1Q) /CSM_Contig/Contig

  10. Dicty_cDB: Contig-U13737-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U13737-1 no gap 672 6 1762420 1761754 MINUS 1 1 U13737 0 1 0 0 0 0 0 0 0 0 0 0 0 0 Show Contig...-U13737-1 Contig ID Contig-U13737-1 Contig update 2002.12.18 Contig sequence >Contig-U13737-1 (Contig...-U13737-1Q) /CSM_Contig/Contig-U13737-1Q.Seq.d NNNNNNNNNNAAAATTAGAAAATGGTACAATTGTTTTTAGAGATATTTCA...AGAATAGAAGGAAAATAT AGATCAATGGGGTGGCACAACA Gap no gap Contig length 672 Chromosome...gwhn own update 2004. 6.10 Homology vs CSM-cDNA Query= Contig-U13737-1 (Contig-U13737-1Q) /CSM_Contig/Contig

  11. Dicty_cDB: Contig-U06307-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U06307-1 no gap 637 6 29174 29801 PLUS 4 5 U06307 4 0 0 0 0 0 0 0 0 0 0 0 0 0 Show Contig...-U06307-1 Contig ID Contig-U06307-1 Contig update 2002. 9.13 Contig sequence >Contig-U06307-1 (Contig...-U06307-1Q) /CSM_Contig/Contig-U06307-1Q.Seq.d CCCGCGTCCGAATGCCTCGTATTTTACACACTATGCTCCGTGTGGGTAAT TTAG...ATAGTATTTTTATTTTATT CTTTTTCTTTTAAAAATTTTTTATATTGTCAACAATATAATCAAATAAAT GTATTTAATTATCGGGTATTAAAAAAAAAAAAAAAAA Gap no gap Contig...own update 2004. 6.10 Homology vs CSM-cDNA Query= Contig-U06307-1 (Contig-U06307-1Q) /CSM_Contig/Contig-U063

  12. Dicty_cDB: Contig-U15541-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U15541-1 gap included 2750 - - - - 634 1127 U15541 1 129 1 375 19 0 2 32 4 69 1 0 1 0 Show Contig...-U15541-1 Contig ID Contig-U15541-1 Contig update 2004. 6.11 Contig sequence >Contig-U15541-1 (Contig...-U15541-1Q) /CSM_Contig/Contig-U15541-1Q.Seq.d ATAATAAACGGTGAATACCTCGACTCCTAAATCGATGAAGACCGTAG...AAAAAT AAAAATAAAAATAAATAAATAATCATTTCATATTAATATTTTTTTTTATT TTTAAAAAAA Gap gap included Contig...ffyf*k own update 2004. 6.23 Homology vs CSM-cDNA Query= Contig-U15541-1 (Contig-U15541-1Q) /CSM_Contig/Contig

  13. Dicty_cDB: Contig-U06822-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U06822-1 no gap 468 3 438742 439211 PLUS 1 1 U06822 1 0 0 0 0 0 0 0 0 0 0 0 0 0 Show Contig...-U06822-1 Contig ID Contig-U06822-1 Contig update 2001. 8.30 Contig sequence >Contig-U06822-1 (Contig...-U06822-1Q) /CSM_Contig/Contig-U06822-1Q.Seq.d ATATTATTCTATTCACTCGTAATAATACATATAAATTGATATCAATCAGA AA...TGCTATTAAGACTTTGGAGCAAAAAAC TAACAAATCAATTCAAAA Gap no gap Contig length 468 Chromosome number (1..6, M) 3 Ch...*mmlklkeikllvllrlwskkltnqfk own update 2004. 6.10 Homology vs CSM-cDNA Query= Contig-U06822-1 (Contig-U06822-1Q) /CSM_Contig/Contig

  14. Dicty_cDB: Contig-U01997-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U01997-1 gap included 886 2 1683026 1682230 MINUS 3 4 U01997 1 0 0 0 0 0 2 0 0 0 0 0 0 0 Show Contig...-U01997-1 Contig ID Contig-U01997-1 Contig update 2001. 8.29 Contig sequence >Contig-U01997-1 (Contig-U01997-1Q) /CSM_Contig/Contig-U01997...ATTGAAATAATATTTATTTATTTTTTTAAAAAAAAAAAA AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA Gap gap included Contig...nfkvfgieiifiyffkkkkkkkkkkkkkkkkkkk own update 2004. 6. 9 Homology vs CSM-cDNA Query= Contig-U01997-1 (Contig-U01997-1Q) /CSM_Contig.../Contig-U01997-1Q.Seq.d (896 letters) Database: CSM 6905 sequences; 5,674,871 total l

  15. Dicty_cDB: Contig-U13254-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U13254-1 no gap 575 5 203798 203233 MINUS 1 1 U13254 0 0 0 0 0 0 0 1 0 0 0 0 0 0 Show Contig...-U13254-1 Contig ID Contig-U13254-1 Contig update 2002.12.18 Contig sequence >Contig-U13254-1 (Contig...-U13254-1Q) /CSM_Contig/Contig-U13254-1Q.Seq.d AAATAATTTATTTAATTTTAAAATTAATAGATAAAAAGATGGAAATGATA A...CATTTTAACATTATTGGATAAT GTCAATGATTGGCCAANNNNNNNNN Gap no gap Contig length 575 Chromosome number (1..6, M) 5 ...2004. 6.10 Homology vs CSM-cDNA Query= Contig-U13254-1 (Contig-U13254-1Q) /CSM_Contig/Contig-U13254-1Q.Seq.d

  16. Dicty_cDB: Contig-U13891-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U13891-1 no gap 1355 6 799802 798446 MINUS 4 4 U13891 0 0 0 0 1 1 1 0 0 0 1 0 0 0 Show Contig...-U13891-1 Contig ID Contig-U13891-1 Contig update 2002.12.18 Contig sequence >Contig-U13891-1 (Contig...-U13891-1Q) /CSM_Contig/Contig-U13891-1Q.Seq.d TTTTAAAATATTTCAAAATTAGCGAGCACGCATTCGCATATAAATATATT ...ACAAATAAAAAAAAAAAATAAAAAAAATA ATTTA Gap no gap Contig length 1355 Chromosome numb...own update 2004. 6.10 Homology vs CSM-cDNA Query= Contig-U13891-1 (Contig-U13891-1Q) /CSM_Contig/Contig-U138

  17. Dicty_cDB: Contig-U16093-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U16093-1 gap included 1020 2 4899973 4899063 MINUS 29 31 U16093 7 0 0 0 0 2 ...18 0 0 0 0 0 2 0 Show Contig-U16093-1 Contig ID Contig-U16093-1 Contig update 2004. 6.11 Contig sequence >Contig-U16093-1 (Contig...-U16093-1Q) /CSM_Contig/Contig-U16093-1Q.Seq.d TTTTTTTTTTTTTTTTTAATTTTTTTTTTTCATAAAACTT...AAAATTAAATT Gap gap included Contig length 1020 Chromosome number (1..6, M) 2 Chr...pdate 2004. 6.23 Homology vs CSM-cDNA Query= Contig-U16093-1 (Contig-U16093-1Q) /CSM_Contig/Contig-U16093-1Q

  18. Dicty_cDB: Contig-U06384-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U06384-1 no gap 660 5 3008439 3007779 MINUS 2 2 U06384 2 0 0 0 0 0 0 0 0 0 0 0 0 0 Show Contig...-U06384-1 Contig ID Contig-U06384-1 Contig update 2001. 8.30 Contig sequence >Contig-U06384-1 (Contig...-U06384-1Q) /CSM_Contig/Contig-U06384-1Q.Seq.d TGAAAAAATTAGAGACAACAAGTGGATCAGCACGTAAAGTATGGCGTTTA...AAATAAAAATTAATTTCC AAAAATAAAA Gap no gap Contig length 660 Chromosome number (1.....own update 2004. 6.10 Homology vs CSM-cDNA Query= Contig-U06384-1 (Contig-U06384-1Q) /CSM_Contig/Contig-U063

  19. Dicty_cDB: Contig-U12545-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U12545-1 gap included 1165 3 3275272 3276395 PLUS 1 2 U12545 0 1 0 0 0 0 0 0 0 0 0 0 0 0 Show Contig...-U12545-1 Contig ID Contig-U12545-1 Contig update 2002.12.18 Contig sequence >Contig-U12545-1 (Contig-U12545-1Q) /CSM_Contig/Contig-U12545...CGTTCTAAATCACTCATTAAAAGATTAAAAATTAAANAAGGTAATATC TCACGACNGCTNNCTCATACACACN Gap gap included Contig length 11...vliknlskrkerkis*klyqlkriqlsl vknwlklvlnhslkd*klxkvishdxxliht own update 2004. 6.10 Homology vs CSM-cDNA Query= Contig...-U12545-1 (Contig-U12545-1Q) /CSM_Contig/Contig-U12545-1Q.Seq.d (1175 letters) Database: CSM 6905 s

  20. Dicty_cDB: Contig-U10823-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U10823-1 gap included 1750 1 3559501 3561234 PLUS 85 124 U10823 0 5 0 30 1 0... 0 20 0 29 0 0 0 0 Show Contig-U10823-1 Contig ID Contig-U10823-1 Contig update 2002.12.18 Contig sequence >Contig-U10823-1 (Contig...-U10823-1Q) /CSM_Contig/Contig-U10823-1Q.Seq.d ACTGTTGGCCTACTGGTATTTTTGGTAGTGTGTTAAAA...CAACAAATAAAATTAAAATTA GTTATATTTTTTTTAAATTAAAAAAAAAAATAAAAAAAATAAATTATTTA TTAAATTTTT Gap gap included Contig ...4. 6.10 Homology vs CSM-cDNA Query= Contig-U10823-1 (Contig-U10823-1Q) /CSM_Contig/Contig-U10823-1Q.Seq.d (1

  1. Dicty_cDB: Contig-U13065-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U13065-1 no gap 718 1 3561021 3561729 PLUS 1 1 U13065 0 0 0 0 0 0 0 0 0 1 0 0 0 0 Show Contig...-U13065-1 Contig ID Contig-U13065-1 Contig update 2002.12.18 Contig sequence >Contig-U13065-1 (Contig...-U13065-1Q) /CSM_Contig/Contig-U13065-1Q.Seq.d NNNNNNNNNNCAATCAAAGCAATCAATGGTAAATTAACTTTGTTACCATT ...TGATTCAACTCTCTCTG TTTCAAATTTACAACTTGCTTTAGATGAATCCTTTGAAGTTGATTTTGTA TTATATTAAAAATTATCA Gap no gap Contig...kny own update 2004. 6.10 Homology vs CSM-cDNA Query= Contig-U13065-1 (Contig-U13065-1Q) /CSM_Contig

  2. Dicty_cDB: Contig-U08861-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U08861-1 gap included 1295 5 2877914 2879217 PLUS 1 2 U08861 0 0 0 0 1 0 0 0 0 0 0 0 0 0 Show Contig...-U08861-1 Contig ID Contig-U08861-1 Contig update 2002. 9.13 Contig sequence >Contig-U08861-1 (Contig-U08861-1Q) /CSM_Contig/Contig-U08861...CACATTATAAAGTACCAAATAAGTTATTAATTTTAGAAAATA AATTCCAAAGAATGCAATGTCTAAAGTTAATAAAAAAGAATACTAAAATA TTTTC Gap gap included Contig...k**iwsryccnhcl*kkqkttnef*r i*nql*tkistl*stk*vinfrk*ipknamskvnkkey*nif own update 2004. 6.10 Homology vs CSM-cDNA Query= Contig...-U08861-1 (Contig-U08861-1Q) /CSM_Contig/Contig-U08861-1Q.Seq.d (1305 letters) Database: C

  3. Dicty_cDB: Contig-U06829-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U06829-1 no gap 449 5 4394444 4394893 PLUS 1 1 U06829 1 0 0 0 0 0 0 0 0 0 0 0 0 0 Show Contig...-U06829-1 Contig ID Contig-U06829-1 Contig update 2001. 8.30 Contig sequence >Contig-U06829-1 (Contig...-U06829-1Q) /CSM_Contig/Contig-U06829-1Q.Seq.d GTAAAAGAATGTAATGAAAATGAAAAAATTAATTTTATAATAAAATTATT ...ATGATTTAGAATTGGTACAATTAGTTTA Gap no gap Contig length 449 Chromosome number (1..6, M) 5 Chromosome length 50...04. 6.10 Homology vs CSM-cDNA Query= Contig-U06829-1 (Contig-U06829-1Q) /CSM_Contig/Contig-U06829-1Q.Seq.d (

  4. Dicty_cDB: Contig-U12073-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U12073-1 gap included 912 2 2118980 2119867 PLUS 4 5 U12073 0 0 0 2 0 0 0 1 0 1 0 0 0 0 Show Contig...-U12073-1 Contig ID Contig-U12073-1 Contig update 2002.12.18 Contig sequence >Contig-U12073-1 (Contig...-U12073-1Q) /CSM_Contig/Contig-U12073-1Q.Seq.d CTGTTGGCCTACTGGNAATTGAAACAATTGTTTCAGCAAATATTA...AAGA Gap gap included Contig length 912 Chromosome number (1..6, M) 2 Chromosome length 8467578 Start point ...GPXSXDY*r own update 2004. 6.10 Homology vs CSM-cDNA Query= Contig-U12073-1 (Contig-U12073-1Q) /CSM_Contig/Contig

  5. Dicty_cDB: Contig-U09615-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U09615-1 gap included 1134 3 4459395 4458259 MINUS 1 2 U09615 0 0 1 0 0 0 0 0 0 0 0 0 0 0 Show Contig...-U09615-1 Contig ID Contig-U09615-1 Contig update 2002. 9.13 Contig sequence >Contig-U09615-1 (Contig-U09615-1Q) /CSM_Contig/Contig-U0961...TGCAAGATTAGAAAGATTAGAAAAAGATGCTATGCTAAAAATA Gap gap included Contig length 1134 Chromosome number (1..6, M) ...*wcnlyfrcre*emgkcn iefhiintrfkiwphrcidtighnvgicw**fnfecsfisleiqyrv**mgirfkyw*ww s*c*irpyfnnhafqyydyiwwskfwh*...4. 6.10 Homology vs CSM-cDNA Query= Contig-U09615-1 (Contig-U09615-1Q) /CSM_Contig/Contig-U09615-1Q.Seq.d (1

  6. Dicty_cDB: Contig-U04432-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U04432-1 no gap 600 1 1520578 1521098 PLUS 1 1 U04432 0 0 0 0 0 0 1 0 0 0 0 0 0 0 Show Contig...-U04432-1 Contig ID Contig-U04432-1 Contig update 2001. 8.29 Contig sequence >Contig-U04432-1 (Contig...-U04432-1Q) /CSM_Contig/Contig-U04432-1Q.Seq.d AATTATAATCAAAACAAATTAATAAAAAAAATGATTAATAGTTTTGTCTC ...TCAACAATATGAAATTGCAAGAT TAAATGGTTATGATAATGCCCATAATTTACCAAGAGATATTAGTCAAATA Gap no gap Contig length 600 Chro...ni**fkgrnsnknyfsrymgtiessti*n ckikwl**cp*ftkry*sn own update 2004. 6.10 Homology vs CSM-cDNA Query= Contig-U04432-1 (Contig

  7. Dicty_cDB: Contig-U15828-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U15828-1 gap included 1593 1 4184040 4182448 MINUS 12 19 U15828 0 0 6 0 0 0 ...0 0 2 0 4 0 0 0 Show Contig-U15828-1 Contig ID Contig-U15828-1 Contig update 2004. 6.11 Contig sequence >Contig-U15828-1 (Contig...-U15828-1Q) /CSM_Contig/Contig-U15828-1Q.Seq.d ATAAAAAAAATTAAAAAATTAAAAAAGTTATCCACCCAAGT...ACA AATATTATAACTGGTACTGCTACTGTTTCAATCCCTCAAAAAAATTTAAT TTATATTTTACCAAATTCAAATACAATTAATCAATCAACAATTACAATTA CAA Gap gap included Contig...SFNPANSDFSFSYNINTTITQPTQIYLNQDIYYPNGFTTNIITGTATVSIPQ KNLIYILPNSNTINQSTITIT own update 2004. 6.23 Homology vs CSM-cDNA Query= Contig

  8. Dicty_cDB: Contig-U01750-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U01750-1 no gap 811 3 3337090 3336279 MINUS 2 2 U01750 1 0 0 0 0 0 1 0 0 0 0 0 0 0 Show Contig...-U01750-1 Contig ID Contig-U01750-1 Contig update 2001. 8.29 Contig sequence >Contig-U01750-1 (Contig...-U01750-1Q) /CSM_Contig/Contig-U01750-1Q.Seq.d GGAAGTTGTAATAATAAAAAAATAAAAATAAAAATAAAAAAATAAAAAAA...GAATACCAAGGTGAAAGAATTTTTCAAAAACTTCCTCAA ATCAACACAAATTTCGAAAAATTAACAATTTGGGAAAAGAAAATCGTTTC AAATCTTTATT Gap no gap Contig...crncnciwsktl*tywiyskiinpi**i*ipr *knfsktssnqhkfrkinnlgkenrfksl own update 2004. 6. 7 Homology vs CSM-cDNA Query= Contig

  9. Dicty_cDB: Contig-U07021-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U07021-1 no gap 601 2 3862699 3862098 MINUS 1 2 U07021 1 0 0 0 0 0 0 0 0 0 0 0 0 0 Show Contig...-U07021-1 Contig ID Contig-U07021-1 Contig update 2001. 8.30 Contig sequence >Contig-U07021-1 (Contig...-U07021-1Q) /CSM_Contig/Contig-U07021-1Q.Seq.d AAAAAAACAAAATGAATAAATTTAATATTACATCATTATTTATTATTTTA...TTTAATATATTCAGAAGGAAATTC TTATTTACAACAAAATTTCCCATTACTTTCTTANTTAAANTCCGTTAAAA T Gap no gap Contig length 601 C...QACCRTTQLFINYADNSFLDSAGFSPFGKVISGFNNTLNFYGGYGEEPDQSLIYSE GNSYLQQNFPLLSXLXSVK own update 2004. 6.10 Homology vs CSM-cDNA Query= Contig

  10. Dicty_cDB: Contig-U16108-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U16108-1 gap included 1456 4 1889609 1888449 MINUS 4 6 U16108 0 0 2 0 1 0 1 0 0 0 0 0 0 0 Show Contig...-U16108-1 Contig ID Contig-U16108-1 Contig update 2004. 6.11 Contig sequence >Contig-U16108-1 (Contig-U16108-1Q) /CSM_Contig/Contig-U1610...AAAATCA TAAAATCAAAAATTGTATAATTAAAATAAAAATAAAAAAAAAAACAAAAA TAAAAAAAAAAAACAA Gap gap included Contig length 1...DFLSQFYGELN QPSLNNLTENIITIDQSSFIPIGYTTITAGLNNFAYAYIPTSCKNDKSLCSIHVAFHGCL QTVATIGDNFYTKTGYNEIAETNNIIILYPQALET...---NYVNNDNIKTMFDIQSEHAFITNSFGNNCTYLGPDYINNCNFNAPWDFLSQFYGELN QPSLNNLTENIITIDQSSFIPIGYTTITAGLNNFAYAYIPTSCKNDKSLCSIHVAFHGCL QTVATIG

  11. Dicty_cDB: Contig-U13974-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U13974-1 no gap 1782 1 1265322 1267105 PLUS 29 32 U13974 0 0 0 1 2 0 22 0 4 0 0 0 0 0 Show Contig...-U13974-1 Contig ID Contig-U13974-1 Contig update 2002.12.18 Contig sequence >Contig-U13974-1 (Contig...-U13974-1Q) /CSM_Contig/Contig-U13974-1Q.Seq.d AAGAGTTAAAACAAAAATAAAAAAATAAAATAAAAAAAAAAAATTAA...TAAAACAAATAA ACATTAAAATGATATTTAGGTTTTAAATTTAAAAAAAAAAAAAAAAAAAA AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA Gap no gap Contig...TTRKIYVYDNQNFFPIDNQGFD VDPAKRIYLNEKKTYHNYHFCMKMNTVFTYKGYEVFNFRGDDDVWVFINNKLVIDLGGLH SPIGTSVDTMTLGLTIG

  12. Dicty_cDB: Contig-U16008-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U16008-1 gap included 1557 5 1711154 1712676 PLUS 5 8 U16008 0 0 0 0 1 1 1 0 1 0 0 0 1 0 Show Contig...-U16008-1 Contig ID Contig-U16008-1 Contig update 2004. 6.11 Contig sequence >Contig-U16008-1 (Contig-U16008-1Q) /CSM_Contig/Contig-U16008... TAAGGTTTATGATTTTTGATTTTAGATTTTATATTTTATTTATTTTAATA AAAAAAAAAAAAAAAAA Gap gap included Contig length 1557 Ch...F LIFVHGSSTIIVLGIAIINFSISRIFERSKMLPAVTWIFNLIILWTCY--- ---PFGGFGARGPPSTIGYSRHTIGGMYGGHSPGPRLHLTGYLGIEPMNGKFLN...SSTIIVLGIAIINFSISRIFERSKMLPAVTWIFNLIILWTCY--- ---PFGGFGARGPPSTIGYSRHTIGGMYGGHSPGPRLHLTGYLGIEPMNGKFLNIGRTFR L

  13. Dicty_cDB: Contig-U11342-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U11342-1 gap included 2051 2 611517 609465 MINUS 4 7 U11342 0 2 1 1 0 0 0 0 0 0 0 0 0 0 Show Contig...-U11342-1 Contig ID Contig-U11342-1 Contig update 2002.12.18 Contig sequence >Contig-U11342-1 (Contig...-U11342-1Q) /CSM_Contig/Contig-U11342-1Q.Seq.d GTCAACATTAACATCATCATCATCATCATCACCATCTAGTAATAA...GAATTTGGTAATTTTAAAATCACTNATTAATATATTAAACAAAATTA TAAAAATAAAA Gap gap included Contig...EFFFIDRKSLLVNFP RGSICAQILKLIGNLYGSNDIIFKINTNNVSFFDGTIGANNSTNNSNSNQPMTPQQVVIK YLNPTARWKRREISNFEYLMTLNTIAGRTYN

  14. Dicty_cDB: Contig-U11195-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U11195-1 gap included 2858 2 4308456 4311316 PLUS 16 27 U11195 0 2 0 8 1 0 0... 3 0 2 0 0 0 0 Show Contig-U11195-1 Contig ID Contig-U11195-1 Contig update 2002.12.18 Contig sequence >Contig-U11195-1 (Contig...-U11195-1Q) /CSM_Contig/Contig-U11195-1Q.Seq.d AGCATTGGAACAAATCGAATTACGTGAAAAGATACCATTGTT...TATCACCTGCTCTTTATCCTTCAAATTTAAGT AATTCAACATTGGCCCAAAGAGTTACATGGATAAATAAATTATAAATAAT GTATAAAATCATTCTCTC Gap gap included Contig... EYREKIPLLDLPWGASKPWTLVDLRDDYDEDLMVRFYNELMLPNFPVKNELEPLSNFISA LSEERRESFNPHLSEVHVLLALRWPTDSSDLQPTIGAGIIFEYFSN

  15. Dicty_cDB: Contig-U01791-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U01791-1 no gap 527 2 7629792 7630319 PLUS 1 1 U01791 0 0 0 0 0 0 1 0 0 0 0 0 0 0 Show Contig...-U01791-1 Contig ID Contig-U01791-1 Contig update 2001. 8.29 Contig sequence >Contig-U01791-1 (Contig...-U01791-1Q) /CSM_Contig/Contig-U01791-1Q.Seq.d GTTTGATTATAATTTATATGAATGTGAAATTAGACAAGCATTATCAAATA ...TCGTTCCCTTATGATTTAAGAACAACTTT GAATAGTTACAGAAATGGTGAATTTAGTATTTATCAATAAATTTTTTTTT AAAGATTTATAATTAAAATAAAAAAAA Gap no gap Contig...SILWSIESIGSLIVSAQINDDRETMELLHRYQIPQKFLIPLF QILALIDQLEKDLSHQIELDKFTINRDYYFLKSFSNLIEPPLNCLGILKTSRPHFRIFKL VGKNMISQVLETIG

  16. Dicty_cDB: Contig-U09412-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U09412-1 gap included 873 3 3953072 3953946 PLUS 1 2 U09412 0 0 0 0 0 0 1 0 0 0 0 0 0 0 Show Contig...-U09412-1 Contig ID Contig-U09412-1 Contig update 2002. 9.13 Contig sequence >Contig-U09412-1 (Contig...-U09412-1Q) /CSM_Contig/Contig-U09412-1Q.Seq.d ATTATCACAACTATTTTATAATAAACCAATTTTAAAGATTAAAGT...TGGTTCAATAAAAGAAATTAAATATAATTATCAATAAT AATAATAAATTAATTAATAAATTTAAATCAAAA Gap gap included Contig length 873 ...DCQCGFVSVVENNNNNNNNSDNENNENNENNENNE NNEDLEDFIPRKLLKKSSSTLQSRTYLVIYLGRRGILEIWGLKHRSREYFKTIG

  17. Dicty_cDB: Contig-U12357-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U12357-1 gap included 1333 1 2827305 2828232 PLUS 5 6 U12357 0 1 1 2 0 0 1 0 0 0 0 0 0 0 Show Contig...-U12357-1 Contig ID Contig-U12357-1 Contig update 2002.12.18 Contig sequence >Contig-U12357-1 (Contig-U12357-1Q) /CSM_Contig/Contig-U12357...ATAAAATAAAATTTATTAATTTTCCAACT Gap gap included Contig length 1333 Chromosome numb...RYXEKKKXXXXDSXNXXXXXPXX XXLXXXXPXX--- ---QYEKMKLSGEKVDPTLDASIILGNRYLEKKKVTIGDSENYTITVPFSQILKNQKPLI IQRKTKGTL...-QYEKMKLSGEKVDPTLDASIILGNRYLEKKKVTIGDSENYTITVPFSQILKNQKPLI IQRKTKGTLYYSINLSYASLNPISKAIFNRGLNIKRTYYPVSNSNDVIY

  18. Dicty_cDB: Contig-U10996-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U10996-1 gap included 3017 2 5488454 5485454 MINUS 41 76 U10996 0 3 0 24 1 0... 0 8 0 5 0 0 0 0 Show Contig-U10996-1 Contig ID Contig-U10996-1 Contig update 2002.12.18 Contig sequence >Contig-U10996-1 (Contig...-U10996-1Q) /CSM_Contig/Contig-U10996-1Q.Seq.d TGGCCTACTGGTAAAAAAAATTCTAATTTTATTAAAACCC...CTATTTATAATGTATTGTTAAG GCAAAAATAAAAAAAAAAGNAAAAAAA Gap gap included Contig length...LTTTA SSSQQQQQELGLAVLTIRQGYEFENIVKELLDEKKKIEIWSMKPNSKQQWELIKKGSPGN TQMFEDVLLNGNCEGSVMMALKVTREKGSIVFGISFGDATFKTIG

  19. Dicty_cDB: Contig-U12049-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U12049-1 gap included 2563 4 3071598 3069091 MINUS 9 17 U12049 0 0 0 0 2 0 0... 1 4 1 1 0 0 0 Show Contig-U12049-1 Contig ID Contig-U12049-1 Contig update 2002.12.18 Contig sequence >Contig-U12049-1 (Contig...-U12049-1Q) /CSM_Contig/Contig-U12049-1Q.Seq.d TAATGAAGGTAGTAATAATAATATAGTTGAAGCATCAAAAGA...TATCATTTAAACTGAAAAAAGTC CAAAAGATTTATGCAATGATTGCTGCGAATATGCTGCAACTTGTTCTCAT TAAAAATAAACAAAAAAATAATA Gap gap included Contig...disngqcvyseiidcgsssienss nqesssdidittastlgstiastigstigltstttttttsqttgtpttppqtvseipisl astistspvsdegtiastiatt

  20. Dicty_cDB: Contig-U13894-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U13894-1 no gap 1550 2 2081463 2079913 MINUS 30 31 U13894 1 0 15 0 9 1 1 0 1 1 1 0 0 0 Show Contig...-U13894-1 Contig ID Contig-U13894-1 Contig update 2002.12.18 Contig sequence >Contig-U13894-1 (Contig...-U13894-1Q) /CSM_Contig/Contig-U13894-1Q.Seq.d CTTTTTGATTGTATAATTGAAAAAAAAAAAAAAAAAAAAAAAAAAA...TAAATTAAATAATTAAAAAAAACAAAAAAATTAAGTGAAAATCAAAAAA Gap no gap Contig length 1550 Chromosome number (1..6, M) ...V*kkkkikk*k*sk*fklnn*kkqkn*vkikk own update 2004. 6.10 Homology vs CSM-cDNA Query= Contig-U13894-1 (Contig

  1. Dicty_cDB: Contig-U15462-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U15462-1 no gap 546 4 3384206 3383661 MINUS 2 2 U15462 0 0 2 0 0 0 0 0 0 0 0 0 0 0 Show Contig...-U15462-1 Contig ID Contig-U15462-1 Contig update 2004. 6.11 Contig sequence >Contig-U15462-1 (Contig...-U15462-1Q) /CSM_Contig/Contig-U15462-1Q.Seq.d CTTTAGATTGGGGNTCAAGAAAAATATTGAAGTATTTGGTGGTGATAAGA...ATTCGATTCACTATCTTATA Gap no gap Contig length 546 Chromosome number (1..6, M) 4 Chromosome length 5430582 St...VMKLGFEVKDLITNDPKCDLFDSLS Y own update 2004. 6.23 Homology vs CSM-cDNA Query= Contig-U15462-1 (Contig

  2. Dicty_cDB: Contig-U09720-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U09720-1 gap included 1323 2 5906974 5908260 PLUS 1 2 U09720 0 0 1 0 0 0 0 0 0 0 0 0 0 0 Show Contig...-U09720-1 Contig ID Contig-U09720-1 Contig update 2002. 9.13 Contig sequence >Contig-U09720-1 (Contig-U09720-1Q) /CSM_Contig/Contig-U09720...ATNATTATTATAAAAATTT Gap gap included Contig length 1323 Chromosome number (1..6, ...QLEAEDIVKQSQLVRNTLLSILNKLFSNY NNSNETTATTTIGQDQEKLSTLKNQREIIAQSLKIXKKL*linqxll*kf ...AEMFDIDSRNNHAIENDGRLDDA LVCSVGIALAPQSIFQSWKSMSEHKREKYFEQLEAEDIVKQSQLVRNTLLSILNKLFSNY NNSNETTATTTIGQDQEKLSTLK

  3. Dicty_cDB: Contig-U09379-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U09379-1 gap included 899 2 1392012 1392912 PLUS 1 2 U09379 0 0 0 0 0 0 1 0 0 0 0 0 0 0 Show Contig...-U09379-1 Contig ID Contig-U09379-1 Contig update 2002. 9.13 Contig sequence >Contig-U09379-1 (Contig...-U09379-1Q) /CSM_Contig/Contig-U09379-1Q.Seq.d AAAAATTTTTTAAACTAAAAAATAAAAAAAATAAATAAAAAAAAA...TTTAAAAATAATAATAAAAGTGAATATTATAATATTAT AATCTTTTTGGTATAATTGAAAAAGATCAATAATATATTAAAATTTCCAA AAAAAAAAA Gap gap included Contig...VSVCRAYATETATIENKTQIMGKMSGAQGAGFVLGPGIGFLLNFCNFTIG--- ---INNK******sn*finykl***f*kikqphfknlkiiikvniiil*sfwyn

  4. Dicty_cDB: Contig-U15566-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U15566-1 gap included 1830 4 3730704 3729599 MINUS 4 8 U15566 0 0 1 0 1 0 0 0 2 0 0 0 0 0 Show Contig...-U15566-1 Contig ID Contig-U15566-1 Contig update 2004. 6.11 Contig sequence >Contig-U15566-1 (Contig-U15566-1Q) /CSM_Contig/Contig-U1556...CAAGATCCAA TGGAATTTTAATAATAAATAAGAATAATAAAAAAAAAAAA Gap gap included Contig length 1830 Chromosome number (1...ITLTPSEDIEKKLKEI QDENLSNSEIWFAVKSYLEDNNLKEHLYNLVFHYTMPRIDEPVTIGLDHLGNVLVSNR*c tflvvvvvytfgcriephni*qerivlqf*...asilnhirvelsqnqipilkrsfdqillphfekc iieeqqiftnekqrknflsllpisykrqdrkipltpsediekklkeiqdenlsnseiwfa vksylednnlkehlynlvfhytmpridepvtig

  5. Dicty_cDB: Contig-U04334-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U04334-1 no gap 399 4 3746420 3746021 MINUS 3 3 U04334 0 0 0 0 0 0 3 0 0 0 0 0 0 0 Show Contig...-U04334-1 Contig ID Contig-U04334-1 Contig update 2001. 8.29 Contig sequence >Contig-U04334-1 (Contig...-U04334-1Q) /CSM_Contig/Contig-U04334-1Q.Seq.d CAAAAAAAAAAAAGTAAAACAATAAATTATATAAAAAAAATAAAAAAAAT...CTAATTTCA AACAATATCAATAAAATGTTATATAATTACTATTAAAATGAAAAAAAAA Gap no gap Contig len...ce QKKKSKTINYIKKIKKMSIINTISKLSLSNSLKSNITIGNLNGTTVNNYTHNETSSKFTE FFYKII*qnkrwf*kvkelnkkkrkkdyiissfcklysiyfvfs

  6. Dicty_cDB: Contig-U10335-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U10335-1 no gap 1353 2 2769724 2768368 MINUS 3 6 U10335 0 0 2 0 0 0 1 0 0 0 0 0 0 0 Show Contig...-U10335-1 Contig ID Contig-U10335-1 Contig update 2002. 9.13 Contig sequence >Contig-U10335-1 (Contig...-U10335-1Q) /CSM_Contig/Contig-U10335-1Q.Seq.d ATTTTTTTTCTAAATATATAAAAAATAATAATAATAATAATAATATAAT...AAACATAATAAAACAAAAGATAAAAATAAAA ACA Gap no gap Contig length 1353 Chromosome numb...SSLATNNNINNNKRITIPDNH SNNPDKLLEIQLINKIFDISKAFDGKSNNLVSSFQNCTNNNNNNNNNTDNNNNNNISNNN NNNNVPTLQPLSFNNRNNLVNGNISSSSSSNSSNNNIGSSNSNNVTIG

  7. Dicty_cDB: Contig-U12399-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U12399-1 gap included 1358 3 4712677 4711450 MINUS 1 2 U12399 0 1 0 0 0 0 0 0 0 0 0 0 0 0 Show Contig...-U12399-1 Contig ID Contig-U12399-1 Contig update 2002.12.18 Contig sequence >Contig-U12399-1 (Contig-U12399-1Q) /CSM_Contig/Contig-U1239...GAAGATGATATTAGTCTGAGGAAGATATTCTTAAAGA ATTTAACAAATGTTAACA Gap gap included Contig ...*e iekkklnyl*eqkvkyqknhqkimiq*enxmks*LQIYHXFAXLIGEPIPNNDXXX--- ---XXXRHVIWKLYEEITIGLKRTISITXKRESCKSHYLANCIMH...kkklnyl*eqkvkyqknhqkimiq*enxmks*LQIYHXFAXLIGEPIPNNDXXX--- ---XXXRHVIWKLYEEITIGLKRTISITXKRESCKSHYLANCIMHVYWRL

  8. Dicty_cDB: Contig-U11404-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U11404-1 gap included 1618 6 1729583 1727965 MINUS 11 19 U11404 0 6 1 1 0 2 ...0 1 0 0 0 0 0 0 Show Contig-U11404-1 Contig ID Contig-U11404-1 Contig update 2002.12.18 Contig sequence >Contig-U11404-1 (Contig...-U11404-1Q) /CSM_Contig/Contig-U11404-1Q.Seq.d ATTTTAAGAGTTTTAATTTTAATAACTATACTTTTAATAAA...TTTTTCTTTTGAACCAGAAAAAAAAA Gap gap included Contig length 1618 Chromosome number ...AGARMLASLATDKLSNVIYLDVSENDFGDEGVSVICDGFVGNSTIKKLILNGNFKQ SK--- ---YEKITIGLDSVFKDLILEESQAQNEASGATPIPDSPVPTRSP

  9. Dicty_cDB: Contig-U09569-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U09569-1 gap included 1424 5 3658944 3660352 PLUS 8 14 U09569 0 0 8 0 0 0 0 0 0 0 0 0 0 0 Show Contig...-U09569-1 Contig ID Contig-U09569-1 Contig update 2002. 9.13 Contig sequence >Contig-U09569-1 (Contig-U09569-1Q) /CSM_Contig/Contig-U0956...TTAAAAA TAAAATAAATATAAAATAAAATAAAAATTAACAA Gap gap included Contig length 1424 Chromosome number (1..6, M) 5...NQTFQQKYYVNDQYYNYKNGGPIILYINGEGPVSSPPYSSDDGVVIYAQA LNCMIVTLEHRFYGESSPFSELTIENLQYLSHQQALEDLATFVVDFQSKLVGAGHIVTIG...YLSHQQALEDLATFVVDFQSKLVGAGHIVTIG GSYSGALSAWFRIKYPHITVGSIASLGVVHSILDFTAFDAYVSYA---

  10. Dicty_cDB: Contig-U15306-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U15306-1 no gap 2452 3 3887051 3889342 PLUS 54 91 U15306 0 0 0 49 4 1 0 0 0 0 0 0 0 0 Show Contig...-U15306-1 Contig ID Contig-U15306-1 Contig update 2004. 6.11 Contig sequence >Contig-U15306-1 (Contig...-U15306-1Q) /CSM_Contig/Contig-U15306-1Q.Seq.d AAGCATAAACGGTGAATACCTCGACTCCTAAATCGATGAAGACCGTA...TTTTAGAACTTCAAAAAATAGTAC AAATTTTTTCAAATTAAGATAAAAAAAATAAAACAAAAATTAATTTAAAA CA Gap no gap Contig length 2452...*naagtgkgeegrt*hkslpywlapqvkgsvmprggqghygasrggrkhmgidfssivg qdivapisgkvvnfkgartkypmlqlypskkftefdylqmlyvhppvginmgasyqvsvg dtig

  11. Dicty_cDB: Contig-U14772-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U14772-1 no gap 665 1 1988279 1987624 MINUS 1 1 U14772 0 1 0 0 0 0 0 0 0 0 0 0 0 0 Show Contig...-U14772-1 Contig ID Contig-U14772-1 Contig update 2002.12.18 Contig sequence >Contig-U14772-1 (Contig...-U14772-1Q) /CSM_Contig/Contig-U14772-1Q.Seq.d AAAAACAATAACCATCGTTTTTTATTTTTATTTTCAAAATATGGATTTAA...AAATTAATGAAGAAAAAA AAGTAANNNNNNNNN Gap no gap Contig length 665 Chromosome number...DADTTISFLSSQNLSQLSIIKNLVNGKTIG DKKVIVDFYDFKKVIPTPTPIPTPTPPTKTQEESNKKIKLTNEKPKEKKP

  12. Dicty_cDB: Contig-U11141-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U11141-1 gap included 2122 2 1113359 1111236 MINUS 6 12 U11141 0 1 0 2 0 0 0... 1 0 2 0 0 0 0 Show Contig-U11141-1 Contig ID Contig-U11141-1 Contig update 2002.12.18 Contig sequence >Contig-U11141-1 (Contig...-U11141-1Q) /CSM_Contig/Contig-U11141-1Q.Seq.d AAAAAACAATCTTAAAACACACACACACTCAACACACTATCA...AAATCAAAATCAAAATCAAA ATAATAATAATTATAATAATAGCTATAATAAT Gap gap included Contig length 2122 Chromosome number ...HNYFGKVSRGIVSLSDYKYYGYLRSVHLIGYE QHEEELIKTIKSLPVGVSTLELSGHLNKIIFKEGSL--- ---DDSTIGAILNSFSSSSSRETFPRSVESLHLNI

  13. Dicty_cDB: Contig-U13202-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U13202-1 no gap 1083 4 1301578 1302630 PLUS 41 45 U13202 8 0 13 0 0 2 16 0 2 0 0 0 0 0 Show Contig...-U13202-1 Contig ID Contig-U13202-1 Contig update 2002.12.18 Contig sequence >Contig-U13202-1 (Contig...-U13202-1Q) /CSM_Contig/Contig-U13202-1Q.Seq.d ACTGTTGGCCTACTGGGATTTTCTGCAGTAATAATAAAATCAAATA...TTTGTAATTTTAAAAAAAAAAAAAA AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA Gap no gap Contig len...kvgqfirvprgaqpaqtskftlmih*gvkshffsmlqpnwpncttigpvq nqarcgsllgfwvlqnqlltvlcihnnekcsikfygygyl**nlitvvkvvmpslhg

  14. Dicty_cDB: Contig-U15062-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U15062-1 no gap 1282 3 4759691 4758480 MINUS 5 6 U15062 0 0 0 0 0 0 1 0 1 1 1 0 1 0 Show Contig...-U15062-1 Contig ID Contig-U15062-1 Contig update 2004. 6.11 Contig sequence >Contig-U15062-1 (Contig...-U15062-1Q) /CSM_Contig/Contig-U15062-1Q.Seq.d CAAATATTTAAATAAATTTAACATTATAAAAACAAAAATTAATAAAGTA...TTTTCAATAGATAATAATAAAAAAAAAAAAAAAAAAA AAAAAAAAATTATTTTAAAAATAAAAAAAAAA Gap no gap Contig length 1282 Chromos...KMSHNHNSNNNKTTTTTTNDSGSAIANGINLEKILADVKECN YNLVNSITATEAIQKEKESLENELSTKGTIGDGKRIKKLQYNISLQTETLMKTLMKLDSL SITG

  15. Dicty_cDB: Contig-U09432-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U09432-1 gap included 993 5 741953 740957 MINUS 1 2 U09432 0 0 0 0 0 0 1 0 0 0 0 0 0 0 Show Contig...-U09432-1 Contig ID Contig-U09432-1 Contig update 2002. 9.13 Contig sequence >Contig-U09432-1 (Contig...-U09432-1Q) /CSM_Contig/Contig-U09432-1Q.Seq.d AGGAAATATTTTAATATTTTATTTTTTTTATTTTTTTTATTTATTA...TTTTGGTGGTAAATATAGATATGAAAATAAA CAAATCCAAATTTTAGTTGAATTAAATTTCACTGATACCACTCAAAAAAA AAA Gap gap included Contig...iy*sni*SVKFGICYNYAKYHLSICNHTIYPGSDNQSLYFKLSSIFDS PTILSGYAVIYNSLDQIITNGTYNLILDEDVPTIG

  16. Dicty_cDB: Contig-U15005-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U15005-1 no gap 2023 1 1509217 1507616 MINUS 2 4 U15005 0 0 0 0 1 0 1 0 0 0 0 0 0 0 Show Contig...-U15005-1 Contig ID Contig-U15005-1 Contig update 2004. 6.11 Contig sequence >Contig-U15005-1 (Contig...-U15005-1Q) /CSM_Contig/Contig-U15005-1Q.Seq.d AATTTTCTTTTCTTTTTAAAACTTAAGTACCATATGGCAGAATATACAC...ATAATAACGATATTAA Gap no gap Contig length 2023 Chromosome number (1..6, M) 1 Chro...HMAEYTHYFIQYNLTDIFYEDVNIEKYSCSICYESVYKKEIYQCKEIHWF CKTCWAESLFKKKECMICRCIVKSISELSRNRFIEQDFLNIKVNCPNSFKYIDENKNNNN KIKDLENGCKDIITIG

  17. Dicty_cDB: Contig-U10406-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U10406-1 no gap 661 4 1621526 1620875 MINUS 1 1 U10406 0 0 1 0 0 0 0 0 0 0 0 0 0 0 Show Contig...-U10406-1 Contig ID Contig-U10406-1 Contig update 2002. 9.13 Contig sequence >Contig-U10406-1 (Contig...-U10406-1Q) /CSM_Contig/Contig-U10406-1Q.Seq.d NNNNNNNNNNATAAGTAAAAGAGTTATTGGTCCAAGATTAGATGATGACA...TACAAATAAGTAAAGTTG ATAAAGAACAT Gap no gap Contig length 661 Chromosome number (1....cid sequence XXXISKRVIGPRLDDDNNNNDNDKFNNNNKKAIGPSRIGPTIGPSIGPSRYNTNNNDSNH NSNNDDDDDSSEEDEEDTKSEWERVRNMIENNKN

  18. Dicty_cDB: Contig-U16457-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U16457-1 no gap 1065 3 996438 997502 PLUS 6 5 U16457 0 0 1 0 1 0 2 0 0 0 0 0 1 1 Show Contig...-U16457-1 Contig ID Contig-U16457-1 Contig update 2004. 6.11 Contig sequence >Contig-U16457-1 (Contig...-U16457-1Q) /CSM_Contig/Contig-U16457-1Q.Seq.d ACAATTGGTGTTGCTGCTCTATTCGGTCTTCCAGCTATGGCACGTTCCGC A...TTTAACAAGATTGGAAGAC CAAAAAGAAAAAAAA Gap no gap Contig length 1065 Chromosome numb... Translated Amino Acid sequence TIGVAALFGLPAMARSAAMSLVFLIPFMWIVFSVHYPINSVVADICMSYNNNTGSIEQQL ANYTNPIVSEIFGTC

  19. Dicty_cDB: Contig-U04768-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U04768-1 no gap 762 6 2607190 2606476 MINUS 3 3 U04768 1 0 0 0 0 0 2 0 0 0 0 0 0 0 Show Contig...-U04768-1 Contig ID Contig-U04768-1 Contig update 2001. 8.29 Contig sequence >Contig-U04768-1 (Contig...-U04768-1Q) /CSM_Contig/Contig-U04768-1Q.Seq.d AAAGTCTTATTTGTTTAAAAAAAAAAAAAAAAAATAAAAAACTTTATTCT...AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA AAAAAAAAAAAA Gap no gap Contig length 762 Chromosome number (1..6, M...lknf*KMVMMHDEYISPTKLQFGFMIAVAFLG TIGVMGFCQNVFDILLGVISILSIYIGMRGVWKRKKRWLFVFMWLMMGMGFLHLVSFAVV VILHHKNPTKNTVF

  20. Dicty_cDB: Contig-U13326-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U13326-1 no gap 240 6 1728259 1728019 MINUS 1 1 U13326 0 0 0 0 0 1 0 0 0 0 0 0 0 0 Show Contig...-U13326-1 Contig ID Contig-U13326-1 Contig update 2002.12.18 Contig sequence >Contig-U13326-1 (Contig...-U13326-1Q) /CSM_Contig/Contig-U13326-1Q.Seq.d AATGACTCAACAAATCTTGGAGAGTATGCAAAATACTTTCCAATCTATGG...CCTCGTTAAAGGTGCTGGTGC TGAATTAAGTTCTCGTGCTCATGAGTGTTTCATTAGTGCCTTGGATATTG CCTCTGATTATACCTACGAGAAAATTACCATTGGCTTGGA Gap no gap Contig...FQSMDGPTIKRLATTIQYGSKDVDEQQIHSTLVKGAGAELSSRAHECF ISALDIASDYTYEKITIGL Translated Amino Acid sequence (All Fra

  1. Dicty_cDB: Contig-U15036-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U15036-1 no gap 3102 - - - - 16 24 U15036 0 5 1 2 0 1 1 2 3 1 0 0 0 0 Show Contig-U15036-1 Contig... ID Contig-U15036-1 Contig update 2004. 6.11 Contig sequence >Contig-U15036-1 (Contig-U15036-1Q) /CSM_Contig.../Contig-U15036-1Q.Seq.d ATCTTTTTAAAAAAAAAAAAAATAAAACAAATAAAGAAAGAAATTAAATA AATATTAATAAT...AATTTAAAATTAATTTTTAG AT Gap no gap Contig length 3102 Chromosome number (1..6, M) - Chromosome length - Star...RKKQTDAVAEIPVD NPTSTSTTTTTTTTSNATSILSAIHTSTINSNTSSHNNNQQQQQQQQTILPTQPTIINTP TPVRSSVSRSQSPLPSGNGSSIISQEKTPLSTFVLSTCRPSALVLPPGSTIG

  2. Dicty_cDB: Contig-U13455-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U13455-1 no gap 750 2 945431 946181 PLUS 2 2 U13455 0 0 0 0 0 0 1 0 0 1 0 0 0 0 Show Contig...-U13455-1 Contig ID Contig-U13455-1 Contig update 2002.12.18 Contig sequence >Contig-U13455-1 (Contig...-U13455-1Q) /CSM_Contig/Contig-U13455-1Q.Seq.d TAATTCCAACAACATCAACAAATTCAACAACAATTACAAATGCAACAACA TA...CAATAATAATAATAATAACAATAACAATAATAATAA Gap no gap Contig length 750 Chromosome number (1..6, M) 2 Chromosome l...KMLEYIQKNPSATRPSCIQVVQQPSSKVVWKNRRLDTPFKVKVDLKAASAMA GTNLTTASVITIGIVTDHKGKLQIDSVENFTEAFNGQGLAVFQGLKMTKGTWGKE

  3. Dicty_cDB: Contig-U14400-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U14400-1 no gap 1939 4 4053811 4055750 PLUS 5 7 U14400 0 0 2 0 0 1 0 0 1 1 0 0 0 0 Show Contig...-U14400-1 Contig ID Contig-U14400-1 Contig update 2002.12.18 Contig sequence >Contig-U14400-1 (Contig...-U14400-1Q) /CSM_Contig/Contig-U14400-1Q.Seq.d CATTACCAATAAATTTATCTGCTTCAACACCTATACCAATGACATCACCA...AGGTTTATAAAATATATTGAATCAATTTTTGATTAAA Gap no gap Contig length 1939 Chromosome number (1..6, M) 4 Chromosome...HQQQQSKTVTSSTTSTETTTTVESSTTSTTITTSTSTPIPTTITTTPTTPI NSDNSWTFTSFSPKVFKEIRRYYGVDEEFLKSQENSSGIVKFLEVQTIGRSGSFFY

  4. Dicty_cDB: Contig-U10709-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U10709-1 gap included 1228 4 757921 759149 PLUS 2 3 U10709 0 0 0 1 1 0 0 0 0 0 0 0 0 0 Show Contig...-U10709-1 Contig ID Contig-U10709-1 Contig update 2002.12.18 Contig sequence >Contig-U10709-1 (Contig...-U10709-1Q) /CSM_Contig/Contig-U10709-1Q.Seq.d ATTAGTAACACAGACATTGGTAACACGAATTTATTACCACCATCAC...ATGTTTAGGTGATAATACTCATAGTCAA Gap gap included Contig length 1228 Chromosome number (1..6, M) 4 Chromosome le...LDIFLIQIGAAIMGSNQFIQHAINIYNLEDWFEIEPFNG SLNKSTEGTPTTTSSQPPSTPSKQTSLRNSAGTVPTTPSQSSSTIVPTLDTIGETTTTTT TTATTTT

  5. Dicty_cDB: Contig-U10837-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U10837-1 gap included 1996 2 5280203 5282199 PLUS 8 9 U10837 0 3 0 3 1 0 0 1 0 0 0 0 0 0 Show Contig...-U10837-1 Contig ID Contig-U10837-1 Contig update 2002.12.18 Contig sequence >Contig-U10837-1 (Contig-U10837-1Q) /CSM_Contig/Contig-U10837...TCNT Gap gap included Contig length 1996 Chromosome number (1..6, M) 2 Chromosome...YSSKGYFKHLDSFLSEISVP LCESVSKSSTLVFSLLFNMLEYSTADYRYPILKILTALVKCGVNPAETKSSRVPEWFDTV TQFLNDHKTPHYIVSQAIRFIEITSGNSPTSLITIDNASLKPSKNTIG...SSRVPEWFDTV TQFLNDHKTPHYIVSQAIRFIEITSGNSPTSLITIDNASLKPSKNTIGTKKFSNKVDRGT LLAGNYFNKVLVDTVPGVRSSVNSLTKSIYSTTQI

  6. Dicty_cDB: Contig-U12765-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U12765-1 no gap 1256 6 1467819 1466563 MINUS 3 3 U12765 0 0 0 2 0 0 0 0 0 0 1 0 0 0 Show Contig...-U12765-1 Contig ID Contig-U12765-1 Contig update 2002.12.18 Contig sequence >Contig-U12765-1 (Contig...-U12765-1Q) /CSM_Contig/Contig-U12765-1Q.Seq.d CAAAAAGGAAACACTAGTCCAGTTAGAACCCCAAATACTACTACTACTA...TATCGATTGTTCAAAGGTTTCAATGGTTGATACTAAT TTCTTA Gap no gap Contig length 1256 Chromosome number (1..6, M) 6 Chr...EYQEDLTPIFEPIFLDLIKIL STTTLTGNVFPYYKVFSRLVQFKAVSDLVGTLQCWNSPNFNGKEMERNTILGSLFSPSSA SDDGSTIKQYFSNASTMNKNTIGDA

  7. Dicty_cDB: Contig-U09480-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U09480-1 gap included 705 5 4277527 4276817 MINUS 1 2 U09480 0 0 0 0 0 0 1 0 0 0 0 0 0 0 Show Contig...-U09480-1 Contig ID Contig-U09480-1 Contig update 2002. 9.13 Contig sequence >Contig-U09480-1 (Contig-U09480-1Q) /CSM_Contig/Contig-U09480...AAAAAAAAAA Gap gap included Contig length 705 Chromosome number (1..6, M) 5 Chromosome length 5062330 Start ...**********imaeinienpfhvntkidvntfvnqirgipngsrcdftnsvvkhf sslgynvfvchpnhavtgpyaklhcefrntkfstig...srcdftnsvvkhf sslgynvfvchpnhavtgpyaklhcefrntkfstigydvyiiargrkvtatnfgdggydn wasggh

  8. Dicty_cDB: Contig-U09345-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U09345-1 gap included 1216 4 3361857 3360637 MINUS 4 5 U09345 1 0 1 0 0 0 2 0 0 0 0 0 0 0 Show Contig...-U09345-1 Contig ID Contig-U09345-1 Contig update 2002. 9.13 Contig sequence >Contig-U09345-1 (Contig-U09345-1Q) /CSM_Contig/Contig-U0934...AATGGTATTTTAAAAATAA Gap gap included Contig length 1216 Chromosome number (1..6, M) 4 Chromosome length 5430...ALFTSSNPKYGCSGCVQLKNQIESFSLSYEPYL NSAGFLEKPIFIVILEVDYNMEVFQTIGLNTIPHLLFIPSGSKPITQKGYAYTGFEQTSS QSISDFIYSHSKI...LLALFTSSNPKYGCSGCVQLKNQIESFSLSYEPYL NSAGFLEKPIFIVILEVDYNMEVFQTIGLNTIPHLLFIPSGSKPI

  9. Dicty_cDB: Contig-U15323-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U15323-1 no gap 1230 2 3760829 3759661 MINUS 76 108 U15323 2 0 21 0 9 4 0 0 22 4 13 0 1 0 Show Contig...-U15323-1 Contig ID Contig-U15323-1 Contig update 2004. 6.11 Contig sequence >Contig-U15323-1 (Contig-U15323-1Q) /CSM_Contig/Contig-U1532...TAAAATTTAAGCAATCATTCCAT Gap no gap Contig length 1230 Chromosome number (1..6, M) 2 Chromosome length 846757...VGLLVFFNILYCTPLYYILFFFKMNSKFADELIATAKAIVAPGKGILAADESTNTIGAR FKKINLENNEENRRAYRELLIGTGNGVNEFIGGIILYEETLYQKMADG...MNSKFADELIATAKAIVAPGKGILAADESTNTIGAR FKKINLENNEENRRAYRELLIGTGNGVNEFIGGIILYEETLYQK

  10. Dicty_cDB: Contig-U14236-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U14236-1 no gap 660 2 5626866 5627517 PLUS 1 1 U14236 0 0 0 0 0 0 0 0 0 1 0 0 0 0 Show Contig...-U14236-1 Contig ID Contig-U14236-1 Contig update 2002.12.18 Contig sequence >Contig-U14236-1 (Contig...-U14236-1Q) /CSM_Contig/Contig-U14236-1Q.Seq.d NNNNNNNNNNGAAAATCAAAAATTAAAAAGTAACATTACTCTATTATATG ...CAATCACTCCAATTAAA CCATAGTTTT Gap no gap Contig length 660 Chromosome number (1..6...MGSEKSPFNLKQYPSLVKIDDVS QCPKYKCLKRKSLNEWTIGLNIPAFCRESRYDCSLCYKYIECSFSDEF*tnlsalfv

  11. Dicty_cDB: Contig-U10291-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U10291-1 no gap 932 4 3203354 3204286 PLUS 2 2 U10291 0 0 1 0 0 0 1 0 0 0 0 0 0 0 Show Contig...-U10291-1 Contig ID Contig-U10291-1 Contig update 2002. 9.13 Contig sequence >Contig-U10291-1 (Contig...-U10291-1Q) /CSM_Contig/Contig-U10291-1Q.Seq.d GTAAAGGTTTTATGTGTATATTTTTTAATGACCTTTTCGAATTAGTTTCA ...CAAAATAGATTAAATCTTAGTTACTCTCATGC TAATCAATATGTTGAGAGTTTTCCATCACAAATGTTATCAACAATTGCAA AATTCATTAGTTTCTTATTTGGTT...SLMYSL FNYIFDENGIIKSEFQDPTQRKRLSRGLSRRFMTIGILGLFTTPFIFFFLLINFFFEYAE ELKNRPGSLFSREWSPLARWEFRELNELPHYFQNRLNLSY

  12. Dicty_cDB: Contig-U07545-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U07545-1 no gap 439 3 4955441 4955098 MINUS 1 1 U07545 0 0 0 0 0 0 1 0 0 0 0 0 0 0 Show Contig...-U07545-1 Contig ID Contig-U07545-1 Contig update 2002. 5. 9 Contig sequence >Contig-U07545-1 (Contig...-U07545-1Q) /CSM_Contig/Contig-U07545-1Q.Seq.d ATATGAAATACTTAATACTTTTAATTTTCCTTTTAATAAATTCAACTTTT...ATGTTTCAGAGTCTGGTTG Gap no gap Contig length 439 Chromosome number (1..6, M) 3 Chromosome length 6358359 Sta...e MKYLILLIFLLINSTFGNIQFSKYISNSGNDNNSCGSFTSPCKTIGYSIQQIKSYEYNQY SIEILLDSGNYYSQNPINLYGLNISISAQNSNDLVQFLVPNINGT

  13. Dicty_cDB: Contig-U15359-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U15359-1 no gap 1420 6 1334613 1333192 MINUS 3 3 U15359 0 1 0 0 1 0 0 0 0 1 0 0 0 0 Show Contig...-U15359-1 Contig ID Contig-U15359-1 Contig update 2004. 6.11 Contig sequence >Contig-U15359-1 (Contig...-U15359-1Q) /CSM_Contig/Contig-U15359-1Q.Seq.d TATAGCATCATTTGCAAAGTTTAGTTTAAAGAAAAAAGAGAAAGCGGAA...A AAAAAAACTGGAAAAATTAA Gap no gap Contig length 1420 Chromosome number (1..6, M) 6 Chromosome length 3595308...SSGF DEPSLAVMYVDRALKGASAVQTIGRLSRVSKGKNACYIVDFVNTRREISDAFGQYWRETC LKGETRKTVLELKLNRVLGKLSAIEPLANGRLEESVEYILRD

  14. Dicty_cDB: Contig-U09581-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U09581-1 gap included 1235 1 2575525 2576764 PLUS 1 2 U09581 0 0 1 0 0 0 0 0 0 0 0 0 0 0 Show Contig...-U09581-1 Contig ID Contig-U09581-1 Contig update 2002. 9.13 Contig sequence >Contig-U09581-1 (Contig-U09581-1Q) /CSM_Contig/Contig-U09581...ATCAAAATAAATTTTTGTAACATTAATAATAAATAAN Gap gap included Contig length 1235 Chromosome number (1..6, M) 1 Chro... VFD420Z ,579,1237 Translated Amino Acid sequence KKPGVVTIKGSSFCSQPTITIGDDSCSQPILSVGNDYDSLTCNFQSNAGLSNSTLLVS...ames) Frame A: KKPGVVTIKGSSFCSQPTITIGDDSCSQPILSVGNDYDSLTCNFQSNAGLSNSTLLVSII CDTIQ

  15. Dicty_cDB: Contig-U04729-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U04729-1 no gap 251 5 1037629 1037880 PLUS 1 1 U04729 0 0 0 0 0 0 1 0 0 0 0 0 0 0 Show Contig...-U04729-1 Contig ID Contig-U04729-1 Contig update 2001. 8.29 Contig sequence >Contig-U04729-1 (Contig...-U04729-1Q) /CSM_Contig/Contig-U04729-1Q.Seq.d TGGATTTATAACAGAGGTTATTGTAGGTGGTAAAACTTTTAGAGGAATCG ...CATTATCTAATGGG T Gap no gap Contig length 251 Chromosome number (1..6, M) 5 Chromosome length 5062330 Start ...ITEVIVGGKTFRGIVFEDLKSSNQTNNHSQNFSPNQSGTNLNNSNSNIPSSKKIKDKN ISPSSFLPTIGSTTSTSNPLSNG Translated Amino Acid seq

  16. Dicty_cDB: Contig-U06929-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U06929-1 no gap 726 5 4252576 4251850 MINUS 1 1 U06929 1 0 0 0 0 0 0 0 0 0 0 0 0 0 Show Contig...-U06929-1 Contig ID Contig-U06929-1 Contig update 2001. 8.30 Contig sequence >Contig-U06929-1 (Contig...-U06929-1Q) /CSM_Contig/Contig-U06929-1Q.Seq.d AGTTCATTCATTTAGTCGTATGATAGTATCACCATTTATAAATCCAAAAT...TAAATTAAATAAATA Gap no gap Contig length 726 Chromosome number (1..6, M) 5 Chromosome length 5062330 Start p...PSAISNNSNNS NNNDDNRPPILGLPFLFDYKNRITRGSRFFETIHYKIVHVTSATEFGIRRISKLYGTKWQ LEIGLKHQITQSGALQCLFTHTIGQTTIFGLSFGF

  17. Dicty_cDB: Contig-U15525-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U15525-1 gap included 3361 6 202399 204109 PLUS 34 57 U15525 0 0 7 0 7 6 0 0 4 3 7 0 0 0 Show Contig...-U15525-1 Contig ID Contig-U15525-1 Contig update 2004. 6.11 Contig sequence >Contig-U15525-1 (Contig-U15525-1Q) /CSM_Contig/Contig-U15525...ATTTAATTAAATAATAATA Gap gap included Contig length 3361 Chromosome number (1..6, M) 6 Chromosome length 3595...TEATCLILSVD ETVQNNQAEQAQAGPQINNQTRQALSRVEVFKQ--- ---LDTIGIKKESGGGLGDSQFIAGAAFKRTFFYAGFEQQPKHIKNPKVLCLNIELELK...lslnsiqslpqlkqlv*ssll mkpfkiiklnklklvhklitkhvklyhg*rcss--- ---LDTIGIKKESGGGLGDSQFIAGAAFKRTFFYAGFEQQPKHIKNPKV

  18. Dicty_cDB: Contig-U09822-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U09822-1 gap included 1255 3 5930658 5929418 MINUS 5 6 U09822 3 0 2 0 0 0 0 0 0 0 0 0 0 0 Show Contig...-U09822-1 Contig ID Contig-U09822-1 Contig update 2002. 9.13 Contig sequence >Contig-U09822-1 (Contig-U09822-1Q) /CSM_Contig/Contig-U0982...AAAAGAAAAAAAAAAAAAAAAGATTTAATTAAATAAAAAAAAA AAAAAAAAAAAAAAA Gap gap included Contig length 1255 Chromosome n...,975 est6= VSA519Z ,780,1257 Translated Amino Acid sequence QPFYLVQSMFEPIQDSSFTSIGEIISYDTIG...rfn*ikkkkkk k Frame C: QPFYLVQSMFEPIQDSSFTSIGEIISYDTIGFDGKINTAVMSSLSPSTMYFYCVGDKS

  19. Dicty_cDB: Contig-U11883-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U11883-1 gap included 599 2 1457179 1457762 PLUS 1 2 U11883 0 0 0 0 0 0 0 0 0 1 0 0 0 0 Show Contig...-U11883-1 Contig ID Contig-U11883-1 Contig update 2002.12.18 Contig sequence >Contig-U11883-1 (Contig...-U11883-1Q) /CSM_Contig/Contig-U11883-1Q.Seq.d TACAAAATTTATATATATATATAATATTTTTAAATAATTATATTT...ATTTAGATGTATTTGGTATTCAAACATTA ACCGAACAACAAGCCTCTACAAAATTATTAACTTTTGTCATTTCAAAATC AGGTGAAAA Gap gap included Contig...ffkixn*kikkgfhvkxksflwfkxxx--- ---xxxx******************yprkyiniti*rn*kdil*ii*rne*rergtksc* nifs*kestpl*fnsxfktniilfstvfnttnvstig

  20. Dicty_cDB: Contig-U13680-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U13680-1 no gap 822 5 2371965 2372786 PLUS 2 2 U13680 0 2 0 0 0 0 0 0 0 0 0 0 0 0 Show Contig...-U13680-1 Contig ID Contig-U13680-1 Contig update 2002.12.18 Contig sequence >Contig-U13680-1 (Contig...-U13680-1Q) /CSM_Contig/Contig-U13680-1Q.Seq.d AAAAAGATTCTCAAGGAATTCACCGTGTTTATACTTCTTATGGTAGAACT ...GGGAATCAATGATTTAAATATCTACCAAATTCAAAAGG AAGGTGATGTCGAGTCACATTCATTACAATCACCATCGAAATTATTATTT CATGGTTCAAGAGCATCGAATT Gap no gap Contig...**sirtinkdig*kslc*snhsidk*ffsynh*twy*ntigclingt s*kw*tcfeknqylfewynqsiisrvgeikfrifhnyst*tw*rfrcclkeyh*kfgsie

  1. Dicty_cDB: Contig-U15718-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U15718-1 gap included 3735 6 2645446 2642451 MINUS 153 276 U15718 0 0 0 118 ...1 0 0 20 3 10 1 0 0 0 Show Contig-U15718-1 Contig ID Contig-U15718-1 Contig update 2004. 6.11 Contig sequence >Contig...-U15718-1 (Contig-U15718-1Q) /CSM_Contig/Contig-U15718-1Q.Seq.d AAATTATTAAATTGTTTATTAATTTTTTTTTTTAC...CCTG Gap gap included Contig length 3735 Chromosome number (1..6, M) 6 Chromosome length 3595308 Start point...ptqtppptqtpt nhsigvnecdccpegqycllifghercfiandggdgipeetigcpgvttgtptstdggtg hytesgtgnphlcdrhhcrsgmechvingipecl

  2. Dicty_cDB: Contig-U15573-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U15573-1 gap included 2005 4 5020093 5018210 MINUS 13 13 U15573 0 5 0 1 1 0 ...0 0 0 1 1 0 2 2 Show Contig-U15573-1 Contig ID Contig-U15573-1 Contig update 2004. 6.11 Contig sequence >Contig-U15573-1 (Contig...-U15573-1Q) /CSM_Contig/Contig-U15573-1Q.Seq.d AGTCTTGAGCTTTTATTGGGTCAACCATTGGGTGAATATAC... AGCNTTAACNGGNAA Gap gap included Contig length 2005 Chromosome number (1..6, M) ...xxlfrsnxslxxxxxxsxnxx Frame C: s*afigstig*iyiylkrfhlfl*skryyqskw*fkifpilkqttiiyen

  3. Dicty_cDB: Contig-U01204-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U01204-1 gap included 918 2 1928287 1927368 MINUS 2 3 U01204 0 0 0 0 0 2 0 0 0 0 0 0 0 0 Show Contig...-U01204-1 Contig ID Contig-U01204-1 Contig update 2001. 8.29 Contig sequence >Contig-U01204-1 (Contig-U01204-1Q) /CSM_Contig/Contig-U01204...AAAAATAATAA Gap gap included Contig length 918 Chromosome number (1..6, M) 2 Chromosome length 8467578 Start...LAWEVFWVGTPLFVLMASAFNQIHWALAWVLMVIILQSGFMN--- ---QHSHTIGNETIIIVMDSWVVDQIPDQVSWMEQ...fgwvlhyly*whqhsikfighwhgy*w*sfynlvl*--- ---QHSHTIGNETIIIVMDSWVVDQIPDQVSWMEQVLSDNN

  4. Dicty_cDB: Contig-U12043-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U12043-1 gap included 1898 6 2694437 2692539 MINUS 7 13 U12043 0 6 0 0 0 0 0... 1 0 0 0 0 0 0 Show Contig-U12043-1 Contig ID Contig-U12043-1 Contig update 2002.12.18 Contig sequence >Contig-U12043-1 (Contig...-U12043-1Q) /CSM_Contig/Contig-U12043-1Q.Seq.d GAAACCATTCGTTTAAAGAAATGAAATATTTATATATATTAA...ATAAA AATAAATT Gap gap included Contig length 1898 Chromosome number (1..6, M) 6 Chromosome length 3595308 S...VPDIVSGILASKYASITLLNSGEM DLTNGITIGLLENSTSDQLFQINPILNTSLTNILVGQRFSIPFEISIKDSTISNQL

  5. Dicty_cDB: Contig-U16467-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U16467-1 no gap 1261 2 7818565 7817305 MINUS 17 18 U16467 0 0 5 0 1 2 1 0 6 0 0 1 1 0 Show Contig...-U16467-1 Contig ID Contig-U16467-1 Contig update 2004. 6.11 Contig sequence >Contig-U16467-1 (Contig...-U16467-1Q) /CSM_Contig/Contig-U16467-1Q.Seq.d CAACAATTAACATTACTTAAATATAATATTATTATATTTTTTTTTTT...TTCAAATAAATAATTGTTTAGAAATTTCTAGAAAAAAAA AAAAAAAAAAA Gap no gap Contig length 1261 Chromosome number (1..6, M...LK833Z ,1005,1249 Translated Amino Acid sequence qqltllkyniiiffffyllplhlyhy**LKKKTLTIIKYFFQKMNKIALLFTIFFALFAI SFACDEFNPNTSTIG

  6. Dicty_cDB: Contig-U12682-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U12682-1 no gap 1408 4 4961739 4963050 PLUS 47 48 U12682 0 0 0 5 0 0 2 30 0 10 0 0 0 0 Show Contig...-U12682-1 Contig ID Contig-U12682-1 Contig update 2002.12.18 Contig sequence >Contig-U12682-1 (Contig...-U12682-1Q) /CSM_Contig/Contig-U12682-1Q.Seq.d AAACACATCATCCCGTTCGATCTGATAAGTAAATCGACCTCAGGCC...ATGA AACTACTG Gap no gap Contig length 1408 Chromosome number (1..6, M) 4 Chromosome length 5430582 Start po... kwniikwysyinwykswyn**fihsiklqwsy*qcke*si*yiir*ny own update 2004. 6.10 Homology vs CSM-cDNA Query= Contig

  7. Dicty_cDB: Contig-U03323-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U03323-1 no gap 533 2 4820223 4820756 PLUS 2 1 U03323 0 0 0 0 0 0 0 0 0 0 0 0 1 1 Show Contig...-U03323-1 Contig ID Contig-U03323-1 Contig update 2001. 8.29 Contig sequence >Contig-U03323-1 (Contig...-U03323-1Q) /CSM_Contig/Contig-U03323-1Q.Seq.d ACATGTGACATTACTATTGGTAAATGTCAATGTTTAAAAAATACATGGTC ...TCAATAATGGTGGTGGTGGTGGTTTAGGT GAAACCCCCAATAGTAATAGTAATAGTGGTGAACTAGTTATCCCACCAAA ATCAAATACTACATTAAATGAAGAAACAGGTGG Gap no gap Contig... Link to clone list U03323 List of clone(s) est1= FC-IC0176F ,1,534 Translated Amino Acid sequence TCDITIGKC

  8. Dicty_cDB: Contig-U15069-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U15069-1 no gap 1241 1 2719927 2720886 PLUS 37 43 U15069 16 2 0 0 0 5 8 0 0 1 1 0 4 0 Show Contig...-U15069-1 Contig ID Contig-U15069-1 Contig update 2004. 6.11 Contig sequence >Contig-U15069-1 (Contig...-U15069-1Q) /CSM_Contig/Contig-U15069-1Q.Seq.d TTTCAAACCAAAACATAAAATAATTAAAAATGACAACTGTTAAACCA...AAAAATAAAATAAATAAAAATAGTTTTAAA Gap no gap Contig length 1241 Chromosome number (1..6, M) 1 Chromosome length...07Z ,263,623 est42= VSJ431Z ,390,646 est43= CHB363Z ,460,1187 Translated Amino Acid sequence snqnik*lkmttvkptspenprvffditig

  9. Dicty_cDB: Contig-U12316-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available Contig-U12316-1 gap included 1238 4 1925901 1927143 PLUS 5 6 U12316 0 4 1 0 0 0 0 0 0 0 0 0 0 0 Show Contig...-U12316-1 Contig ID Contig-U12316-1 Contig update 2002.12.18 Contig sequence >Contig-U12316-1 (Contig-U12316-1Q) /CSM_Contig/Contig-U12316...GAGTTGAAGATTTAGTTTTATCAGNANGAANAAATAAGAT Gap gap included Contig length 1238 Chromosome number (1..6, M) 4 C...,915,1174 Translated Amino Acid sequence lvqhhyh*liscvivllksmv*isqvhivvhlfmfvn*qyileih*iptlknlskiftig...lip*r*rtrkttn*kiknny*itketkiqs*t*rvmmmi*vedlvls xxxnk Frame B: lvqhhyh*liscvivllksmv*isqvhivvhlfmfvn*qyileih*iptlknlskiftig

  10. ESTminer: a Web interface for mining EST contig and cluster databases.

    Science.gov (United States)

    Huang, Yecheng; Pumphrey, Janie; Gingle, Alan R

    2005-03-01

    ESTminer is a Web application and database schema for interactive mining of expressed sequence tag (EST) contig and cluster datasets. The Web interface contains a query frame that allows the selection of contigs/clusters with specific cDNA library makeup or a threshold number of members. The results are displayed as color-coded tree nodes, where the color indicates the fractional size of each cDNA library component. The nodes are expandable, revealing library statistics as well as EST or contig members, with links to sequence data, GenBank records or user configurable links. Also, the interface allows 'queries within queries' where the result set of a query is further filtered by the subsequent query. ESTminer is implemented in Java/JSP and the package, including MySQL and Oracle schema creation scripts, is available from http://cggc.agtec.uga.edu/Data/download.asp agingle@uga.edu.

  11. Short interspersed elements (SINEs) of the Geomyoidea superfamily rodents.

    Science.gov (United States)

    Gogolevsky, Konstantin P; Kramerov, Dmitri A

    2006-05-24

    A new short interspersed element (SINE) was isolated from the genome of desert kangaroo rat (Dipodomys deserti) using single-primer PCR. This SINE consists of two monomers: the left monomer (IDL) resembles rodent ID element and other tRNAAla(CGC)-derived SINEs, whereas the right one (Geo) shows no similarity with known SINE sequences. PCR and hybridization analyses demonstrated that IDL-Geo SINE is restricted to the rodent superfamily Geomyoidea (families Geomyidea and Heteromyidea). Isolation and analysis of IDL-Geo from California pocket mouse (Chaetodipus californicus) and Botta's pocket gopher (Thomomys bottae) revealed some species-specific features of this SINE family. The structure and evolution of known dimeric SINEs are discussed.

  12. Evaluation of Mammalian Interspersed Repeats to investigate the goat genome

    Directory of Open Access Journals (Sweden)

    P. Mariani

    2010-01-01

    Full Text Available Among the repeated sequences present in most eukaryotic genomes, SINEs (Short Interspersed Nuclear Elements are widely used to investigate evolution in the mammalian order (Buchanan et al., 1999. One family of these repetitive sequences, the MIR (Mammalian Interspersed Repeats; Jurka et al., 1995, is ubiquitous in all mammals.MIR elements are tRNA-derived SINEs and are identifiable by a conserved core region of about 70 nucleotides.

  13. CORE-SINEs: eukaryotic short interspersed retroposing elements with common sequence motifs.

    Science.gov (United States)

    Gilbert, N; Labuda, D

    1999-03-16

    A 65-bp "core" sequence is dispersed in hundreds of thousands copies in the human genome. This sequence was found to constitute the central segment of a group of short interspersed elements (SINEs), referred to as mammalian-wide interspersed repeats, that proliferated before the radiation of placental mammals. Here, we propose that the core identifies an ancient tRNA-like SINE element, which survived in different lineages such as mammals, reptiles, birds, and fish, as well as mollusks, presumably for >550 million years. This element gave rise to a number of sequence families (CORE-SINEs), including mammalian-wide interspersed repeats, whose distinct 3' ends are shared with different families of long interspersed elements (LINEs). The evolutionary success of the generic CORE-SINE element can be related to the recruitment of the internal promoter from highly transcribed host RNA as well as to its capacity to adapt to changing retropositional opportunities by sequence exchange with actively amplifying LINEs. It reinforces the notion that the very existence of SINEs depends on the cohabitation with both LINEs and the host genome.

  14. Projector : automatic contig mapping for gap closure purposes

    NARCIS (Netherlands)

    van Hijum, SAFT; Zomer, AL; Kuipers, OP; Kok, J

    2003-01-01

    Projector was designed for automatic positioning of contigs from an unfinished prokaryotic genome onto a template genome of a closely related strain or species. Projector mapped 84 contigs of Lactococcus lactis MG1363 (corresponding to 81% of the assembly nucleotides) against the genome of L.lactis

  15. Characterization of short interspersed elements (SINEs) in a red alga, Porphyra yezoensis.

    Science.gov (United States)

    Zhang, Wenbo; Lin, Xiaofei; Peddigari, Suresh; Takechi, Katsuaki; Takano, Hiroyoshi; Takio, Susumu

    2007-02-01

    Short interspersed element (SINE)-like sequences referred to as PySN1 and PySN2 were identified in a red alga, Porphyra yezoensis. Both elements contained an internal promoter with motifs (A box and B box) recognized by RNA polymerase III, and target site duplications at both ends. Genomic Southern blot analysis revealed that both elements were widely and abundantly distributed on the genome. 3' and 5' RACE suggested that PySN1 was expressed as a chimera transcript with flanking SINE-unrelated sequences and possessed the poly-A tail at the same position near the 3' end of PySN1.

  16. Epigenetic regulation of transcription and possible functions of mammalian short interspersed elements, SINEs.

    Science.gov (United States)

    Ichiyanagi, Kenji

    2013-01-01

    Short interspersed elements (SINEs) are a class of retrotransposons, which amplify their copy numbers in their host genomes by retrotransposition. More than a million copies of SINEs are present in a mammalian genome, constituting over 10% of the total genomic sequence. In contrast to the other two classes of retrotransposons, long interspersed elements (LINEs) and long terminal repeat (LTR) elements, SINEs are transcribed by RNA polymerase III. However, like LINEs and LTR elements, the SINE transcription is likely regulated by epigenetic mechanisms such as DNA methylation, at least for human Alu and mouse B1. Whereas SINEs and other transposable elements have long been thought as selfish or junk DNA, recent studies have revealed that they play functional roles at their genomic locations, for example, as distal enhancers, chromatin boundaries and binding sites of many transcription factors. These activities imply that SINE retrotransposition has shaped the regulatory network and chromatin landscape of their hosts. Whereas it is thought that the epigenetic mechanisms were originated as a host defense system against proliferation of parasitic elements, this review discusses a possibility that the same mechanisms are also used to regulate the SINE-derived functions.

  17. YAC contig information - RGP physicalmap | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available 8908/lsdba.nbdc00318-06-001 Description of data contents YAC contigs on the rice chromosomes Data file File name: rgp_physical...map_yac_contigs.zip File URL: ftp://ftp.biosciencedbc.jp/archive/rgp-physicalmap/LATEST/rgp_physical...sciencedbc.jp/togodb/view/rgp_physicalmap_yac_contigs#en Data acquisition method The range including YAC con...m Description Chrom. No. Chromosome number Region Region number Physical map image The file name of rice physical...n Download License Update History of This Database Site Policy | Contact Us YAC contig information - RGP physicalmap | LSDB Archive ...

  18. Contig-Layout-Authenticator (CLA): A Combinatorial Approach to Ordering and Scaffolding of Bacterial Contigs for Comparative Genomics and Molecular Epidemiology.

    Science.gov (United States)

    Shaik, Sabiha; Kumar, Narender; Lankapalli, Aditya K; Tiwari, Sumeet K; Baddam, Ramani; Ahmed, Niyaz

    2016-01-01

    A wide variety of genome sequencing platforms have emerged in the recent past. High-throughput platforms like Illumina and 454 are essentially adaptations of the shotgun approach generating millions of fragmented single or paired sequencing reads. To reconstruct whole genomes, the reads have to be assembled into contigs, which often require further downstream processing. The contigs can be directly ordered according to a reference, scaffolded based on paired read information, or assembled using a combination of the two approaches. While the reference-based approach appears to mask strain-specific information, scaffolding based on paired-end information suffers when repetitive elements longer than the size of the sequencing reads are present in the genome. Sequencing technologies that produce long reads can solve the problems associated with repetitive elements but are not necessarily easily available to researchers. The most common high-throughput technology currently used is the Illumina short read platform. To improve upon the shortcomings associated with the construction of draft genomes with Illumina paired-end sequencing, we developed Contig-Layout-Authenticator (CLA). The CLA pipeline can scaffold reference-sorted contigs based on paired reads, resulting in better assembled genomes. Moreover, CLA also hints at probable misassemblies and contaminations, for the users to cross-check before constructing the consensus draft. The CLA pipeline was designed and trained extensively on various bacterial genome datasets for the ordering and scaffolding of large repetitive contigs. The tool has been validated and compared favorably with other widely-used scaffolding and ordering tools using both simulated and real sequence datasets. CLA is a user friendly tool that requires a single command line input to generate ordered scaffolds.

  19. Dicty_cDB: Contig-U16279-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available ( AB254080 |pid:none) Streptomyces kanamyceticus kanam... 47 0.002 CP000964_3229( CP000964 |pid:none) Klebsiella pneumoni...nkkmtkpvasyeldekrfltllgkligetenlqnrppalipiednag rhviealtpylkanggvleleqvhcdpvnypkrgniiie... letters Score E Sequences producing significant alignments: (bits) Value Contig-U16279-1 (Contig-U16279-1Q....................................................done Score E Sequences producing significant alignments: (bits) Val....................................done Score E Sequences producing significant alignments: (bits) Val

  20. Phylogenetic relationships among East African haplochromine fish as revealed by short interspersed elements (SINEs).

    Science.gov (United States)

    Terai, Yohey; Takezaki, Naoko; Mayer, Werner E; Tichy, Herbert; Takahata, Naoyuki; Klein, Jan; Okada, Norihiro

    2004-01-01

    Genomic DNA libraries were prepared from two endemic species of Lake Victoria haplochromine (cichlid) fish and used to isolate and characterize a set of short interspersed elements (SINEs). The distribution and sequences of the SINEs were used to infer phylogenetic relationships among East African haplochromines. The SINE-based classification divides the fish into four groups, which, in order of their divergence from a stem lineage, are the endemic Lake Tanganyika flock (group 1); fish of the nonendemic, monotypic, widely distributed genus Astatoreochromis (group 2); the endemic Lake Malawi flock (group 3); and group 4, which contains fish from widely dispersed East African localities including Lakes Victoria, Edward, George, Albert, and Rukwa, as well as many rivers. The group 4 haplochromines are characterized by a subset of polymorphic SINEs, each of which is present in some individuals and absent in others of the same population at a given locality, the same morphologically defined species, and the same mtDNA-defined haplogroup. SINE-defined group 4 contains six of the seven previously described mtDNA haplogroups. One of the polymorphic SINEs appears to be fixed in the endemic Lake Victoria flock; four others display the presence-or-absence polymorphism within the species of this flock. These findings have implications for the origin of Lake Victoria cichlids and for their founding population sizes.

  1. A Blumeria graminis f.sp. hordei BAC library - contig building and microsynteny studies

    DEFF Research Database (Denmark)

    Pedersen, C.; Wu, B.; Giese, H.

    2002-01-01

    A bacterial artificial chromosome (BAC) library of Blumeria graminis f.sp. hordei, containing 12,000 clones with an average insert size of 41 kb, was constructed. The library represents about three genome equivalents and BAC-end sequencing showed a high content of repetitive sequences, making...... contigs, at or close to avirulence loci, were constructed. Single nucleotide polymorphism (SNP) markers were developed from BAC-end sequences to link the contigs to the genetic maps. Two other BAC contigs were used to study microsynteny between B. graminis and two other ascomycetes, Neurospora crassa...

  2. Prediction and phylogenetic analysis of mammalian short interspersed elements (SINEs).

    Science.gov (United States)

    Rogozin, I B; Mayorov, V I; Lavrentieva, M V; Milanesi, L; Adkison, L R

    2000-09-01

    The presence of repetitive elements can create serious problems for sequence analysis, especially in the case of homology searches in nucleotide sequence databases. Repetitive elements should be treated carefully by using special programs and databases. In this paper, various aspects of SINE (short interspersed repetitive element) identification, analysis and evolution are discussed.

  3. Gene conversion as a secondary mechanism of short interspersed element (SINE) evolution

    Energy Technology Data Exchange (ETDEWEB)

    Kass, D.H. [Louisiana State Univ. Medical Center, New Orleans, LA (United States). Dept. of Biochemistry and Molecular Biology; Batzer, M.A. [Lawrence Livermore National Lab., CA (United States); Deininger, P.L. [Louisiana State Univ. Medical Center, New Orleans, LA (United States). Dept. of Biochemistry and Molecular Biology]|[Alton Ochsner Medical Foundation, New Orleans, LA (United States). Lab. of Molecular Genetics

    1995-01-01

    The Alu repetitive family of short interspersed elements (SINEs) in primates can be subdivided into distinct subfamilies by specific diagnostic nucleotide changes. The older subfamilies are generally very abundant, while the younger subfamilies have fewer copies. Some of the youngest Alu elements are absent in the orthologous loci of nonhuman primates, indicative of recent retroposition events, the primary mode of SINE evolutions. PCR analysis of one young Alu subfamily (Sb2) member found in the low-density lipoprotein receptor gene apparently revealed the presence of this element in the green monkey, orangutan, gorilla, and chimpanzee genomes, as well as the human genome. However, sequence analysis of these genomes revealed a highly mutated, older, primate-specific Alu element was present at this position in the nonhuman primates. Comparison of the flanking DNA sequences upstream of this Alu insertion corresponded to evolution expected for standard primate phylogeny, but comparison of the Alu repeat sequences revealed that the human element departed from this phylogeny. The change in the human sequence apparently occurred by a gene conversion event only within the Alu element itself, converting it from one of the oldest to one of the youngest Alu subfamilies. Although gene conversions of Alu elements are clearly very rare, this finding shows that such events can occur and contribute to specific cases of SINE subfamily evolution.

  4. Dicty_cDB: Contig-U03802-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available T-2KB Trichosurus... 48 3e-11 4 ( DY894715 ) CeleSEQ14351 Cunninghamella elegans pBluescript (... 58 4e-11 3... letters Score E Sequences producing significant alignments: (bits) Value Contig-U03802-1 (Contig-U... letters Searching..................................................done Score E Sequences producing significant al...1... 62 4e-05 1 ( EJ306703 ) 1095390099376 Global-Ocean-Sampling_GS-27-01-01-1... 62 4e-05 1 ( CP000238 ) Baumannia cicadellinicola... AY241394 |pid:none) Melopsittacus undulatus Mn superox... 244 2e-63 AF329270_1( AF329270 |pid:none) Gallus gallus manganes

  5. LTC: a novel algorithm to improve the efficiency of contig assembly for physical mapping in complex genomes

    Directory of Open Access Journals (Sweden)

    Feuillet Catherine

    2010-11-01

    Full Text Available Abstract Background Physical maps are the substrate of genome sequencing and map-based cloning and their construction relies on the accurate assembly of BAC clones into large contigs that are then anchored to genetic maps with molecular markers. High Information Content Fingerprinting has become the method of choice for large and repetitive genomes such as those of maize, barley, and wheat. However, the high level of repeated DNA present in these genomes requires the application of very stringent criteria to ensure a reliable assembly with the FingerPrinted Contig (FPC software, which often results in short contig lengths (of 3-5 clones before merging as well as an unreliable assembly in some difficult regions. Difficulties can originate from a non-linear topological structure of clone overlaps, low power of clone ordering algorithms, and the absence of tools to identify sources of gaps in Minimal Tiling Paths (MTPs. Results To address these problems, we propose a novel approach that: (i reduces the rate of false connections and Q-clones by using a new cutoff calculation method; (ii obtains reliable clusters robust to the exclusion of single clone or clone overlap; (iii explores the topological contig structure by considering contigs as networks of clones connected by significant overlaps; (iv performs iterative clone clustering combined with ordering and order verification using re-sampling methods; and (v uses global optimization methods for clone ordering and Band Map construction. The elements of this new analytical framework called Linear Topological Contig (LTC were applied on datasets used previously for the construction of the physical map of wheat chromosome 3B with FPC. The performance of LTC vs. FPC was compared also on the simulated BAC libraries based on the known genome sequences for chromosome 1 of rice and chromosome 1 of maize. Conclusions The results show that compared to other methods, LTC enables the construction of highly

  6. Evolutionary dynamics and sites of illegitimate recombination revealed in the interspersion and sequence junctions of two nonhomologous satellite DNAs in cactophilic Drosophila species.

    Science.gov (United States)

    Kuhn, G C S; Teo, C H; Schwarzacher, T; Heslop-Harrison, J S

    2009-05-01

    Satellite DNA (satDNA) is a major component of genomes but relatively little is known about the fine-scale organization of unrelated satDNAs residing at the same chromosome location, and the sequence structure and dynamics of satDNA junctions. We studied the organization and sequence junctions of two nonhomologous satDNAs, pBuM and DBC-150, in three species from the neotropical Drosophila buzzatii cluster (repleta group). In situ hybridization to microchromosomes, interphase nuclei and extended DNA fibers showed frequent interspersion of the two satellites in D. gouveai, D. antonietae and, to a lesser extent, D. seriema. We isolated by PCR six pBuM x DBC-150 junctions: four are exclusive to D. gouveai and two are exclusive to D. antonietae. The six junction breakpoints occur at different positions within monomers, suggesting independent origin. Four junctions showed abrupt transitions between the two satellites, whereas two junctions showed a distinct 10 bp tandem duplication before the junction. Unlike pBuM, DBC-150 junction repeats are more variable than randomly cloned monomers and showed diagnostic features in common to a 3-monomer higher-order repeat seen in the sister species D. serido. The high levels of interspersion between pBuM and DBC-150 repeats suggest extensive rearrangements between the two satellites, maybe favored by specific features of the microchromosomes. Our interpretation is that the junctions evolved by multiples events of illegitimate recombination between nonhomologous satDNA repeats, with subsequent rounds of unequal crossing-over expanding the copy number of some of the junctions.

  7. Construction of a yeast artifical chromosome contig spanning the spinal muscular atrophy disease gene region

    Energy Technology Data Exchange (ETDEWEB)

    Kleyn, P.W.; Wang, C.H.; Vitale, E.; Pan, J.; Ross, B.M.; Grunn, A.; Palmer, D.A.; Warburton, D.; Brzustowicz, L.M.; Gilliam, T.G. (New York State Psychiatric Institute, NY (United States)); Lien, L.L.; Kunkel, L.M. (Howard Hughes Medical Institute, Boston, MA (United States))

    1993-07-15

    The childhood spinal muscular atrophies (SMAs) are the most common, serious neuromuscular disorders of childhood second to Duchenne muscular dystrophy. A single locus for these disorders has been mapped by recombination events to a region of 0.7 centimorgan (range, 0.1-2.1 centimorgans) between loci D5S435 and MAP1B on chromosome 5q11.2-13.3. By using PCR amplification to screen yeast artificial chromosome (YAC) DNA pools and the PCR-vectorette method to amplify YAC ends, a YAC contig was constructed across the disease gene region. Nine walk steps identified 32 YACs, including a minimum of seven overlapping YAC clones (average size, 460 kb) that span the SMA region. The contig is characterized by a collection of 30 YAC-end sequence tag sites together with seven genetic markers. The entire YAC contig spans a minimum of 3.2 Mb; the SMA locus is confined to roughly half of this region. Microsatellite markers generated along the YAC contig segregate with the SMA locus in all families where the flanking markers (D5S435 and MAP1B) recombine. Construction of a YAC contig across the disease gene region is an essential step in isolation of the SMA-encoding gene. 26 refs., 3 figs., 1 tab.

  8. Dicty_cDB: Contig-U14477-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available vmvvvivfylqviynlriivilmvqlivivf*mglvmvtviiivivii i*rinnnnsnnnnnsnnnkikmif*yqiinrlnnyf*shyqkfiiiqrldfwdyqrler* *hhlyqrlvnqvvivq*fhwisl...amvlaimxxx own update 2004. 6.10 Homology vs CSM-cDNA Query= Contig-U14477-1 (Conti

  9. Short interspersed transposable elements (SINEs) are excluded from imprinted regions in the human genome.

    Science.gov (United States)

    Greally, John M

    2002-01-08

    To test whether regions undergoing genomic imprinting have unique genomic characteristics, imprinted and nonimprinted human loci were compared for nucleotide and retroelement composition. Maternally and paternally expressed subgroups of imprinted genes were found to differ in terms of guanine and cytosine, CpG, and retroelement content, indicating a segregation into distinct genomic compartments. Imprinted regions have been normally permissive to L1 long interspersed transposable element retroposition during mammalian evolution but universally and significantly lack short interspersed transposable elements (SINEs). The primate-specific Alu SINEs, as well as the more ancient mammalian-wide interspersed repeat SINEs, are found at significantly low densities in imprinted regions. The latter paleogenomic signature indicates that the sequence characteristics of currently imprinted regions existed before the mammalian radiation. Transitions from imprinted to nonimprinted genomic regions in cis are characterized by a sharp inflection in SINE content, demonstrating that this genomic characteristic can help predict the presence and extent of regions undergoing imprinting. During primate evolution, SINE accumulation in imprinted regions occurred at a decreased rate compared with control loci. The constraint on SINE accumulation in imprinted regions may be mediated by an active selection process. This selection could be because of SINEs attracting and spreading methylation, as has been found at other loci. Methylation-induced silencing could lead to deleterious consequences at imprinted loci, where inactivation of one allele is already established, and expression is often essential for embryonic growth and survival.

  10. The binning of metagenomic contigs for microbial physiology of mixed cultures.

    Science.gov (United States)

    Strous, Marc; Kraft, Beate; Bisdorf, Regina; Tegetmeyer, Halina E

    2012-01-01

    So far, microbial physiology has dedicated itself mainly to pure cultures. In nature, cross feeding and competition are important aspects of microbial physiology and these can only be addressed by studying complete communities such as enrichment cultures. Metagenomic sequencing is a powerful tool to characterize such mixed cultures. In the analysis of metagenomic data, well established algorithms exist for the assembly of short reads into contigs and for the annotation of predicted genes. However, the binning of the assembled contigs or unassembled reads is still a major bottleneck and required to understand how the overall metabolism is partitioned over different community members. Binning consists of the clustering of contigs or reads that apparently originate from the same source population. In the present study eight metagenomic samples from the same habitat, a laboratory enrichment culture, were sequenced. Each sample contained 13-23 Mb of assembled contigs and up to eight abundant populations. Binning was attempted with existing methods but they were found to produce poor results, were slow, dependent on non-standard platforms or produced errors. A new binning procedure was developed based on multivariate statistics of tetranucleotide frequencies combined with the use of interpolated Markov models. Its performance was evaluated by comparison of the results between samples with BLAST and in comparison to existing algorithms for four publicly available metagenomes and one previously published artificial metagenome. The accuracy of the new approach was comparable or higher than existing methods. Further, it was up to a 100 times faster. It was implemented in Java Swing as a complete open source graphical binning application available for download and further development (http://sourceforge.net/projects/metawatt).

  11. The binning of metagenomic contigs for microbial physiology of mixed cultures

    Directory of Open Access Journals (Sweden)

    Marc eStrous

    2012-12-01

    Full Text Available So far, microbial physiology has dedicated itself mainly to pure cultures. In nature, cross feeding and competition are important aspects of microbial physiology and these can only be addressed by studying complete communities such as enrichment cultures. Metagenomic sequencing is a powerful tool to characterize such mixed cultures. In the analysis of metagenomic data, well established algorithms exist for the assembly of short reads into contigs and for the annotation of predicted genes. However, the binning of the assembled contigs or unassembled reads is still a major bottleneck and required to understand how the overall metabolism is partitioned over different community members. Binning consists of the clustering of contigs or reads that apparently originate from the same source population.In the present study eight metagenomic samples originating from the same habitat, a laboratory enrichment culture, were sequenced. Each sample contained 13-23 Mb of assembled contigs and up to eight abundant populations. Binning was attempted with existing methods but they were found to produce poor results, were slow, dependent on non-standard platforms or produced errors. A new binning procedure was developed based on multivariate statistics of tetranucleotide frequencies combined with the use of interpolated Markov models. Its performance was evaluated by comparison of the results between samples with BLAST and in comparison to exisiting algorithms for four publicly available metagenomes and one previously published artificial metagenome. The accuracy of the new approach was comparable or higher than existing methods. Further, it was up to a hunderd times faster. It was implemented in Java Swing as a complete open source graphical binning application available for download and further development (http://sourceforge.net/projects/metawatt.

  12. CSAR-web: a web server of contig scaffolding using algebraic rearrangements.

    Science.gov (United States)

    Chen, Kun-Tze; Lu, Chin Lung

    2018-05-04

    CSAR-web is a web-based tool that allows the users to efficiently and accurately scaffold (i.e. order and orient) the contigs of a target draft genome based on a complete or incomplete reference genome from a related organism. It takes as input a target genome in multi-FASTA format and a reference genome in FASTA or multi-FASTA format, depending on whether the reference genome is complete or incomplete, respectively. In addition, it requires the users to choose either 'NUCmer on nucleotides' or 'PROmer on translated amino acids' for CSAR-web to identify conserved genomic markers (i.e. matched sequence regions) between the target and reference genomes, which are used by the rearrangement-based scaffolding algorithm in CSAR-web to order and orient the contigs of the target genome based on the reference genome. In the output page, CSAR-web displays its scaffolding result in a graphical mode (i.e. scalable dotplot) allowing the users to visually validate the correctness of scaffolded contigs and in a tabular mode allowing the users to view the details of scaffolds. CSAR-web is available online at http://genome.cs.nthu.edu.tw/CSAR-web.

  13. CBrowse: a SAM/BAM-based contig browser for transcriptome assembly visualization and analysis.

    Science.gov (United States)

    Li, Pei; Ji, Guoli; Dong, Min; Schmidt, Emily; Lenox, Douglas; Chen, Liangliang; Liu, Qi; Liu, Lin; Zhang, Jie; Liang, Chun

    2012-09-15

    To address the impending need for exploring rapidly increased transcriptomics data generated for non-model organisms, we developed CBrowse, an AJAX-based web browser for visualizing and analyzing transcriptome assemblies and contigs. Designed in a standard three-tier architecture with a data pre-processing pipeline, CBrowse is essentially a Rich Internet Application that offers many seamlessly integrated web interfaces and allows users to navigate, sort, filter, search and visualize data smoothly. The pre-processing pipeline takes the contig sequence file in FASTA format and its relevant SAM/BAM file as the input; detects putative polymorphisms, simple sequence repeats and sequencing errors in contigs and generates image, JSON and database-compatible CSV text files that are directly utilized by different web interfaces. CBowse is a generic visualization and analysis tool that facilitates close examination of assembly quality, genetic polymorphisms, sequence repeats and/or sequencing errors in transcriptome sequencing projects. CBrowse is distributed under the GNU General Public License, available at http://bioinfolab.muohio.edu/CBrowse/ liangc@muohio.edu or liangc.mu@gmail.com; glji@xmu.edu.cn Supplementary data are available at Bioinformatics online.

  14. [Short interspersed repetitive sequences (SINEs) and their use as a phylogenetic tool].

    Science.gov (United States)

    Kramerov, D A; Vasetskiĭ, N S

    2009-01-01

    The data on one of the most common repetitive elements of eukaryotic genomes, short interspersed elements (SINEs), are reviewed. Their structure, origin, and functioning in the genome are discussed. The variation and abundance of these neutral genomic markers makes them a convenient and reliable tool for phylogenetic analysis. The main methods of such analysis are presented, and the potential and limitations of this approach are discussed using specific examples.

  15. Similarities between long interspersed element-1 (LINE-1) reverse transcriptase and telomerase.

    Science.gov (United States)

    Kopera, Huira C; Moldovan, John B; Morrish, Tammy A; Garcia-Perez, Jose Luis; Moran, John V

    2011-12-20

    Long interspersed element-1 (LINE-1 or L1) retrotransposons encode two proteins (ORF1p and ORF2p) that contain activities required for conventional retrotransposition by a mechanism termed target-site primed reverse transcription. Previous experiments in XRCC4 or DNA protein kinase catalytic subunit-deficient CHO cell lines, which are defective for the nonhomologous end-joining DNA repair pathway, revealed an alternative endonuclease-independent (ENi) pathway for L1 retrotransposition. Interestingly, some ENi retrotransposition events in DNA protein kinase catalytic subunit-deficient cells are targeted to dysfunctional telomeres. Here we used an in vitro assay to detect L1 reverse transcriptase activity to demonstrate that wild-type or endonuclease-defective L1 ribonucleoprotein particles can use oligonucleotide adapters that mimic telomeric ends as primers to initiate the reverse transcription of L1 mRNA. Importantly, these ribonucleoprotein particles also contain a nuclease activity that can process the oligonucleotide adapters before the initiation of reverse transcription. Finally, we demonstrate that ORF1p is not strictly required for ENi retrotransposition at dysfunctional telomeres. Thus, these data further highlight similarities between the mechanism of ENi L1 retrotransposition and telomerase.

  16. Effects of Interspersed versus Summary Feedback on the Quality of Students' Case Report Revisions

    Science.gov (United States)

    Phillips, Fred; Wolcott, Susan

    2014-01-01

    This study examines whether students show greater improvement in written case analyses when given feedback that is either interspersed throughout their written case analyses or presented only as a summary, and whether the benefits of these placements vary across differing levels of student performance in the course. Results from an exploratory…

  17. A specific family of interspersed repeats (SINEs facilitates meiotic synapsis in mammals

    Directory of Open Access Journals (Sweden)

    Johnson Matthew E

    2013-01-01

    Full Text Available Abstract Background Errors during meiosis that affect synapsis and recombination between homologous chromosomes contribute to aneuploidy and infertility in humans. Despite the clinical relevance of these defects, we know very little about the mechanisms by which homologous chromosomes interact with one another during mammalian meiotic prophase. Further, we remain ignorant of the way in which chromosomal DNA complexes with the meiosis-specific structure that tethers homologs, the synaptonemal complex (SC, and whether specific DNA elements are necessary for this interaction. Results In the present study we utilized chromatin immunoprecipitation (ChIP and DNA sequencing to demonstrate that the axial elements of the mammalian SC are markedly enriched for a specific family of interspersed repeats, short interspersed elements (SINEs. Further, we refine the role of the repeats to specific sub-families of SINEs, B1 in mouse and AluY in old world monkey (Macaca mulatta. Conclusions Because B1 and AluY elements are the most actively retrotransposing SINEs in mice and rhesus monkeys, respectively, our observations imply that they may serve a dual function in axial element binding; i.e., as the anchoring point for the SC but possibly also as a suppressor/regulator of retrotransposition.

  18. Dicty_cDB: Contig-U05126-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CP000939 ) Clostridium botulinum B1 str. Okra, complete genome. 34 2.5 18 ( GE803619 ) EST_scau_evk_893885 ...scauevk mixed_tissue Sebastes... 32 2.6 3 ( AM462416 ) Vitis vinifera contig VV78X219254.19, whole genom...

  19. Distribution and characterization of staphylococcal interspersed repeat units (SIRUs) and potential use for strain differentiation

    DEFF Research Database (Denmark)

    Hardy, K.J.; Ussery, David; Oppenheim, B.A.

    2004-01-01

    are similar to those used in the study of the evolution of Mycobacterium tuberculosis clades. Seven VNTRs, termed staphylococcal interspersed repeat units (SIRUs), distributed around the genome are described, occurring in both unique and multiple sites, and varying in length from 48 to 159 bp. Variations...

  20. Dicty_cDB: Contig-U11311-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available A from... 58 6e-07 2 ( AM481444 ) Vitis vinifera contig VV78X144362.4, whole genome... 68 1e-06 1 ( AY604469 ) Prodonto...117 4e-27 AY604469_1( AY604469 |pid:none) Prodontorhabditis wirthi strain DF... 125 5e-27 ( P25202 ) RecName

  1. Dicty_cDB: Contig-U16102-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available 0 6 ( BJ408668 ) Dictyostelium discoideum cDNA clone:dds46g14, 3' ... 44 3.0 2 ( CV162186 ) CS_hyp_01d11_M13Reverse Blue crab hypoder...08_M13Reverse Blue crab hypodermis, nor... 42 3.6 2 ( AM474408 ) Vitis vinifera contig VV78X173370.5, whole

  2. Short Interspersed Nuclear Element (SINE) Sequences in the Genome of the Human Pathogenic Fungus Aspergillus fumigatus Af293.

    Science.gov (United States)

    Kanhayuwa, Lakkhana; Coutts, Robert H A

    2016-01-01

    Novel families of short interspersed nuclear element (SINE) sequences in the human pathogenic fungus Aspergillus fumigatus, clinical isolate Af293, were identified and categorised into tRNA-related and 5S rRNA-related SINEs. Eight predicted tRNA-related SINE families originating from different tRNAs, and nominated as AfuSINE2 sequences, contained target site duplications of short direct repeat sequences (4-14 bp) flanking the elements, an extended tRNA-unrelated region and typical features of RNA polymerase III promoter sequences. The elements ranged in size from 140-493 bp and were present in low copy number in the genome and five out of eight were actively transcribed. One putative tRNAArg-derived sequence, AfuSINE2-1a possessed a unique feature of repeated trinucleotide ACT residues at its 3'-terminus. This element was similar in sequence to the I-4_AO element found in A. oryzae and an I-1_AF long nuclear interspersed element-like sequence identified in A. fumigatus Af293. Families of 5S rRNA-related SINE sequences, nominated as AfuSINE3, were also identified and their 5'-5S rRNA-related regions show 50-65% and 60-75% similarity to respectively A. fumigatus 5S rRNAs and SINE3-1_AO found in A. oryzae. A. fumigatus Af293 contains five copies of AfuSINE3 sequences ranging in size from 259-343 bp and two out of five AfuSINE3 sequences were actively transcribed. Investigations on AfuSINE distribution in the fungal genome revealed that the elements are enriched in pericentromeric and subtelomeric regions and inserted within gene-rich regions. We also demonstrated that some, but not all, AfuSINE sequences are targeted by host RNA silencing mechanisms. Finally, we demonstrated that infection of the fungus with mycoviruses had no apparent effects on SINE activity.

  3. Enrichment of short interspersed transposable elements to embryonic stem cell-specific hypomethylated gene regions.

    Science.gov (United States)

    Muramoto, Hiroki; Yagi, Shintaro; Hirabayashi, Keiji; Sato, Shinya; Ohgane, Jun; Tanaka, Satoshi; Shiota, Kunio

    2010-08-01

    Embryonic stem cells (ESCs) have a distinctive epigenome, which includes their genome-wide DNA methylation modification status, as represented by the ESC-specific hypomethylation of tissue-dependent and differentially methylated regions (T-DMRs) of Pou5f1 and Nanog. Here, we conducted a genome-wide investigation of sequence characteristics associated with T-DMRs that were differentially methylated between ESCs and somatic cells, by focusing on transposable elements including short interspersed elements (SINEs), long interspersed elements (LINEs) and long terminal repeats (LTRs). We found that hypomethylated T-DMRs were predominantly present in SINE-rich/LINE-poor genomic loci. The enrichment for SINEs spread over 300 kb in cis and there existed SINE-rich genomic domains spreading continuously over 1 Mb, which contained multiple hypomethylated T-DMRs. The characterization of sequence information showed that the enriched SINEs were relatively CpG rich and belonged to specific subfamilies. A subset of the enriched SINEs were hypomethylated T-DMRs in ESCs at Dppa3 gene locus, although SINEs are overall methylated in both ESCs and the liver. In conclusion, we propose that SINE enrichment is the genomic property of regions harboring hypomethylated T-DMRs in ESCs, which is a novel aspect of the ESC-specific epigenomic information.

  4. ParTIES: a toolbox for Paramecium interspersed DNA elimination studies.

    Science.gov (United States)

    Denby Wilkes, Cyril; Arnaiz, Olivier; Sperling, Linda

    2016-02-15

    Developmental DNA elimination occurs in a wide variety of multicellular organisms, but ciliates are the only single-celled eukaryotes in which this phenomenon has been reported. Despite considerable interest in ciliates as models for DNA elimination, no standard methods for identification and characterization of the eliminated sequences are currently available. We present the Paramecium Toolbox for Interspersed DNA Elimination Studies (ParTIES), designed for Paramecium species, that (i) identifies eliminated sequences, (ii) measures their presence in a sequencing sample and (iii) detects rare elimination polymorphisms. ParTIES is multi-threaded Perl software available at https://github.com/oarnaiz/ParTIES. ParTIES is distributed under the GNU General Public Licence v3. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  5. Short interspersed elements (SINEs) in plants: origin, classification, and use as phylogenetic markers.

    Science.gov (United States)

    Deragon, Jean-Marc; Zhang, Xiaoyu

    2006-12-01

    Short interspersed elements (SINEs) are a class of dispersed mobile sequences that use RNA as an intermediate in an amplification process called retroposition. The presence-absence of a SINE at a given locus has been used as a meaningful classification criterion to evaluate phylogenetic relations among species. We review here recent developments in the characterisation of plant SINEs and their use as molecular makers to retrace phylogenetic relations among wild and cultivated Oryza and Brassica species. In Brassicaceae, further use of SINE markers is limited by our partial knowledge of endogenous SINE families (their origin and evolution histories) and by the absence of a clear classification. To solve this problem, phylogenetic relations among all known Brassicaceae SINEs were analyzed and a new classification, grouping SINEs in 15 different families, is proposed. The relative age and size of each Brassicaceae SINE family was evaluated and new phylogenetically supported subfamilies were described. We also present evidence suggesting that new potentially active SINEs recently emerged in Brassica oleracea from the shuffling of preexisting SINE portions. Finally, the comparative evolution history of SINE families present in Arabidopsis thaliana and Brassica oleracea revealed that SINEs were in general more active in the Brassica lineage. The importance of these new data for the use of Brassicaceae SINEs as molecular markers in future applications is discussed.

  6. The use of mycobacterial interspersed repetitive unit typing and whole genome sequencing to inform tuberculosis prevention and control activities.

    Science.gov (United States)

    Gilbert, Gwendolyn L; Sintchenko, Vitali

    2013-07-01

    Molecular strain typing of Mycobacterium tuberculosis has been possible for only about 20 years; it has significantly improved our understanding of the evolution and epidemiology of Mycobacterium tuberculosis and tuberculosis disease. Mycobacterial interspersed repetitive unit typing, based on 24 variable number tandem repeat unit loci, is highly discriminatory, relatively easy to perform and interpret and is currently the most widely used molecular typing system for tuberculosis surveillance. Nevertheless, clusters identified by mycobacterial interspersed repetitive unit typing sometimes cannot be confirmed or adequately defined by contact tracing and additional methods are needed. Recently, whole genome sequencing has been used to identify single nucleotide polymorphisms and other mutations, between genotypically indistinguishable isolates from the same cluster, to more accurately trace transmission pathways. Rapidly increasing speed and quality and reduced costs will soon make large scale whole genome sequencing feasible, combined with the use of sophisticated bioinformatics tools, for epidemiological surveillance of tuberculosis.

  7. Sauria SINEs: Novel short interspersed retroposable elements that are widespread in reptile genomes.

    Science.gov (United States)

    Piskurek, Oliver; Austin, Christopher C; Okada, Norihiro

    2006-05-01

    SINEs are short interspersed retrotransposable elements that invade new genomic sites. Their retrotransposition depends on reverse transcriptase and endonuclease activities encoded by partner LINEs (long interspersed elements). Recent genomic research has demonstrated that retroposons account for at least 40% of the human genome. Hitherto, more than 30 families of SINEs have been characterized in mammalian genomes, comprising approximately 4600 extant species; the distribution and extent of SINEs in reptilian genomes, however, are poorly documented. With more than 7400 species of lizards and snakes, Squamata constitutes the largest and most diverse group of living reptiles. We have discovered and characterized a novel SINE family, Sauria SINEs, whose members are widely distributed among genomes of lizards, snakes, and tuataras. Sauria SINEs comprise a 5' tRNA-related region, a tRNA-unrelated region, and a 3' tail region (containing short tandem repeats) derived from LINEs. We distinguished eight Sauria SINE subfamilies in genomes of four major squamate lineages and investigated their evolutionary relationships. Our data illustrate the overall efficacy of Sauria SINEs as novel retrotransposable markers for elucidation of squamate evolutionary history. We show that all Sauria SINEs share an identical 3' sequence with Bov-B LINEs and propose that they utilize the enzymatic machinery of Bov-B LINEs for their own retrotransposition. This finding, along with the ubiquity of Bov-B LINEs previously demonstrated in squamate genomes, suggests that these LINEs have been an active partner of Sauria SINEs since this SINE family was generated more than 200 million years ago.

  8. Short Interspersed Nuclear Element (SINE Sequences in the Genome of the Human Pathogenic Fungus Aspergillus fumigatus Af293.

    Directory of Open Access Journals (Sweden)

    Lakkhana Kanhayuwa

    Full Text Available Novel families of short interspersed nuclear element (SINE sequences in the human pathogenic fungus Aspergillus fumigatus, clinical isolate Af293, were identified and categorised into tRNA-related and 5S rRNA-related SINEs. Eight predicted tRNA-related SINE families originating from different tRNAs, and nominated as AfuSINE2 sequences, contained target site duplications of short direct repeat sequences (4-14 bp flanking the elements, an extended tRNA-unrelated region and typical features of RNA polymerase III promoter sequences. The elements ranged in size from 140-493 bp and were present in low copy number in the genome and five out of eight were actively transcribed. One putative tRNAArg-derived sequence, AfuSINE2-1a possessed a unique feature of repeated trinucleotide ACT residues at its 3'-terminus. This element was similar in sequence to the I-4_AO element found in A. oryzae and an I-1_AF long nuclear interspersed element-like sequence identified in A. fumigatus Af293. Families of 5S rRNA-related SINE sequences, nominated as AfuSINE3, were also identified and their 5'-5S rRNA-related regions show 50-65% and 60-75% similarity to respectively A. fumigatus 5S rRNAs and SINE3-1_AO found in A. oryzae. A. fumigatus Af293 contains five copies of AfuSINE3 sequences ranging in size from 259-343 bp and two out of five AfuSINE3 sequences were actively transcribed. Investigations on AfuSINE distribution in the fungal genome revealed that the elements are enriched in pericentromeric and subtelomeric regions and inserted within gene-rich regions. We also demonstrated that some, but not all, AfuSINE sequences are targeted by host RNA silencing mechanisms. Finally, we demonstrated that infection of the fungus with mycoviruses had no apparent effects on SINE activity.

  9. Ultrathin Carbon with Interspersed Graphene/Fullerene-like Nanostructures: A Durable Protective Overcoat for High Density Magnetic Storage.

    Science.gov (United States)

    Dwivedi, Neeraj; Satyanarayana, Nalam; Yeo, Reuben J; Xu, Hai; Ping Loh, Kian; Tripathy, Sudhiranjan; Bhatia, Charanjit S

    2015-06-25

    One of the key issues for future hard disk drive technology is to design and develop ultrathin (Forming carbon overcoats (COCs) having interspersed nanostructures by the filtered cathodic vacuum arc (FCVA) process can be an effective approach to achieve the desired target. In this work, by employing a novel bi-level surface modification approach using FCVA, the formation of a high sp(3) bonded ultrathin (~1.7 nm) amorphous carbon overcoat with interspersed graphene/fullerene-like nanostructures, grown on magnetic hard disk media, is reported. The in-depth spectroscopic and microscopic analyses by high resolution transmission electron microscopy, scanning tunneling microscopy, time-of-flight secondary ion mass spectrometry, and Raman spectroscopy support the observed findings. Despite a reduction of ~37% in COC thickness, the FCVA-processed thinner COC (~1.7 nm) shows promising functional performance in terms of lower coefficient of friction (~0.25), higher wear resistance, lower surface energy, excellent hydrophobicity and similar/better oxidation corrosion resistance than current commercial COCs of thickness ~2.7 nm. The surface and tribological properties of FCVA-deposited COC was further improved after deposition of lubricant layer.

  10. Genome-wide analysis of short interspersed nuclear elements SINES revealed high sequence conservation, gene association and retrotranspositional activity in wheat.

    Science.gov (United States)

    Ben-David, Smadar; Yaakov, Beery; Kashkush, Khalil

    2013-10-01

    Short interspersed nuclear elements (SINEs) are non-autonomous non-LTR retroelements that are present in most eukaryotic species. While SINEs have been intensively investigated in humans and other animal systems, they are poorly studied in plants, especially in wheat (Triticum aestivum). We used quantitative PCR of various wheat species to determine the copy number of a wheat SINE family, termed Au SINE, combined with computer-assisted analyses of the publicly available 454 pyrosequencing database of T. aestivum. In addition, we utilized site-specific PCR on 57 Au SINE insertions, transposon methylation display and transposon display on newly formed wheat polyploids to assess retrotranspositional activity, epigenetic status and genetic rearrangements in Au SINE, respectively. We retrieved 3706 different insertions of Au SINE from the 454 pyrosequencing database of T. aestivum, and found that most of the elements are inserted in A/T-rich regions, while approximately 38% of the insertions are associated with transcribed regions, including known wheat genes. We observed typical retrotransposition of Au SINE in the second generation of a newly formed wheat allohexaploid, and massive hypermethylation in CCGG sites surrounding Au SINE in the third generation. Finally, we observed huge differences in the copy numbers in diploid Triticum and Aegilops species, and a significant increase in the copy numbers in natural wheat polyploids, but no significant increase in the copy number of Au SINE in the first four generations for two of three newly formed allopolyploid species used in this study. Our data indicate that SINEs may play a prominent role in the genomic evolution of wheat through stress-induced activation. © 2013 Ben-Gurion University The Plant Journal © 2013 John Wiley & Sons Ltd.

  11. Construction of a YAC contig and STS map spanning 2.5 Mbp in Xq25, the critical region for the X-linked lymphoproliferative (XLP) gene

    Energy Technology Data Exchange (ETDEWEB)

    Lanyi, A.; Li, B.F.; Li, S. [Univ. of Nebraska Medical Center, Omaha, NE (United States)] [and others

    1994-09-01

    X-linked lymphoproliferative disease (XLP) is characterized by a marked vulnerability in Epstein-Barr virus (EBV) infection. Infection of XLP patients with EBV invariably results in fatal mononucleosis, agammaglobulinemia or B-cell lymphoma. The XLP gene lies within a 10 cM region in Xq25 between DXS42 and DXS10. Initial chromosome studies revealed an interstitial, cytogenetically visible deletion in Xq25 in one XLP family (43-004). We estimated the size of the Xq25 deletion by dual laser flow karyotyping to involve 2% of the X chromosome, or approximately 3 Mbp of DNA sequences. To further delineate the deletion we performed a series of pulsed field gel electrophoresis (PFGE) analyses which showed that DXS6 and DXS100, two Xq25-specific markers, are missing from 45-004 DNA. Five yeast artificial chromosomes (YACs) from a chromosome X specific YAC library containing sequences deleted in patient`s 43-004 DNA were isolated. These five YACs did not overlap, and their end fragments were used to screen the CEPH MegaYAC library. Seven YACs were isolated from the CEPH MegaYAC library. They could be arranged into a contig which spans between DXS6 and DXS100. The contig contains a minimum of 2.5 Mbp of human DNA. A total of 12 YAC end clone, lambda subclones and STS probes have been used to order clones within the contig. These reagents were also used in Southern blot and patients showed interstitial deletions in Xq25. The size of these deletions range between 0.5 and 2.5 Mbp. The shortest deletion probably represents the critical region for the XLP gene.

  12. Evolutionary modes of emergence of short interspersed nuclear element (SINE) families in grasses.

    Science.gov (United States)

    Kögler, Anja; Schmidt, Thomas; Wenke, Torsten

    2017-11-01

    Short interspersed nuclear elements (SINEs) are non-autonomous transposable elements which are propagated by retrotransposition and constitute an inherent part of the genome of most eukaryotic species. Knowledge of heterogeneous and highly abundant SINEs is crucial for de novo (or improvement of) annotation of whole genome sequences. We scanned Poaceae genome sequences of six important cereals (Oryza sativa, Triticum aestivum, Hordeum vulgare, Panicum virgatum, Sorghum bicolor, Zea mays) and Brachypodium distachyon to examine the diversity and evolution of SINE populations. We comparatively analyzed the structural features, distribution, evolutionary relation and abundance of 32 SINE families and subfamilies within grasses, comprising 11 052 individual copies. The investigation of activity profiles within the Poaceae provides insights into their species-specific diversification and amplification. We found that Poaceae SINEs (PoaS) fall into two length categories: simple SINEs of up to 180 bp and dimeric SINEs larger than 240 bp. Detailed analysis at the nucleotide level revealed that multimerization of related and unrelated SINE copies is an important evolutionary mechanism of SINE formation. We conclude that PoaS families diversify by massive reshuffling between SINE families, likely caused by insertion of truncated copies, and provide a model for this evolutionary scenario. Twenty-eight of 32 PoaS families and subfamilies show significant conservation, in particular either in the 5' or 3' regions, across Poaceae species and share large sequence stretches with one or more other PoaS families. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.

  13. A 2-megabase physical contig incorporating 43 DNA markers on the human X chromosome at p11.23-p11.22 from ZNF21 to DXS255

    Energy Technology Data Exchange (ETDEWEB)

    Boycott, K.M.; Bech-Hansen, N.T. [Univ. of Calgary, Alberta (Canada); Halley, G.R.; Schlessinger, D. [Washington Univ. School of Medicine, St. Louis, MO (United States)

    1996-05-01

    A comprehensive physical contig of yeast artificial chromosomes (YACs) and cosmid clones between ZNF21 and DXS255 has been constructed, spanning 2 Mb within the region Xp11.23-p11.22. As a portion of the region was found to be particularly unstable in yeast, the integrity of the contig is dependent on additional information provided by the sequence-tagged site (STS) content of cosmid clones and DNA marker retention in conventional and radiation hybrids. The contig was formatted with 43 DNA markers, including 19 new STSs from YAC insert ends and an internal Alu-PCR product. The density of STSs across the contig ranges from one marker every 20 kb to one every 60 kb, with an average density of one marker every 50 kb. The relative order of previously known gene and expressed sequence tags in this region is predicted to be Xpter-ZNF21-DXS7465E (MG66)-DXS7927E (MG81)-WASP, DXS1011E, DXS7467E (MG21)-DXS-7466E (MG44)-GATA1-DXS7469E (Xp664)-TFE3-SYP (DXS1007E)-Xcen. This contig extends the coverage in Xp11 and provides a framework for the future identification and mapping of new genes, as well as the resources for developing DNA sequencing templates. 47 refs., 1 fig., 4 tabs.

  14. High Potential Source for Biomass Degradation Enzyme Discovery and Environmental Aspects Revealed through Metagenomics of Indian Buffalo Rumen

    Directory of Open Access Journals (Sweden)

    K. M. Singh

    2014-01-01

    Full Text Available The complex microbiomes of the rumen functions as an effective system for plant cell wall degradation, and biomass utilization provide genetic resource for degrading microbial enzymes that could be used in the production of biofuel. Therefore the buffalo rumen microbiota was surveyed using shot gun sequencing. This metagenomic sequencing generated 3.9 GB of sequences and data were assembled into 137270 contiguous sequences (contigs. We identified potential 2614 contigs encoding biomass degrading enzymes including glycoside hydrolases (GH: 1943 contigs, carbohydrate binding module (CBM: 23 contigs, glycosyl transferase (GT: 373 contigs, carbohydrate esterases (CE: 259 contigs, and polysaccharide lyases (PE: 16 contigs. The hierarchical clustering of buffalo metagenomes demonstrated the similarities and dissimilarity in microbial community structures and functional capacity. This demonstrates that buffalo rumen microbiome was considerably enriched in functional genes involved in polysaccharide degradation with great prospects to obtain new molecules that may be applied in the biofuel industry.

  15. D20S16 is a complex interspersed repeated sequence: Genetic and physical analysis of the locus

    Energy Technology Data Exchange (ETDEWEB)

    Bowden, D.W.; Krawchuk, M.D.; Howard, T.D. [Wake Forest Univ., Winston-Salem, NC (United States)] [and others

    1995-01-20

    The genomic structure of the D20S16 locus has been evaluated using genetic and physical methods. D20S16, originally detected with the probe CRI-L1214, is a highly informative, complex restriction fragment length polymorphism consisting of two separate allelic systems. The allelic systems have the characteristics of conventional VNTR polymorphisms and are separated by recombination ({theta} = 0.02, Z{sub max} = 74.82), as demonstrated in family studies. Most of these recombination events are meiotic crossovers and are maternal in origin, but two, including deletion of the locus in a cell line from a CEPH family member, occur without evidence for exchange of flanking markers. DNA sequence analysis suggests that the basis of the polymorphism is variable numbers of a 98-bp sequence tandemly repeated with 87 to 90% sequence similarity between repeats. The 98-bp repeat is a dimer of 49 bp sequence with 45 to 98% identity between the elements. In addition, nonpolymorphic genomic sequences adjacent to the polymorphic 98-bp repeat tracts are also repeated but are not polymorphic, i.e., show no individual to individual variation. Restriction enzyme mapping of cosmids containing the CRI-L1214 sequence suggests that there are multiple interspersed repeats of the CRI-L1214 sequence on chromosome 20. The results of dual-color fluorescence in situ hybridization experiments with interphase nuclei are also consistent with multiple repeats of an interspersed sequence on chromosome 20. 23 refs., 6 figs.

  16. Transcriptome analysis reveals the time of the fourth round of genome duplication in common carp (Cyprinus carpio)

    Science.gov (United States)

    2012-01-01

    Background Common carp (Cyprinus carpio) is thought to have undergone one extra round of genome duplication compared to zebrafish. Transcriptome analysis has been used to study the existence and timing of genome duplication in species for which genome sequences are incomplete. Large-scale transcriptome data for the common carp genome should help reveal the timing of the additional duplication event. Results We have sequenced the transcriptome of common carp using 454 pyrosequencing. After assembling the 454 contigs and the published common carp sequences together, we obtained 49,669 contigs and identified genes using homology searches and an ab initio method. We identified 4,651 orthologous pairs between common carp and zebrafish and found 129,984 paralogous pairs within the common carp. An estimation of the synonymous substitution rate in the orthologous pairs indicated that common carp and zebrafish diverged 120 million years ago (MYA). We identified one round of genome duplication in common carp and estimated that it had occurred 5.6 to 11.3 MYA. In zebrafish, no genome duplication event after speciation was observed, suggesting that, compared to zebrafish, common carp had undergone an additional genome duplication event. We annotated the common carp contigs with Gene Ontology terms and KEGG pathways. Compared with zebrafish gene annotations, we found that a set of biological processes and pathways were enriched in common carp. Conclusions The assembled contigs helped us to estimate the time of the fourth-round of genome duplication in common carp. The resource that we have built as part of this study will help advance functional genomics and genome annotation studies in the future. PMID:22424280

  17. Physical mapping in highly heterozygous genomes: a physical contig map of the Pinot Noir grapevine cultivar

    Directory of Open Access Journals (Sweden)

    Jurman Irena

    2010-03-01

    Full Text Available Abstract Background Most of the grapevine (Vitis vinifera L. cultivars grown today are those selected centuries ago, even though grapevine is one of the most important fruit crops in the world. Grapevine has therefore not benefited from the advances in modern plant breeding nor more recently from those in molecular genetics and genomics: genes controlling important agronomic traits are practically unknown. A physical map is essential to positionally clone such genes and instrumental in a genome sequencing project. Results We report on the first whole genome physical map of grapevine built using high information content fingerprinting of 49,104 BAC clones from the cultivar Pinot Noir. Pinot Noir, as most grape varieties, is highly heterozygous at the sequence level. This resulted in the two allelic haplotypes sometimes assembling into separate contigs that had to be accommodated in the map framework or in local expansions of contig maps. We performed computer simulations to assess the effects of increasing levels of sequence heterozygosity on BAC fingerprint assembly and showed that the experimental assembly results are in full agreement with the theoretical expectations, given the heterozygosity levels reported for grape. The map is anchored to a dense linkage map consisting of 994 markers. 436 contigs are anchored to the genetic map, covering 342 of the 475 Mb that make up the grape haploid genome. Conclusions We have developed a resource that makes it possible to access the grapevine genome, opening the way to a new era both in grape genetics and breeding and in wine making. The effects of heterozygosity on the assembly have been analyzed and characterized by using several complementary approaches which could be easily transferred to the study of other genomes which present the same features.

  18. Dicty_cDB: Contig-U01541-1 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available ula chromosome 7 BAC clone mth2-7... 36 3.7 5 ( FG283242 ) 1108457714276 New World Screwworm Egg 9261 ESTs C...1-1... 40 3.8 2 ( AM448784 ) Vitis vinifera contig VV78X077229.13, whole genom... 40 3.8 5 ( FG299281 ) 1108793334783 New World...malized cDNA li... 34 3.8 3 ( FG298782 ) 1108793320683 New World Screwworm Larvae 9387 EST... 40 3.8 2 ( AC2...) Populus trichocarpa clone POP011-A24, complete se... 38 3.9 5 ( FG298363 ) 1108793311332 New World Screwwo...rm Larvae 9387 EST... 40 3.9 2 ( AE017263 ) Mesoplasma florum L1 complete genome. 34 3.9 11 ( FG290177 ) 1108793315292 New World

  19. Targeted identification of short interspersed nuclear element families shows their widespread existence and extreme heterogeneity in plant genomes.

    Science.gov (United States)

    Wenke, Torsten; Döbel, Thomas; Sörensen, Thomas Rosleff; Junghans, Holger; Weisshaar, Bernd; Schmidt, Thomas

    2011-09-01

    Short interspersed nuclear elements (SINEs) are non-long terminal repeat retrotransposons that are highly abundant, heterogeneous, and mostly not annotated in eukaryotic genomes. We developed a tool designated SINE-Finder for the targeted discovery of tRNA-derived SINEs. We analyzed sequence data of 16 plant genomes, including 13 angiosperms and three gymnosperms and identified 17,829 full-length and truncated SINEs falling into 31 families showing the widespread occurrence of SINEs in higher plants. The investigation focused on potato (Solanum tuberosum), resulting in the detection of seven different SolS SINE families consisting of 1489 full-length and 870 5' truncated copies. Consensus sequences of full-length members range in size from 106 to 244 bp depending on the SINE family. SolS SINEs populated related species and evolved separately, which led to some distinct subfamilies. Solanaceae SINEs are dispersed along chromosomes and distributed without clustering but with preferred integration into short A-rich motifs. They emerged more than 23 million years ago and were species specifically amplified during the radiation of potato, tomato (Solanum lycopersicum), and tobacco (Nicotiana tabacum). We show that tobacco TS retrotransposons are composite SINEs consisting of the 3' end of a long interspersed nuclear element integrated downstream of a nonhomologous SINE family followed by successfully colonization of the genome. We propose an evolutionary scenario for the formation of TS as a spontaneous event, which could be typical for the emergence of SINE families.

  20. Alginate nanobeads interspersed fibrin network as in situ forming hydrogel for soft tissue engineering.

    Science.gov (United States)

    Deepthi, S; Jayakumar, R

    2018-06-01

    Hydrogels are a class of materials that has the property of injectability and in situ gel formation. This property of hydrogels is manipulated in this study to develop a biomimetic bioresorbable injectable system of alginate nanobeads interspersed in fibrin network. Alginate nanobeads developed by calcium cross-linking yielded a size of 200-500 nm. The alginate nanobeads fibrin hydrogel was formed using dual syringe apparatus. Characterization of the in situ injectable hydrogel was done by SEM, FTIR and Rheometer. The developed hydrogel showed mechanical strength of 19 kPa which provides the suitable compliance for soft tissue engineering. Cytocompatibility studies using human umbilical cord blood derived mesenchymal stem cells showed good attachment, proliferation and infiltration within the hydrogel similar to fibrin gel. The developed in situ forming hydrogel could be a suitable delivery carrier of stem cells for soft tissue regeneration.

  1. COCACOLA: binning metagenomic contigs using sequence COmposition, read CoverAge, CO-alignment and paired-end read LinkAge.

    Science.gov (United States)

    Lu, Yang Young; Chen, Ting; Fuhrman, Jed A; Sun, Fengzhu

    2017-03-15

    The advent of next-generation sequencing technologies enables researchers to sequence complex microbial communities directly from the environment. Because assembly typically produces only genome fragments, also known as contigs, instead of an entire genome, it is crucial to group them into operational taxonomic units (OTUs) for further taxonomic profiling and down-streaming functional analysis. OTU clustering is also referred to as binning. We present COCACOLA, a general framework automatically bin contigs into OTUs based on sequence composition and coverage across multiple samples. The effectiveness of COCACOLA is demonstrated in both simulated and real datasets in comparison with state-of-art binning approaches such as CONCOCT, GroopM, MaxBin and MetaBAT. The superior performance of COCACOLA relies on two aspects. One is using L 1 distance instead of Euclidean distance for better taxonomic identification during initialization. More importantly, COCACOLA takes advantage of both hard clustering and soft clustering by sparsity regularization. In addition, the COCACOLA framework seamlessly embraces customized knowledge to facilitate binning accuracy. In our study, we have investigated two types of additional knowledge, the co-alignment to reference genomes and linkage of contigs provided by paired-end reads, as well as the ensemble of both. We find that both co-alignment and linkage information further improve binning in the majority of cases. COCACOLA is scalable and faster than CONCOCT, GroopM, MaxBin and MetaBAT. The software is available at https://github.com/younglululu/COCACOLA . fsun@usc.edu. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  2. Targeted Identification of Short Interspersed Nuclear Element Families Shows Their Widespread Existence and Extreme Heterogeneity in Plant Genomes[W

    Science.gov (United States)

    Wenke, Torsten; Döbel, Thomas; Sörensen, Thomas Rosleff; Junghans, Holger; Weisshaar, Bernd; Schmidt, Thomas

    2011-01-01

    Short interspersed nuclear elements (SINEs) are non-long terminal repeat retrotransposons that are highly abundant, heterogeneous, and mostly not annotated in eukaryotic genomes. We developed a tool designated SINE-Finder for the targeted discovery of tRNA-derived SINEs. We analyzed sequence data of 16 plant genomes, including 13 angiosperms and three gymnosperms and identified 17,829 full-length and truncated SINEs falling into 31 families showing the widespread occurrence of SINEs in higher plants. The investigation focused on potato (Solanum tuberosum), resulting in the detection of seven different SolS SINE families consisting of 1489 full-length and 870 5′ truncated copies. Consensus sequences of full-length members range in size from 106 to 244 bp depending on the SINE family. SolS SINEs populated related species and evolved separately, which led to some distinct subfamilies. Solanaceae SINEs are dispersed along chromosomes and distributed without clustering but with preferred integration into short A-rich motifs. They emerged more than 23 million years ago and were species specifically amplified during the radiation of potato, tomato (Solanum lycopersicum), and tobacco (Nicotiana tabacum). We show that tobacco TS retrotransposons are composite SINEs consisting of the 3′ end of a long interspersed nuclear element integrated downstream of a nonhomologous SINE family followed by successfully colonization of the genome. We propose an evolutionary scenario for the formation of TS as a spontaneous event, which could be typical for the emergence of SINE families. PMID:21908723

  3. Alginate nanobeads interspersed fibrin network as in situ forming hydrogel for soft tissue engineering

    Directory of Open Access Journals (Sweden)

    S. Deepthi

    2018-06-01

    Full Text Available Hydrogels are a class of materials that has the property of injectability and in situ gel formation. This property of hydrogels is manipulated in this study to develop a biomimetic bioresorbable injectable system of alginate nanobeads interspersed in fibrin network. Alginate nanobeads developed by calcium cross-linking yielded a size of 200–500 nm. The alginate nanobeads fibrin hydrogel was formed using dual syringe apparatus. Characterization of the in situ injectable hydrogel was done by SEM, FTIR and Rheometer. The developed hydrogel showed mechanical strength of 19 kPa which provides the suitable compliance for soft tissue engineering. Cytocompatibility studies using human umbilical cord blood derived mesenchymal stem cells showed good attachment, proliferation and infiltration within the hydrogel similar to fibrin gel. The developed in situ forming hydrogel could be a suitable delivery carrier of stem cells for soft tissue regeneration.

  4. Genome-wide mapping of infection-induced SINE RNAs reveals a role in selective mRNA export.

    Science.gov (United States)

    Karijolich, John; Zhao, Yang; Alla, Ravi; Glaunsinger, Britt

    2017-06-02

    Short interspersed nuclear elements (SINEs) are retrotransposons evolutionarily derived from endogenous RNA Polymerase III RNAs. Though SINE elements have undergone exaptation into gene regulatory elements, how transcribed SINE RNA impacts transcriptional and post-transcriptional regulation is largely unknown. This is partly due to a lack of information regarding which of the loci have transcriptional potential. Here, we present an approach (short interspersed nuclear element sequencing, SINE-seq), which selectively profiles RNA Polymerase III-derived SINE RNA, thereby identifying transcriptionally active SINE loci. Applying SINE-seq to monitor murine B2 SINE expression during a gammaherpesvirus infection revealed transcription from 28 270 SINE loci, with ∼50% of active SINE elements residing within annotated RNA Polymerase II loci. Furthermore, B2 RNA can form intermolecular RNA-RNA interactions with complementary mRNAs, leading to nuclear retention of the targeted mRNA via a mechanism involving p54nrb. These findings illuminate a pathway for the selective regulation of mRNA export during stress via retrotransposon activation. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  5. Long interspersed element-1 protein expression is a hallmark of many human cancers.

    Science.gov (United States)

    Rodić, Nemanja; Sharma, Reema; Sharma, Rajni; Zampella, John; Dai, Lixin; Taylor, Martin S; Hruban, Ralph H; Iacobuzio-Donahue, Christine A; Maitra, Anirban; Torbenson, Michael S; Goggins, Michael; Shih, Ie-Ming; Duffield, Amy S; Montgomery, Elizabeth A; Gabrielson, Edward; Netto, George J; Lotan, Tamara L; De Marzo, Angelo M; Westra, William; Binder, Zev A; Orr, Brent A; Gallia, Gary L; Eberhart, Charles G; Boeke, Jef D; Harris, Chris R; Burns, Kathleen H

    2014-05-01

    Cancers comprise a heterogeneous group of human diseases. Unifying characteristics include unchecked abilities of tumor cells to proliferate and spread anatomically, and the presence of clonal advantageous genetic changes. However, universal and highly specific tumor markers are unknown. Herein, we report widespread long interspersed element-1 (LINE-1) repeat expression in human cancers. We show that nearly half of all human cancers are immunoreactive for a LINE-1-encoded protein. LINE-1 protein expression is a common feature of many types of high-grade malignant cancers, is rarely detected in early stages of tumorigenesis, and is absent from normal somatic tissues. Studies have shown that LINE-1 contributes to genetic changes in cancers, with somatic LINE-1 insertions seen in selected types of human cancers, particularly colon cancer. We sought to correlate this observation with expression of the LINE-1-encoded protein, open reading frame 1 protein, and found that LINE-1 open reading frame 1 protein is a surprisingly broad, yet highly tumor-specific, antigen. Copyright © 2014 American Society for Investigative Pathology. Published by Elsevier Inc. All rights reserved.

  6. L’Anaphore associative: contigüité métonymique

    Directory of Open Access Journals (Sweden)

    Gemma Peña Martínez

    2008-04-01

    Full Text Available Cet article porte sur les rapports de contigüité exigés lors de la résolution des anaphores associatives. Il s’agit en général de relations métonymiques, car les différents rapports, à caractère notamment socioculturel, entre référent et marque anaphorique convergent dans un même cadre conceptuel, se faisant écho d’éléments ou caractéristiques du même domaine cognitif. L’anaphore associative reprenant ainsi un attribut concret du référent, nous envisageons donc une classification de ces marques anaphoriques d’après des rapports métonymiques, tels que partie à tout, objet à matière et caractéristique ou propriété à objet.

  7. Giant panda BAC library construction and assembly of a 650-kb contig spanning major histocompatibility complex class II region

    Directory of Open Access Journals (Sweden)

    Pan Hui-Juan

    2007-09-01

    Full Text Available Abstract Background Giant panda is rare and endangered species endemic to China. The low rates of reproductive success and infectious disease resistance have severely hampered the development of captive and wild populations of the giant panda. The major histocompatibility complex (MHC plays important roles in immune response and reproductive system such as mate choice and mother-fetus bio-compatibility. It is thus essential to understand genetic details of the giant panda MHC. Construction of a bacterial artificial chromosome (BAC library will provide a new tool for panda genome physical mapping and thus facilitate understanding of panda MHC genes. Results A giant panda BAC library consisting of 205,800 clones has been constructed. The average insert size was calculated to be 97 kb based on the examination of 174 randomly selected clones, indicating that the giant panda library contained 6.8-fold genome equivalents. Screening of the library with 16 giant panda PCR primer pairs revealed 6.4 positive clones per locus, in good agreement with an expected 6.8-fold genomic coverage of the library. Based on this BAC library, we constructed a contig map of the giant panda MHC class II region from BTNL2 to DAXX spanning about 650 kb by a three-step method: (1 PCR-based screening of the BAC library with primers from homologous MHC class II gene loci, end sequences and BAC clone shotgun sequences, (2 DNA sequencing validation of positive clones, and (3 restriction digest fingerprinting verification of inter-clone overlapping. Conclusion The identifications of genes and genomic regions of interest are greatly favored by the availability of this giant panda BAC library. The giant panda BAC library thus provides a useful platform for physical mapping, genome sequencing or complex analysis of targeted genomic regions. The 650 kb sequence-ready BAC contig map of the giant panda MHC class II region from BTNL2 to DAXX, verified by the three-step method, offers a

  8. Gene content and organization of a 281-kbp contig from the genome of the extremely thermophilic archaeon, Sulfolobus solfataricus P2

    NARCIS (Netherlands)

    Charlebois, R.; Confalonieri, F.; Curtis, B.; Doolittle, W.F.; Duguet, M.; Erauso, G.; Faguy, D.; Gaasterland, T.; Garrett, R.A.; Gordon, P.; Kozera, C.; Medina, N.; Oost, van der J.; Peng, X.; Ragan, M.; She, Q.; Singh, R.K.

    2000-01-01

    The sequence of a 281-kbp contig from the crenarchaeote Sulfolobus solfataricus P2 was determined and analysed. Notable features in this region include 29 ribosomal protein genes, 12 tRNA genes (four of which contain archaeal-type introns), operons encoding enzymes of histidine biosynthesis,

  9. Genome wide analysis of acute myeloid leukemia reveal leukemia specific methylome and subtype specific hypomethylation of repeats.

    Directory of Open Access Journals (Sweden)

    Marwa H Saied

    Full Text Available Methylated DNA immunoprecipitation followed by high-throughput sequencing (MeDIP-seq has the potential to identify changes in DNA methylation important in cancer development. In order to understand the role of epigenetic modulation in the development of acute myeloid leukemia (AML we have applied MeDIP-seq to the DNA of 12 AML patients and 4 normal bone marrows. This analysis revealed leukemia-associated differentially methylated regions that included gene promoters, gene bodies, CpG islands and CpG island shores. Two genes (SPHKAP and DPP6 with significantly methylated promoters were of interest and further analysis of their expression showed them to be repressed in AML. We also demonstrated considerable cytogenetic subtype specificity in the methylomes affecting different genomic features. Significantly distinct patterns of hypomethylation of certain interspersed repeat elements were associated with cytogenetic subtypes. The methylation patterns of members of the SINE family tightly clustered all leukemic patients with an enrichment of Alu repeats with a high CpG density (P<0.0001. We were able to demonstrate significant inverse correlation between intragenic interspersed repeat sequence methylation and gene expression with SINEs showing the strongest inverse correlation (R(2 = 0.7. We conclude that the alterations in DNA methylation that accompany the development of AML affect not only the promoters, but also the non-promoter genomic features, with significant demethylation of certain interspersed repeat DNA elements being associated with AML cytogenetic subtypes. MeDIP-seq data were validated using bisulfite pyrosequencing and the Infinium array.

  10. Diversity in a Polymicrobial Community Revealed by Analysis of Viromes, Endolysins and CRISPR Spacers.

    Directory of Open Access Journals (Sweden)

    Michelle Davison

    Full Text Available The polymicrobial biofilm communities in Mushroom and Octopus Spring in Yellowstone National Park (YNP are well characterized, yet little is known about the phage populations. Dominant species, Synechococcus sp. JA-2-3B'a(2-13, Synechococcus sp. JA-3-3Ab, Chloroflexus sp. Y-400-fl, and Roseiflexus sp. RS-1, contain multiple CRISPR-Cas arrays, suggesting complex interactions with phage predators. To analyze phage populations from Octopus Spring biofilms, we sequenced a viral enriched fraction. To assemble and analyze phage metagenomic data, we developed a custom module, VIRITAS, implemented within the MetAMOS framework. This module bins contigs into groups based on tetranucleotide frequencies and CRISPR spacer-protospacer matching and ORF calling. Using this pipeline we were able to assemble phage sequences into contigs and bin them into three clusters that corroborated with their potential host range. The virome contained 52,348 predicted ORFs; some were clearly phage-like; 9319 ORFs had a recognizable Pfam domain while the rest were hypothetical. Of the recognized domains with CRISPR spacer matches, was the phage endolysin used by lytic phage to disrupt cells. Analysis of the endolysins present in the thermophilic cyanophage contigs revealed a subset of characterized endolysins as well as a Glyco_hydro_108 (PF05838 domain not previously associated with sequenced cyanophages. A search for CRISPR spacer matches to all identified phage endolysins demonstrated that a majority of endolysin domains were targets. This strategy provides a general way to link host and phage as endolysins are known to be widely distributed in bacteriophage. Endolysins can also provide information about host cell wall composition and have the additional potential to be used as targets for novel therapeutics.

  11. Contig Maps and Genomic Sequencing Identify Candidate Genes in the Usher 1C Locus

    Science.gov (United States)

    Higgins, Michael J.; Day, Colleen D.; Smilinich, Nancy J.; Ni, L.; Cooper, Paul R.; Nowak, Norma J.; Davies, Chris; de Jong, Pieter J.; Hejtmancik, Fielding; Evans, Glen A.; Smith, Richard J.H.; Shows, Thomas B.

    1998-01-01

    Usher syndrome 1C (USH1C) is a congenital condition manifesting profound hearing loss, the absence of vestibular function, and eventual retinal degeneration. The USH1C locus has been mapped genetically to a 2- to 3-cM interval in 11p14–15.1 between D11S899 and D11S861. In an effort to identify the USH1C disease gene we have isolated the region between these markers in yeast artificial chromosomes (YACs) using a combination of STS content mapping and Alu–PCR hybridization. The YAC contig is ∼3.5 Mb and has located several other loci within this interval, resulting in the order CEN-LDHA-SAA1-TPH-D11S1310-(D11S1888/KCNC1)-MYOD1-D11S902D11S921-D11S1890-TEL. Subsequent haplotyping and homozygosity analysis refined the location of the disease gene to a 400-kb interval between D11S902 and D11S1890 with all affected individuals being homozygous for the internal marker D11S921. To facilitate gene identification, the critical region has been converted into P1 artificial chromosome (PAC) clones using sequence-tagged sites (STSs) mapped to the YAC contig, Alu–PCR products generated from the YACs, and PAC end probes. A contig of >50 PAC clones has been assembled between D11S1310 and D11S1890, confirming the order of markers used in haplotyping. Three PAC clones representing nearly two-thirds of the USH1C critical region have been sequenced. PowerBLAST analysis identified six clusters of expressed sequence tags (ESTs), two known genes (BIR,SUR1) mapped previously to this region, and a previously characterized but unmapped gene NEFA (DNA binding/EF hand/acidic amino-acid-rich). GRAIL analysis identified 11 CpG islands and 73 exons of excellent quality. These data allowed the construction of a transcription map for the USH1C critical region, consisting of three known genes and six or more novel transcripts. Based on their map location, these loci represent candidate disease loci for USH1C. The NEFA gene was assessed as the USH1C locus by the sequencing of an amplified NEFA

  12. Genome signature analysis of thermal virus metagenomes reveals Archaea and thermophilic signatures.

    Science.gov (United States)

    Pride, David T; Schoenfeld, Thomas

    2008-09-17

    Metagenomic analysis provides a rich source of biological information for otherwise intractable viral communities. However, study of viral metagenomes has been hampered by its nearly complete reliance on BLAST algorithms for identification of DNA sequences. We sought to develop algorithms for examination of viral metagenomes to identify the origin of sequences independent of BLAST algorithms. We chose viral metagenomes obtained from two hot springs, Bear Paw and Octopus, in Yellowstone National Park, as they represent simple microbial populations where comparatively large contigs were obtained. Thermal spring metagenomes have high proportions of sequences without significant Genbank homology, which has hampered identification of viruses and their linkage with hosts. To analyze each metagenome, we developed a method to classify DNA fragments using genome signature-based phylogenetic classification (GSPC), where metagenomic fragments are compared to a database of oligonucleotide signatures for all previously sequenced Bacteria, Archaea, and viruses. From both Bear Paw and Octopus hot springs, each assembled contig had more similarity to other metagenome contigs than to any sequenced microbial genome based on GSPC analysis, suggesting a genome signature common to each of these extreme environments. While viral metagenomes from Bear Paw and Octopus share some similarity, the genome signatures from each locale are largely unique. GSPC using a microbial database predicts most of the Octopus metagenome has archaeal signatures, while bacterial signatures predominate in Bear Paw; a finding consistent with those of Genbank BLAST. When using a viral database, the majority of the Octopus metagenome is predicted to belong to archaeal virus Families Globuloviridae and Fuselloviridae, while none of the Bear Paw metagenome is predicted to belong to archaeal viruses. As expected, when microbial and viral databases are combined, each of the Octopus and Bear Paw metagenomic contigs

  13. A YAC contig and an EST map in the pericentromeric region of chromosome 13 surrounding the loci for neurosensory nonsyndromic deafness (DFNB1 and DFNA3) and Limb-Girdle muscular dystrophy type 2C (LGMD2C)

    Energy Technology Data Exchange (ETDEWEB)

    Guilford, P.; Crozet, F.; Blanchard, S. [Institut Pasteur, Paris (France)] [and others

    1995-09-01

    Two forms of inherited childhood nonsyndromic deafness (DFNB1 and DFNA3) and a Duchenne-like form of progressive muscular dystrophy (LGMD2C) have been mapped to the pericentromeric region of chromosome 13. To clone the genes responsible for these diseases we constructed a yeast artificial chromosome (YAC) contig spanning an 8-cM region between the polymorphic markers D13S221. The contig comprises 24 sequence-tagged sites, among which 15 were newly obtained. This contig allowed us to order the polymorphic markers centromere- D13S175-D13S141-D13S143-D13S115-AFM128yc1-D13S292-D13S283-AFM323vh5-D13S221-telomere. Eight expressed sequence tags, previously assigned to 13q11-q12 (D13S182E, D13S183E, D13S502E, D13S504E, D13S505E, D13S837E, TUBA2, ATP1AL1), were localized on the YAC contig. YAC screening of a cDNA library derived from mouse cochlea allowed us to identify an {alpha}-tubulin gene (TUBA2) that was subsequently precisely mapped within the candidate region. 36 refs., 2 figs., 2 tabs.

  14. Long interspersed element-1 is differentially regulated by food-borne carcinogens via the aryl hydrocarbon receptor.

    Science.gov (United States)

    Okudaira, N; Okamura, T; Tamura, M; Iijma, K; Goto, M; Matsunaga, A; Ochiai, M; Nakagama, H; Kano, S; Fujii-Kuriyama, Y; Ishizaka, Y

    2013-10-10

    A single human cell contains more than 5.0 × 10(5) copies of long interspersed element-1 (L1), 80-100 of which are competent for retrotransposition (L1-RTP). Recent observations have revealed the presence of de novo L1 insertions in various tumors, but little is known about its mechanism. Here, we found that 2-amino-1-methyl-6-phenylimidazo[4,5-b]pyridine (PhIP) and 2-amino-3,8-dimethyl-imidazo[4,5-f]quinoxaline (MeIQx), food-borne carcinogens that are present in broiled meats, induced L1-RTP. This induction was dependent on a cellular cascade comprising the aryl hydrocarbon receptor (AhR), a mitogen-activated protein kinase, and CCAAT/enhancer-binding protein β. Notably, these compounds exhibited differential induction of L1-RTP. MeIQx-induced L1-RTP was dependent on AhR nuclear translocator 1 (ARNT1), a counterpart of AhR required for gene expression in response to environmental pollutants. By contrast, PhIP-induced L1-RTP did not require ARNT1 but was dependent on estrogen receptor α (ERα) and AhR repressor. In vivo studies using transgenic mice harboring the human L1 gene indicated that PhIP-induced L1-RTP was reproducibly detected in the mammary gland, which is a target organ of PhIP-induced carcinoma. Moreover, picomolar levels of each compound induced L1-RTP, which is comparable to the PhIP concentration detected in human breast milk. Data suggest that somatic cells possess machineries that induce L1-RTP in response to the carcinogenic compounds. Together with data showing that micromolar levels of heterocyclic amines (HCAs) were non-genotoxic, our observations indicate that L1-RTP by environmental compounds is a novel type of genomic instability, further suggesting that analysis of L1-RTP by HCAs is a novel approach to clarification of modes of carcinogenesis.

  15. Genome signature analysis of thermal virus metagenomes reveals Archaea and thermophilic signatures

    Directory of Open Access Journals (Sweden)

    Pride David T

    2008-09-01

    Full Text Available Abstract Background Metagenomic analysis provides a rich source of biological information for otherwise intractable viral communities. However, study of viral metagenomes has been hampered by its nearly complete reliance on BLAST algorithms for identification of DNA sequences. We sought to develop algorithms for examination of viral metagenomes to identify the origin of sequences independent of BLAST algorithms. We chose viral metagenomes obtained from two hot springs, Bear Paw and Octopus, in Yellowstone National Park, as they represent simple microbial populations where comparatively large contigs were obtained. Thermal spring metagenomes have high proportions of sequences without significant Genbank homology, which has hampered identification of viruses and their linkage with hosts. To analyze each metagenome, we developed a method to classify DNA fragments using genome signature-based phylogenetic classification (GSPC, where metagenomic fragments are compared to a database of oligonucleotide signatures for all previously sequenced Bacteria, Archaea, and viruses. Results From both Bear Paw and Octopus hot springs, each assembled contig had more similarity to other metagenome contigs than to any sequenced microbial genome based on GSPC analysis, suggesting a genome signature common to each of these extreme environments. While viral metagenomes from Bear Paw and Octopus share some similarity, the genome signatures from each locale are largely unique. GSPC using a microbial database predicts most of the Octopus metagenome has archaeal signatures, while bacterial signatures predominate in Bear Paw; a finding consistent with those of Genbank BLAST. When using a viral database, the majority of the Octopus metagenome is predicted to belong to archaeal virus Families Globuloviridae and Fuselloviridae, while none of the Bear Paw metagenome is predicted to belong to archaeal viruses. As expected, when microbial and viral databases are combined, each of

  16. RUDI, a short interspersed element of the V-SINE superfamily widespread in molluscan genomes.

    Science.gov (United States)

    Luchetti, Andrea; Šatović, Eva; Mantovani, Barbara; Plohl, Miroslav

    2016-06-01

    Short interspersed elements (SINEs) are non-autonomous retrotransposons that are widespread in eukaryotic genomes. They exhibit a chimeric sequence structure consisting of a small RNA-related head, an anonymous body and an AT-rich tail. Although their turnover and de novo emergence is rapid, some SINE elements found in distantly related species retain similarity in certain core segments (or highly conserved domains, HCD). We have characterized a new SINE element named RUDI in the bivalve molluscs Ruditapes decussatus and R. philippinarum and found this element to be widely distributed in the genomes of a number of mollusc species. An unexpected structural feature of RUDI is the HCD domain type V, which was first found in non-amniote vertebrate SINEs and in the SINE from one cnidarian species. In addition to the V domain, the overall sequence conservation pattern of RUDI elements resembles that found in ancient AmnSINE (~310 Myr old) and Au SINE (~320 Myr old) families, suggesting that RUDI might be among the most ancient SINE families. Sequence conservation suggests a monophyletic origin of RUDI. Nucleotide variability and phylogenetic analyses suggest long-term vertical inheritance combined with at least one horizontal transfer event as the most parsimonious explanation for the observed taxonomic distribution.

  17. The human MCP-2 gene (SCYA8): Cloning, sequence analysis, tissue expression, and assignment to the CC chemokine gene contig on chromosome 17q11.2

    Energy Technology Data Exchange (ETDEWEB)

    Van Coillie, E.; Fiten, P.; Van Damme, J.; Opdenakker, G. [Univ. of Leuven (Belgium)] [and others

    1997-03-01

    Monocyte chemotactic proteins (MCPs) form a subfamily of chemokines that recruit leukocytes to sites of inflammation and that may contribute to tumor-associated leukocyte infiltration and to the antiviral state against HIV infection. With the use of degenerate primers that were based on CC chemokine consensus sequences, the known MIP-1{alpha}/LD78{alpha}, MCP-1, and MCP-3 genes and the previously unidentified eotaxin and MCP-2 genes were isolated from a YAC contig from human chromosome 17q11.2. The amplified genomic MCP-2 fragment was used to isolate an MCP-2 cosmid from which the gene sequence was determined. The MCP-2 gene shares with the MCP-1 and MCP-3 genes a conserved intron-exon structure and a coding nucleotide sequence homology of 77%. By Northern blot analysis the 1.0-kb MCP-2 mRNA was predominantly detectable in the small intestine, peripheral blood, heart, placenta, lung, skeletal muscle, ovary, colon, spinal cord, pancreas, and thymus. Transcripts of 1.5 and 2.4 kb were found in the testis, the small intestine, and the colon. The isolation of the MCP-2 gene from the chemokine contig localized it on YAC clones of chromosome 17q11.2, which also contain the eotaxin, MCP-1, MCP-3, and NCC-1/MCP-4 genes. The combination of using degenerate primer PCR and YACs illustrates that novel genes can efficiently be isolated from gene cluster contigs with less redundancy and effort than the isolation of novel ESTs. 42 refs., 5 figs., 2 tabs.

  18. Plasmodium falciparum Plasmodium helical interspersed subtelomeric proteins contribute to cytoadherence and anchor P. falciparum erythrocyte membrane protein 1 to the host cell cytoskeleton

    DEFF Research Database (Denmark)

    Oberli, Alexander; Zurbrügg, Laura; Rusch, Sebastian

    2016-01-01

    is anchored to the cytoskeleton, and the Plasmodium helical interspersed subtelomeric (PHIST) gene family plays a role in many host cell modifications including binding the intracellular domain of PfEMP1. Here, we show that conditional reduction of the PHIST protein PFE1605w strongly reduces adhesion...... interacts with both the intracellular segment of PfEMP1 and with cytoskeletal components. This is the first report of a PHIST protein interacting with key molecules of the cytoadherence complex and the host cytoskeleton, and this functional role seems to play an essential role in the pathology of P...

  19. BoS: a large and diverse family of short interspersed elements (SINEs) in Brassica oleracea.

    Science.gov (United States)

    Zhang, Xiaoyu; Wessler, Susan R

    2005-05-01

    Short interspersed elements (SINEs) are nonautonomous non-LTR retrotransposons that populate eukaryotic genomes. Numerous SINE families have been identified in animals, whereas only a few have been described in plants. Here we describe a new family of SINEs, named BoS, that is widespread in Brassicaceae and present at approximately 2000 copies in Brassica oleracea. In addition to sharing a modular structure and target site preference with previously described SINEs, BoS elements have several unusual features. First, the head regions of BoS RNAs can adopt a distinct hairpin-like secondary structure. Second, with 15 distinct subfamilies, BoS represents one of the most diverse SINE families described to date. Third, several of the subfamilies have a mosaic structure that has arisen through the exchange of sequences between existing subfamilies, possibly during retrotransposition. Analysis of BoS subfamilies indicate that they were active during various time periods through the evolution of Brassicaceae and that active elements may still reside in some Brassica species. As such, BoS elements may be a valuable tool as phylogenetic makers for resolving outstanding issues in the evolution of species in the Brassicaceae family.

  20. ScaffoldScaffolder: solving contig orientation via bidirected to directed graph reduction.

    Science.gov (United States)

    Bodily, Paul M; Fujimoto, M Stanley; Snell, Quinn; Ventura, Dan; Clement, Mark J

    2016-01-01

    The contig orientation problem, which we formally define as the MAX-DIR problem, has at times been addressed cursorily and at times using various heuristics. In setting forth a linear-time reduction from the MAX-CUT problem to the MAX-DIR problem, we prove the latter is NP-complete. We compare the relative performance of a novel greedy approach with several other heuristic solutions. Our results suggest that our greedy heuristic algorithm not only works well but also outperforms the other algorithms due to the nature of scaffold graphs. Our results also demonstrate a novel method for identifying inverted repeats and inversion variants, both of which contradict the basic single-orientation assumption. Such inversions have previously been noted as being difficult to detect and are directly involved in the genetic mechanisms of several diseases. http://bioresearch.byu.edu/scaffoldscaffolder. paulmbodily@gmail.com Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  1. Virome profiling of bats from Myanmar by metagenomic analysis of tissue samples reveals more novel Mammalian viruses.

    Directory of Open Access Journals (Sweden)

    Biao He

    Full Text Available Bats are reservoir animals harboring many important pathogenic viruses and with the capability of transmitting these to humans and other animals. To establish an effective surveillance to monitor transboundary spread of bat viruses between Myanmar and China, complete organs from the thorax and abdomen from 853 bats of six species from two Myanmar counties close to Yunnan province, China, were collected and tested for their virome through metagenomics by Solexa sequencing and bioinformatic analysis. In total, 3,742,314 reads of 114 bases were generated, and over 86% were assembled into 1,649,512 contigs with an average length of 114 bp, of which 26,698 (2% contigs were recognizable viral sequences belonging to 24 viral families. Of the viral contigs 45% (12,086/26,698 were related to vertebrate viruses, 28% (7,443/26,698 to insect viruses, 27% (7,074/26,698 to phages and 95 contigs to plant viruses. The metagenomic results were confirmed by PCR of selected viruses in all bat samples followed by phylogenetic analysis, which has led to the discovery of some novel bat viruses of the genera Mamastrovirus, Bocavirus, Circovirus, Iflavirus and Orthohepadnavirus and to their prevalence rates in two bat species. In conclusion, the present study aims to present the bat virome in Myanmar, and the results obtained further expand the spectrum of viruses harbored by bats.

  2. Comparative sequence analysis of Solanum and Arabidopsis in a hot spot for pathogen resistance on potato chromosome V reveals a patchwork of conserved and rapidly evolving genome segments

    Directory of Open Access Journals (Sweden)

    Bruggmann Rémy

    2007-05-01

    Full Text Available Abstract Background Quantitative phenotypic variation of agronomic characters in crop plants is controlled by environmental and genetic factors (quantitative trait loci = QTL. To understand the molecular basis of such QTL, the identification of the underlying genes is of primary interest and DNA sequence analysis of the genomic regions harboring QTL is a prerequisite for that. QTL mapping in potato (Solanum tuberosum has identified a region on chromosome V tagged by DNA markers GP21 and GP179, which contains a number of important QTL, among others QTL for resistance to late blight caused by the oomycete Phytophthora infestans and to root cyst nematodes. Results To obtain genomic sequence for the targeted region on chromosome V, two local BAC (bacterial artificial chromosome contigs were constructed and sequenced, which corresponded to parts of the homologous chromosomes of the diploid, heterozygous genotype P6/210. Two contiguous sequences of 417,445 and 202,781 base pairs were assembled and annotated. Gene-by-gene co-linearity was disrupted by non-allelic insertions of retrotransposon elements, stretches of diverged intergenic sequences, differences in gene content and gene order. The latter was caused by inversion of a 70 kbp genomic fragment. These features were also found in comparison to orthologous sequence contigs from three homeologous chromosomes of Solanum demissum, a wild tuber bearing species. Functional annotation of the sequence identified 48 putative open reading frames (ORF in one contig and 22 in the other, with an average of one ORF every 9 kbp. Ten ORFs were classified as resistance-gene-like, 11 as F-box-containing genes, 13 as transposable elements and three as transcription factors. Comparing potato to Arabidopsis thaliana annotated proteins revealed five micro-syntenic blocks of three to seven ORFs with A. thaliana chromosomes 1, 3 and 5. Conclusion Comparative sequence analysis revealed highly conserved collinear regions

  3. The Salmon Smai Family of Short Interspersed Repetitive Elements (Sines): Interspecific and Intraspecific Variation of the Insertion of Sines in the Genomes of Chum and Pink Salmon

    OpenAIRE

    Takasaki, N.; Yamaki, T.; Hamada, M.; Park, L.; Okada, N.

    1997-01-01

    The genomes of chum salmon and pink salmon contain a family of short interspersed repetitive elements (SINEs), designated the salmon SmaI family. It is restricted to these two species, a distribution that suggests that this SINE family might have been generated in their common ancestor. When insertions of the SmaI SINEs at 10 orthologous loci of these species were analyzed, however, it was found that there were no shared insertion sites between chum and pink salmon. Furthermore, at six loci w...

  4. Scaffold filling, contig fusion and comparative gene order inference

    Directory of Open Access Journals (Sweden)

    Rounsley Steve

    2010-06-01

    Full Text Available Abstract Background There has been a trend in increasing the phylogenetic scope of genome sequencing without finishing the sequence of the genome. Increasing numbers of genomes are being published in scaffold or contig form. Rearrangement algorithms, however, including gene order-based phylogenetic tools, require whole genome data on gene order or syntenic block order. How then can we use rearrangement algorithms to compare genomes available in scaffold form only? Can the comparative evidence predict the location of unsequenced genes? Results Our method involves optimally filling in genes missing from the scaffolds, while incorporating the augmented scaffolds directly into the rearrangement algorithms as if they were chromosomes. This is accomplished by an exact, polynomial-time algorithm. We then correct for the number of extra fusion/fission operations required to make scaffolds comparable to full assemblies. We model the relationship between the ratio of missing genes actually absent from the genome versus merely unsequenced ones, on one hand, and the increase of genomic distance after scaffold filling, on the other. We estimate the parameters of this model through simulations and by comparing the angiosperm genomes Ricinus communis and Vitis vinifera. Conclusions The algorithm solves the comparison of genomes with 18,300 genes, including 4500 missing from one genome, in less than a minute on a MacBook, putting virtually all genomes within range of the method.

  5. Scaffold filling, contig fusion and comparative gene order inference.

    Science.gov (United States)

    Muñoz, Adriana; Zheng, Chunfang; Zhu, Qian; Albert, Victor A; Rounsley, Steve; Sankoff, David

    2010-06-04

    There has been a trend in increasing the phylogenetic scope of genome sequencing without finishing the sequence of the genome. Increasing numbers of genomes are being published in scaffold or contig form. Rearrangement algorithms, however, including gene order-based phylogenetic tools, require whole genome data on gene order or syntenic block order. How then can we use rearrangement algorithms to compare genomes available in scaffold form only? Can the comparative evidence predict the location of unsequenced genes? Our method involves optimally filling in genes missing from the scaffolds, while incorporating the augmented scaffolds directly into the rearrangement algorithms as if they were chromosomes. This is accomplished by an exact, polynomial-time algorithm. We then correct for the number of extra fusion/fission operations required to make scaffolds comparable to full assemblies. We model the relationship between the ratio of missing genes actually absent from the genome versus merely unsequenced ones, on one hand, and the increase of genomic distance after scaffold filling, on the other. We estimate the parameters of this model through simulations and by comparing the angiosperm genomes Ricinus communis and Vitis vinifera. The algorithm solves the comparison of genomes with 18,300 genes, including 4500 missing from one genome, in less than a minute on a MacBook, putting virtually all genomes within range of the method.

  6. SINE_scan: an efficient tool to discover short interspersed nuclear elements (SINEs) in large-scale genomic datasets.

    Science.gov (United States)

    Mao, Hongliang; Wang, Hao

    2017-03-01

    Short Interspersed Nuclear Elements (SINEs) are transposable elements (TEs) that amplify through a copy-and-paste mode via RNA intermediates. The computational identification of new SINEs are challenging because of their weak structural signals and rapid diversification in sequences. Here we report SINE_Scan, a highly efficient program to predict SINE elements in genomic DNA sequences. SINE_Scan integrates hallmark of SINE transposition, copy number and structural signals to identify a SINE element. SINE_Scan outperforms the previously published de novo SINE discovery program. It shows high sensitivity and specificity in 19 plant and animal genome assemblies, of which sizes vary from 120 Mb to 3.5 Gb. It identifies numerous new families and substantially increases the estimation of the abundance of SINEs in these genomes. The code of SINE_Scan is freely available at http://github.com/maohlzj/SINE_Scan , implemented in PERL and supported on Linux. wangh8@fudan.edu.cn. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  7. Diversification, evolution and methylation of short interspersed nuclear element families in sugar beet and related Amaranthaceae species.

    Science.gov (United States)

    Schwichtenberg, Katrin; Wenke, Torsten; Zakrzewski, Falk; Seibt, Kathrin M; Minoche, André; Dohm, Juliane C; Weisshaar, Bernd; Himmelbauer, Heinz; Schmidt, Thomas

    2016-01-01

    Short interspersed nuclear elements (SINEs) are non-autonomous non-long terminal repeat retrotransposons which are widely distributed in eukaryotic organisms. While SINEs have been intensively studied in animals, only limited information is available about plant SINEs. We analysed 22 SINE families from seven genomes of the Amaranthaceae family and identified 34 806 SINEs, including 19 549 full-length copies. With the focus on sugar beet (Beta vulgaris), we performed a comparative analysis of the diversity, genomic and chromosomal organization and the methylation of SINEs to provide a detailed insight into the evolution and age of Amaranthaceae SINEs. The lengths of consensus sequences of SINEs range from 113 nucleotides (nt) up to 224 nt. The SINEs show dispersed distribution on all chromosomes but were found with higher incidence in subterminal euchromatic chromosome regions. The methylation of SINEs is increased compared with their flanking regions, and the strongest effect is visible for cytosines in the CHH context, indicating an involvement of asymmetric methylation in the silencing of SINEs. © 2015 The Authors The Plant Journal © 2015 John Wiley & Sons Ltd.

  8. Comparative genomic mapping of the bovine Fragile Histidine Triad (FHIT tumour suppressor gene: characterization of a 2 Mb BAC contig covering the locus, complete annotation of the gene, analysis of cDNA and of physiological expression profiles

    Directory of Open Access Journals (Sweden)

    Boussaha Mekki

    2006-05-01

    Full Text Available Abstract Background The Fragile Histidine Triad gene (FHIT is an oncosuppressor implicated in many human cancers, including vesical tumors. FHIT is frequently hit by deletions caused by fragility at FRA3B, the most active of human common fragile sites, where FHIT lays. Vesical tumors affect also cattle, including animals grazing in the wild on bracken fern; compounds released by the fern are known to induce chromosome fragility and may trigger cancer with the interplay of latent Papilloma virus. Results The bovine FHIT was characterized by assembling a contig of 78 BACs. Sequence tags were designed on human exons and introns and used directly to select bovine BACs, or compared with sequence data in the bovine genome database or in the trace archive of the bovine genome sequencing project, and adapted before use. FHIT is split in ten exons like in man, with exons 5 to 9 coding for a 149 amino acids protein. VISTA global alignments between bovine genomic contigs retrieved from the bovine genome database and the human FHIT region were performed. Conservation was extremely high over a 2 Mb region spanning the whole FHIT locus, including the size of introns. Thus, the bovine FHIT covers about 1.6 Mb compared to 1.5 Mb in man. Expression was analyzed by RT-PCR and Northern blot, and was found to be ubiquitous. Four cDNA isoforms were isolated and sequenced, that originate from an alternative usage of three variants of exon 4, revealing a size very close to the major human FHIT cDNAs. Conclusion A comparative genomic approach allowed to assemble a contig of 78 BACs and to completely annotate a 1.6 Mb region spanning the bovine FHIT gene. The findings confirmed the very high level of conservation between human and bovine genomes and the importance of comparative mapping to speed the annotation process of the recently sequenced bovine genome. The detailed knowledge of the genomic FHIT region will allow to study the role of FHIT in bovine cancerogenesis

  9. Comparative genomic mapping of the bovine Fragile Histidine Triad (FHIT) tumour suppressor gene: characterization of a 2 Mb BAC contig covering the locus, complete annotation of the gene, analysis of cDNA and of physiological expression profiles.

    Science.gov (United States)

    Uboldi, Cristina; Guidi, Elena; Roperto, Sante; Russo, Valeria; Roperto, Franco; Di Meo, Giulia Pia; Iannuzzi, Leopoldo; Floriot, Sandrine; Boussaha, Mekki; Eggen, André; Ferretti, Luca

    2006-05-23

    The Fragile Histidine Triad gene (FHIT) is an oncosuppressor implicated in many human cancers, including vesical tumors. FHIT is frequently hit by deletions caused by fragility at FRA3B, the most active of human common fragile sites, where FHIT lays. Vesical tumors affect also cattle, including animals grazing in the wild on bracken fern; compounds released by the fern are known to induce chromosome fragility and may trigger cancer with the interplay of latent Papilloma virus. The bovine FHIT was characterized by assembling a contig of 78 BACs. Sequence tags were designed on human exons and introns and used directly to select bovine BACs, or compared with sequence data in the bovine genome database or in the trace archive of the bovine genome sequencing project, and adapted before use. FHIT is split in ten exons like in man, with exons 5 to 9 coding for a 149 amino acids protein. VISTA global alignments between bovine genomic contigs retrieved from the bovine genome database and the human FHIT region were performed. Conservation was extremely high over a 2 Mb region spanning the whole FHIT locus, including the size of introns. Thus, the bovine FHIT covers about 1.6 Mb compared to 1.5 Mb in man. Expression was analyzed by RT-PCR and Northern blot, and was found to be ubiquitous. Four cDNA isoforms were isolated and sequenced, that originate from an alternative usage of three variants of exon 4, revealing a size very close to the major human FHIT cDNAs. A comparative genomic approach allowed to assemble a contig of 78 BACs and to completely annotate a 1.6 Mb region spanning the bovine FHIT gene. The findings confirmed the very high level of conservation between human and bovine genomes and the importance of comparative mapping to speed the annotation process of the recently sequenced bovine genome. The detailed knowledge of the genomic FHIT region will allow to study the role of FHIT in bovine cancerogenesis, especially of vesical papillomavirus-associated cancers of

  10. Distribution, Diversity, and Long-Term Retention of Grass Short Interspersed Nuclear Elements (SINEs).

    Science.gov (United States)

    Mao, Hongliang; Wang, Hao

    2017-08-01

    Instances of highly conserved plant short interspersed nuclear element (SINE) families and their enrichment near genes have been well documented, but little is known about the general patterns of such conservation and enrichment and underlying mechanisms. Here, we perform a comprehensive investigation of the structure, distribution, and evolution of SINEs in the grass family by analyzing 14 grass and 5 other flowering plant genomes using comparative genomics methods. We identify 61 SINE families composed of 29,572 copies, in which 46 families are first described. We find that comparing with other grass TEs, grass SINEs show much higher level of conservation in terms of genomic retention: The origin of at least 26% families can be traced to early grass diversification and these families are among most abundant SINE families in 86% species. We find that these families show much higher level of enrichment near protein coding genes than families of relatively recent origin (51%:28%), and that 40% of all grass SINEs are near gene and the percentage is higher than other types of grass TEs. The pattern of enrichment suggests that differential removal of SINE copies in gene-poor regions plays an important role in shaping the genomic distribution of these elements. We also identify a sequence motif located at 3' SINE end which is shared in 17 families. In short, this study provides insights into structure and evolution of SINEs in the grass family. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  11. A post-assembly genome-improvement toolkit (PAGIT) to obtain annotated genomes from contigs.

    Science.gov (United States)

    Swain, Martin T; Tsai, Isheng J; Assefa, Samual A; Newbold, Chris; Berriman, Matthew; Otto, Thomas D

    2012-06-07

    Genome projects now produce draft assemblies within weeks owing to advanced high-throughput sequencing technologies. For milestone projects such as Escherichia coli or Homo sapiens, teams of scientists were employed to manually curate and finish these genomes to a high standard. Nowadays, this is not feasible for most projects, and the quality of genomes is generally of a much lower standard. This protocol describes software (PAGIT) that is used to improve the quality of draft genomes. It offers flexible functionality to close gaps in scaffolds, correct base errors in the consensus sequence and exploit reference genomes (if available) in order to improve scaffolding and generating annotations. The protocol is most accessible for bacterial and small eukaryotic genomes (up to 300 Mb), such as pathogenic bacteria, malaria and parasitic worms. Applying PAGIT to an E. coli assembly takes ∼24 h: it doubles the average contig size and annotates over 4,300 gene models.

  12. Re-annotation of the physical map of Glycine max for polyploid-like regions by BAC end sequence driven whole genome shotgun read assembly

    Directory of Open Access Journals (Sweden)

    Shultz Jeffry

    2008-07-01

    Full Text Available Abstract Background Many of the world's most important food crops have either polyploid genomes or homeologous regions derived from segmental shuffling following polyploid formation. The soybean (Glycine max genome has been shown to be composed of approximately four thousand short interspersed homeologous regions with 1, 2 or 4 copies per haploid genome by RFLP analysis, microsatellite anchors to BACs and by contigs formed from BAC fingerprints. Despite these similar regions,, the genome has been sequenced by whole genome shotgun sequence (WGS. Here the aim was to use BAC end sequences (BES derived from three minimum tile paths (MTP to examine the extent and homogeneity of polyploid-like regions within contigs and the extent of correlation between the polyploid-like regions inferred from fingerprinting and the polyploid-like sequences inferred from WGS matches. Results Results show that when sequence divergence was 1–10%, the copy number of homeologous regions could be identified from sequence variation in WGS reads overlapping BES. Homeolog sequence variants (HSVs were single nucleotide polymorphisms (SNPs; 89% and single nucleotide indels (SNIs 10%. Larger indels were rare but present (1%. Simulations that had predicted fingerprints of homeologous regions could be separated when divergence exceeded 2% were shown to be false. We show that a 5–10% sequence divergence is necessary to separate homeologs by fingerprinting. BES compared to WGS traces showed polyploid-like regions with less than 1% sequence divergence exist at 2.3% of the locations assayed. Conclusion The use of HSVs like SNPs and SNIs to characterize BACs wil improve contig building methods. The implications for bioinformatic and functional annotation of polyploid and paleopolyploid genomes show that a combined approach of BAC fingerprint based physical maps, WGS sequence and HSV-based partitioning of BAC clones from homeologous regions to separate contigs will allow reliable de

  13. RNA-Seq reveals genotype-specific molecular responses to water deficit in eucalyptus

    Science.gov (United States)

    2011-01-01

    Background In a context of climate change, phenotypic plasticity provides long-lived species, such as trees, with the means to adapt to environmental variations occurring within a single generation. In eucalyptus plantations, water availability is a key factor limiting productivity. However, the molecular mechanisms underlying the adaptation of eucalyptus to water shortage remain unclear. In this study, we compared the molecular responses of two commercial eucalyptus hybrids during the dry season. Both hybrids differ in productivity when grown under water deficit. Results Pyrosequencing of RNA extracted from shoot apices provided extensive transcriptome coverage - a catalog of 129,993 unigenes (49,748 contigs and 80,245 singletons) was generated from 398 million base pairs, or 1.14 million reads. The pyrosequencing data enriched considerably existing Eucalyptus EST collections, adding 36,985 unigenes not previously represented. Digital analysis of read abundance in 14,460 contigs identified 1,280 that were differentially expressed between the two genotypes, 155 contigs showing differential expression between treatments (irrigated vs. non irrigated conditions during the dry season), and 274 contigs with significant genotype-by-treatment interaction. The more productive genotype displayed a larger set of genes responding to water stress. Moreover, stress signal transduction seemed to involve different pathways in the two genotypes, suggesting that water shortage induces distinct cellular stress cascades. Similarly, the response of functional proteins also varied widely between genotypes: the most productive genotype decreased expression of genes related to photosystem, transport and secondary metabolism, whereas genes related to primary metabolism and cell organisation were over-expressed. Conclusions For the most productive genotype, the ability to express a broader set of genes in response to water availability appears to be a key characteristic in the maintenance

  14. Pyrosequencing the Midgut Transcriptome of the Banana Weevil Cosmopolites sordidus (Germar (Coleoptera: Curculionidae Reveals Multiple Protease-Like Transcripts.

    Directory of Open Access Journals (Sweden)

    Arnubio Valencia

    Full Text Available The banana weevil Cosmopolites sordidus is an important and serious insect pest in most banana and plantain-growing areas of the world. In spite of the economic importance of this insect pest very little genomic and transcriptomic information exists for this species. In the present study, we characterized the midgut transcriptome of C. sordidus using massive 454-pyrosequencing. We generated over 590,000 sequencing reads that assembled into 30,840 contigs with more than 400 bp, representing a significant expansion of existing sequences available for this insect pest. Among them, 16,427 contigs contained one or more GO terms. In addition, 15,263 contigs were assigned an EC number. In-depth transcriptome analysis identified genes potentially involved in insecticide resistance, peritrophic membrane biosynthesis, immunity-related function and defense against pathogens, and Bacillus thuringiensis toxins binding proteins as well as multiple enzymes involved with protein digestion. This transcriptome will provide a valuable resource for understanding larval physiology and for identifying novel target sites and management approaches for this important insect pest.

  15. Pyrosequencing the Midgut Transcriptome of the Banana Weevil Cosmopolites sordidus (Germar) (Coleoptera: Curculionidae) Reveals Multiple Protease-Like Transcripts.

    Science.gov (United States)

    Valencia, Arnubio; Wang, Haichuan; Soto, Alberto; Aristizabal, Manuel; Arboleda, Jorge W; Eyun, Seong-Il; Noriega, Daniel D; Siegfried, Blair

    2016-01-01

    The banana weevil Cosmopolites sordidus is an important and serious insect pest in most banana and plantain-growing areas of the world. In spite of the economic importance of this insect pest very little genomic and transcriptomic information exists for this species. In the present study, we characterized the midgut transcriptome of C. sordidus using massive 454-pyrosequencing. We generated over 590,000 sequencing reads that assembled into 30,840 contigs with more than 400 bp, representing a significant expansion of existing sequences available for this insect pest. Among them, 16,427 contigs contained one or more GO terms. In addition, 15,263 contigs were assigned an EC number. In-depth transcriptome analysis identified genes potentially involved in insecticide resistance, peritrophic membrane biosynthesis, immunity-related function and defense against pathogens, and Bacillus thuringiensis toxins binding proteins as well as multiple enzymes involved with protein digestion. This transcriptome will provide a valuable resource for understanding larval physiology and for identifying novel target sites and management approaches for this important insect pest.

  16. Male-biased genes in catfish as revealed by RNA-Seq analysis of the testis transcriptome.

    Directory of Open Access Journals (Sweden)

    Fanyue Sun

    Full Text Available BACKGROUND: Catfish has a male-heterogametic (XY sex determination system, but genes involved in gonadogenesis, spermatogenesis, testicular determination, and sex determination are poorly understood. As a first step of understanding the transcriptome of the testis, here, we conducted RNA-Seq analysis using high throughput Illumina sequencing. METHODOLOGY/PRINCIPAL FINDINGS: A total of 269.6 million high quality reads were assembled into 193,462 contigs with a N50 length of 806 bp. Of these contigs, 67,923 contigs had hits to a set of 25,307 unigenes, including 167 unique genes that had not been previously identified in catfish. A meta-analysis of expressed genes in the testis and in the gynogen (double haploid female allowed the identification of 5,450 genes that are preferentially expressed in the testis, providing a pool of putative male-biased genes. Gene ontology and annotation analysis suggested that many of these male-biased genes were involved in gonadogenesis, spermatogenesis, testicular determination, gametogenesis, gonad differentiation, and possibly sex determination. CONCLUSION/SIGNIFICANCE: We provide the first transcriptome-level analysis of the catfish testis. Our analysis would lay the basis for sequential follow-up studies of genes involved in sex determination and differentiation in catfish.

  17. A Short Interspersed Nuclear Element (SINE)-Based Real-Time PCR Approach to Detect and Quantify Porcine Component in Meat Products.

    Science.gov (United States)

    Zhang, Chi; Fang, Xin; Qiu, Haopu; Li, Ning

    2015-01-01

    Real-time PCR amplification of mitochondria gene could not be used for DNA quantification, and that of single copy DNA did not allow an ideal sensitivity. Moreover, cross-reactions among similar species were commonly observed in the published methods amplifying repetitive sequence, which hindered their further application. The purpose of this study was to establish a short interspersed nuclear element (SINE)-based real-time PCR approach having high specificity for species detection that could be used in DNA quantification. After massive screening of candidate Sus scrofa SINEs, one optimal combination of primers and probe was selected, which had no cross-reaction with other common meat species. LOD of the method was 44 fg DNA/reaction. Further, quantification tests showed this approach was practical in DNA estimation without tissue variance. Thus, this study provided a new tool for qualitative detection of porcine component, which could be promising in the QC of meat products.

  18. Retrotransposon long interspersed nucleotide element-1 (LINE-1) is activated during salamander limb regeneration

    Science.gov (United States)

    Zhu, Wei; Kuo, Dwight; Nathanson, Jason; Satoh, Akira; Pao, Gerald M.; Yeo, Gene W.; Bryant, Susan V.; Voss, S. Randal; Gardiner, David M.; Hunter, Tony

    2012-01-01

    Salamanders possess an extraordinary capacity for tissue and organ regeneration when compared to mammals. In our effort to characterize the unique transcriptional fingerprint emerging during the early phase of salamander limb regeneration, we identified transcriptional activation of some germline-specific genes within the Mexican axolotl (Ambystoma mexicanum) that is indicative of cellular reprogramming of differentiated cells into a germline-like state. In this work, we focus on one of these genes, the long interspersed nucleotide element-1 (LINE-1) retrotransposon, which is usually active in germ cells and silent in most of the somatic tissues in other organisms. LINE-1 was found to be dramatically upregulated during regeneration. In addition, higher genomic LINE-1 content was also detected in the limb regenerate when compared to that before amputation indicating that LINE-1 retrotransposition is indeed active during regeneration. Active LINE-1 retrotransposition has been suggested to have a potentially deleterious impact on genomic integrity. Silencing of activated LINE-1 by small RNAs has been reported to be part of the machinery aiming to maintain genomic integrity. Indeed, we were able to identify putative LINE-1-related piRNAs in the limb blastema. Transposable element-related piRNAs have been identified frequently in the germline in other organisms. Thus, we present here a scenario in which a unique germline-like state is established during axolotl limb regeneration, and the re-activation of LINE-1 may serve as a marker for cellular dedifferentiation in the early-stage of limb regeneration. PMID:22913491

  19. Carbon-wrapped MnO nanodendrites interspersed on reduced graphene oxide sheets as anode materials for lithium-ion batteries

    Energy Technology Data Exchange (ETDEWEB)

    Liu, Boli; Li, Dan; Liu, Zhengjiao; Gu, Lili; Xie, Wenhe; Li, Qun; Guo, Pengqian; Liu, Dequan; He, Deyan, E-mail: hedy@lzu.edu.cn

    2017-02-01

    Highlights: • The C-MnO/rGO composites were anchored on nickel foam by a facile vacuum filtration and a subsequent thermal treatment. • The novel architecture of anodes effectively improved the electrochemical performance of lithium ion battery. • The active MnO nanodendrites became smaller nanoparticles still wrapped in graphene sheets after cycles. - Abstract: Carbon-wrapped MnO nanodendrites interspersed on reduced graphene oxide sheets (C-MnO/rGO) were prepared on nickel foam by a facile vacuum filtration and a subsequent thermal treatment. As a binder-free anode of lithium-ion battery, the nanodendritic structure of C-MnO accommodates the huge volume expansion and shortens the diffusion length for lithium ion and electron, rGO sheets prevent C-MnO nanodendites from aggregation and offer a good electronic conduction. As a result, the electrode with such a novel architecture delivers superior electrochemical properties including high reversible capacity, excellent rate capability and cycle stability. Moreover, MnO nanodendrites change to nanoparticles wrapped in graphene sheets during the lithiation/delithiation process, which is a more beneficial microstructure to further increase the specific capacity and cycle life of the electrode.

  20. Draft whole genome sequence of groundnut stem rot fungus Athelia rolfsii revealing genetic architect of its pathogenicity and virulence.

    Science.gov (United States)

    Iquebal, M A; Tomar, Rukam S; Parakhia, M V; Singla, Deepak; Jaiswal, Sarika; Rathod, V M; Padhiyar, S M; Kumar, Neeraj; Rai, Anil; Kumar, Dinesh

    2017-07-13

    Groundnut (Arachis hypogaea L.) is an important oil seed crop having major biotic constraint in production due to stem rot disease caused by fungus, Athelia rolfsii causing 25-80% loss in productivity. As chemical and biological combating strategies of this fungus are not very effective, thus genome sequencing can reveal virulence and pathogenicity related genes for better understanding of the host-parasite interaction. We report draft assembly of Athelia rolfsii genome of ~73 Mb having 8919 contigs. Annotation analysis revealed 16830 genes which are involved in fungicide resistance, virulence and pathogenicity along with putative effector and lethal genes. Secretome analysis revealed CAZY genes representing 1085 enzymatic genes, glycoside hydrolases, carbohydrate esterases, carbohydrate-binding modules, auxillary activities, glycosyl transferases and polysaccharide lyases. Repeat analysis revealed 11171 SSRs, LTR, GYPSY and COPIA elements. Comparative analysis with other existing ascomycotina genome predicted conserved domain family of WD40, CYP450, Pkinase and ABC transporter revealing insight of evolution of pathogenicity and virulence. This study would help in understanding pathogenicity and virulence at molecular level and development of new combating strategies. Such approach is imperative in endeavour of genome based solution in stem rot disease management leading to better productivity of groundnut crop in tropical region of world.

  1. Metagenomic insights into the rumen microbial fibrolytic enzymes in Indian crossbred cattle fed finger millet straw.

    Science.gov (United States)

    Jose, V Lyju; Appoothy, Thulasi; More, Ravi P; Arun, A Sha

    2017-12-01

    The rumen is a unique natural habitat, exhibiting an unparalleled genetic resource of fibrolytic enzymes of microbial origin that degrade plant polysaccharides. The objectives of this study were to identify the principal plant cell wall-degrading enzymes and the taxonomic profile of rumen microbial communities that are associated with it. The cattle rumen microflora and the carbohydrate-active enzymes were functionally classified through a whole metagenomic sequencing approach. Analysis of the assembled sequences by the Carbohydrate-active enzyme analysis Toolkit identified the candidate genes encoding fibrolytic enzymes belonging to different classes of glycoside hydrolases(11,010 contigs), glycosyltransferases (6366 contigs), carbohydrate esterases (4945 contigs), carbohydrate-binding modules (1975 contigs), polysaccharide lyases (480 contigs), and auxiliary activities (115 contigs). Phylogenetic analysis of CAZyme encoding contigs revealed that a significant proportion of CAZymes were contributed by bacteria belonging to genera Prevotella, Bacteroides, Fibrobacter, Clostridium, and Ruminococcus. The results indicated that the cattle rumen microbiome and the CAZymes are highly complex, structurally similar but compositionally distinct from other ruminants. The unique characteristics of rumen microbiota and the enzymes produced by resident microbes provide opportunities to improve the feed conversion efficiency in ruminants and serve as a reservoir of industrially important enzymes for cellulosic biofuel production.

  2. Short interspersed DNA elements and miRNAs: a novel hidden gene regulation layer in zebrafish?

    Science.gov (United States)

    Scarpato, Margherita; Angelini, Claudia; Cocca, Ennio; Pallotta, Maria M; Morescalchi, Maria A; Capriglione, Teresa

    2015-09-01

    In this study, we investigated by in silico analysis the possible correlation between microRNAs (miRNAs) and Anamnia V-SINEs (a superfamily of short interspersed nuclear elements), which belong to those retroposon families that have been preserved in vertebrate genomes for millions of years and are actively transcribed because they are embedded in the 3' untranslated region (UTR) of several genes. We report the results of the analysis of the genomic distribution of these mobile elements in zebrafish (Danio rerio) and discuss their involvement in generating miRNA gene loci. The computational study showed that the genes predicted to bear V-SINEs can be targeted by miRNAs with a very high hybridization E-value. Gene ontology analysis indicates that these genes are mainly involved in metabolic, membrane, and cytoplasmic signaling pathways. Nearly all the miRNAs that were predicted to target the V-SINEs of these genes, i.e., miR-338, miR-9, miR-181, miR-724, miR-735, and miR-204, have been validated in similar regulatory roles in mammals. The large number of genes bearing a V-SINE involved in metabolic and cellular processes suggests that V-SINEs may play a role in modulating cell responses to different stimuli and in preserving the metabolic balance during cell proliferation and differentiation. Although they need experimental validation, these preliminary results suggest that in the genome of D. rerio, as in other TE families in vertebrates, the preservation of V-SINE retroposons may also have been favored by their putative role in gene network modulation.

  3. Molecular characterization, genomic distribution and evolutionary dynamics of Short INterspersed Elements in the termite genome.

    Science.gov (United States)

    Luchetti, Andrea; Mantovani, Barbara

    2011-02-01

    Short INterspersed Elements (SINEs) in invertebrates, and especially in animal inbred genomes such that of termites, are poorly known; in this paper we characterize three new SINE families (Talub, Taluc and Talud) through the analyses of 341 sequences, either isolated from the Reticulitermes lucifugus genome or drawn from EST Genbank collection. We further add new data to the only isopteran element known so far, Talua. These SINEs are tRNA-derived elements, with an average length ranging from 258 to 372 bp. The tails are made up by poly(A) or microsatellite motifs. Their copy number varies from 7.9 × 10(3) to 10(5) copies, well within the range observed for other metazoan genomes. Species distribution, age and target site duplication analysis indicate Talud as the oldest, possibly inactive SINE originated before the onset of Isoptera (~150 Myr ago). Taluc underwent to substantial sequence changes throughout the evolution of termites and data suggest it was silenced and then re-activated in the R. lucifugus lineage. Moreover, Taluc shares a conserved sequence block with other unrelated SINEs, as observed for some vertebrate and cephalopod elements. The study of genomic environment showed that insertions are mainly surrounded by microsatellites and other SINEs, indicating a biased accumulation within non-coding regions. The evolutionary dynamics of Talu~ elements is explained through selective mechanisms acting in an inbred genome; in this respect, the study of termites' SINEs activity may provide an interesting framework to address the (co)evolution of mobile elements and the host genome.

  4. Amplification of an ancestral mammalian L1 family of long interspersed repeated DNA occurred just before the murine radiation

    International Nuclear Information System (INIS)

    Pascale, E.; Valle, E.; Furano, A.V.

    1990-01-01

    Each mammalian genus examined so far contains 50,000-100,000 members of an L1 (LINE 1) family of long interspersed repeated DNA elements. Current knowledge on the evolution of L1 families presents a paradox because, although L1 families have been in mammalian genomes since before the mammalian radiation ∼80 million years ago, most members of the L1 families are only a few million years old. Accordingly it has been suggested either that the extensive amplification that characterizes present-day L1 families did not occur in the past or that old members were removed as new one were generated. However, the authors show here that an ancestral rodent L1 family was extensively amplified ∼10 million years ago and that the relics of this amplification have persisted in modern murine genomes. This amplification occurred just before the divergence of modern murine genera from their common ancestor and identifies the murine node in the lineage of modern muroid rodents The results suggest that repeated amplification of L1 elements is a feature of the evaluation of mammalian genomes and that ancestral amplification events could provide a useful tool for determining mammalian lineages

  5. Automated High-Throughput Genotyping for Study of Global Epidemiology of Mycobacterium tuberculosis Based on Mycobacterial Interspersed Repetitive Units

    Science.gov (United States)

    Supply, Philip; Lesjean, Sarah; Savine, Evgueni; Kremer, Kristin; van Soolingen, Dick; Locht, Camille

    2001-01-01

    Large-scale genotyping of Mycobacterium tuberculosis is especially challenging, as the current typing methods are labor-intensive and the results are difficult to compare among laboratories. Here, automated typing based on variable-number tandem repeats (VNTRs) of genetic elements named mycobacterial interspersed repetitive units (MIRUs) in 12 mammalian minisatellite-like loci of M. tuberculosis is presented. This system combines analysis of multiplex PCRs on a fluorescence-based DNA analyzer with computerized automation of the genotyping. Analysis of a blinded reference set of 90 strains from 38 countries (K. Kremer et al., J. Clin. Microbiol. 37:2607–2618, 1999) demonstrated that it is 100% reproducible, sensitive, and specific for M. tuberculosis complex isolates, a performance that has not been achieved by any other typing method tested in the same conditions. MIRU-VNTRs can be used for analysis of the global genetic diversity of M. tuberculosis complex strains at different levels of evolutionary divergence. To fully exploit the portability of this typing system, a website was set up for the analysis of M. tuberculosis MIRU-VNTR genotypes via the Internet. This opens the way for global epidemiological surveillance of tuberculosis and should lead to novel insights into the evolutionary and population genetics of this major pathogen. PMID:11574573

  6. Conservation of gene cassettes among diverse viruses of the human gut.

    Directory of Open Access Journals (Sweden)

    Samuel Minot

    Full Text Available Viruses are a crucial component of the human microbiome, but large population sizes, high sequence diversity, and high frequencies of novel genes have hindered genomic analysis by high-throughput sequencing. Here we investigate approaches to metagenomic assembly to probe genome structure in a sample of 5.6 Gb of gut viral DNA sequence from six individuals. Tests showed that a new pipeline based on DeBruijn graph assembly yielded longer contigs that were able to recruit more reads than the equivalent non-optimized, single-pass approach. To characterize gene content, the database of viral RefSeq proteins was compared to the assembled viral contigs, generating a bipartite graph with functional cassettes linking together viral contigs, which revealed a high degree of connectivity between diverse genomes involving multiple genes of the same functional class. In a second step, open reading frames were grouped by their co-occurrence on contigs in a database-independent manner, revealing conserved cassettes of co-oriented ORFs. These methods reveal that free-living bacteriophages, while usually dissimilar at the nucleotide level, often have significant similarity at the level of encoded amino acid motifs, gene order, and gene orientation. These findings thus connect contemporary metagenomic analysis with classical studies of bacteriophage genomic cassettes. Software is available at https://sourceforge.net/projects/optitdba/.

  7. Tree ferns: monophyletic groups and their relationships as revealed by four protein-coding plastid loci.

    Science.gov (United States)

    Korall, Petra; Pryer, Kathleen M; Metzgar, Jordan S; Schneider, Harald; Conant, David S

    2006-06-01

    Tree ferns are a well-established clade within leptosporangiate ferns. Most of the 600 species (in seven families and 13 genera) are arborescent, but considerable morphological variability exists, spanning the giant scaly tree ferns (Cyatheaceae), the low, erect plants (Plagiogyriaceae), and the diminutive endemics of the Guayana Highlands (Hymenophyllopsidaceae). In this study, we investigate phylogenetic relationships within tree ferns based on analyses of four protein-coding, plastid loci (atpA, atpB, rbcL, and rps4). Our results reveal four well-supported clades, with genera of Dicksoniaceae (sensu ) interspersed among them: (A) (Loxomataceae, (Culcita, Plagiogyriaceae)), (B) (Calochlaena, (Dicksonia, Lophosoriaceae)), (C) Cibotium, and (D) Cyatheaceae, with Hymenophyllopsidaceae nested within. How these four groups are related to one other, to Thyrsopteris, or to Metaxyaceae is weakly supported. Our results show that Dicksoniaceae and Cyatheaceae, as currently recognised, are not monophyletic and new circumscriptions for these families are needed.

  8. Detection of a Usp-like gene in Calotropis procera plant from the de novo assembled genome contigs of the high-throughput sequencing dataset

    KAUST Repository

    Shokry, Ahmed M.

    2014-02-01

    The wild plant species Calotropis procera (C. procera) has many potential applications and beneficial uses in medicine, industry and ornamental field. It also represents an excellent source of genes for drought and salt tolerance. Genes encoding proteins that contain the conserved universal stress protein (USP) domain are known to provide organisms like bacteria, archaea, fungi, protozoa and plants with the ability to respond to a plethora of environmental stresses. However, information on the possible occurrence of Usp in C. procera is not available. In this study, we uncovered and characterized a one-class A Usp-like (UspA-like, NCBI accession No. KC954274) gene in this medicinal plant from the de novo assembled genome contigs of the high-throughput sequencing dataset. A number of GenBank accessions for Usp sequences were blasted with the recovered de novo assembled contigs. Homology modelling of the deduced amino acids (NCBI accession No. AGT02387) was further carried out using Swiss-Model, accessible via the EXPASY. Superimposition of C. procera USPA-like full sequence model on Thermus thermophilus USP UniProt protein (PDB accession No. Q5SJV7) was constructed using RasMol and Deep-View programs. The functional domains of the novel USPA-like amino acids sequence were identified from the NCBI conserved domain database (CDD) that provide insights into sequence structure/function relationships, as well as domain models imported from a number of external source databases (Pfam, SMART, COG, PRK, TIGRFAM). © 2014 Académie des sciences.

  9. Three vibrio-resistance related EST-SSR markers revealed by selective genotyping in the clam Meretrix meretrix.

    Science.gov (United States)

    Nie, Qing; Yue, Xin; Chai, Xueliang; Wang, Hongxia; Liu, Baozhong

    2013-08-01

    The clam Meretrix meretrix is an important commercial bivalve distributed in the coastal areas of South and Southeast Asia. In this study, marker-trait association analyses were performed based on the stock materials of M. meretrix with different vibrio-resistance profile obtained by selective breeding. Forty-eight EST-SSR markers were screened and 27 polymorphic SSRs of them were genotyped in the clam stocks with different resistance to Vibrio parahaemolyticus (11-R and 11-S) and to Vibrio harveyi (09-R and 09-C). Allele frequency distributions of the SSRs among different stocks were compared using Pearson's Chi-square test, and three functional EST-SSR markers (MM959, MM4765 and MM8364) were found to be associated with vibrio-resistance trait. The 140-bp allele of MM959 and 128-bp allele of MM4765 had significantly higher frequencies in resistant groups (11-R and 09-R) than in susceptive/control groups (11-S and 09-C) (P SSR markers were consistent with the three subgroups distinctions. The putative functions of contig959, contig4765 and contig8364 also suggested that the three SSR-involved genes might play important roles in immunity of M. meretrix. All these results supported that EST-SSR markers MM959, MM4765 and MM8364 were associated with vibrio-resistance and would be useful for marker-assisted selection (MAS) in M. meretrix genetic breeding. Copyright © 2013 Elsevier Ltd. All rights reserved.

  10. Mapping of Micro-Tom BAC-End Sequences to the Reference Tomato Genome Reveals Possible Genome Rearrangements and Polymorphisms

    Science.gov (United States)

    Asamizu, Erika; Shirasawa, Kenta; Hirakawa, Hideki; Sato, Shusei; Tabata, Satoshi; Yano, Kentaro; Ariizumi, Tohru; Shibata, Daisuke; Ezura, Hiroshi

    2012-01-01

    A total of 93,682 BAC-end sequences (BESs) were generated from a dwarf model tomato, cv. Micro-Tom. After removing repetitive sequences, the BESs were similarity searched against the reference tomato genome of a standard cultivar, “Heinz 1706.” By referring to the “Heinz 1706” physical map and by eliminating redundant or nonsignificant hits, 28,804 “unique pair ends” and 8,263 “unique ends” were selected to construct hypothetical BAC contigs. The total physical length of the BAC contigs was 495, 833, 423 bp, covering 65.3% of the entire genome. The average coverage of euchromatin and heterochromatin was 58.9% and 67.3%, respectively. From this analysis, two possible genome rearrangements were identified: one in chromosome 2 (inversion) and the other in chromosome 3 (inversion and translocation). Polymorphisms (SNPs and Indels) between the two cultivars were identified from the BLAST alignments. As a result, 171,792 polymorphisms were mapped on 12 chromosomes. Among these, 30,930 polymorphisms were found in euchromatin (1 per 3,565 bp) and 140,862 were found in heterochromatin (1 per 2,737 bp). The average polymorphism density in the genome was 1 polymorphism per 2,886 bp. To facilitate the use of these data in Micro-Tom research, the BAC contig and polymorphism information are available in the TOMATOMICS database. PMID:23227037

  11. PCR and magnetic bead-mediated target capture for the isolation of short interspersed nucleotide elements in fishes.

    Science.gov (United States)

    Liu, Dong; Zhu, Guoli; Tang, Wenqiao; Yang, Jinquan; Guo, Hongyi

    2012-01-01

    Short interspersed nucleotide elements (SINEs), a type of retrotransposon, are widely distributed in various genomes with multiple copies arranged in different orientations, and cause changes to genes and genomes during evolutionary history. This can provide the basis for determining genome diversity, genetic variation and molecular phylogeny, etc. SINE DNA is transcribed into RNA by polymerase III from an internal promoter, which is composed of two conserved boxes, box A and box B. Here we present an approach to isolate novel SINEs based on these promoter elements. Box A of a SINE is obtained via PCR with only one primer identical to box B (B-PCR). Box B and its downstream sequence are acquired by PCR with one primer corresponding to box A (A-PCR). The SINE clone produced by A-PCR is selected as a template to label a probe with biotin. The full-length SINEs are isolated from the genomic pool through complex capture using the biotinylated probe bound to magnetic particles. Using this approach, a novel SINE family, Cn-SINE, from the genomes of Coilia nasus, was isolated. The members are 180-360 bp long. Sequence homology suggests that Cn-SINEs evolved from a leucine tRNA gene. This is the first report of a tRNA(Leu)-related SINE obtained without the use of a genomic library or inverse PCR. These results provide new insights into the origin of SINEs.

  12. PCR and Magnetic Bead-Mediated Target Capture for the Isolation of Short Interspersed Nucleotide Elements in Fishes

    Directory of Open Access Journals (Sweden)

    Dong Liu

    2012-02-01

    Full Text Available Short interspersed nucleotide elements (SINEs, a type of retrotransposon, are widely distributed in various genomes with multiple copies arranged in different orientations, and cause changes to genes and genomes during evolutionary history. This can provide the basis for determining genome diversity, genetic variation and molecular phylogeny, etc. SINE DNA is transcribed into RNA by polymerase III from an internal promoter, which is composed of two conserved boxes, box A and box B. Here we present an approach to isolate novel SINEs based on these promoter elements. Box A of a SINE is obtained via PCR with only one primer identical to box B (B-PCR. Box B and its downstream sequence are acquired by PCR with one primer corresponding to box A (A-PCR. The SINE clone produced by A-PCR is selected as a template to label a probe with biotin. The full-length SINEs are isolated from the genomic pool through complex capture using the biotinylated probe bound to magnetic particles. Using this approach, a novel SINE family, Cn-SINE, from the genomes of Coilia nasus, was isolated. The members are 180–360 bp long. Sequence homology suggests that Cn-SINEs evolved from a leucine tRNA gene. This is the first report of a tRNALeu-related SINE obtained without the use of a genomic library or inverse PCR. These results provide new insights into the origin of SINEs.

  13. Analysis of the salivary gland transcriptome of Frankliniella occidentalis.

    Directory of Open Access Journals (Sweden)

    Candice A Stafford-Banks

    Full Text Available Saliva is known to play a crucial role in insect feeding behavior and virus transmission. Currently, little is known about the salivary glands and saliva of thrips, despite the fact that Frankliniella occidentalis (Pergande (the western flower thrips is a serious pest due to its destructive feeding, wide host range, and transmission of tospoviruses. As a first step towards characterizing thrips salivary gland functions, we sequenced the transcriptome of the primary salivary glands of F. occidentalis using short read sequencing (Illumina technology. A de novo-assembled transcriptome revealed 31,392 high quality contigs with an average size of 605 bp. A total of 12,166 contigs had significant BLASTx or tBLASTx hits (E≤1.0E-6 to known proteins, whereas a high percentage (61.24% of contigs had no apparent protein or nucleotide hits. Comparison of the F. occidentalis salivary gland transcriptome (sialotranscriptome against a published F. occidentalis full body transcriptome assembled from Roche-454 reads revealed several contigs with putative annotations associated with salivary gland functions. KEGG pathway analysis of the sialotranscriptome revealed that the majority (18 out of the top 20 predicted KEGG pathways of the salivary gland contig sequences match proteins involved in metabolism. We identified several genes likely to be involved in detoxification and inhibition of plant defense responses including aldehyde dehydrogenase, metalloprotease, glucose oxidase, glucose dehydrogenase, and regucalcin. We also identified several genes that may play a role in the extra-oral digestion of plant structural tissues including β-glucosidase and pectin lyase; and the extra-oral digestion of sugars, including α-amylase, maltase, sucrase, and α-glucosidase. This is the first analysis of a sialotranscriptome for any Thysanopteran species and it provides a foundational tool to further our understanding of how thrips interact with their plant hosts and the

  14. Long interspersed nuclear element-1 hypomethylation and oxidative stress: correlation and bladder cancer diagnostic potential.

    Directory of Open Access Journals (Sweden)

    Maturada Patchsung

    Full Text Available Although, increased oxidative stress and hypomethylation of long interspersed nuclear element-1 (LINE-1 associate with bladder cancer (BCa development, the relationship between these alterations is unknown. We evaluated the oxidative stress and hypomethylation of the LINE-1 in 61 BCa patients and 45 normal individuals. To measure the methylation levels and to differentiate the LINE-1 loci into hypermethylated, partially methylated and hypomethylated, peripheral blood cells, urinary exfoliated cells and cancerous tissues were evaluated by combined bisulfite restriction analysis PCR. The urinary total antioxidant status (TAS and plasma protein carbonyl content were determined. The LINE-1 methylation levels and patterns, especially hypomethylated loci, in the blood and urine cells of the BCa patients were different from the levels and patterns in the healthy controls. The urinary TAS was decreased, whereas the plasma protein carbonyl content was increased in the BCa patients relative to the controls. A positive correlation between the methylation of LINE-1 in the blood-derived DNA and urinary TAS was found in both the BCa and control groups. The urinary hypomethylated LINE-1 loci and the plasma protein carbonyl content provided the best diagnostic potential for BCa prediction. Based on post-diagnostic samples, the combination test improved the diagnostic power to a sensitivity of 96% and a specificity of 96%. In conclusion, decreased LINE-1 methylation is associated with increased oxidative stress both in healthy and BCa subjects across the various tissue types, implying a dose-response association. Increases in the LINE-1 hypomethylation levels and the number of hypomethylated loci in both the blood- and urine-derived cells and increase in the oxidative stress were found in the BCa patients. The combination test of the urinary hypomethylated LINE-1 loci and the plasma protein carbonyl content may be useful for BCa screening and monitoring of

  15. Polycaprolactone nanofiber interspersed collagen type-I scaffold for bone regeneration: a unique injectable osteogenic scaffold

    International Nuclear Information System (INIS)

    Baylan, Nuray; Ditto, Maggie; Lawrence, Joseph G; Yildirim-Ayan, Eda; Bhat, Samerna; Lecka-Czernik, Beata

    2013-01-01

    There is an increasing demand for an injectable cell coupled three-dimensional (3D) scaffold to be used as bone fracture augmentation material. To address this demand, a novel injectable osteogenic scaffold called PN-COL was developed using cells, a natural polymer (collagen type-I), and a synthetic polymer (polycaprolactone (PCL)). The injectable nanofibrous PN-COL is created by interspersing PCL nanofibers within pre-osteoblast cell embedded collagen type-I. This simple yet novel and powerful approach provides a great benefit as an injectable bone scaffold over other non-living bone fracture stabilization polymers, such as polymethylmethacrylate and calcium content resin-based materials. The advantages of injectability and the biomimicry of collagen was coupled with the structural support of PCL nanofibers, to create cell encapsulated injectable 3D bone scaffolds with intricate porous internal architecture and high osteoconductivity. The effects of PCL nanofiber inclusion within the cell encapsulated collagen matrix has been evaluated for scaffold size retention and osteocompatibility, as well as for MC3T3-E1 cells osteogenic activity. The structural analysis of novel bioactive material proved that the material is chemically stable enough in an aqueous solution for an extended period of time without using crosslinking reagents, but it is also viscous enough to be injected through a syringe needle. Data from long-term in vitro proliferation and differentiation data suggests that novel PN-COL scaffolds promote the osteoblast proliferation, phenotype expression, and formation of mineralized matrix. This study demonstrates for the first time the feasibility of creating a structurally competent, injectable, cell embedded bone tissue scaffold. Furthermore, the results demonstrate the advantages of mimicking the hierarchical architecture of native bone with nano- and micro-size formation through introducing PCL nanofibers within macron-size collagen fibers and in

  16. The Douglas-fir genome sequence reveals specialization of the photosynthetic apparatus in Pinaceae

    Science.gov (United States)

    David B. Neale; Patrick E. McGuire; Nicholas C. Wheeler; Kristian A. Stevens; Marc W. Crepeau; Charis Cardeno; Aleksey V. Zimin; Daniela Puiu; Geo M. Pertea; U. Uzay Sezen; Claudio Casola; Tomasz E. Koralewski; Robin Paul; Daniel Gonzalez-Ibeas; Sumaira Zaman; Richard Cronn; Mark Yandell; Carson Holt; Charles H. Langley; James A. Yorke; Steven L. Salzberg; Jill L. Wegrzyn

    2017-01-01

    A reference genome sequence for Pseudotsuga menziesii var. menziesii (Mirb.) Franco (Coastal Douglas-fir) is reported, thus providing a reference sequence for a third genus of the family Pinaceae. The contiguity and quality of the genome assembly far exceeds that of other conifer reference genome sequences (contig N50 = 44,136 bp and scaffold N50...

  17. Short interspersed elements (SINEs) of squamate reptiles (Squam1 and Squam2): structure and phylogenetic significance.

    Science.gov (United States)

    Grechko, Vernata V; Kosushkin, Sergei A; Borodulina, Olga R; Butaeva, Fatima G; Darevsky, Ilya S

    2011-05-15

    Short interspersed elements (SINEs) are important nuclear molecular markers of the evolution of many eukaryotes. However, the SINEs of squamate reptile genomes have been little studied. We first identified two families of SINEs, termed Squam1 and Squam2, in the DNA of meadow lizard Darevskia praticola (Lacertidae) by performing DNA hybridization and PCR. Later, the same families of retrotransposons were found using the same methods in members of another 25 lizard families (from Iguania, Scincomorpha, Gekkota, Varanoidea, and Diploglossa infraorders) and two snake families, but their abundances in these taxa varied greatly. Both SINEs were Squamata-specific and were absent from mammals, birds, crocodiles, turtles, amphibians, and fish. Squam1 possessed some characteristics common to tRNA-related SINEs from fish and mammals, while Squam2 belonged to the tRNA(Ala) group of SINEs and had a more unusual and divergent structure. Squam2-related sequences were found in several unannotated GenBank sequences of squamate reptiles. Squam1 abundance in the Polychrotidae, Agamidae, Leiolepididae, Chamaeleonidae, Scincidae, Lacertidae, Gekkonidae, Varanidae, Helodermatidae, and two snake families were 10(2) -10(4) times higher than those in other taxa (Corytophanidae, Iguanidae, Anguidae, Cordylidae, Gerrhosauridae, Pygopodidae, and Eublepharidae). A less dramatic degree of copy number variation was observed for Squam2 in different taxa. Several Squam1 copies from Lacertidae, Chamaeleonidae, Gekkonidae, Varanidae, and Colubridae were sequenced and found to have evident orthologous features, as well as taxa-specific autapomorphies. Squam1 from Lacertidae and Chamaeleonidae could be divided into several subgroups based on sequence differences. Possible applications of these SINEs as Squamata phylogeny markers are discussed. Copyright © 2010 Wiley-Liss, Inc., A Wiley Company.

  18. The use of Open Reading frame ESTs (ORESTES for analysis of the honey bee transcriptome

    Directory of Open Access Journals (Sweden)

    Soares Ademilson EE

    2004-11-01

    Full Text Available Abstract Background The ongoing efforts to sequence the honey bee genome require additional initiatives to define its transcriptome. Towards this end, we employed the Open Reading frame ESTs (ORESTES strategy to generate profiles for the life cycle of Apis mellifera workers. Results Of the 5,021 ORESTES, 35.2% matched with previously deposited Apis ESTs. The analysis of the remaining sequences defined a set of putative orthologs whose majority had their best-match hits with Anopheles and Drosophila genes. CAP3 assembly of the Apis ORESTES with the already existing 15,500 Apis ESTs generated 3,408 contigs. BLASTX comparison of these contigs with protein sets of organisms representing distinct phylogenetic clades revealed a total of 1,629 contigs that Apis mellifera shares with different taxa. Most (41% represent genes that are in common to all taxa, another 21% are shared between metazoans (Bilateria, and 16% are shared only within the Insecta clade. A set of 23 putative genes presented a best match with human genes, many of which encode factors related to cell signaling/signal transduction. 1,779 contigs (52% did not match any known sequence. Applying a correction factor deduced from a parallel analysis performed with Drosophila melanogaster ORESTES, we estimate that approximately half of these no-match ESTs contigs (22% should represent Apis-specific genes. Conclusions The versatile and cost-efficient ORESTES approach produced minilibraries for honey bee life cycle stages. Such information on central gene regions contributes to genome annotation and also lends itself to cross-transcriptome comparisons to reveal evolutionary trends in insect genomes.

  19. High-throughput sequencing and pathway analysis reveal alteration of the pituitary transcriptome by 17α-ethynylestradiol (EE2) in female coho salmon, Oncorhynchus kisutch

    Energy Technology Data Exchange (ETDEWEB)

    Harding, Louisa B. [School of Aquatic and Fishery Sciences, University of Washington, Seattle, WA 98195 (United States); Schultz, Irvin R. [Battelle, Marine Sciences Laboratory – Pacific Northwest National Laboratory, 1529 West Sequim Bay Road, Sequim, WA 98382 (United States); Goetz, Giles W. [School of Aquatic and Fishery Sciences, University of Washington, Seattle, WA 98195 (United States); Luckenbach, J. Adam [Northwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, 2725 Montlake Blvd E, Seattle, WA 98112 (United States); Center for Reproductive Biology, Washington State University, Pullman, WA 98164 (United States); Young, Graham [School of Aquatic and Fishery Sciences, University of Washington, Seattle, WA 98195 (United States); Center for Reproductive Biology, Washington State University, Pullman, WA 98164 (United States); Goetz, Frederick W. [Northwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, Manchester Research Station, P.O. Box 130, Manchester, WA 98353 (United States); Swanson, Penny, E-mail: penny.swanson@noaa.gov [Northwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, 2725 Montlake Blvd E, Seattle, WA 98112 (United States); Center for Reproductive Biology, Washington State University, Pullman, WA 98164 (United States)

    2013-10-15

    Highlights: •Studied impacts of ethynylestradiol (EE2) exposure on salmon pituitary transcriptome. •High-throughput sequencing, RNAseq, and pathway analysis were performed. •EE2 altered mRNAs for genes in circadian rhythm, GnRH, and TGFβ signaling pathways. •LH and FSH beta subunit mRNAs were most highly up- and down-regulated by EE2, respectively. •Estrogens may alter processes associated with reproductive timing in salmon. -- Abstract: Considerable research has been done on the effects of endocrine disrupting chemicals (EDCs) on reproduction and gene expression in the brain, liver and gonads of teleost fish, but information on impacts to the pituitary gland are still limited despite its central role in regulating reproduction. The aim of this study was to further our understanding of the potential effects of natural and synthetic estrogens on the brain–pituitary–gonad axis in fish by determining the effects of 17α-ethynylestradiol (EE2) on the pituitary transcriptome. We exposed sub-adult coho salmon (Oncorhynchus kisutch) to 0 or 12 ng EE2/L for up to 6 weeks and effects on the pituitary transcriptome of females were assessed using high-throughput Illumina{sup ®} sequencing, RNA-Seq and pathway analysis. After 1 or 6 weeks, 218 and 670 contiguous sequences (contigs) respectively, were differentially expressed in pituitaries of EE2-exposed fish relative to control. Two of the most highly up- and down-regulated contigs were luteinizing hormone β subunit (241-fold and 395-fold at 1 and 6 weeks, respectively) and follicle-stimulating hormone β subunit (−3.4-fold at 6 weeks). Additional contigs related to gonadotropin synthesis and release were differentially expressed in EE2-exposed fish relative to controls. These included contigs involved in gonadotropin releasing hormone (GNRH) and transforming growth factor-β signaling. There was an over-representation of significantly affected contigs in 33 and 18 canonical pathways at 1 and 6 weeks

  20. High-throughput sequencing and pathway analysis reveal alteration of the pituitary transcriptome by 17α-ethynylestradiol (EE2) in female coho salmon, Oncorhynchus kisutch

    International Nuclear Information System (INIS)

    Harding, Louisa B.; Schultz, Irvin R.; Goetz, Giles W.; Luckenbach, J. Adam; Young, Graham; Goetz, Frederick W.; Swanson, Penny

    2013-01-01

    Highlights: •Studied impacts of ethynylestradiol (EE2) exposure on salmon pituitary transcriptome. •High-throughput sequencing, RNAseq, and pathway analysis were performed. •EE2 altered mRNAs for genes in circadian rhythm, GnRH, and TGFβ signaling pathways. •LH and FSH beta subunit mRNAs were most highly up- and down-regulated by EE2, respectively. •Estrogens may alter processes associated with reproductive timing in salmon. -- Abstract: Considerable research has been done on the effects of endocrine disrupting chemicals (EDCs) on reproduction and gene expression in the brain, liver and gonads of teleost fish, but information on impacts to the pituitary gland are still limited despite its central role in regulating reproduction. The aim of this study was to further our understanding of the potential effects of natural and synthetic estrogens on the brain–pituitary–gonad axis in fish by determining the effects of 17α-ethynylestradiol (EE2) on the pituitary transcriptome. We exposed sub-adult coho salmon (Oncorhynchus kisutch) to 0 or 12 ng EE2/L for up to 6 weeks and effects on the pituitary transcriptome of females were assessed using high-throughput Illumina ® sequencing, RNA-Seq and pathway analysis. After 1 or 6 weeks, 218 and 670 contiguous sequences (contigs) respectively, were differentially expressed in pituitaries of EE2-exposed fish relative to control. Two of the most highly up- and down-regulated contigs were luteinizing hormone β subunit (241-fold and 395-fold at 1 and 6 weeks, respectively) and follicle-stimulating hormone β subunit (−3.4-fold at 6 weeks). Additional contigs related to gonadotropin synthesis and release were differentially expressed in EE2-exposed fish relative to controls. These included contigs involved in gonadotropin releasing hormone (GNRH) and transforming growth factor-β signaling. There was an over-representation of significantly affected contigs in 33 and 18 canonical pathways at 1 and 6 weeks

  1. The genome and proteome of a Campylobacter coli bacteriophage vB_CcoM-IBB_35 reveal unusual features

    Directory of Open Access Journals (Sweden)

    Carvalho Carla M

    2012-01-01

    Full Text Available Abstract Background Campylobacter is the leading cause of foodborne diseases worldwide. Bacteriophages (phages are naturally occurring predators of bacteria, ubiquitous in the environment, with high host specificity and thus considered an appealing option to control bacterial pathogens. Nevertheless for an effective use of phages as antimicrobial agents, it is important to understand phage biology which renders crucial the analysis of phage genomes and proteomes. The lack of sequence data from Campylobacter phages adds further importance to these studies. Methods vB_CcoM-IBB_35 is a broad lytic spectrum Myoviridae Campylobacter phage with high potential for therapeutic use. The genome of this phage was obtained by pyrosequencing and the sequence data was further analyzed. The proteomic analysis was performed by SDS-PAGE and Mass spectrometry. Results and conclusions The DNA sequence data of vB_CcoM-IBB_35 consists of five contigs for a total of 172,065 bp with an average GC content of 27%. Attempts to close the gaps between contigs were unsuccessful since the DNA preparations appear to contain substances that inhibited Taq and ϕ29 polymerases. From the 210 identified ORFs, around 60% represent proteins that were not functionally assigned. Homology exists with members of the Teequatrovirinae namely for T4 proteins involved in morphogenesis, nucleotide metabolism, transcription, DNA replication and recombination. Tandem mass spectrometric analysis revealed 38 structural proteins as part of the mature phage particle. Conclusions Genes encoding proteins involved in the carbohydrate metabolism along with several incidences of gene duplications, split genes with inteins and introns have been rarely found in other phage genomes yet are found in this phage. We identified the genes encoding for tail fibres and for the lytic cassette, this later, expressing enzymes for bacterial capsular polysaccharides (CPS degradation, which has not been reported

  2. The Structure of Neurexin 1[alpha] Reveals Features Promoting a Role as Synaptic Organizer

    Energy Technology Data Exchange (ETDEWEB)

    Chen, Fang; Venugopal, Vandavasi; Murray, Beverly; Rudenko, Gabby (Michigan)

    2014-10-02

    {alpha}-Neurexins are essential synaptic adhesion molecules implicated in autism spectrum disorder and schizophrenia. The {alpha}-neurexin extracellular domain consists of six LNS domains interspersed by three EGF-like repeats and interacts with many different proteins in the synaptic cleft. To understand how {alpha}-neurexins might function as synaptic organizers, we solved the structure of the neurexin 1{alpha} extracellular domain (n1{alpha}) to 2.65 {angstrom}. The L-shaped molecule can be divided into a flexible repeat I (LNS1-EGF-A-LNS2), a rigid horseshoe-shaped repeat II (LNS3-EGF-B-LNS4) with structural similarity to so-called reelin repeats, and an extended repeat III (LNS5-EGF-B-LNS6) with controlled flexibility. A 2.95 {angstrom} structure of n1{alpha} carrying splice insert SS3 in LNS4 reveals that SS3 protrudes as a loop and does not alter the rigid arrangement of repeat II. The global architecture imposed by conserved structural features enables {alpha}-neurexins to recruit and organize proteins in distinct and variable ways, influenced by splicing, thereby promoting synaptic function.

  3. Identification and characterisation of Short Interspersed Nuclear Elements in the olive tree (Olea europaea L.) genome.

    Science.gov (United States)

    Barghini, Elena; Mascagni, Flavia; Natali, Lucia; Giordani, Tommaso; Cavallini, Andrea

    2017-02-01

    Short Interspersed Nuclear Elements (SINEs) are nonautonomous retrotransposons in the genome of most eukaryotic species. While SINEs have been intensively investigated in humans and other animal systems, SINE identification has been carried out only in a limited number of plant species. This lack of information is apparent especially in non-model plants whose genome has not been sequenced yet. The aim of this work was to produce a specific bioinformatics pipeline for analysing second generation sequence reads of a non-model species and identifying SINEs. We have identified, for the first time, 227 putative SINEs of the olive tree (Olea europaea), that constitute one of the few sets of such sequences in dicotyledonous species. The identified SINEs ranged from 140 to 362 bp in length and were characterised with regard to the occurrence of the tRNA domain in their sequence. The majority of identified elements resulted in single copy or very lowly repeated, often in association with genic sequences. Analysis of sequence similarity allowed us to identify two major groups of SINEs showing different abundances in the olive tree genome, the former with sequence similarity to SINEs of Scrophulariaceae and Solanaceae and the latter to SINEs of Salicaceae. A comparison of sequence conservation between olive SINEs and LTR retrotransposon families suggested that SINE expansion in the genome occurred especially in very ancient times, before LTR retrotransposon expansion, and presumably before the separation of the rosids (to which Oleaceae belong) from the Asterids. Besides providing data on olive SINEs, our results demonstrate the suitability of the pipeline employed for SINE identification. Applying this pipeline will favour further structural and functional analyses on these relatively unknown elements to be performed also in other plant species, even in the absence of a reference genome, and will allow establishing general evolutionary patterns for this kind of repeats in

  4. Metatranscriptome analysis of fungal strains Penicillium camemberti and Geotrichum candidum reveal cheese matrix breakdown and potential development of sensory properties of ripened Camembert-type cheese.

    Science.gov (United States)

    Lessard, Marie-Hélène; Viel, Catherine; Boyle, Brian; St-Gelais, Daniel; Labrie, Steve

    2014-03-26

    Camembert-type cheese ripening is driven mainly by fungal microflora including Geotrichum candidum and Penicillium camemberti. These species are major contributors to the texture and flavour of typical bloomy rind cheeses. Biochemical studies showed that G. candidum reduces bitterness, enhances sulphur flavors through amino acid catabolism and has an impact on rind texture, firmness and thickness, while P. camemberti is responsible for the white and bloomy aspect of the rind, and produces enzymes involved in proteolysis and lipolysis activities. However, very little is known about the genetic determinants that code for these activities and their expression profile over time during the ripening process. The metatranscriptome of an industrial Canadian Camembert-type cheese was studied at seven different sampling days over 77 days of ripening. A database called CamemBank01 was generated, containing a total of 1,060,019 sequence tags (reads) assembled in 7916 contigs. Sequence analysis revealed that 57% of the contigs could be affiliated to molds, 16% originated from yeasts, and 27% could not be identified. According to the functional annotation performed, the predominant processes during Camembert ripening include gene expression, energy-, carbohydrate-, organic acid-, lipid- and protein- metabolic processes, cell growth, and response to different stresses. Relative expression data showed that these functions occurred mostly in the first two weeks of the ripening period. These data provide further advances in our knowledge about the biological activities of the dominant ripening microflora of Camembert cheese and will help select biological markers to improve cheese quality assessment.

  5. Comparative Transcriptome Analysis Identifies Candidate Genes Related to Skin Color Differentiation in Red Tilapia.

    Science.gov (United States)

    Zhu, Wenbin; Wang, Lanmei; Dong, Zaijie; Chen, Xingting; Song, Feibiao; Liu, Nian; Yang, Hui; Fu, Jianjun

    2016-08-11

    Red tilapia is becoming more popular for aquaculture production in China in recent years. However, the pigmentation differentiation in genetic breeding is the main problem limiting its development of commercial red tilapia culture and the genetic basis of skin color variation is still unknown. In this study, we conducted Illumina sequencing of transcriptome on three color variety red tilapia. A total of 224,895,758 reads were generated, resulting in 160,762 assembled contigs that were used as reference contigs. The contigs of red tilapia transcriptome had hits in the range of 53.4% to 86.7% of the unique proteins of zebrafish, fugu, medaka, three-spined stickleback and tilapia. And 44,723 contigs containing 77,423 simple sequence repeats (SSRs) were identified, with 16,646 contigs containing more than one SSR. Three skin transcriptomes were compared pairwise and the results revealed that there were 148 common significantly differentially expressed unigenes and several key genes related to pigment synthesis, i.e. tyr, tyrp1, silv, sox10, slc24a5, cbs and slc7a11, were included. The results will facilitate understanding the molecular mechanisms of skin pigmentation differentiation in red tilapia and accelerate the molecular selection of the specific strain with consistent skin colors.

  6. The Douglas-Fir Genome Sequence Reveals Specialization of the Photosynthetic Apparatus in Pinaceae

    Directory of Open Access Journals (Sweden)

    David B. Neale

    2017-09-01

    Full Text Available A reference genome sequence for Pseudotsuga menziesii var. menziesii (Mirb. Franco (Coastal Douglas-fir is reported, thus providing a reference sequence for a third genus of the family Pinaceae. The contiguity and quality of the genome assembly far exceeds that of other conifer reference genome sequences (contig N50 = 44,136 bp and scaffold N50 = 340,704 bp. Incremental improvements in sequencing and assembly technologies are in part responsible for the higher quality reference genome, but it may also be due to a slightly lower exact repeat content in Douglas-fir vs. pine and spruce. Comparative genome annotation with angiosperm species reveals gene-family expansion and contraction in Douglas-fir and other conifers which may account for some of the major morphological and physiological differences between the two major plant groups. Notable differences in the size of the NDH-complex gene family and genes underlying the functional basis of shade tolerance/intolerance were observed. This reference genome sequence not only provides an important resource for Douglas-fir breeders and geneticists but also sheds additional light on the evolutionary processes that have led to the divergence of modern angiosperms from the more ancient gymnosperms.

  7. De novo assembly and comparison of the ovarian transcriptomes of the common Chinese cuttlefish (Sepiella japonica with different gonadal development

    Directory of Open Access Journals (Sweden)

    Zhenming Lü

    2016-03-01

    Full Text Available The common Chinese cuttlefish (Sepiella japonica has been considered one of the most economically important marine Cephalopod species in East Asia and seed breeding technology has been established for massive aquaculture and stock enhancement. In the present study, we used Illumina HiSeq2000 to sequence, assemble and annotate the transcriptome of the ovary tissues of S. japonica for the first time. A total of 53,116,650 and 53,446,640 reads were obtained from the immature and matured ovaries, respectively (NCBI SRA database SRX1409472 and SRX1409473, and 70,039 contigs (N50 = 1443 bp were obtained after de novo assembling with Trinity software. Digital gene expression analysis reveals 47,288 contigs show differential expression profile and 793 contigs are highly expressed in the immature ovary, while 38 contigs are highly expressed in the mature ovary with FPKM >100. We hope that the ovarian transcriptome and those stage-enriched transcripts of S. japonica can provide some insight into the understanding of genome-wide transcriptome profile of cuttlefish gonad tissue and give useful information in cuttlefish gonad development. Keywords: Cuttlefish, Gonad development, Transcriptome

  8. Analysis of the 227 bp short interspersed nuclear element (SINE insertion of the promoter of the myostatin (MSTN gene in different horse breeds

    Directory of Open Access Journals (Sweden)

    Stefania Dall'Olio

    2014-09-01

    Full Text Available The myostatin (MSTN gene encodes a protein known to be a negative regulator of muscle mass in mammalian species. Different polymorphisms of the horse (Equus caballus MSTN gene have been identified, including single nucleotide polymorphisms and a short interspersed nuclear element (SINE insertion of 227 bp within the promoter of the gene. The SINE insertion has been associated with performance traits in Thoroughbred racehorses and it was proposed as a predictor of optimum racing distance. The aims of this study were to perform in silico analysis to identify putative gains or abrogation of transcription-factor binding sites (TFBSs generated by the SINE allele of the promoter and to analyse the frequency of the SINE insertion in horses used for racing (gallop and trot and other purposes. The SINE insertion was genotyped in 227 horses from 10 breeds belonging to different morphological types (brachimorphic, mesomorphic, meso-dolichomorphic and dolichomorphic. The presence of the insertion was confirmed in the Quarter Horse (SINE allele frequency of 0.81 and in the Thoroughbred (0.51, whereas the SINE allele did not segregate in any of the other analysed breeds. As the SINE MSTN gene polymorphism may be population or breed specific, it is not a useful marker for association studies in all breeds.

  9. Analysis of the 227 bp short interspersed nuclear element (SINE) insertion of the promoter of the myostatin (MSTN) gene in different horse breeds.

    Science.gov (United States)

    Dall'Olio, Stefania; Scotti, Emilio; Fontanesi, Luca; Tassinari, Marco

    2014-01-01

    The myostatin (MSTN) gene encodes a protein known to be a negative regulator of muscle mass in mammalian species. Different polymorphisms of the horse (Equus caballus) MSTN gene have been identified, including single nucleotide polymorphisms and a short interspersed nuclear element (SINE) insertion of 227 bp within the promoter of the gene. The SINE insertion has been associated with performance traits in Thoroughbred racehorses and it was proposed as a predictor of optimum racing distance. The aims of this study were to perform in silico analysis to identify putative gains or abrogation of transcription-factor binding sites (TFBSs) generated by the SINE allele of the promoter and to analyse the frequency of the SINE insertion in horses used for racing (gallop and trot) and other purposes. The SINE insertion was genotyped in 227 horses from 10 breeds belonging to different morphological types (brachimorphic, mesomorphic, meso-dolichomorphic and dolichomorphic). The presence of the insertion was confirmed in the Quarter Horse (SINE allele frequency of 0.81) and in the Thoroughbred (0.51), whereas the SINE allele did not segregate in any of the other analysed breeds. As the SINE MSTN gene polymorphism may be population or breed specific, it is not a useful marker for association studies in all breeds.

  10. Ecology and evolution of viruses infecting uncultivated SUP05 bacteria as revealed by single-cell- and meta-genomics.

    Science.gov (United States)

    Roux, Simon; Hawley, Alyse K; Torres Beltran, Monica; Scofield, Melanie; Schwientek, Patrick; Stepanauskas, Ramunas; Woyke, Tanja; Hallam, Steven J; Sullivan, Matthew B

    2014-08-29

    Viruses modulate microbial communities and alter ecosystem functions. However, due to cultivation bottlenecks, specific virus-host interaction dynamics remain cryptic. In this study, we examined 127 single-cell amplified genomes (SAGs) from uncultivated SUP05 bacteria isolated from a model marine oxygen minimum zone (OMZ) to identify 69 viral contigs representing five new genera within dsDNA Caudovirales and ssDNA Microviridae. Infection frequencies suggest that ∼1/3 of SUP05 bacteria is viral-infected, with higher infection frequency where oxygen-deficiency was most severe. Observed Microviridae clonality suggests recovery of bloom-terminating viruses, while systematic co-infection between dsDNA and ssDNA viruses posits previously unrecognized cooperation modes. Analyses of 186 microbial and viral metagenomes revealed that SUP05 viruses persisted for years, but remained endemic to the OMZ. Finally, identification of virus-encoded dissimilatory sulfite reductase suggests SUP05 viruses reprogram their host's energy metabolism. Together, these results demonstrate closely coupled SUP05 virus-host co-evolutionary dynamics with the potential to modulate biogeochemical cycling in climate-critical and expanding OMZs.

  11. Short interspersed nuclear elements (SINEs) are abundant in Solanaceae and have a family-specific impact on gene structure and genome organization.

    Science.gov (United States)

    Seibt, Kathrin M; Wenke, Torsten; Muders, Katja; Truberg, Bernd; Schmidt, Thomas

    2016-05-01

    Short interspersed nuclear elements (SINEs) are highly abundant non-autonomous retrotransposons that are widespread in plants. They are short in size, non-coding, show high sequence diversity, and are therefore mostly not or not correctly annotated in plant genome sequences. Hence, comparative studies on genomic SINE populations are rare. To explore the structural organization and impact of SINEs, we comparatively investigated the genome sequences of the Solanaceae species potato (Solanum tuberosum), tomato (Solanum lycopersicum), wild tomato (Solanum pennellii), and two pepper cultivars (Capsicum annuum). Based on 8.5 Gbp sequence data, we annotated 82 983 SINE copies belonging to 10 families and subfamilies on a base pair level. Solanaceae SINEs are dispersed over all chromosomes with enrichments in distal regions. Depending on the genome assemblies and gene predictions, 30% of all SINE copies are associated with genes, particularly frequent in introns and untranslated regions (UTRs). The close association with genes is family specific. More than 10% of all genes annotated in the Solanaceae species investigated contain at least one SINE insertion, and we found genes harbouring up to 16 SINE copies. We demonstrate the involvement of SINEs in gene and genome evolution including the donation of splice sites, start and stop codons and exons to genes, enlargement of introns and UTRs, generation of tandem-like duplications and transduction of adjacent sequence regions. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.

  12. A contig-based strategy for the genome-wide discovery of microRNAs without complete genome resources.

    Directory of Open Access Journals (Sweden)

    Jun-Zhi Wen

    Full Text Available MicroRNAs (miRNAs are important regulators of many cellular processes and exist in a wide range of eukaryotes. High-throughput sequencing is a mainstream method of miRNA identification through which it is possible to obtain the complete small RNA profile of an organism. Currently, most approaches to miRNA identification rely on a reference genome for the prediction of hairpin structures. However, many species of economic and phylogenetic importance are non-model organisms without complete genome sequences, and this limits miRNA discovery. Here, to overcome this limitation, we have developed a contig-based miRNA identification strategy. We applied this method to a triploid species of edible banana (GCTCV-119, Musa spp. AAA group and identified 180 pre-miRNAs and 314 mature miRNAs, which is three times more than those were predicted by the available dataset-based methods (represented by EST+GSS. Based on the recently published miRNA data set of Musa acuminate, the recall rate and precision of our strategy are estimated to be 70.6% and 92.2%, respectively, significantly better than those of EST+GSS-based strategy (10.2% and 50.0%, respectively. Our novel, efficient and cost-effective strategy facilitates the study of the functional and evolutionary role of miRNAs, as well as miRNA-based molecular breeding, in non-model species of economic or evolutionary interest.

  13. Sequencing of BAC pools by different next generation sequencing platforms and strategies

    Directory of Open Access Journals (Sweden)

    Scholz Uwe

    2011-10-01

    Full Text Available Abstract Background Next generation sequencing of BACs is a viable option for deciphering the sequence of even large and highly repetitive genomes. In order to optimize this strategy, we examined the influence of read length on the quality of Roche/454 sequence assemblies, to what extent Illumina/Solexa mate pairs (MPs improve the assemblies by scaffolding and whether barcoding of BACs is dispensable. Results Sequencing four BACs with both FLX and Titanium technologies revealed similar sequencing accuracy, but showed that the longer Titanium reads produce considerably less misassemblies and gaps. The 454 assemblies of 96 barcoded BACs were improved by scaffolding 79% of the total contig length with MPs from a non-barcoded library. Assembly of the unmasked 454 sequences without separation by barcodes revealed chimeric contig formation to be a major problem, encompassing 47% of the total contig length. Masking the sequences reduced this fraction to 24%. Conclusion Optimal BAC pool sequencing should be based on the longest available reads, with barcoding essential for a comprehensive assessment of both repetitive and non-repetitive sequence information. When interest is restricted to non-repetitive regions and repeats are masked prior to assembly, barcoding is non-essential. In any case, the assemblies can be improved considerably by scaffolding with non-barcoded BAC pool MPs.

  14. DNA Methylation Status of the Interspersed Repetitive Sequences for LINE-1, Alu, HERV-E, and HERV-K in Trabeculectomy Specimens from Glaucoma Eyes

    Directory of Open Access Journals (Sweden)

    Sunee Chansangpetch

    2018-01-01

    Full Text Available Background/Aims. Epigenetic mechanisms via DNA methylation may be related to glaucoma pathogenesis. This study aimed to determine the global DNA methylation level of the trabeculectomy specimens among patients with different types of glaucoma and normal subjects. Methods. Trabeculectomy sections from 16 primary open-angle glaucoma (POAG, 12 primary angle-closure glaucoma (PACG, 16 secondary glaucoma patients, and 10 normal controls were assessed for DNA methylation using combined-bisulfite restriction analysis. The percentage of global methylation level of the interspersed repetitive sequences for LINE-1, Alu, HERV-E, and HERV-K were compared between the 4 groups. Results. There were no significant differences in the methylation for LINE-1 and HERV-E between patients and normal controls. For the Alu marker, the methylation was significantly lower in all types of glaucoma patients compared to controls (POAG 52.19% versus control 52.83%, p=0.021; PACG 51.50% versus control, p=0.005; secondary glaucoma 51.95% versus control, p=0.014, whereas the methylation level of HERV-K was statistically higher in POAG patients compared to controls (POAG 49.22% versus control 48.09%, p=0.017. Conclusions. The trabeculectomy sections had relative DNA hypomethylation of Alu in all glaucoma subtypes and relative DNA hypermethylation of HERV-K in POAG patients. These methylation changes may lead to the fibrotic phenotype in the trabecular meshwork.

  15. Fine typing of methicillin-resistant Staphylococcus aureus isolates using direct repeat unit and staphylococcal interspersed repeat unit typing methods.

    Science.gov (United States)

    Ho, Cheng-Mao; Ho, Mao-Wang; Li, Chi-Yuan; Lu, Jang-Jih

    2015-08-01

    Methicillin-resistant Staphylococcus aureus (MRSA) typing is an important epidemiologic tool for monitoring trends and preventing outbreaks. However, the efficiency of various MRSA typing methods for each SCCmec MRSA isolate is rarely evaluated. A total of 157 MRSA isolates from four different regions in Taiwan were typed with five different molecular methods, including SCCmec typing, multilocus sequence typing (MLST), spa typing, mec-associated direct repeat unit (dru) copy number determination, and staphylococcal interspersed repeat unit (SIRU) profiling. There were four SCCmec types, eight MLST types, 15 spa types, 11 dru types, and 31 SIRU profiles. The most common type determined by each molecular typing method was SCCmec III (115 isolates, 73.2%), ST239 (99 isolates, 63.1%), t037 (107 isolates, 68.2%), 14 dru copies (76 isolates, 48.4%), and SIRU profile 3013722 (102 isolates, 65%), respectively. When using the combination of MLST, spa typing, and dru copy number, ST5-t002-4 (n = 8), ST239-t037-14 (n = 68), ST59-t437-9 (n = 9), and ST59-t437-11 (n = 6) were found to be the most common types of SCCmec types II (n = 9), III (n = 115), IV (n = 21), and VT (n = 11) isolates, respectively. SCCmec type III isolates were further classified into 11 dru types. Of the 21 SCCmec type IV isolates, 14 SIRU profiles were found. Seven SIRU patterns were observed in the 11 SCCmec type VT isolates. Different typing methods showed a similar Hunter-Gaston discrimination index among the 157 MRSA isolates. However, dru and SIRU typing methods had a better discriminatory power for SCCmec type III and SCCmec types IV and VT isolates, respectively, suggesting that dru and SIRU can be used to further type these isolates. Copyright © 2013. Published by Elsevier B.V.

  16. A segment of the apospory-specific genomic region is highly microsyntenic not only between the apomicts Pennisetum squamulatum and buffelgrass, but also with a rice chromosome 11 centromeric-proximal genomic region.

    Science.gov (United States)

    Gualtieri, Gustavo; Conner, Joann A; Morishige, Daryl T; Moore, L David; Mullet, John E; Ozias-Akins, Peggy

    2006-03-01

    Bacterial artificial chromosome (BAC) clones from apomicts Pennisetum squamulatum and buffelgrass (Cenchrus ciliaris), isolated with the apospory-specific genomic region (ASGR) marker ugt197, were assembled into contigs that were extended by chromosome walking. Gene-like sequences from contigs were identified by shotgun sequencing and BLAST searches, and used to isolate orthologous rice contigs. Additional gene-like sequences in the apomicts' contigs were identified by bioinformatics using fully sequenced BACs from orthologous rice contigs as templates, as well as by interspecies, whole-contig cross-hybridizations. Hierarchical contig orthology was rapidly assessed by constructing detailed long-range contig molecular maps showing the distribution of gene-like sequences and markers, and searching for microsyntenic patterns of sequence identity and spatial distribution within and across species contigs. We found microsynteny between P. squamulatum and buffelgrass contigs. Importantly, this approach also enabled us to isolate from within the rice (Oryza sativa) genome contig Rice A, which shows the highest microsynteny and is most orthologous to the ugt197-containing C1C buffelgrass contig. Contig Rice A belongs to the rice genome database contig 77 (according to the current September 12, 2003, rice fingerprint contig build) that maps proximal to the chromosome 11 centromere, a feature that interestingly correlates with the mapping of ASGR-linked BACs proximal to the centromere or centromere-like sequences. Thus, relatedness between these two orthologous contigs is supported both by their molecular microstructure and by their centromeric-proximal location. Our discoveries promote the use of a microsynteny-based positional-cloning approach using the rice genome as a template to aid in constructing the ASGR toward the isolation of genes underlying apospory.

  17. A Segment of the Apospory-Specific Genomic Region Is Highly Microsyntenic Not Only between the Apomicts Pennisetum squamulatum and Buffelgrass, But Also with a Rice Chromosome 11 Centromeric-Proximal Genomic Region1[W

    Science.gov (United States)

    Gualtieri, Gustavo; Conner, Joann A.; Morishige, Daryl T.; Moore, L. David; Mullet, John E.; Ozias-Akins, Peggy

    2006-01-01

    Bacterial artificial chromosome (BAC) clones from apomicts Pennisetum squamulatum and buffelgrass (Cenchrus ciliaris), isolated with the apospory-specific genomic region (ASGR) marker ugt197, were assembled into contigs that were extended by chromosome walking. Gene-like sequences from contigs were identified by shotgun sequencing and BLAST searches, and used to isolate orthologous rice contigs. Additional gene-like sequences in the apomicts' contigs were identified by bioinformatics using fully sequenced BACs from orthologous rice contigs as templates, as well as by interspecies, whole-contig cross-hybridizations. Hierarchical contig orthology was rapidly assessed by constructing detailed long-range contig molecular maps showing the distribution of gene-like sequences and markers, and searching for microsyntenic patterns of sequence identity and spatial distribution within and across species contigs. We found microsynteny between P. squamulatum and buffelgrass contigs. Importantly, this approach also enabled us to isolate from within the rice (Oryza sativa) genome contig Rice A, which shows the highest microsynteny and is most orthologous to the ugt197-containing C1C buffelgrass contig. Contig Rice A belongs to the rice genome database contig 77 (according to the current September 12, 2003, rice fingerprint contig build) that maps proximal to the chromosome 11 centromere, a feature that interestingly correlates with the mapping of ASGR-linked BACs proximal to the centromere or centromere-like sequences. Thus, relatedness between these two orthologous contigs is supported both by their molecular microstructure and by their centromeric-proximal location. Our discoveries promote the use of a microsynteny-based positional-cloning approach using the rice genome as a template to aid in constructing the ASGR toward the isolation of genes underlying apospory. PMID:16415213

  18. The testis and ovary transcriptomes of the rock bream (Oplegnathus fasciatus: A bony fish with a unique neo Y chromosome

    Directory of Open Access Journals (Sweden)

    Dongdong Xu

    2016-03-01

    Full Text Available The rock bream (Oplegnathus fasciatus is considerably one of the most economically important marine fish in East Asia and has a unique neo-Y chromosome system that is a good model to study the sex determination and differentiation in fish. In the present study, we used Illumina sequencing technology (HiSeq2000 to sequence, assemble and annotate the transcriptome of the testis and ovary tissues of rock bream. A total of 40,004,378 (NCBI SRA database SRX1406649 and 53,108,992 (NCBI SRA database SRX1406648 high quality reads were obtained from testis and ovary RNA sequencing, respectively, and 60,421 contigs (with average length of 1301 bp were obtained after de novo assembling with Trinity software. Digital gene expression analysis reveals 14,036 contigs that show gender-enriched expressional profile with either testis-enriched (237 contigs or ovary-enriched (581 contigs with RPKM >100. There are 237 male- and 582 female-abundant expressed genes that show sex dimorphic expression. We hope that the gonad transcriptome and those gender-enriched transcripts of rock bream can provide some insight into the understanding of genome-wide transcriptome profile of teleost gonad tissue and give useful information in fish gonad development. Keywords: Gonad transcriptome, Testis, Ovary, Rock bream

  19. AcEST: [AcEST

    Lifescience Database Archive (English)

    Full Text Available CL2194Contig1 771 2 Adiantum capillus-veneris contig: CL2194contig1 sequence. Link to clone list...apillus-veneris contig: CL2194contig1 sequence. Link to clone list Link to clone list Clone ID BP915172 DK95

  20. A korarchaeal genome reveals insights into the evolution of the Archaea

    Energy Technology Data Exchange (ETDEWEB)

    Anderson, Iain J; Elkins, James G.; Podar, Mircea; Graham, David E.; Makarova, Kira S.; Wolf, Yuri; Randau, Lennart; Hedlund, Brian P.; Brochier-Armanet, Celine; Kunin, Victor; Anderson, Iain; Lapidus, Alla; Goltsman, Eugene; Barry, Kerrie; Koonin, Eugene V.; Hugenholtz, Phil; Kyrpides, Nikos; Wanner, Gerhard; Richardson, Paul; Keller, Martin; Stetter, Karl O.

    2008-06-05

    The candidate division Korarchaeota comprises a group of uncultivated microorganisms that, by their small subunit rRNA phylogeny, may have diverged early from the major archaeal phyla Crenarchaeota and Euryarchaeota. Here, we report the initial characterization of a member of the Korarchaeota with the proposed name,"Candidatus Korarchaeum cryptofilum," which exhibits an ultrathin filamentous morphology. To investigate possible ancestral relationships between deep-branching Korarchaeota and other phyla, we used whole-genome shotgun sequencing to construct a complete composite korarchaeal genome from enriched cells. The genome was assembled into a single contig 1.59 Mb in length with a G + C content of 49percent. Of the 1,617 predicted protein-coding genes, 1,382 (85percent) could be assigned to a revised set of archaeal Clusters of Orthologous Groups (COGs). The predicted gene functions suggest that the organism relies on a simple mode of peptide fermentation for carbon and energy and lacks the ability to synthesize de novo purines, CoA, and several other cofactors. Phylogenetic analyses based on conserved single genes and concatenated protein sequences positioned the korarchaeote as a deep archaeal lineage with an apparent affinity to the Crenarchaeota. However, the predicted gene content revealed that several conserved cellular systems, such as cell division, DNA replication, and tRNA maturation, resemble the counterparts in the Euryarchaeota. In light of the known composition of archaeal genomes, the Korarchaeota might have retained a set of cellular features that represents the ancestral archaeal form.

  1. A Korarchael Genome Reveals Insights into the Evolution of the Archaea

    Energy Technology Data Exchange (ETDEWEB)

    Lapidus, Alla; Elkins, James G.; Podar, Mircea; Graham, David E.; Makarova, Kira S.; Wolf, Yuri; Randau, Lennart; Hedlund, Brian P.; Brochier-Armanet, Celine; Kunin, Victor; Anderson, Iain; Lapidus, Alla; Goltsman, Eugene; Barry, Kerrie; Koonin, Eugene V.; Hugenholtz, Phil; Kyrpides, Nikos; Wanner, Gerhard; Richardson, Paul; Keller, Martin; Stetter, Karl O.

    2008-01-07

    The candidate division Korarchaeota comprises a group of uncultivated microorganisms that, by their small subunit rRNA phylogeny, may have diverged early from the major archaeal phyla Crenarchaeota and Euryarchaeota. Here, we report the initial characterization of a member of the Korarchaeota with the proposed name, ?Candidatus Korarchaeum cryptofilum,? which exhibits an ultrathin filamentous morphology. To investigate possible ancestral relationships between deep-branching Korarchaeota and other phyla, we used whole-genome shotgun sequencing to construct a complete composite korarchaeal genome from enriched cells. The genome was assembled into a single contig 1.59 Mb in length with a G + C content of 49percent. Of the 1,617 predicted protein-coding genes, 1,382 (85percent) could be assigned to a revised set of archaeal Clusters of Orthologous Groups (COGs). The predicted gene functions suggest that the organism relies on a simple mode of peptide fermentation for carbon and energy and lacks the ability to synthesize de novo purines, CoA, and several other cofactors. Phylogenetic analyses based on conserved single genes and concatenated protein sequences positioned the korarchaeote as a deep archaeal lineage with an apparent affinity to the Crenarchaeota. However, the predicted gene content revealed that several conserved cellular systems, such as cell division, DNA replication, and tRNA maturation, resemble the counterparts in the Euryarchaeota. In light of the known composition of archaeal genomes, the Korarchaeota might have retained a set of cellular features that represents the ancestral archaeal form.

  2. Exploiting a wheat EST database to assess genetic diversity.

    Science.gov (United States)

    Karakas, Ozge; Gurel, Filiz; Uncuoglu, Ahu Altinkut

    2010-10-01

    Expressed sequence tag (EST) markers have been used to assess variety and genetic diversity in wheat (Triticum aestivum). In this study, 1549 ESTs from wheat infested with yellow rust were used to examine the genetic diversity of six susceptible and resistant wheat cultivars. The aim of using these cultivars was to improve the competitiveness of public wheat breeding programs through the intensive use of modern, particularly marker-assisted, selection technologies. The F(2) individuals derived from cultivar crosses were screened for resistance to yellow rust at the seedling stage in greenhouses and adult stage in the field to identify DNA markers genetically linked to resistance. Five hundred and sixty ESTs were assembled into 136 contigs and 989 singletons. BlastX search results showed that 39 (29%) contigs and 96 (10%) singletons were homologous to wheat genes. The database-matched contigs and singletons were assigned to eight functional groups related to protein synthesis, photosynthesis, metabolism and energy, stress proteins, transporter proteins, protein breakdown and recycling, cell growth and division and reactive oxygen scavengers. PCR analyses with primers based on the contigs and singletons showed that the most polymorphic functional categories were photosynthesis (contigs) and metabolism and energy (singletons). EST analysis revealed considerable genetic variability among the Turkish wheat cultivars resistant and susceptible to yellow rust disease and allowed calculation of the mean genetic distance between cultivars, with the greatest similarity (0.725) being between Harmankaya99 and Sönmez2001, and the lowest (0.622) between Aytin98 and Izgi01.

  3. Exploiting a wheat EST database to assess genetic diversity

    Directory of Open Access Journals (Sweden)

    Ozge Karakas

    2010-01-01

    Full Text Available Expressed sequence tag (EST markers have been used to assess variety and genetic diversity in wheat (Triticum aestivum. In this study, 1549 ESTs from wheat infested with yellow rust were used to examine the genetic diversity of six susceptible and resistant wheat cultivars. The aim of using these cultivars was to improve the competitiveness of public wheat breeding programs through the intensive use of modern, particularly marker-assisted, selection technologies. The F2 individuals derived from cultivar crosses were screened for resistance to yellow rust at the seedling stage in greenhouses and adult stage in the field to identify DNA markers genetically linked to resistance. Five hundred and sixty ESTs were assembled into 136 contigs and 989 singletons. BlastX search results showed that 39 (29% contigs and 96 (10% singletons were homologous to wheat genes. The database-matched contigs and singletons were assigned to eight functional groups related to protein synthesis, photosynthesis, metabolism and energy, stress proteins, transporter proteins, protein breakdown and recycling, cell growth and division and reactive oxygen scavengers. PCR analyses with primers based on the contigs and singletons showed that the most polymorphic functional categories were photosynthesis (contigs and metabolism and energy (singletons. EST analysis revealed considerable genetic variability among the Turkish wheat cultivars resistant and susceptible to yellow rust disease and allowed calculation of the mean genetic distance between cultivars, with the greatest similarity (0.725 being between Harmankaya99 and Sönmez2001, and the lowest (0.622 between Aytin98 and Izgi01.

  4. Tissue-Specific Methylation of Long Interspersed Nucleotide Element-1 of Homo Sapiens (L1Hs) During Human Embryogenesis and Roles in Neural Tube Defects.

    Science.gov (United States)

    Wang, L; Chang, S; Guan, J; Shangguan, S; Lu, X; Wang, Z; Wu, L; Zou, J; Zhao, H; Bao, Y; Qiu, Z; Niu, B; Zhang, T

    2015-01-01

    Epigenetic regulation of long interspersed nucleotide element-1 (LINE-1) retrotransposition events plays crucial roles during early development. Previously we showed that LINE-1 hypomethylation in neuronal tissues is associated with pathogenesis of neural tube defect (NTD). Herein, we further evaluated LINE-1 Homo sapiens (L1Hs) methylation in tissues derived from three germ layers of stillborn NTD fetuses, to define patterns of tissue specific methylation and site-specific hypomethylation at CpG sites within an L1Hs promoter region. Stable, tissue-specific L1Hs methylation patterns throughout three germ layer lineages of the fetus, placenta, and maternal peripheral blood were observed. Samples from maternal peripheral blood exhibited the highest level of L1Hs methylation (64.95%) and that from placenta showed the lowest (26.82%). Between samples from NTDs and controls, decrease in L1Hs methylation was only significant in NTD-affected brain tissue at 7.35%, especially in females (8.98%). L1Hs hypomethylation in NTDs was also associated with a significant increase in expression level of an L1Hs-encoded transcript in females (r = -0.846, p = 0.004). This could be due to genomic DNA instability and alternation in chromatins accessibility resulted from abnormal L1Hs hypomethylation, as showed in this study with HCT-15 cells treated with methylation inhibitor 5-Aza.

  5. Effect of study design and setting on tuberculosis clustering estimates using Mycobacterial Interspersed Repetitive Units-Variable Number Tandem Repeats (MIRU-VNTR): a systematic review.

    Science.gov (United States)

    Mears, Jessica; Abubakar, Ibrahim; Cohen, Theodore; McHugh, Timothy D; Sonnenberg, Pam

    2015-01-21

    To systematically review the evidence for the impact of study design and setting on the interpretation of tuberculosis (TB) transmission using clustering derived from Mycobacterial Interspersed Repetitive Units-Variable Number Tandem Repeats (MIRU-VNTR) strain typing. MEDLINE, EMBASE, CINHAL, Web of Science and Scopus were searched for articles published before 21st October 2014. Studies in humans that reported the proportion of clustering of TB isolates by MIRU-VNTR were included in the analysis. Univariable meta-regression analyses were conducted to assess the influence of study design and setting on the proportion of clustering. The search identified 27 eligible articles reporting clustering between 0% and 63%. The number of MIRU-VNTR loci typed, requiring consent to type patient isolates (as a proxy for sampling fraction), the TB incidence and the maximum cluster size explained 14%, 14%, 27% and 48% of between-study variation, respectively, and had a significant association with the proportion of clustering. Although MIRU-VNTR typing is being adopted worldwide there is a paucity of data on how study design and setting may influence estimates of clustering. We have highlighted study design variables for consideration in the design and interpretation of future studies. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

  6. Silencing of the PiAvr3a effector-encoding gene from Phytophthora infestans by transcriptional fusion to a short interspersed element.

    Science.gov (United States)

    Vetukuri, Ramesh R; Tian, Zhendong; Avrova, Anna O; Savenkov, Eugene I; Dixelius, Christina; Whisson, Stephen C

    2011-12-01

    Phytophthora infestans is the notorious oomycete causing late blight of potato and tomato. A large proportion of the P. infestans genome is composed of transposable elements, the activity of which may be controlled by RNA silencing. Accumulation of small RNAs is one of the hallmarks of RNA silencing. Here we demonstrate the presence of small RNAs corresponding to the sequence of a short interspersed retrotransposable element (SINE) suggesting that small RNAs might be involved in silencing of SINEs in P. infestans. This notion was exploited to develop novel tools for gene silencing in P. infestans by engineering transcriptional fusions of the PiAvr3a gene, encoding an RXLR avirulence effector, to the infSINEm retroelement. Transgenic P. infestans lines expressing either 5'-infSINEm::PiAvr3a-3' or 5'-PiAvr3a::SINEm-3' chimeric transcripts initially exhibited partial silencing of PiAvr3a. Over time, PiAvr3a either recovered wild type transcript levels in some lines, or became fully silenced in others. Introduction of an inverted repeat construct was also successful in yielding P. infestans transgenic lines silenced for PiAvr3a. In contrast, constructs expressing antisense or aberrant RNA transcripts failed to initiate silencing of PiAvr3a. Lines exhibiting the most effective silencing of PiAvr3a were either weakly or non-pathogenic on susceptible potato cv. Bintje. This study expands the repertoire of reverse genetics tools available for P. infestans research, and provides insights into a possible mode of variation in effector expression through spread of silencing from adjacent retroelements. Crown Copyright © 2011. Published by Elsevier Ltd. All rights reserved.

  7. Dicty_cDB: CHA362 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CH (Link to library) CHA362 (Link to dictyBase) - - - Contig-U15579-1 | Contig-U156... library) Clone ID CHA362 (Link to dictyBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U155...79-1 | Contig-U15687-1 Original site URL http://dictycdb.biol.tsukuba.ac.jp/CSM/C...XACNAVDTCITNDLCFPRECNPRGNPPCLINPINCTSTDPCIFSYCENGVCI PTYICTPTPSVTPTVTPTVTPTVTPTVT...GNPPCLINPINCTSTDPCIFSYCENGVCI PTYICTPTPSVTPTVTPTVTPTVTPTVTPTVTPTVTPTPTTTPTPSPTTVP

  8. Analysis of differentially expressed genes in two immunologically distinct strains of Eimeria maxima using suppression subtractive hybridization and dot-blot hybridization

    Science.gov (United States)

    2014-01-01

    Background It is well known that different Eimeria maxima strains exhibit significant antigenic variation. However, the genetic basis of these phenotypes remains unclear. Methods Total RNA and mRNA were isolated from unsporulated oocysts of E. maxima strains SH and NT, which were found to have significant differences in immunogenicity in our previous research. Two subtractive cDNA libraries were constructed using suppression subtractive hybridization (SSH) and specific genes were further analyzed by dot-blot hybridization and qRT-PCR analysis. Results A total of 561 clones were selected from both cDNA libraries and the length of the inserted fragments was 0.25–1.0 kb. Dot-blot hybridization revealed a total of 86 differentially expressed clones (63 from strain SH and 23 from strain NT). Nucleotide sequencing analysis of these clones revealed ten specific contigs (six from strain SH and four from strain NT). Further analysis found that six contigs from strain SH and three from strain NT shared significant identities with previously reported proteins, and one contig was presumed to be novel. The specific differentially expressed genes were finally verified by RT-PCR and qRT-PCR analyses. Conclusions The data presented here suggest that specific genes identified between the two strains may be important molecules in the immunogenicity of E. maxima that may present potential new drug targets or vaccine candidates for coccidiosis. PMID:24894832

  9. Features of the organization of bread wheat chromosome 5BS based on physical mapping.

    Science.gov (United States)

    Salina, Elena A; Nesterov, Mikhail A; Frenkel, Zeev; Kiseleva, Antonina A; Timonova, Ekaterina M; Magni, Federica; Vrána, Jan; Šafář, Jan; Šimková, Hana; Doležel, Jaroslav; Korol, Abraham; Sergeeva, Ekaterina M

    2018-02-09

    The IWGSC strategy for construction of the reference sequence of the bread wheat genome is based on first obtaining physical maps of the individual chromosomes. Our aim is to develop and use the physical map for analysis of the organization of the short arm of wheat chromosome 5B (5BS) which bears a number of agronomically important genes, including genes conferring resistance to fungal diseases. A physical map of the 5BS arm (290 Mbp) was constructed using restriction fingerprinting and LTC software for contig assembly of 43,776 BAC clones. The resulting physical map covered ~ 99% of the 5BS chromosome arm (111 scaffolds, N50 = 3.078 Mb). SSR, ISBP and zipper markers were employed for anchoring the BAC clones, and from these 722 novel markers were developed based on previously obtained data from partial sequencing of 5BS. The markers were mapped using a set of Chinese Spring (CS) deletion lines, and F2 and RICL populations from a cross of CS and CS-5B dicoccoides. Three approaches have been used for anchoring BAC contigs on the 5BS chromosome, including clone-by-clone screening of BACs, GenomeZipper analysis, and comparison of BAC-fingerprints with in silico fingerprinting of 5B pseudomolecules of T. dicoccoides. These approaches allowed us to reach a high level of BAC contig anchoring: 96% of 5BS BAC contigs were located on 5BS. An interesting pattern was revealed in the distribution of contigs along the chromosome. Short contigs (200-999 kb) containing markers for the regions interrupted by tandem repeats, were mainly localized to the 5BS subtelomeric block; whereas the distribution of larger 1000-3500 kb contigs along the chromosome better correlated with the distribution of the regions syntenic to rice, Brachypodium, and sorghum, as detected by the Zipper approach. The high fingerprinting quality, LTC software and large number of BAC clones selected by the informative markers in screening of the 43,776 clones allowed us to significantly increase the

  10. Dicty_cDB: CHR241 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CH (Link to library) CHR241 (Link to dictyBase) - - - Contig-U10843-1 | Contig-U131... library) Clone ID CHR241 (Link to dictyBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U108...43-1 | Contig-U13148-1 Original site URL http://dictycdb.biol.tsukuba.ac.jp/CSM/C...ilyhtht**KTMATQQQQQQQQQQQQQIKARKDIQIQQ AQSASDILGPPEISETEITTESILGDGSFGTVYKGRCRLKDVAVKVMLKQVDQKTLTDFR KEVAIMSKIFHPNIVLFLGACTSTPGKLMICT...PPEISETEITTESILGDGSFGTVYKGRCRLKDVAVKVMLKQVDQKTLTDFR KEVAIMSKIFHPNIVLFLGACTSTPGKLMICTELMKGNLVSLLLDPMVKLPLITRM

  11. Comparative analysis of transcriptomes in aerial stems and roots of Ephedra sinica based on high-throughput mRNA sequencing

    Directory of Open Access Journals (Sweden)

    Taketo Okada

    2016-12-01

    Full Text Available Ephedra plants are taxonomically classified as gymnosperms, and are medicinally important as the botanical origin of crude drugs and as bioresources that contain pharmacologically active chemicals. Here we show a comparative analysis of the transcriptomes of aerial stems and roots of Ephedra sinica based on high-throughput mRNA sequencing by RNA-Seq. De novo assembly of short cDNA sequence reads generated 23,358, 13,373, and 28,579 contigs longer than 200 bases from aerial stems, roots, or both aerial stems and roots, respectively. The presumed functions encoded by these contig sequences were annotated by BLAST (blastx. Subsequently, these contigs were classified based on gene ontology slims, Enzyme Commission numbers, and the InterPro database. Furthermore, comparative gene expression analysis was performed between aerial stems and roots. These transcriptome analyses revealed differences and similarities between the transcriptomes of aerial stems and roots in E. sinica. Deep transcriptome sequencing of Ephedra should open the door to molecular biological studies based on the entire transcriptome, tissue- or organ-specific transcriptomes, or targeted genes of interest.

  12. Transcriptome Analysis Reveals Molecular Signatures of Luteoloside Accumulation in Senescing Leaves of Lonicera macranthoides

    Directory of Open Access Journals (Sweden)

    Zexiong Chen

    2018-03-01

    Full Text Available Lonicera macranthoides is an important medicinal plant widely used in traditional Chinese medicine. Luteoloside is a critical bioactive compound in L. macranthoides. To date, the molecular mechanisms underlying luteoloside biosynthesis are still largely unknown. In this work, high performance liquid chromatography (HPLC was employed to determine the luteoloside contents in leaves, stems, and flowers at different developmental stages. Results showed that senescing leaves can accumulate large amounts of luteoloside, extremely higher than that in young and semi-lignified leaves and other tissues. RNA-Seq analysis identified that twenty-four differentially expressed unigenes (DEGs associated with luteoloside biosynthesis were significantly up-regulated in senescing leaves, which are positively correlated with luteoloside accumulation. These DEGs include phenylalanine ammonia lyase 2, cinnamate 4-hydroxylase 2, thirteen 4-coumarate-CoA ligases, chalcone synthase 2, six flavonoid 3′-monooxygenase (F3′H and two flavone 7-O-β-glucosyltransferase (UFGT genes. Further analysis demonstrated that two F3′Hs (CL11828.Contig1 and CL11828.Contig2 and two UFGTs (Unigene2918 and Unigene97915 might play vital roles in luteoloside generation. Furthermore, several transcription factors (TFs related to flavonoid biosynthesis including MYB, bHLH and WD40, were differentially expressed during leaf senescence. Among these TFs, MYB12, MYB75, bHLH113 and TTG1 were considered to be key factors involved in the regulation of luteoloside biosynthesis. These findings provide insights for elucidating the molecular signatures of luteoloside accumulation in L. macranthoides.

  13. Transcriptomic Analysis and the Expression of Disease-Resistant Genes in Oryza meyeriana under Native Condition.

    Directory of Open Access Journals (Sweden)

    Bin He

    Full Text Available Oryza meyeriana (O. meyeriana, with a GG genome type (2n = 24, accumulated plentiful excellent characteristics with respect to resistance to many diseases such as rice shade and blast, even immunity to bacterial blight. It is very important to know if the diseases-resistant genes exist and express in this wild rice under native conditions. However, limited genomic or transcriptomic data of O. meyeriana are currently available. In this study, we present the first comprehensive characterization of the O. meyeriana transcriptome using RNA-seq and obtained 185,323 contigs with an average length of 1,692 bp and an N50 of 2,391 bp. Through differential expression analysis, it was found that there were most tissue-specifically expressed genes in roots, and next to stems and leaves. By similarity search against protein databases, 146,450 had at least a significant alignment to existed gene models. Comparison with the Oryza sativa (japonica-type Nipponbare and indica-type 93-11 genomes revealed that 13% of the O. meyeriana contigs had not been detected in O. sativa. Many diseases-resistant genes, such as bacterial blight resistant, blast resistant, rust resistant, fusarium resistant, cyst nematode resistant and downy mildew gene, were mined from the transcriptomic database. There are two kinds of rice bacterial blight-resistant genes (Xa1 and Xa26 differentially or specifically expressed in O. meyeriana. The 4 Xa1 contigs were all only expressed in root, while three of Xa26 contigs have the highest expression level in leaves, two of Xa26 contigs have the highest expression profile in stems and one of Xa26 contigs was expressed dominantly in roots. The transcriptomic database of O. meyeriana has been constructed and many diseases-resistant genes were found to express under native condition, which provides a foundation for future discovery of a number of novel genes and provides a basis for studying the molecular mechanisms associated with disease

  14. A Scaffold Analysis Tool Using Mate-Pair Information in Genome Sequencing

    Directory of Open Access Journals (Sweden)

    Pan-Gyu Kim

    2008-01-01

    Full Text Available We have developed a Windows-based program, ConPath, as a scaffold analyzer. ConPath constructs scaffolds by ordering and orienting separate sequence contigs by exploiting the mate-pair information between contig-pairs. Our algorithm builds directed graphs from link information and traverses them to find the longest acyclic graphs. Using end read pairs of fixed-sized mate-pair libraries, ConPath determines relative orientations of all contigs, estimates the gap size of each adjacent contig pair, and reports wrong assembly information by validating orientations and gap sizes. We have utilized ConPath in more than 10 microbial genome projects, including Mannheimia succiniciproducens and Vibro vulnificus, where we verified contig assembly and identified several erroneous contigs using the four types of error defined in ConPath. Also, ConPath supports some convenient features and viewers that permit investigation of each contig in detail; these include contig viewer, scaffold viewer, edge information list, mate-pair list, and the printing of complex scaffold structures.

  15. Insight into Dominant Cellulolytic Bacteria from Two Biogas Digesters and Their Glycoside Hydrolase Genes

    Science.gov (United States)

    Zhang, Jun; Zhang, Lei; Geng, Alei; Liu, Fanghua; Zhao, Guoping; Wang, Shengyue; Zhou, Zhihua; Yan, Xing

    2015-01-01

    Diverse cellulolytic bacteria are essential for maintaining high lignocellulose degradation ability in biogas digesters. However, little was known about functional genes and gene clusters of dominant cellulolytic bacteria in biogas digesters. This is the foundation to understand lignocellulose degradation mechanisms of biogas digesters and apply these gene resource for optimizing biofuel production. A combination of metagenomic and 16S rRNA gene clone library methods was used to investigate the dominant cellulolytic bacteria and their glycoside hydrolase (GH) genes in two biogas digesters. The 16S rRNA gene analysis revealed that the dominant cellulolytic bacteria were strains closely related to Clostridium straminisolvens and an uncultured cellulolytic bacterium designated BG-1. To recover GH genes from cellulolytic bacteria in general, and BG-1 in particular, a refined assembly approach developed in this study was used to assemble GH genes from metagenomic reads; 163 GH-containing contigs ≥ 1 kb in length were obtained. Six recovered GH5 genes that were expressed in E. coli demonstrated multiple lignocellulase activities and one had high mannanase activity (1255 U/mg). Eleven fosmid clones harboring the recovered GH-containing contigs were sequenced and assembled into 10 fosmid contigs. The composition of GH genes in the 163 assembled metagenomic contigs and 10 fosmid contigs indicated that diverse GHs and lignocellulose degradation mechanisms were present in the biogas digesters. In particular, a small portion of BG-1 genome information was recovered by PhyloPythiaS analysis. The lignocellulase gene clusters in BG-1 suggested that it might use a possible novel lignocellulose degradation mechanism to efficiently degrade lignocellulose. Dominant cellulolytic bacteria of biogas digester possess diverse GH genes, not only in sequences but also in their functions, which may be applied for production of biofuel in the future. PMID:26070087

  16. Retrotransposition of long interspersed nucleotide element-1 is associated with colitis but not tumors in a murine colitic cancer model.

    Directory of Open Access Journals (Sweden)

    Takeshi Otsubo

    Full Text Available Long interspersed element-1 (L1 is a transposable element that can move within the genome, potentially leading to genome diversity and modified gene function. Although L1 activity in somatic cells is normally suppressed through DNA methylation, some L1s are activated in tumors including colorectal carcinoma. However, how L1-retrotransposition (L1-RTP is involved in gastrointestinal disorders remains to be elucidated. We hypothesized that L1-RTP in somatic cells might contribute to colitis-associated cancer (CAC. To address this, we employed an experimental model of CAC using transgenic L1-reporter mice carrying a human L1-EGFP reporter gene. Mice were subjected to repeated cycles of colitis induced by administration of dextran sodium sulfate (DSS in drinking water with injection of carcinogen azoxymethane (AOM. L1-RTP levels were measured by a quantitative polymerase chain reaction targeting the newly inserted reporter EGFP in various tissues and cell types, including samples obtained by laser microdissection and cell sorting with flow cytometry. DNA methylation levels of the human L1 promoter were analyzed by bisulfite pyrosequencing. AOM+DSS-treated mice exhibited significantly higher levels of L1-RTP in whole colon tissue during the acute phase of colitis when compared with control naïve mice. L1-RTP levels in whole colon tissue were positively correlated with the histological severity of colitis and degree of neutrophil infiltration into the lamina propria (LP, but not with tumor development in the colon. L1-RTP was enriched in LP mesenchymal cells rather than epithelial cells (ECs, myeloid, or lymphoid cells. DNA methylation levels of the human L1 promoter region showed a negative correlation with L1-RTP levels. L1-RTP was absent from most tumors found in 22-week-old mice. In conclusion, we demonstrated that L1-RTP was induced in the mouse CAC mucosa in accordance with the acute inflammatory response; however, retrotransposition appears

  17. How conserved are the conserved 16S-rRNA regions?

    Directory of Open Access Journals (Sweden)

    Marcel Martinez-Porchas

    2017-02-01

    Full Text Available The 16S rRNA gene has been used as master key for studying prokaryotic diversity in almost every environment. Despite the claim of several researchers to have the best universal primers, the reality is that no primer has been demonstrated to be truly universal. This suggests that conserved regions of the gene may not be as conserved as expected. The aim of this study was to evaluate the conservation degree of the so-called conserved regions flanking the hypervariable regions of the 16S rRNA gene. Data contained in SILVA database (release 123 were used for the study. Primers reported as matches of each conserved region were assembled to form contigs; sequences sizing 12 nucleotides (12-mers were extracted from these contigs and searched into the entire set of SILVA sequences. Frequency analysis shown that extreme regions, 1 and 10, registered the lowest frequencies. 12-mer frequencies revealed segments of contigs that were not as conserved as expected (≤90%. Fragments corresponding to the primer contigs 3, 4, 5b and 6a were recovered from all sequences in SILVA database. Nucleotide frequency analysis in each consensus demonstrated that only a small fraction of these so-called conserved regions is truly conserved in non-redundant sequences. It could be concluded that conserved regions of the 16S rRNA gene exhibit considerable variation that has to be considered when using this gene as biomarker.

  18. Retroposition of the AFC family of SINEs (short interspersed repetitive elements) before and during the adaptive radiation of cichlid fishes in Lake Malawi and related inferences about phylogeny.

    Science.gov (United States)

    Takahashi, K; Nishida, M; Yuma, M; Okada, N

    2001-01-01

    Lake Malawi is home to more than 450 species of endemic cichlids, which provide a spectacular example of adaptive radiation. To clarify the phylogenetic relationships among these fish, we examined the presence and absence of SINEs (short interspersed repetitive elements) at orthologous loci. We identified six loci at which a SINE sequence had apparently been specifically inserted by retroposition in the common ancestor of all the investigated species of endemic cichlids in Lake Malawi. At another locus, unique sharing of a SINE sequence was evident among all the investigated species of endemic non-Mbuna cichlids with the exception of Rhamphochromis sp. The relationships were in good agreement with those deduced in previous studies with various different markers, demonstrating that the SINE method is useful for the elucidation of phylogenetic relationships among cichlids in Lake Malawi. We also characterized a locus that exhibited transspecies polymorphism with respect to the presence or absence of the SINE sequence among non-Mbuna species. This result suggests that incomplete lineage sorting and/or interspecific hybridization might have occurred or be occurring among the species in this group, which might potentially cause misinterpretation of phylogenetic data, in particular when a single-locus marker, such as a sequence in the mitochondrial DNA, is used for analysis.

  19. Genomic library screening for viruses from the human dental plaque revealed pathogen-specific lytic phage sequences.

    Science.gov (United States)

    Al-Jarbou, Ahmed Nasser

    2012-01-01

    Bacterial pathogenesis presents an astounding arsenal of virulence factors that allow them to conquer many different niches throughout the course of infection. Principally fascinating is the fact that some bacterial species are able to induce different diseases by expression of different combinations of virulence factors. Nevertheless, studies aiming at screening for the presence of bacteriophages in humans have been limited. Such screening procedures would eventually lead to identification of phage-encoded properties that impart increased bacterial fitness and/or virulence in a particular niche, and hence, would potentially be used to reverse the course of bacterial infections. As the human oral cavity represents a rich and dynamic ecosystem for several upper respiratory tract pathogens. However, little is known about virus diversity in human dental plaque which is an important reservoir. We applied the culture-independent approach to characterize virus diversity in human dental plaque making a library from a virus DNA fraction amplified using a multiple displacement method and sequenced 80 clones. The resulting sequence showed 44% significant identities to GenBank databases by TBLASTX analysis. TBLAST homology comparisons showed that 66% was viral; 18% eukarya; 10% bacterial; 6% mobile elements. These sequences were sorted into 6 contigs and 45 single sequences in which 4 contigs and a single sequence showed significant identity to a small region of a putative prophage in the Corynebacterium diphtheria genome. These findings interestingly highlight the uniqueness of over half of the sequences, whilst the dominance of a pathogen-specific prophage sequences imply their role in virulence.

  20. A 1.7-Mb YAC contig around the human BDNF gene (11p13): integration of the physical, genetic, and cytogenetic maps in relation to WAGR syndrome

    Energy Technology Data Exchange (ETDEWEB)

    Rosier, M.F.; Martin, A.; Houlgatte, R. [Genetique Moleculaire et Biologie du Development, Villejuif (France)] [and others

    1994-11-01

    WAGR (Wilms tumor, aniridia, genito-urinary abnormalities, mental retardation) syndrome in humans is associated with deletions of the 11p13 region. The brain-derived neurotrophic factor (BDNF) gene maps to this region, and its deletion seems to contribute to the severity of the patient`s mental retardation. Yeast artificial chromosomes (YACs) carrying the BDNF gene have been isolated and characterized. Localization of two known exons of this gene leads to a minimal estimation of its size of about 40 kb. Chimerism of the BDNF YACs has been investigated by fluorescence in situ hybridization and chromosome assignment on somatic cell hybrids. Using the BDNF gene, YAC end sequence tagged sites (STS), and Genethon microsatellite markers, the authors constructed a 1.7-Mb contig and refined the cytogenetic map at 11p13. The resulting integrated physical, genetic, and cytogenetic map constitutes a resource for the characterization of genes that may be involved in the WAGR syndrome. 42 refs., 2 figs., 3 tabs.

  1. A parts list for fungal cellulosomes revealed by comparative genomics

    Energy Technology Data Exchange (ETDEWEB)

    Haitjema, Charles H.; Gilmore, Sean P.; Henske, John K.; Solomon, Kevin V.; de Groot, Randall; Kuo, Alan; Mondo, Stephen J.; Salamov, Asaf A.; LaButti, Kurt; Zhao, Zhiying; Chiniquy, Jennifer; Barry, Kerrie; Brewer, Heather M.; Purvine, Samuel O.; Wright, Aaron T.; Hainaut, Matthieu; Boxma, Brigitte; van Alen, Theo; Hackstein, Johannes H. P.; Henrissat, Bernard; Baker, Scott E.; Grigoriev, Igor V.; O' Malley, Michelle A.

    2017-05-26

    Cellulosomes are large, multi-protein complexes that tether plant biomass degrading enzymes together for improved hydrolysis1. These complexes were first described in anaerobic bacteria where species specific dockerin domains mediate assembly of enzymes onto complementary cohesin motifs interspersed within non-catalytic protein scaffolds1. The versatile protein assembly mechanism conferred by the bacterial cohesin-dockerin interaction is now a standard design principle for synthetic protein-scale pathways2,3. For decades, analogous structures have been reported in the early branching anaerobic fungi, which are known to assemble by sequence divergent non-catalytic dockerin domains (NCDD)4. However, the enzyme components, modular assembly mechanism, and functional role of fungal cellulosomes remain unknown5,6. Here, we describe the comprehensive set of proteins critical to fungal cellulosome assembly, including novel, conserved scaffolding proteins unique to the Neocallimastigomycota. High quality genomes of the anaerobic fungi Anaeromyces robustus, Neocallimastix californiae and Piromyces finnis were assembled with long-read, single molecule technology to overcome their repeat-richness and extremely low GC content. Genomic analysis coupled with proteomic validation revealed an average 320 NCDD-containing proteins per fungal strain that were overwhelmingly carbohydrate active enzymes (CAZymes), with 95 large fungal scaffoldins identified across 4 genera that contain a conserved amino acid sequence repeat that binds to NCDDs. Fungal dockerin and scaffoldin domains have no similarity to their bacterial counterparts, yet several catalytic domains originated via horizontal gene transfer with gut bacteria. Though many catalytic domains are shared with bacteria, the biocatalytic activity of anaerobic fungi is expanded by the inclusion of GH3, GH6, and GH45 enzymes in the enzyme complexes. Collectively, these findings suggest that the fungal cellulosome is an evolutionarily

  2. Transcriptome of the dead: characterisation of immune genes and marker development from necropsy samples in a free-ranging marine mammal

    Directory of Open Access Journals (Sweden)

    Hoffman Joseph I

    2013-01-01

    Full Text Available Abstract Background Transcriptomes are powerful resources, providing a window on the expressed portion of the genome that can be generated rapidly and at low cost for virtually any organism. However, because many genes have tissue-specific expression patterns, developing a complete transcriptome usually requires a 'discovery pool' of individuals to be sacrificed in order to harvest mRNA from as many different types of tissue as possible. This hinders transcriptome development in large, charismatic and endangered species, many of which stand the most to gain from such approaches. To circumvent this problem in a model pinniped species, we 454 sequenced cDNA from testis, heart, spleen, intestine, kidney and lung tissues obtained from nine adult male Antarctic fur seals (Arctocephalus gazella that died of natural causes at Bird Island, South Georgia. Results After applying stringent quality control criteria based on length and annotation, we obtained 12,397 contigs which, in combination with 454 data previously obtained from skin, gave a total of 23,096 unique contigs. Homology was found to 77.0% of dog (Canis lupus familiaris transcripts, suggesting that the combined assembly represents a substantial proportion of this species' transcriptome. Moreover, only 0.5% of transcripts revealed sequence similarity to bacteria, implying minimal contamination, and the percentage of transcripts involved in cell death was low at 2.6%. Transcripts with immune-related annotations were almost five-fold enriched relative to skin and represented 13.2% of all spleen-specific contigs. By reference to the dog, we also identified transcripts revealing homology to five class I, ten class II and three class III genes of the Major Histocompatibility Complex and derived the putative genomic distribution of 17,121 contigs, 2,119 in silico mined microsatellites and 9,382 single nucleotide polymorphisms. Conclusions Our findings suggest that transcriptome development based on

  3. Candidate genes that have facilitated freshwater adaptation by palaemonid prawns in the genus Macrobrachium: identification and expression validation in a model species (M. koombooloomba).

    Science.gov (United States)

    Rahi, Md Lifat; Amin, Shorash; Mather, Peter B; Hurwood, David A

    2017-01-01

    The endemic Australian freshwater prawn, Macrobrachium koombooloomba , provides a model for exploring genes involved with freshwater adaptation because it is one of the relatively few Macrobrachium species that can complete its entire life cycle in freshwater. The present study was conducted to identify potential candidate genes that are likely to contribute to effective freshwater adaptation by M. koombooloomba using a transcriptomics approach. De novo assembly of 75 bp paired end 227,564,643 high quality Illumina raw reads from 6 different cDNA libraries revealed 125,917 contigs of variable lengths (200-18,050 bp) with an N50 value of 1597. In total, 31,272 (24.83%) of the assembled contigs received significant blast hits, of which 27,686 and 22,560 contigs were mapped and functionally annotated, respectively. CEGMA (Core Eukaryotic Genes Mapping Approach) based transcriptome quality assessment revealed 96.37% completeness. We identified 43 different potential genes that are likely to be involved with freshwater adaptation in M. koombooloomba . Identified candidate genes included: 25 genes for osmoregulation, five for cell volume regulation, seven for stress tolerance, three for body fluid (haemolymph) maintenance, eight for epithelial permeability and water channel regulation, nine for egg size control and three for larval development. RSEM (RNA-Seq Expectation Maximization) based abundance estimation revealed that 6,253, 5,753 and 3,795 transcripts were expressed (at TPM value ≥10) in post larvae, juveniles and adults, respectively. Differential gene expression (DGE) analysis showed that 15 genes were expressed differentially in different individuals but these genes apparently were not involved with freshwater adaptation but rather were involved in growth, development and reproductive maturation. The genomic resources developed here will be useful for better understanding the molecular basis of freshwater adaptation in Macrobrachium prawns and other

  4. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB568 (Link to dictyBase) - G23244 DDB0217993 Contig-U02062-1...ary VF (Link to library) Clone ID VFB568 (Link to dictyBase) Atlas ID - NBRP ID G23244 dictyBase ID DDB02179...93 Link to Contig Contig-U02062-1 | Contig-U11831-1 Original site URL http://dict...TELLY NNFYNTAEPITDEIIGFDLRKYLNFELFTYDYILGRELNQYNNSGCSGCSGCSSSSSSSS SSGGDDGRKNNKKYITQFDTILKELIKQLSFNICTNFSDRD...STELLY NNFYNTAEPITDEIIGFDLRKYLNFELFTYDYILGRELNQYNNSGCSGCSGCSSSSSSSS SSGGDDGRKNNKKYITQFDTILKELIKQLSFNICTNFSDR

  5. Dicty_cDB: SHE365 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SH (Link to library) SHE365 (Link to dictyBase) - G23163 DDB0185626 Contig-U11298-1...ary SH (Link to library) Clone ID SHE365 (Link to dictyBase) Atlas ID - NBRP ID G23163 dictyBase ID DDB01856...26 Link to Contig Contig-U11298-1 | Contig-U13252-1 Original site URL http://dict...VEIAFSDKTPSIAVEAIRLVSIMSTL DLLDESDVQKICTLYLVDQPEISKAAGGLIFSKYLLSTQTKIDTTLSNIINSTSAASTTS GQHARKKSKTTNLTSADLDE...PSIAVEAIRLVSIMSTL DLLDESDVQKICTLYLVDQPEISKAAGGLIFSKYLLSTQTKIDTTLSNIINSTSAASTTS GQHARKKSKTTNLTSADLDEQQQQQISSQ

  6. Revealing hidden covariation detection: evidence for implicit abstraction at study.

    Science.gov (United States)

    Rossnagel, C S

    2001-09-01

    Four experiments in the brain scans paradigm (P. Lewicki, T. Hill, & I. Sasaki, 1989) investigated hidden covariation detection (HCD). In Experiment 1 HCD was found in an implicit- but not in an explicit-instruction group. In Experiment 2 HCD was impaired by nonholistic perception of stimuli but not by divided attention. In Experiment 3 HCD was eliminated by interspersing stimuli that deviated from the critical covariation. In Experiment 4 a transfer procedure was used. HCD was found with dissimilar test stimuli that preserved the covariation but was almost eliminated with similar stimuli that were neutral as to the covariation. Awareness was assessed both by objective and subjective tests in all experiments. Results suggest that HCD is an effect of implicit rule abstraction and that similarity processing plays only a minor role. HCD might be suppressed by intentional search strategies that induce inappropriate aggregation of stimulus information.

  7. Optimizing Transcriptome Assemblies for Eleusine indica Leaf and Seedling by Combining Multiple Assemblies from Three De Novo Assemblers

    Directory of Open Access Journals (Sweden)

    Shu Chen

    2015-03-01

    Full Text Available Due to rapid advances in sequencing technology, increasing amounts of genomic and transcriptomic data are available for plant species, presenting enormous challenges for biocomputing analysis. A crucial first step for a successful transcriptomics-based study is the building of a high-quality assembly. Here, we utilized three different de novo assemblers (Trinity, Velvet, and CLC and the EvidentialGene pipeline tr2aacds to assemble two optimized transcript sets for the notorious weed species, . Two RNA sequencing (RNA-seq datasets from leaf and aboveground seedlings were processed using three assemblers, which resulted in 20 assemblies for each dataset. The contig numbers and N50 values of each assembly were compared to study the effect of read number, k-mer size, and in silico normalization on assembly output. The 20 assemblies were then processed through the tr2aacds pipeline to remove redundant transcripts and to select the transcript set with the best coding potential. Each assembly contributed a considerable proportion to the final transcript combination with the exception of the CLC-k14. Thus each assembler and parameter set did assemble better contigs for certain transcripts. The redundancy, total contig number, N50, fully assembled contig number, and transcripts related to target-site herbicide resistance were evaluated for the EvidentialGene and Trinity assemblies. Comparing the EvidentialGene set with the Trinity assembly revealed improved quality and reduced redundancy in both leaf and seedling EvidentialGene sets. The optimized transcriptome references will be useful for studying herbicide resistance in and the evolutionary process in the three allotetraploid offspring.

  8. Dicty_cDB: SSE149 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SS (Link to library) SSE149 (Link to dictyBase) - - - Contig-U01658-1 | Contig-U165... library) Clone ID SSE149 (Link to dictyBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U016...58-1 | Contig-U16509-1 Original site URL http://dictycdb.biol.tsukuba.ac.jp/CSM/S...g significant alignments: (bits) Value N D16417 |D16417.1 Dictyostelium discoideum mRNA. 64 1e-17 2 AC100496...egans clone T13G4, complete sequence. 36 5.0 2 BM028890 |BM028890.1 IpSkn01670 Skin cDNA library Ictalurus p

  9. Dicty_cDB: VFB330 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB330 (Link to dictyBase) - G22107 DDB0216429 Contig-U02054-1...ary VF (Link to library) Clone ID VFB330 (Link to dictyBase) Atlas ID - NBRP ID G22107 dictyBase ID DDB02164...29 Link to Contig Contig-U02054-1 | Contig-U16357-1 Original site URL http://dict...d Amino Acid sequence *k*k*NKMKLDPKALRYLSKDDFRTLVAVEMGMKNHELVPVSLICTIANLKYGGTKKSIQ TLHKFKLLFHDGRNYDGYKLTYLGY...LRYLSKDDFRTLVAVEMGMKNHELVPVSLICTIANLKYGGTKKSIQ TLHKFKLLFHDGRNYDGYKLTYLGYDFLALKTLVSRGVCSYVGNQIGVGKESDIYIVAND

  10. Draft Genome Sequence of Halostagnicola sp. A56, an Extremely Halophilic Archaeon Isolated from the Andaman Islands

    Science.gov (United States)

    Kanekar, Sagar P.; Saxena, Neha; Pore, Soham D.; Arora, Preeti; Kanekar, P. P.

    2015-01-01

    The first draft genome of Halostagnicola sp. A56, isolated from the Andaman Islands is reported here. The A56 genome comprises 3,178,490 bp in 26 contigs with a G+C content of 60.8%. The genome annotation revealed that A56 could have potential applications for the production of polyhydroxyalkanoate or bioplastics. PMID:26564049

  11. Comprehensive analyses of imprinted differentially methylated regions reveal epigenetic and genetic characteristics in hepatoblastoma

    International Nuclear Information System (INIS)

    Rumbajan, Janette Mareska; Aoki, Shigehisa; Kohashi, Kenichi; Oda, Yoshinao; Hata, Kenichiro; Saji, Tsutomu; Taguchi, Tomoaki; Tajiri, Tatsuro; Soejima, Hidenobu; Joh, Keiichiro; Maeda, Toshiyuki; Souzaki, Ryota; Mitsui, Kazumasa; Higashimoto, Ken; Nakabayashi, Kazuhiko; Yatsuki, Hitomi; Nishioka, Kenichi; Harada, Ryoko

    2013-01-01

    Aberrant methylation at imprinted differentially methylated regions (DMRs) in human 11p15.5 has been reported in many tumors including hepatoblastoma. However, the methylation status of imprinted DMRs in imprinted loci scattered through the human genome has not been analyzed yet in any tumors. The methylation statuses of 33 imprinted DMRs were analyzed in 12 hepatoblastomas and adjacent normal liver tissue by MALDI-TOF MS and pyrosequencing. Uniparental disomy (UPD) and copy number abnormalities were investigated with DNA polymorphisms. Among 33 DMRs analyzed, 18 showed aberrant methylation in at least 1 tumor. There was large deviation in the incidence of aberrant methylation among the DMRs. KvDMR1 and IGF2-DMR0 were the most frequently hypomethylated DMRs. INPP5Fv2-DMR and RB1-DMR were hypermethylated with high frequencies. Hypomethylation was observed at certain DMRs not only in tumors but also in a small number of adjacent histologically normal liver tissue, whereas hypermethylation was observed only in tumor samples. The methylation levels of long interspersed nuclear element-1 (LINE-1) did not show large differences between tumor tissue and normal liver controls. Chromosomal abnormalities were also found in some tumors. 11p15.5 and 20q13.3 loci showed the frequent occurrence of both genetic and epigenetic alterations. Our analyses revealed tumor-specific aberrant hypermethylation at some imprinted DMRs in 12 hepatoblastomas with additional suggestion for the possibility of hypomethylation prior to tumor development. Some loci showed both genetic and epigenetic alterations with high frequencies. These findings will aid in understanding the development of hepatoblastoma

  12. Pyrosequencing the transcriptome of the greenhouse whitefly, Trialeurodes vaporariorum reveals multiple transcripts encoding insecticide targets and detoxifying enzymes

    Directory of Open Access Journals (Sweden)

    Gorman Kevin

    2011-01-01

    Full Text Available Abstract Background The whitefly Trialeurodes vaporariorum is an economically important crop pest in temperate regions that has developed resistance to most classes of insecticides. However, the molecular mechanisms underlying resistance have not been characterised and, to date, progress has been hampered by a lack of nucleotide sequence data for this species. Here, we use pyrosequencing on the Roche 454-FLX platform to produce a substantial and annotated EST dataset. This 'unigene set' will form a critical reference point for quantitation of over-expressed messages via digital transcriptomics. Results Pyrosequencing produced around a million sequencing reads that assembled into 54,748 contigs, with an average length of 965 bp, representing a dramatic expansion of existing cDNA sequences available for T. vaporariorum (only 43 entries in GenBank at the time of this publication. BLAST searching of non-redundant databases returned 20,333 significant matches and those gene families potentially encoding gene products involved in insecticide resistance were manually curated and annotated. These include, enzymes potentially involved in the detoxification of xenobiotics and those encoding the targets of the major chemical classes of insecticides. A total of 57 P450s, 17 GSTs and 27 CCEs were identified along with 30 contigs encoding the target proteins of six different insecticide classes. Conclusion Here, we have developed new transcriptomic resources for T. vaporariorum. These include a substantial and annotated EST dataset that will serve the community studying this important crop pest and will elucidate further the molecular mechanisms underlying insecticide resistance.

  13. Antarctic krill 454 pyrosequencing reveals chaperone and stress transcriptome.

    Directory of Open Access Journals (Sweden)

    Melody S Clark

    Full Text Available BACKGROUND: The Antarctic krill Euphausia superba is a keystone species in the Antarctic food chain. Not only is it a significant grazer of phytoplankton, but it is also a major food item for charismatic megafauna such as whales and seals and an important Southern Ocean fisheries crop. Ecological data suggest that this species is being affected by climate change and this will have considerable consequences for the balance of the Southern Ocean ecosystem. Hence, understanding how this organism functions is a priority area and will provide fundamental data for life history studies, energy budget calculations and food web models. METHODOLOGY/PRINCIPAL FINDINGS: The assembly of the 454 transcriptome of E. superba resulted in 22,177 contigs with an average size of 492bp (ranging between 137 and 8515bp. In depth analysis of the data revealed an extensive catalogue of the cellular chaperone systems and the major antioxidant proteins. Full length sequences were characterised for the chaperones HSP70, HSP90 and the super-oxide dismutase antioxidants, with the discovery of potentially novel duplications of these genes. The sequence data contained 41,470 microsatellites and 17,776 Single Nucleotide Polymorphisms (SNPs/INDELS, providing a resource for population and also gene function studies. CONCLUSIONS: This paper details the first 454 generated data for a pelagic Antarctic species or any pelagic crustacean globally. The classical "stress proteins", such as HSP70, HSP90, ferritin and GST were all highly expressed. These genes were shown to be over expressed in the transcriptomes of Antarctic notothenioid fish and hypothesized as adaptations to living in the cold, with the associated problems of decreased protein folding efficiency and increased vulnerability to damage by reactive oxygen species. Hence, these data will provide a major resource for future physiological work on krill, but in particular a suite of "stress" genes for studies understanding

  14. The first Chameleon transcriptome: comparative genomic analysis of the OXPHOS system reveals loss of COX8 in Iguanian lizards.

    Science.gov (United States)

    Bar-Yaacov, Dan; Bouskila, Amos; Mishmar, Dan

    2013-01-01

    Recently, we found dramatic mitochondrial DNA divergence of Israeli Chamaeleo chamaeleon populations into two geographically distinct groups. We aimed to examine whether the same pattern of divergence could be found in nuclear genes. However, no genomic resource is available for any chameleon species. Here we present the first chameleon transcriptome, obtained using deep sequencing (SOLiD). Our analysis identified 164,000 sequence contigs of which 19,000 yielded unique BlastX hits. To test the efficacy of our sequencing effort, we examined whether the chameleon and other available reptilian transcriptomes harbored complete sets of genes comprising known biochemical pathways, focusing on the nDNA-encoded oxidative phosphorylation (OXPHOS) genes as a model. As a reference for the screen, we used the human 86 (including isoforms) known structural nDNA-encoded OXPHOS subunits. Analysis of 34 publicly available vertebrate transcriptomes revealed orthologs for most human OXPHOS genes. However, OXPHOS subunit COX8 (Cytochrome C oxidase subunit 8), including all its known isoforms, was consistently absent in transcriptomes of iguanian lizards, implying loss of this subunit during the radiation of this suborder. The lack of COX8 in the suborder Iguania is intriguing, since it is important for cellular respiration and ATP production. Our sequencing effort added a new resource for comparative genomic studies, and shed new light on the evolutionary dynamics of the OXPHOS system.

  15. Dicty_cDB: SSB171 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SS (Link to library) SSB171 (Link to dictyBase) ssb171 - - Contig-U02783-1 SSB171F ...nk to dictyBase) Atlas ID ssb171 NBRP ID - dictyBase ID - Link to Contig Contig-U02783-1 Original site URL h

  16. An integrated metagenome and -proteome analysis of the microbial community residing in a biogas production plant.

    Science.gov (United States)

    Ortseifen, Vera; Stolze, Yvonne; Maus, Irena; Sczyrba, Alexander; Bremges, Andreas; Albaum, Stefan P; Jaenicke, Sebastian; Fracowiak, Jochen; Pühler, Alfred; Schlüter, Andreas

    2016-08-10

    To study the metaproteome of a biogas-producing microbial community, fermentation samples were taken from an agricultural biogas plant for microbial cell and protein extraction and corresponding metagenome analyses. Based on metagenome sequence data, taxonomic community profiling was performed to elucidate the composition of bacterial and archaeal sub-communities. The community's cytosolic metaproteome was represented in a 2D-PAGE approach. Metaproteome databases for protein identification were compiled based on the assembled metagenome sequence dataset for the biogas plant analyzed and non-corresponding biogas metagenomes. Protein identification results revealed that the corresponding biogas protein database facilitated the highest identification rate followed by other biogas-specific databases, whereas common public databases yielded insufficient identification rates. Proteins of the biogas microbiome identified as highly abundant were assigned to the pathways involved in methanogenesis, transport and carbon metabolism. Moreover, the integrated metagenome/-proteome approach enabled the examination of genetic-context information for genes encoding identified proteins by studying neighboring genes on the corresponding contig. Exemplarily, this approach led to the identification of a Methanoculleus sp. contig encoding 16 methanogenesis-related gene products, three of which were also detected as abundant proteins within the community's metaproteome. Thus, metagenome contigs provide additional information on the genetic environment of identified abundant proteins. Copyright © 2016 Elsevier B.V. All rights reserved.

  17. The genome of the Antarctic-endemic copepod, Tigriopus kingsejongensis.

    Science.gov (United States)

    Kang, Seunghyun; Ahn, Do-Hwan; Lee, Jun Hyuck; Lee, Sung Gu; Shin, Seung Chul; Lee, Jungeun; Min, Gi-Sik; Lee, Hyoungseok; Kim, Hyun-Woo; Kim, Sanghee; Park, Hyun

    2017-01-01

    The Antarctic intertidal zone is continuously subjected to extremely fluctuating biotic and abiotic stressors. The West Antarctic Peninsula is the most rapidly warming region on Earth. Organisms living in Antarctic intertidal pools are therefore interesting for research into evolutionary adaptation to extreme environments and the effects of climate change. We report the whole genome sequence of the Antarctic-endemic harpacticoid copepod Tigriopus kingsejongensi . The 37 Gb raw DNA sequence was generated using the Illumina Miseq platform. Libraries were prepared with 65-fold coverage and a total length of 295 Mb. The final assembly consists of 48 368 contigs with an N50 contig length of 17.5 kb, and 27 823 scaffolds with an N50 contig length of 159.2 kb. A total of 12 772 coding genes were inferred using the MAKER annotation pipeline. Comparative genome analysis revealed that T. kingsejongensis -specific genes are enriched in transport and metabolism processes. Furthermore, rapidly evolving genes related to energy metabolism showed positive selection signatures. The T. kingsejongensis genome provides an interesting example of an evolutionary strategy for Antarctic cold adaptation, and offers new genetic insights into Antarctic intertidal biota. © The Author 2017. Published by Oxford University Press.

  18. The Epidemiological Significance and Temporal Stability of Mycobacterial Interspersed Repetitive Units-Variable Number of Tandem Repeats-Based Method Applied to Mycobacterium tuberculosis in China

    Directory of Open Access Journals (Sweden)

    Yang Li

    2018-04-01

    Full Text Available This study aimed to validate the epidemiological significance and temporal stability of Mycobacterial Interspersed Repetitive Units-Variable Number of Tandem Repeats (MIRU-VNTR typing in a genetically and geographically diverse set of clinical isolates from patients diagnosed with pulmonary tuberculosis in China. Between 2010 and 2013, a total of 982 Mycobacterium tuberculosis isolates were collected from four population-based investigations in China. Apart from the currently applied 24-locus MIRU-VNTR, six additional hypervariable loci were analyzed in order to validate the MIRU-VNTR combinations in terms of their epidemiological links, clustering time span, and paired geographic distance. In vitro temporal stability was analyzed for both individual MIRU-VNTR loci, and for several combinations of loci. In the present study, four MIRU-VNTR combinations, including the hypervariable loci 3820, 3232, 2163a, and 4120, were evaluated. All of these combinations obtained a Hunter-Gaston discriminatory index (HGDI value over 0.9900 with a reduced clustering proportion (from 32.0% to 25.6%. By comparing epidemiological links, clustering time span, and paired geographic distance, we found that the performances of the four MIRU-VNTR combinations were comparable to the insertion sequence 6110 restriction fragment length polymorphism (IS6110-RFLP, and significantly better than that of 24-locus MIRU-VNTR genotyping alone. The proportion of temporally stable loci ranged from 90.5% to 92.5% within the combined MIRU-VNTR genotyping, which is higher than IS6110-RFLP (85.4%. By adding four hypervariable loci to the standard 24-locus MIRU-VNTR genotyping, we obtained a high discriminatory power, stability and epidemiological significance. This algorithm could therefore be used to improve tuberculosis transmission surveillance and outbreak investigation in China.

  19. Draft Genome Sequence of Zobellia sp. Strain OII3, Isolated from the Coastal Zone of the Baltic Sea.

    Science.gov (United States)

    Harms, Henrik; Poehlein, Anja; Thürmer, Andrea; König, Gabriele M; Schäberle, Till F

    2017-09-07

    Zobellia sp. strain OII3 was isolated from a marine environmental sample due to its heterotrophic lifestyle, i.e., using Escherichia coli cells as prey. It shows strong agar-lytic activity. The genome was assembled into 41 contigs with a total size of 5.4 Mb, revealing the genetic basis for natural product biosynthesis. Copyright © 2017 Harms et al.

  20. SNP markers retrieval for a non-model species: a practical approach

    Directory of Open Access Journals (Sweden)

    Shahin Arwa

    2012-01-01

    Full Text Available Abstract Background SNP (Single Nucleotide Polymorphism markers are rapidly becoming the markers of choice for applications in breeding because of next generation sequencing technology developments. For SNP development by NGS technologies, correct assembly of the huge amounts of sequence data generated is essential. Little is known about assembler's performance, especially when dealing with highly heterogeneous species that show a high genome complexity and what the possible consequences are of differences in assemblies on SNP retrieval. This study tested two assemblers (CAP3 and CLC on 454 data from four lily genotypes and compared results with respect to SNP retrieval. Results CAP3 assembly resulted in higher numbers of contigs, lower numbers of reads per contig, and shorter average read lengths compared to CLC. Blast comparisons showed that CAP3 contigs were highly redundant. Contrastingly, CLC in rare cases combined paralogs in one contig. Redundant and chimeric contigs may lead to erroneous SNPs. Filtering for redundancy can be done by blasting selected SNP markers to the contigs and discarding all the SNP markers that show more than one blast hit. Results on chimeric contigs showed that only four out of 2,421 SNP markers were selected from chimeric contigs. Conclusion In practice, CLC performs better in assembling highly heterogeneous genome sequences compared to CAP3, and consequently SNP retrieval is more efficient. Additionally a simple flow scheme is suggested for SNP marker retrieval that can be valid for all non-model species.

  1. Integrated physical map of bread wheat chromosome arm 7DS to facilitate gene cloning and comparative studies.

    Science.gov (United States)

    Tulpová, Zuzana; Luo, Ming-Cheng; Toegelová, Helena; Visendi, Paul; Hayashi, Satomi; Vojta, Petr; Paux, Etienne; Kilian, Andrzej; Abrouk, Michaël; Bartoš, Jan; Hajdúch, Marián; Batley, Jacqueline; Edwards, David; Doležel, Jaroslav; Šimková, Hana

    2018-03-08

    Bread wheat (Triticum aestivum L.) is a staple food for a significant part of the world's population. The growing demand on its production can be satisfied by improving yield and resistance to biotic and abiotic stress. Knowledge of the genome sequence would aid in discovering genes and QTLs underlying these traits and provide a basis for genomics-assisted breeding. Physical maps and BAC clones associated with them have been valuable resources from which to generate a reference genome of bread wheat and to assist map-based gene cloning. As a part of a joint effort coordinated by the International Wheat Genome Sequencing Consortium, we have constructed a BAC-based physical map of bread wheat chromosome arm 7DS consisting of 895 contigs and covering 94% of its estimated length. By anchoring BAC contigs to one radiation hybrid map and three high resolution genetic maps, we assigned 73% of the assembly to a distinct genomic position. This map integration, interconnecting a total of 1713 markers with ordered and sequenced BAC clones from a minimal tiling path, provides a tool to speed up gene cloning in wheat. The process of physical map assembly included the integration of the 7DS physical map with a whole-genome physical map of Aegilops tauschii and a 7DS Bionano genome map, which together enabled efficient scaffolding of physical-map contigs, even in the non-recombining region of the genetic centromere. Moreover, this approach facilitated a comparison of bread wheat and its ancestor at BAC-contig level and revealed a reconstructed region in the 7DS pericentromere. Copyright © 2018. Published by Elsevier B.V.

  2. Molecular characterization and combined genotype association study of bovine cluster of differentiation 14 gene with clinical mastitis in crossbred dairy cattle.

    Science.gov (United States)

    Selvan, A Sakthivel; Gupta, I D; Verma, A; Chaudhari, M V; Magotra, A

    2016-07-01

    The present study was undertaken with the objectives to characterize and to analyze combined genotypes of cluster of differentiation 14 (CD14) gene to explore its association with clinical mastitis in Karan Fries (KF) cows maintained in the National Dairy Research Institute herd, Karnal. Genomic DNA was extracted using blood of randomly selected 94 KF lactating cattle by phenol-chloroform method. After checking its quality and quantity, polymerase chain reaction (PCR) was carried out using six sets of reported gene-specific primers to amplify complete KF CD14 gene. The forward and reverse sequences for each PCR fragments were assembled to form complete sequence for the respective region of KF CD14 gene. The multiple sequence alignments of the edited sequence with the corresponding reference with reported Bos taurus sequence (EU148610.1) were performed with ClustalW software to identify single nucleotide polymorphisms (SNPs). Basic Local Alignment Search Tool analysis was performed to compare the sequence identity of KF CD14 gene with other species. The restriction fragment length polymorphism (RFLP) analysis was carried out in all KF cows using Helicobacter pylori 188I (Hpy188I) (contig 2) and Haemophilus influenzae I (HinfI) (contig 4) restriction enzyme (RE). Cows were assigned genotypes obtained by PCR-RFLP analysis, and association study was done using Chi-square (χ (2)) test. The genotypes of both contigs (loci) number 2 and 4 were combined with respect to each animal to construct combined genotype patterns. Two types of sequences of KF were obtained: One with 2630 bp having one insertion at 616 nucleotide (nt) position and one deletion at 1117 nt position, and the another sequence was of 2629 bp having only one deletion at 615 nt position. ClustalW, multiple alignments of KF CD14 gene sequence with B. taurus cattle sequence (EU148610.1), revealed 24 nt changes (SNPs). Cows were also screened using PCR-RFLP with Hpy188I (contig 2) and HinfI (contig 4) RE

  3. Gene discovery from Jatropha curcas by sequencing of ESTs from normalized and full-length enriched cDNA library from developing seeds

    Directory of Open Access Journals (Sweden)

    Sugantham Priyanka Annabel

    2010-10-01

    Full Text Available Abstract Background Jatropha curcas L. is promoted as an important non-edible biodiesel crop worldwide. Jatropha oil, which is a triacylglycerol, can be directly blended with petro-diesel or transesterified with methanol and used as biodiesel. Genetic improvement in jatropha is needed to increase the seed yield, oil content, drought and pest resistance, and to modify oil composition so that it becomes a technically and economically preferred source for biodiesel production. However, genetic improvement efforts in jatropha could not take advantage of genetic engineering methods due to lack of cloned genes from this species. To overcome this hurdle, the current gene discovery project was initiated with an objective of isolating as many functional genes as possible from J. curcas by large scale sequencing of expressed sequence tags (ESTs. Results A normalized and full-length enriched cDNA library was constructed from developing seeds of J. curcas. The cDNA library contained about 1 × 106 clones and average insert size of the clones was 2.1 kb. Totally 12,084 ESTs were sequenced to average high quality read length of 576 bp. Contig analysis revealed 2258 contigs and 4751 singletons. Contig size ranged from 2-23 and there were 7333 ESTs in the contigs. This resulted in 7009 unigenes which were annotated by BLASTX. It showed 3982 unigenes with significant similarity to known genes and 2836 unigenes with significant similarity to genes of unknown, hypothetical and putative proteins. The remaining 191 unigenes which did not show similarity with any genes in the public database may encode for unique genes. Functional classification revealed unigenes related to broad range of cellular, molecular and biological functions. Among the 7009 unigenes, 6233 unigenes were identified to be potential full-length genes. Conclusions The high quality normalized cDNA library was constructed from developing seeds of J. curcas for the first time and 7009 unigenes coding

  4. The canine sarcoglycan delta gene: BAC clone contig assembly, chromosome assignment and interrogation as a candidate gene for dilated cardiomyopathy in Dobermann dogs.

    Science.gov (United States)

    Stabej, P; Leegwater, P A J; Imholz, S; Versteeg, S A; Zijlstra, C; Stokhof, A A; Domanjko-Petriè, A; van Oost, B A

    2005-01-01

    Dilated cardiomyopathy (DCM) is a common disease of the myocardium recognized in human, dog and experimental animals. Genetic factors are responsible for a large proportion of cases in humans, and 17 genes with DCM causing mutations have been identified. The genetic origin of DCM in the Dobermann dogs has been suggested, but no disease genes have been identified to date. In this paper, we describe the characterization and evaluation of the canine sarcoglycan delta (SGCD), a gene implicated in DCM in human and hamster. Bacterial artificial chromosomes (BACs) containing the canine SGCD gene were isolated with probes for exon 3 and exons 4-8 and were characterized by Southern blot analysis. BAC end sequences were obtained for four BACs. Three of the BACs overlapped and could be ordered relative to each other and the end sequences of all four BACs could be anchored on the preliminary assembly of the dog genome sequence (www. ensembl.org). One of the BACs of the partial contig was localized by fluorescent in situ hybridization to canine chromosome 4q22, in agreement with the dog genome sequence. Two highly informative polymorphic microsatellite markers in intron 7 of the SGCD gene were identified. In 25 DCM-affected and 13 non DCM-affected dogs seven different haplotypes could be distinguished. However, no association between any of the SGCD variants and the disease locus was apparent.

  5. Distribution of the ore elements in ore bodies of the Zapolyarnoe field (Kola Peninsula

    Directory of Open Access Journals (Sweden)

    D. V. Taymasov

    2017-12-01

    Full Text Available The author conducted studies of the chemical zonality of ore deposits of all types for the sulfide-copper-nickel deposit Zapolyarnoe. Within the deposit of sulfide-copper-nickel ores Zapolyarnoe are the following industrial types of ores: rich interspersed ores, which are interspersed mineralization in peridotites; brecciated ores formed by detrital material; poor interspersed ores characterized by vein-interspersed mineralization; solid ores in a form of massive sulphide emissions. This work is based on operational and detailed exploration data. The author sorted samples according to the types of ores. Using the results, the author derived regression lines and their formulas to rectilinear dependencies. Analysis of the graphs showed that for all types of ores the dependences of copper and cobalt content on nickel content are direct, but at that, in thick-interspersed and brecciated types they are similar, and in scattered-interspersed they differ significantly. It becomes clear that correlation of copper and nickel in ore body depends on the scattered-interspersed ores, whereas the correlation of cobalt and nickel is determined mainly by thick-interspersed and brecciated ores. Dependencies between nickel, copper and cobalt in ores change when concentration of nickel in the ore changes. The graphs also show that the ratio of Ni/Cu in breccias and scattered-interspersed ores, as well as in near-ore metaperidotites, is almost constant at different nickel concentrations, on average 2:1.5 and 1, respectively. Thick- interspersed ores show the tendency towards an increase in the ratio with an increase in nickel content. The Ni/Co ratio increases with an increase of Ni concentration in all types of ores. In rich ores, growth occurs along a stepped curve, and in breccias – rectilinearly. Thus, the analysis of distribution of ore elements in ore bodies indicates a complex heterogeneous composition of sulphide mineralizations in different types of ores

  6. RNA-seq de novo Assembly Reveals Differential Gene Expression in Glossina palpalis gambiensis Infected with Trypanosoma brucei gambiense vs. Non-Infected and Self-Cured Flies.

    Science.gov (United States)

    Hamidou Soumana, Illiassou; Klopp, Christophe; Ravel, Sophie; Nabihoudine, Ibouniyamine; Tchicaya, Bernadette; Parrinello, Hugues; Abate, Luc; Rialle, Stéphanie; Geiger, Anne

    2015-01-01

    Trypanosoma brucei gambiense (Tbg), causing the sleeping sickness chronic form, completes its developmental cycle within the tsetse fly vector Glossina palpalis gambiensis (Gpg) before its transmission to humans. Within the framework of an anti-vector disease control strategy, a global gene expression profiling of trypanosome infected (susceptible), non-infected, and self-cured (refractory) tsetse flies was performed, on their midguts, to determine differential genes expression resulting from in vivo trypanosomes, tsetse flies (and their microbiome) interactions. An RNAseq de novo assembly was achieved. The assembled transcripts were mapped to reference sequences for functional annotation. Twenty-four percent of the 16,936 contigs could not be annotated, possibly representing untranslated mRNA regions, or Gpg- or Tbg-specific ORFs. The remaining contigs were classified into 65 functional groups. Only a few transposable elements were present in the Gpg midgut transcriptome, which may represent active transpositions and play regulatory roles. One thousand three hundred and seventy three genes differentially expressed (DEGs) between stimulated and non-stimulated flies were identified at day-3 post-feeding; 52 and 1025 between infected and self-cured flies at 10 and 20 days post-feeding, respectively. The possible roles of several DEGs regarding fly susceptibility and refractoriness are discussed. The results provide new means to decipher fly infection mechanisms, crucial to develop anti-vector control strategies.

  7. Transcriptome Sequencing of Diverse Peanut (Arachis Wild Species and the Cultivated Species Reveals a Wealth of Untapped Genetic Variability

    Directory of Open Access Journals (Sweden)

    Ratan Chopra

    2016-12-01

    Full Text Available To test the hypothesis that the cultivated peanut species possesses almost no molecular variability, we sequenced a diverse panel of 22 Arachis accessions representing Arachis hypogaea botanical classes, A-, B-, and K- genome diploids, a synthetic amphidiploid, and a tetraploid wild species. RNASeq was performed on pools of three tissues, and de novo assembly was performed. Realignment of individual accession reads to transcripts of the cultivar OLin identified 306,820 biallelic SNPs. Among 10 naturally occurring tetraploid accessions, 40,382 unique homozygous SNPs were identified in 14,719 contigs. In eight diploid accessions, 291,115 unique SNPs were identified in 26,320 contigs. The average SNP rate among the 10 cultivated tetraploids was 0.5, and among eight diploids was 9.2 per 1000 bp. Diversity analysis indicated grouping of diploids according to genome classification, and cultivated tetraploids by subspecies. Cluster analysis of variants indicated that sequences of B genome species were the most similar to the tetraploids, and the next closest diploid accession belonged to the A genome species. A subset of 66 SNPs selected from the dataset was validated; of 782 SNP calls, 636 (81.32% were confirmed using an allele-specific discrimination assay. We conclude that substantial genetic variability exists among wild species. Additionally, significant but lesser variability at the molecular level occurs among accessions of the cultivated species. This survey is the first to report significant SNP level diversity among transcripts, and may explain some of the phenotypic differences observed in germplasm surveys. Understanding SNP variants in the Arachis accessions will benefit in developing markers for selection.

  8. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB882 (Link to dictyBase) - - - Contig-U13884-1 VFB882E (Link...) Clone ID VFB882 (Link to dictyBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U13884-1 Ori... m_ : 1.00 48.0 %: cytoplasmic 44.0 %: nuclear 4.0 %: mitochondrial 4.0 %: peroxisomal >> predict

  9. Development and validation of an rDNA operon based primer walking strategy applicable to de novo bacterial genome finishing.

    Directory of Open Access Journals (Sweden)

    Alexander William Eastman

    2015-01-01

    Full Text Available Advances in sequencing technology have drastically increased the depth and feasibility of bacterial genome sequencing. However, little information is available that details the specific techniques and procedures employed during genome sequencing despite the large numbers of published genomes. Shotgun approaches employed by second-generation sequencing platforms has necessitated the development of robust bioinformatics tools for in silico assembly, and complete assembly is limited by the presence of repetitive DNA sequences and multi-copy operons. Typically, re-sequencing with multiple platforms and laborious, targeted Sanger sequencing are employed to finish a draft bacterial genome. Here we describe a novel strategy based on the identification and targeted sequencing of repetitive rDNA operons to expedite bacterial genome assembly and finishing. Our strategy was validated by finishing the genome of Paenibacillus polymyxa strain CR1, a bacterium with potential in sustainable agriculture and bio-based processes. An analysis of the 38 contigs contained in the P. polymyxa strain CR1 draft genome revealed 12 repetitive rDNA operons with varied intragenic and flanking regions of variable length, unanimously located at contig boundaries and within contig gaps. These highly similar but not identical rDNA operons were experimentally verified and sequenced simultaneously with multiple, specially designed primer sets. This approach also identified and corrected significant sequence rearrangement generated during the initial in silico assembly of sequencing reads. Our approach reduces the required effort associated with blind primer walking for contig assembly, increasing both the speed and feasibility of genome finishing. Our study further reinforces the notion that repetitive DNA elements are major limiting factors for genome finishing. Moreover, we provided a step-by-step workflow for genome finishing, which may guide future bacterial genome finishing

  10. Analyses of carnivore microsatellites and their intimate association with tRNA-derived SINEs.

    Science.gov (United States)

    López-Giráldez, Francesc; Andrés, Olga; Domingo-Roura, Xavier; Bosch, Montserrat

    2006-10-23

    The popularity of microsatellites has greatly increased in the last decade on account of their many applications. However, little is currently understood about the factors that influence their genesis and distribution among and within species genomes. In this work, we analyzed carnivore microsatellite clones from GenBank to study their association with interspersed repeats and elucidate the role of the latter in microsatellite genesis and distribution. We constructed a comprehensive carnivore microsatellite database comprising 1236 clones from GenBank. Thirty-three species of 11 out of 12 carnivore families were represented, although two distantly related species, the domestic dog and cat, were clearly overrepresented. Of these clones, 330 contained tRNALys-derived SINEs and 357 contained other interspersed repeats. Our rough estimates of tRNA SINE copies per haploid genome were much higher than published ones. Our results also revealed a distinct juxtaposition of AG and A-rich repeats and tRNALys-derived SINEs suggesting their coevolution. Both microsatellites arose repeatedly in two regions of the interspersed repeat. Moreover, microsatellites associated with tRNALys-derived SINEs showed the highest complexity and less potential instability. Our results suggest that tRNALys-derived SINEs are a significant source for microsatellite generation in carnivores, especially for AG and A-rich repeat motifs. These observations indicate two modes of microsatellite generation: the expansion and variation of pre-existing tandem repeats and the conversion of sequences with high cryptic simplicity into a repeat array; mechanisms which are not specific to tRNALys-derived SINEs. Microsatellite and interspersed repeat coevolution could also explain different distribution of repeat types among and within species genomes.Finally, due to their higher complexity and lower potential informative content of microsatellites associated with tRNALys-derived SINEs, we recommend avoiding

  11. Analysis of newly established EST databases reveals similarities between heart regeneration in newt and fish

    Directory of Open Access Journals (Sweden)

    Weis Patrick

    2010-01-01

    Full Text Available Abstract Background The newt Notophthalmus viridescens possesses the remarkable ability to respond to cardiac damage by formation of new myocardial tissue. Surprisingly little is known about changes in gene activities that occur during the course of regeneration. To begin to decipher the molecular processes, that underlie restoration of functional cardiac tissue, we generated an EST database from regenerating newt hearts and compared the transcriptional profile of selected candidates with genes deregulated during zebrafish heart regeneration. Results A cDNA library of 100,000 cDNA clones was generated from newt hearts 14 days after ventricular injury. Sequencing of 11520 cDNA clones resulted in 2894 assembled contigs. BLAST searches revealed 1695 sequences with potential homology to sequences from the NCBI database. BLAST searches to TrEMBL and Swiss-Prot databases assigned 1116 proteins to Gene Ontology terms. We also identified a relatively large set of 174 ORFs, which are likely to be unique for urodele amphibians. Expression analysis of newt-zebrafish homologues confirmed the deregulation of selected genes during heart regeneration. Sequences, BLAST results and GO annotations were visualized in a relational web based database followed by grouping of identified proteins into clusters of GO Terms. Comparison of data from regenerating zebrafish hearts identified biological processes, which were uniformly overrepresented during cardiac regeneration in newt and zebrafish. Conclusion We concluded that heart regeneration in newts and zebrafish led to the activation of similar sets of genes, which suggests that heart regeneration in both species might follow similar principles. The design of the newly established newt EST database allows identification of molecular pathways important for heart regeneration.

  12. Dicty_cDB: SHA665 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SH (Link to library) SHA665 (Link to dictyBase) - - - Contig-U15540-1 - (Link to Or...iginal site) - - SHA665Z 676 - - - - Show SHA665 Library SH (Link to library) Clone ID SHA665 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15540-1 Original site URL http://dictycdb.b...TQIDDKTGIFDSQRFLAFNNPQAMSKYESYRIYIHPSLGY SGNAKRFKQQPDVNEKALILDGNVYDGHLNPIYNCKICTE...ACWNLLEPMHR NYYQPIQFKLPSFPDTSLPITQIDDKTGIFDSQRFLAFNNPQAMSKYESYRIYIHPSLGY SGNAKRFKQQPDVNEKALILDGNVYDGHLNPIYNCKICT

  13. Dicty_cDB: CHE687 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CH (Link to library) CHE687 (Link to dictyBase) - - - Contig-U16336-1 - (Link to Or...iginal site) CHE687F 622 - - - - - - Show CHE687 Library CH (Link to library) Clone ID CHE687 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16336-1 Original site URL http://dictycdb.b...KFINQCINEIKEELKGDMQKKTVAVQKLTYIQ MLGFDISWASFKIVEVMSCNKFSSKRIGYLAASQSFNEGTDVIVLATHQIRKDFLSSNQS EAYLALNCLSNICT...LGFDISWASFKIVEVMSCNKFSSKRIGYLAASQSFNEGTDVIVLATHQIRKDFLSSNQS EAYLALNCLSNICTTDLARELANDILTLLSTQKTHILKRAITVLYKIF

  14. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB595 (Link to dictyBase) - - - Contig-U09552-1 VFB595E (Link...) Clone ID VFB595 (Link to dictyBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U09552-1 Ori...KLLKSDNWISTCQNLIQEYEPQ IIAVVEGFMAPSELCQKIKFCSSSSSTNDFDFIGSSTTDCEICTFISGYAENFLEENKTL EDIIKVVDDFCKILPAAYKTDCVA...A: VEGSGECLVCEFISEKIVTYLEANQTETQILQYLDNDCKLLKSDNWISTCQNLIQEYEPQ IIAVVEGFMAPSELCQKIKFCSSSSSTNDFDFIGSSTTDCEICT... Sequences producing significant alignments: (bits) Value N U66367 |U66367.1 Dictyostelium discoideum SapA (

  15. Dicty_cDB: VHJ417 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHJ417 (Link to dictyBase) - G23450 DDB0202154 Contig-U12507-1...brary) Clone ID VHJ417 (Link to dictyBase) Atlas ID - NBRP ID G23450 dictyBase ID DDB0202154 Link to Contig ...Contig-U12507-1 Original site URL http://dictycdb.biol.tsukuba.ac.jp/CSM/VH/VHJ4-...LLSAVSDSSVQSLSSSI--- ---IDEIKQSVFGATAIIGQSKYIMEGQLSLLKGVVASTKKDLFIYLFKDCIICTEINSN...QSKYIMEGQLSLLKGVVASTKKDLFIYLFKDCIICTEINSN YQKDGSNNNNNNNNNNSNNSNSNSSSNNSGSNNSSNNNSPNNLTPKKQTFKSLSTSSSTP NNLSQ

  16. Dicty_cDB: CHK172 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CH (Link to library) CHK172 (Link to dictyBase) - - - Contig-U11104-1 - (Link to Or...iginal site) CHK172F 626 - - - - - - Show CHK172 Library CH (Link to library) Clone ID CHK172 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U11104-1 Original site URL http://dictycdb.b...HEECKTQGNNYFKQSQYMDAIRCYTQAIELSNGTIA AYYGNRAAAYLAICTKSSLQDSIKDSLKAIELERSFIKGYTRASKAYIHLAQYDQAASII VRGLVFDPRN...KMDHEECKTQGNNYFKQSQYMDAIRCYTQAIELSNGTIA AYYGNRAAAYLAICTKSSLQDSIKDSLKAIELERSFIKGYT

  17. Dicty_cDB: VFH641 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFH641 (Link to dictyBase) - - - Contig-U16151-1 VFH641E (Link... Clone ID VFH641 (Link to dictyBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16151-1 Original site URL http://dict...KLLGVQTQAKQTRATWKIVVGDGAGAVTFKLATNGGTTEGDFTTTLTSKVLS GSDPKEVGTYYMDVTVPTGTTCTGTNGICT...KLLGVQTQAKQTRATWKIVVGDGAGAVTFKLATNGGTTEGDFTTTLTSKVLS GSDPKEVGTYYMDVTVPTGTTCTGTNGICT...nificant alignments: (bits) Value N AC116984 |AC116984.2 Dictyostelium discoideum

  18. Dicty_cDB: CHA557 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CH (Link to library) CHA557 (Link to dictyBase) - - - Contig-U15635-1 - (Link to Or...iginal site) CHA557F 620 - - - - - - Show CHA557 Library CH (Link to library) Clone ID CHA557 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15635-1 Original site URL http://dictycdb.b...HSIEIGKVEILPNSLIGIDEDGVIQHMKSN YEDLKQLEKDVTMICTDNGINEQESVIDMGNKFLIPGFIDTHAHAPQYHNAGTGTDLPLL KWLEKYTFPVESKFKD...EIGKVEILPNSLIGIDEDGVIQHMKSN YEDLKQLEKDVTMICTDNGINEQESVIDMGNKFLIPGFIDTHAHAPQYHNAGT

  19. Dicty_cDB: SLA117 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SL (Link to library) SLA117 (Link to dictyBase) - - - Contig-U15540-1 - (Link to Or...iginal site) - - SLA117Z 649 - - - - Show SLA117 Library SL (Link to library) Clone ID SLA117 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15540-1 Original site URL http://dictycdb.b...--EPMHRNYYQPIQFKLPSFPDTSLPITQIDDKTGIFDSQRFLAFNNPQAMSKYESYRI YIHPSLGYSGNAKRFKQQPDVNEKALILDGNVYDGHLNPIYNCKICTE...SYRI YIHPSLGYSGNAKRFKQQPDVNEKALILDGNVYDGHLNPIYNCKICTEYYQTKSYFSANP HAKGKVLLVKNNILTRVKDGGFTLSLKPMCCSGHNSHIPLYF

  20. Dicty_cDB: SHE695 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SH (Link to library) SHE695 (Link to dictyBase) - G22060 DDB0217012 Contig-U12072-1...brary) Clone ID SHE695 (Link to dictyBase) Atlas ID - NBRP ID G22060 dictyBase ID DDB0217012 Link to Contig ...Contig-U12072-1 Original site URL http://dictycdb.biol.tsukuba.ac.jp/CSM/SH/SHE6-...CPISDCSDHGYCDTSIGKC NCDSSHQGAGCEMDLVSCPTSNKAICSGWGTCNNQTGICTCDANRVGSDCSGISCPVPNC NGNGNCDTTVGQCSCDPSHQGANCEMN...IGKC NCDSSHQGAGCEMDLVSCPTSNKAICSGWGTCNNQTGICTCDANRVGSDCSGISCPVPNC NGNGNCDTTVGQCSCDPSHQGANCEMNFVSCPTPNRSVCSGWGTCNNQTGICT

  1. A physical map of the heterozygous grapevine 'Cabernet Sauvignon' allows mapping candidate genes for disease resistance

    Directory of Open Access Journals (Sweden)

    Scalabrin Simone

    2008-06-01

    Full Text Available Abstract Background Whole-genome physical maps facilitate genome sequencing, sequence assembly, mapping of candidate genes, and the design of targeted genetic markers. An automated protocol was used to construct a Vitis vinifera 'Cabernet Sauvignon' physical map. The quality of the result was addressed with regard to the effect of high heterozygosity on the accuracy of contig assembly. Its usefulness for the genome-wide mapping of genes for disease resistance, which is an important trait for grapevine, was then assessed. Results The physical map included 29,727 BAC clones assembled into 1,770 contigs, spanning 715,684 kbp, and corresponding to 1.5-fold the genome size. Map inflation was due to high heterozygosity, which caused either the separation of allelic BACs in two different contigs, or local mis-assembly in contigs containing BACs from the two haplotypes. Genetic markers anchored 395 contigs or 255,476 kbp to chromosomes. The fully automated assembly and anchorage procedures were validated by BAC-by-BAC blast of the end sequences against the grape genome sequence, unveiling 7.3% of chimerical contigs. The distribution across the physical map of candidate genes for non-host and host resistance, and for defence signalling pathways was then studied. NBS-LRR and RLK genes for host resistance were found in 424 contigs, 133 of them (32% were assigned to chromosomes, on which they are mostly organised in clusters. Non-host and defence signalling genes were found in 99 contigs dispersed without a discernable pattern across the genome. Conclusion Despite some limitations that interfere with the correct assembly of heterozygous clones into contigs, the 'Cabernet Sauvignon' physical map is a useful and reliable intermediary step between a genetic map and the genome sequence. This tool was successfully exploited for a quick mapping of complex families of genes, and it strengthened previous clues of co-localisation of major NBS-LRR clusters and

  2. Salmon louse (Lepeophtheirus salmonis transcriptomes during post molting maturation and egg production, revealed using EST-sequencing and microarray analysis

    Directory of Open Access Journals (Sweden)

    Jonassen Inge

    2008-03-01

    Full Text Available Abstract Background Lepeophtheirus salmonis is an ectoparasitic copepod feeding on skin, mucus and blood from salmonid hosts. Initial analysis of EST sequences from pre adult and adult stages of L. salmonis revealed a large proportion of novel transcripts. In order to link unknown transcripts to biological functions we have combined EST sequencing and microarray analysis to characterize female salmon louse transcriptomes during post molting maturation and egg production. Results EST sequence analysis shows that 43% of the ESTs have no significant hits in GenBank. Sequenced ESTs assembled into 556 contigs and 1614 singletons and whenever homologous genes were identified no clear correlation with homologous genes from any specific animal group was evident. Sequence comparison of 27 L. salmonis proteins with homologous proteins in humans, zebrafish, insects and crustaceans revealed an almost identical sequence identity with all species. Microarray analysis of maturing female adult salmon lice revealed two major transcription patterns; up-regulation during the final molting followed by down regulation and female specific up regulation during post molting growth and egg production. For a third minor group of ESTs transcription decreased during molting from pre-adult II to immature adults. Genes regulated during molting typically gave hits with cuticula proteins whilst transcripts up regulated during post molting growth were female specific, including two vitellogenins. Conclusion The copepod L.salmonis contains high a level of novel genes. Among analyzed L.salmonis proteins, sequence identities with homologous proteins in crustaceans are no higher than to homologous proteins in humans. Three distinct processes, molting, post molting growth and egg production correlate with transcriptional regulation of three groups of transcripts; two including genes related to growth, one including genes related to egg production. The function of the regulated

  3. Comparative transcriptome analysis reveals the genetic basis of skin color variation in common carp.

    Directory of Open Access Journals (Sweden)

    Yanliang Jiang

    Full Text Available The common carp is an important aquaculture species that is widely distributed across the world. During the long history of carp domestication, numerous carp strains with diverse skin colors have been established. Skin color is used as a visual criterion to determine the market value of carp. However, the genetic basis of common carp skin color has not been extensively studied.In this study, we performed Illumina sequencing on two common carp strains: the reddish Xingguo red carp and the brownish-black Yellow River carp. A total of 435,348,868 reads were generated, resulting in 198,781 assembled contigs that were used as reference sequences. Comparisons of skin transcriptome files revealed 2,012 unigenes with significantly different expression in the two common carp strains, including 874 genes that were up-regulated in Xingguo red carp and 1,138 genes that were up-regulated in Yellow River carp. The expression patterns of 20 randomly selected differentially expressed genes were validated using quantitative RT-PCR. Gene pathway analysis of the differentially expressed genes indicated that melanin biosynthesis, along with the Wnt and MAPK signaling pathways, is highly likely to affect the skin pigmentation process. Several key genes involved in the skin pigmentation process, including TYRP1, SILV, ASIP and xCT, showed significant differences in their expression patterns between the two strains.In this study, we conducted a comparative transcriptome analysis of Xingguo red carp and Yellow River carp skins, and we detected key genes involved in the common carp skin pigmentation process. We propose that common carp skin pigmentation depends upon at least three pathways. Understanding fish skin color genetics will facilitate future molecular selection of the fish skin colors with high market values.

  4. Comparative transcriptome analysis reveals the genetic basis of skin color variation in common carp.

    Science.gov (United States)

    Jiang, Yanliang; Zhang, Songhao; Xu, Jian; Feng, Jianxin; Mahboob, Shahid; Al-Ghanim, Khalid A; Sun, Xiaowen; Xu, Peng

    2014-01-01

    The common carp is an important aquaculture species that is widely distributed across the world. During the long history of carp domestication, numerous carp strains with diverse skin colors have been established. Skin color is used as a visual criterion to determine the market value of carp. However, the genetic basis of common carp skin color has not been extensively studied. In this study, we performed Illumina sequencing on two common carp strains: the reddish Xingguo red carp and the brownish-black Yellow River carp. A total of 435,348,868 reads were generated, resulting in 198,781 assembled contigs that were used as reference sequences. Comparisons of skin transcriptome files revealed 2,012 unigenes with significantly different expression in the two common carp strains, including 874 genes that were up-regulated in Xingguo red carp and 1,138 genes that were up-regulated in Yellow River carp. The expression patterns of 20 randomly selected differentially expressed genes were validated using quantitative RT-PCR. Gene pathway analysis of the differentially expressed genes indicated that melanin biosynthesis, along with the Wnt and MAPK signaling pathways, is highly likely to affect the skin pigmentation process. Several key genes involved in the skin pigmentation process, including TYRP1, SILV, ASIP and xCT, showed significant differences in their expression patterns between the two strains. In this study, we conducted a comparative transcriptome analysis of Xingguo red carp and Yellow River carp skins, and we detected key genes involved in the common carp skin pigmentation process. We propose that common carp skin pigmentation depends upon at least three pathways. Understanding fish skin color genetics will facilitate future molecular selection of the fish skin colors with high market values.

  5. Dicty_cDB: SFH450 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SF (Link to library) SFH450 (Link to dictyBase) - - - Contig-U13857-1 SFH450F (Link... to Original site) SFH450F 623 - - - - - - Show SFH450 Library SF (Link to library) Clone ID SFH450 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U13857-1 Original site URL http://dict...IMIIFLCLEATFDQPYHSIGGFIIVCSGYQLGQGIAGGAFSGIIPDVVHPSQSGI ASGWLGVGFSLGLLIGTILFGTLLEVKNVIHTWYLYGATIAFLGISALITICT...GTILFGTLLEVKNVIHTWYLYGATIAFLGISALITICTMHEDSND EWSFDGSLPSFFKSLHLPSSIYFNFYWVLITRFFNTLGIYMIFSFLLYFATDIIGQTNLM T

  6. Dicty_cDB: SFH236 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SF (Link to library) SFH236 (Link to dictyBase) - G00315 DDB0191340 Contig-U16349-1...brary) Clone ID SFH236 (Link to dictyBase) Atlas ID - NBRP ID G00315 dictyBase ID DDB0191340 Link to Contig ...Contig-U16349-1 Original site URL http://dictycdb.biol.tsukuba.ac.jp/CSM/SF/SFH2-...VDCVTNSSDASCTNFQYPLANITADINNL CGSMPYMPVCTIQQSCNQESSTSGICDPFSILGDSCLHDMPGMSG--- ---ASGSVVEQCSSVDSISNLPTTMQLFAGIKSICT...ii yvvaclicqfvpfnnhviknpqqvvfvihsqflvivvfticqa*vv--- ---ASGSVVEQCSSVDSISNLPTTMQLFAGIKSICTEMAMDGCEKCSGNSPTTTC

  7. Dicty_cDB: SFC488 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SF (Link to library) SFC488 (Link to dictyBase) - - - Contig-U10843-1 SFC488F (Link... to Original site) SFC488F 556 - - - - - - Show SFC488 Library SF (Link to library) Clone ID SFC488 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U10843-1 Original site URL http://dict...ETEITTESILGDGSFGTVYKGRCRLKDVAVKVMLKQVDQKTLTDFRKE VAIMSKIFHPNIVLFLGACTSTPGKLMICTELMKGNLESLLLDPMVKLPLITRMRMAKD...yhtht**KTMATQQQQQQQQQQQQQIKARKDIQIQQAQ SASDILGPPEISETEITTESILGDGSFGTVYKGRCRLKDVAVKVMLKQVDQKTLTDFRKE VAIMSKIFHPNIVLFLGACTSTPGKLMICT

  8. Dicty_cDB: CFH353 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CF (Link to library) CFH353 (Link to dictyBase) - - - Contig-U16205-1 CFH353Z (Link... to Original site) - - CFH353Z 625 - - - - Show CFH353 Library CF (Link to library) Clone ID CFH353 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16205-1 Original site URL http://dict...RDLYTVSQDVTKLAIQSAEQNGIVFLDEIDKICTSRESIKNGGDASTDGVQRDLLPI VEGCMVSTKYGQIDTSRILFIASGAFHNTKPSDLISELQGRLPIRVELKP...: ---RDLYTVSQDVTKLAIQSAEQNGIVFLDEIDKICTSRESIKNGGDASTDGVQRDLLPI VEGCMVSTKYGQIDTSRILFIASGAFHNTKPSDLISELQGRLPIR

  9. Dicty_cDB: SFF779 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SF (Link to library) SFF779 (Link to dictyBase) - - - Contig-U16255-1 SFF779Z (Link... to Original site) - - SFF779Z 731 - - - - Show SFF779 Library SF (Link to library) Clone ID SFF779 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16255-1 Original site URL http://dict...NATCSSLNCNAQQMSCKYVXQACHETSCCPDIPQCQIPATGGGPATGSATGQGTSGG TPGSCDKVNCPNGYICTIVNQLA...DIPQCQIPATGGGPATGSATGQGTSGG TPGSCDKVNCPNGYICTIVNQLAVCVSPSSSSSSSSSTTGSHTTTGGSTTGSH

  10. Dicty_cDB: SHL605 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SH (Link to library) SHL605 (Link to dictyBase) - - - Contig-U16336-1 - (Link to Or...iginal site) SHL605F 632 - - - - - - Show SHL605 Library SH (Link to library) Clone ID SHL605 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16336-1 Original site URL http://dictycdb.b...VDLIRGIRNHKKNETKFINQCINEIKEELKGDMQKKTVAVQK LTYIQMLGFDISWASFKIVEVMSCNKFSSKRIGYLAASQSFNEGTDVIVLATHQIRKDFL SSNQSEAYLALNCLSNICT...KRIGYLAASQSFNEGTDVIVLATHQIRKDFL SSNQSEAYLALNCLSNICTTDLARELANDILTLLSTQKTHILKRAITVLYKIFLRYPESL RPAFPKLREKLDDPE

  11. Dicty_cDB: CHD753 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CH (Link to library) CHD753 (Link to dictyBase) - - - Contig-U15579-1 - (Link to Or...iginal site) - - CHD753Z 650 - - - - Show CHD753 Library CH (Link to library) Clone ID CHD753 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15579-1 Original site URL http://dictycdb.b...---IQACGPDNNYQCEFVDKICNTTNDKCLVESCEIGFGCLAIPKNCNDNDPCTTDHCDP AIGCYYDKFDNCDACNAVDTCITNDLCFPRECNPRGNPPCLINPINCTSTDPCIFSYCEN GVCIPTYICT...NPRGNPPCLINPINCTSTDPCIFSYCEN GVCIPTYICTPTPSVTPTVTPTVTPTVTPTVTPTVTPTVTPTPTTTPTPSPTTVPPRPTP TPLPADPPPYDLEEGCLV

  12. Dicty_cDB: CFG115 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CF (Link to library) CFG115 (Link to dictyBase) - - - Contig-U03237-1 CFG115E (Link...) Clone ID CFG115 (Link to dictyBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U03237-1 Ori...k*NNMNGFLKSQLENKTNATTTTTTRYCNNDESCNDENLCTN EMCDPVIGCIYENISCDDDNGCTKDFCDPLTGCFHKRVVCDDNDKCTDNICNPETGTCSF RRRICT...TTTRYCNNDESCNDENLCTN EMCDPVIGCIYENISCDDDNGCTKDFCDPLTGCFHKRVVCDDNDKCTDNICNPETGTCSF RRRICTDNNDCTMDWCNQLTGECVYQ...ng significant alignments: (bits) Value N AC116984 |AC116984.2 Dictyostelium discoideum chromosome 2 map 256

  13. Dicty_cDB: SSA581 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SS (Link to library) SSA581 (Link to dictyBase) - - - Contig-U12576-1 SSA581Z (Link... to Original site) - - SSA581Z 504 - - - - Show SSA581 Library SS (Link to library) Clone ID SSA581 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U12576-1 Original site URL http://dict...SARPLGVAMILIGIDDELGPQLFKVDPAGVFTGYKATAAGEK EQESTNFLEKKFKSNPQLSKDETIQMAISTLQSVLGADLKSSDLEIGICTMDNRKFVIMT DEDI...QHASARPLGVAMILIGIDDELGPQLFKVDPAGVFTGYKATAAGEK EQESTNFLEKKFKSNPQLSKDETIQMAISTLQSVLGADLKSSDLEIGICTMDNRKFVIMT D

  14. Dicty_cDB: SFI222 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SF (Link to library) SFI222 (Link to dictyBase) - - - Contig-U13893-1 - (Link to Or...iginal site) - - SFI222Z 716 - - - - Show SFI222 Library SF (Link to library) Clone ID SFI222 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U13893-1 Original site URL http://dictycdb.b...LLSIVQLLMGEIPERNTFS QKQLKIALKPYFHLTEAVRVGDLGSFNQALEQNSDIFKSDQTFTLVQRLRSNVIKAGLKK LNTAYSRISFNDICT...SIVQLLMGEIPERNTFS QKQLKIALKPYFHLTEAVRVGDLGSFNQALEQNSDIFKSDQTFTLVQRLRSNVIKAGLKK LNTAYSRISFNDICT

  15. Dicty_cDB: SLE232 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SL (Link to library) SLE232 (Link to dictyBase) - - - Contig-U16255-1 SLE232F (Link... to Original site) SLE232F 614 - - - - - - Show SLE232 Library SL (Link to library) Clone ID SLE232 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16255-1 Original site URL http://dict...PSSGFTDFIPSNATCSSLNCNAQQMSCKYVQQACHETSCCPDIPQC QIPATGGGPATGSATGQGTSGGTPGSCDKVNCPNGYICTIVNQLAVCVSPSSSSSSSSST ...CHETSCCPDIPQC QIPATGGGPATGSATGQGTSGGTPGSCDKVNCPNGYICTIVNQLAVCVSPSSSSSSSSST TGSHTTTGGSTTGSHTTTGGSTTGSHTTTGGSA

  16. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFC278 (Link to dictyBase) - - - Contig-U13504-1 VFC278E (Link... Clone ID VFC278 (Link to dictyBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U13504-1 Original site URL http://dict...0.0 %: mitochondrial 4.0 %: plasma membrane >> prediction for VFC278 is cyt 5' end seq. ID VFC278F 5' end se

  17. Dicty_cDB: VFJ144 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFJ144 (Link to dictyBase) - - - Contig-U12576-1 - (Link to Or...iginal site) - - VFJ144Z 698 - - - - Show VFJ144 Library VF (Link to library) Clone ID VFJ144 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U12576-1 Original site URL http://dictycdb.b...IGIDDELGPQLFKVDPAGVFTGYKATAAGEKEQESTNFLEKK FKSNPQLSKDETIQMAISTLQSVLGADLKPSDLEIGICTMDNRKFVIMTDED Translated A...PQLFKVDPAGVFTGYKATAAGEKEQESTNFLEKK FKSNPQLSKDETIQMAISTLQSVLGADLKPSDLEIGICTMDNRKFVIMTDED Frame B: ---givrkyl*

  18. Molecular characterization and combined genotype association study of bovine cluster of differentiation 14 gene with clinical mastitis in crossbred dairy cattle

    Directory of Open Access Journals (Sweden)

    A. Sakthivel Selvan

    2016-07-01

    Full Text Available Aim: The present study was undertaken with the objectives to characterize and to analyze combined genotypes of cluster of differentiation 14 (CD14 gene to explore its association with clinical mastitis in Karan Fries (KF cows maintained in the National Dairy Research Institute herd, Karnal. Materials and Methods: Genomic DNA was extracted using blood of randomly selected 94 KF lactating cattle by phenolchloroform method. After checking its quality and quantity, polymerase chain reaction (PCR was carried out using six sets of reported gene-specific primers to amplify complete KF CD14 gene. The forward and reverse sequences for each PCR fragments were assembled to form complete sequence for the respective region of KF CD14 gene. The multiple sequence alignments of the edited sequence with the corresponding reference with reported Bos taurus sequence (EU148610.1 were performed with ClustalW software to identify single nucleotide polymorphisms (SNPs. Basic Local Alignment Search Tool analysis was performed to compare the sequence identity of KF CD14 gene with other species. The restriction fragment length polymorphism (RFLP analysis was carried out in all KF cows using Helicobacter pylori 188I (Hpy188I (contig 2 and Haemophilus influenzae I (HinfI (contig 4 restriction enzyme (RE. Cows were assigned genotypes obtained by PCRRFLP analysis, and association study was done using Chi-square (χ2 test. The genotypes of both contigs (loci number 2 and 4 were combined with respect to each animal to construct combined genotype patterns. Results: Two types of sequences of KF were obtained: One with 2630 bp having one insertion at 616 nucleotide (nt position and one deletion at 1117 nt position, and the another sequence was of 2629 bp having only one deletion at 615 nt position. ClustalW, multiple alignments of KF CD14 gene sequence with B. taurus cattle sequence (EU148610.1, revealed 24 nt changes (SNPs. Cows were also screened using PCR-RFLP with Hpy188I

  19. Two sequence-ready contigs spanning the two copies of a 200-kb duplication on human 21q: partial sequence and polymorphisms.

    Science.gov (United States)

    Potier, M; Dutriaux, A; Orti, R; Groet, J; Gibelin, N; Karadima, G; Lutfalla, G; Lynn, A; Van Broeckhoven, C; Chakravarti, A; Petersen, M; Nizetic, D; Delabar, J; Rossier, J

    1998-08-01

    Physical mapping across a duplication can be a tour de force if the region is larger than the size of a bacterial clone. This was the case of the 170- to 275-kb duplication present on the long arm of chromosome 21 in normal human at 21q11.1 (proximal region) and at 21q22.1 (distal region), which we described previously. We have constructed sequence-ready contigs of the two copies of the duplication of which all the clones are genuine representatives of one copy or the other. This required the identification of four duplicon polymorphisms that are copy-specific and nonallelic variations in the sequence of the STSs. Thirteen STSs were mapped inside the duplicated region and 5 outside but close to the boundaries. Among these STSs 10 were end clones from YACs, PACs, or cosmids, and the average interval between two markers in the duplicated region was 16 kb. Eight PACs and cosmids showing minimal overlaps were selected in both copies of the duplication. Comparative sequence analysis along the duplication showed three single-basepair changes between the two copies over 659 bp sequenced (4 STSs), suggesting that the duplication is recent (less than 4 mya). Two CpG islands were located in the duplication, but no genes were identified after a 36-kb cosmid from the proximal copy of the duplication was sequenced. The homology of this chromosome 21 duplicated region with the pericentromeric regions of chromosomes 13, 2, and 18 suggests that the mechanism involved is probably similar to pericentromeric-directed mechanisms described in interchromosomal duplications. Copyright 1998 Academic Press.

  20. Dicty_cDB: SHI867 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SH (Link to library) SHI867 (Link to dictyBase) - - - Contig-U11503-1 - (Link to Or...iginal site) SHI867F 646 - - - - - - Show SHI867 Library SH (Link to library) Clone ID SHI867 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U11503-1 Original site URL http://dictycdb.b...fsl*iy*YMIRKSNNFSILFAIFLKIVFVVSAPLCPNSTILLNYNILTVYNSSEGC GFNNEPICTSLKDAVSRAFLLISNNSRVCIGIIGNINVTSEQITLGNYCGA...l*lsif--- Frame C: qkkqfsl*iy*YMIRKSNNFSILFAIFLKIVFVVSAPLCPNSTILLNYNILTVYNSSEGC GFNNEPICT

  1. Dicty_cDB: FC-AJ15 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available FC (Link to library) FC-AJ15 (Link to dictyBase) - G01152 DDB0232964 Contig-U16520-...to library) Clone ID FC-AJ15 (Link to dictyBase) Atlas ID - NBRP ID G01152 dictyBase ID DDB0232964 Link to C...ontig Contig-U16520-1 Original site URL http://dictycdb.biol.tsukuba.ac.jp/CSM/FC...KIEFPLPDIKTKRKIFEI HTAKMNLSEDVNLEEFVMSKDDLSGADIKAICTESGLLALRERRMRVTHTDFKKAKEKVL YRKTAGAPEGLYM*kkknqnqk Trans...vih*qlviwrrlsmxitpschqp*tralcpyhvfv--- ---ELLNQLDGFDASTDVKVIMATNRIETLDPALIRPGRIDRKIEFPLPDIKTKRKIFEI HTAKMNLSEDVNLEEFVMSKDDLSGADIKAICT

  2. Dicty_cDB: SSI339 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SS (Link to library) SSI339 (Link to dictyBase) - - - Contig-U04467-1 SSI339Z (Link... to Original site) - - SSI339Z 563 - - - - Show SSI339 Library SS (Link to library) Clone ID SSI339 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U04467-1 Original site URL http://dict...1998. 1.22 Translated Amino Acid sequence ---FTCSNNQVISSSLVSENNCIYTVEMSGNIFCPTPTPTPTPTPTPTPNPTSNVTCKSS NGISITSSDIITCIGYGQSICT...NQVISSSLVSENNCIYTVEMSGNIFCPTPTPTPTPTPTPTPNPTSNVTCKSS NGISITSSDIITCIGYGQSICTTSSGYSCETNQTNGVLKCISPDNSISCIGNQFY

  3. Dicty_cDB: SLB394 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SL (Link to library) SLB394 (Link to dictyBase) - G24120 DDB0184167BP Contig-U13982... (Link to library) Clone ID SLB394 (Link to dictyBase) Atlas ID - NBRP ID G24120 dictyBase ID DDB0184167BP L...ink to Contig Contig-U13982-1 Original site URL http://dictycdb.biol.tsukuba.ac.j...2 Translated Amino Acid sequence TDTNPKEPSNVESIITTSEVTPSPPPSTTTSTTTNATTTITTSQPQANIIGGKRSRKDDE IISIQEALNDQLEEEKNLLEEAKEQEQEDWGDESICT...e B: TDTNPKEPSNVESIITTSEVTPSPPPSTTTSTTTNATTTITTSQPQANIIGGKRSRKDDE IISIQEALNDQLEEEKNLLEEAKEQEQEDWGDESICT

  4. Dicty_cDB: SHI777 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SH (Link to library) SHI777 (Link to dictyBase) - - - Contig-U12085-1 - (Link to Or...iginal site) SHI777F 332 - - - - - - Show SHI777 Library SH (Link to library) Clone ID SHI777 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U12085-1 Original site URL http://dictycdb.b...IKEIIQHAKEYGIRVELEIDMPGHAYSWGIGYPS VLPANFSHSIQCQPSICTNK***ikknk*kk*kniylkikkkx*kk...IKEIIQHAKEYGIRVELEIDMPGHAYSWGIGYPS VLPANFSHSIQCQPSICTNK***ikknk*kk*kniylkikkkx*kk

  5. Dicty_cDB: VHB478 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHB478 (Link to dictyBase) - - - Contig-U16349-1 - (Link to Or...iginal site) - - VHB478Z 556 - - - - Show VHB478 Library VH (Link to library) Clone ID VHB478 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16349-1 Original site URL http://dictycdb.b...cid sequence ---ASGSVVXQCSSVDSISNLPTXMQLFAGIKSICTEMAMDGCEKCSGNSPTTTCDVLPV YSSLCMAMPDMSQCANWTKMCSSSGQLYNSQITT...xxcxlf*trknp*kyfrkkmdsqqkrfxr*xfn*vhlsl sgxy Frame C: ---ASGSVVXQCSSVDSISNLPTXMQLFAGIKSICTEMAMDGCEKCSGNSPTTT

  6. Dicty_cDB: SHF448 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SH (Link to library) SHF448 (Link to dictyBase) - - - Contig-U11503-1 SHF448E (Link...) Clone ID SHF448 (Link to dictyBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U11503-1 Ori...quence qkkqfsl*iy*YMIRKSNNFSILFAIFLKIVFVVSAPLCPNSTILLNYNILTVYNSSEGC GFNNEPICTSLKD...mnvnlkimrigltqlxisyrfy Frame C: qkkqfsl*iy*YMIRKSNNFSILFAIFLKIVFVVSAPLCPNSTILLNYNILTVYNSSEGC GFNNEPICTSLKDAV...nts: (bits) Value N AC115680 |AC115680.3 Dictyostelium discoideum chromosome 2 map 4915084-5005461 strain AX

  7. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB533 (Link to dictyBase) - G02515 DDB0218082 Contig-U10369-1...brary) Clone ID VFB533 (Link to dictyBase) Atlas ID - NBRP ID G02515 dictyBase ID DDB0218082 Link to Contig ...Contig-U10369-1 Original site URL http://dictycdb.biol.tsukuba.ac.jp/CSM/VF/VFB5-... %: cytoskeletal 4.0 %: extracellular, including cell wall 4.0 %: peroxisomal >> predict

  8. Dicty_cDB: SFC123 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SF (Link to library) SFC123 (Link to dictyBase) - - - Contig-U16368-1 SFC123E (Link...) Clone ID SFC123 (Link to dictyBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16368-1 Ori...CVPHHDGCGNIQCPWGHYCVNEHGKCRCVPHRPPPRPPVDQCRNQHCPH GYSCRVIKGCATCVRDARPPHNLCRGFGCPEGSHCEVLEKHPVCVRNHVPPHPPPPPQIC GSVNCGPGYICT...CRCVPHRPPPRPPVDQCRNQHCPH GYSCRVIKGCATCVRDARPPHNLCRGFGCPEGSHCEVLEKHPVCVRNHVPPHPPPPPQIC GSVNCGPGYICTIINGHPTCIR...(bits) Value N D13973 |D13973.1 Dictyostelium discoideum DNA for Dp87 protein, complete cds. 1643 0.0 5 BJ17

  9. Dicty_cDB: SHL102 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SH (Link to library) SHL102 (Link to dictyBase) - - - Contig-U11892-1 - (Link to Or...iginal site) - - SHL102Z 634 - - - - Show SHL102 Library SH (Link to library) Clone ID SHL102 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U11892-1 Original site URL http://dictycdb.b...TCRXKFDNQKIHDCDWIIQGPTTPSLCANCGKICTSKCTTNYCDRDCQTVDWQSDHKL ICGLKSFKDR*detpikn*ik*...A FSTCRXKFDNQKIHDCDWIIQGPTTPSLCANCGKICTSKCTTNYCDRDCQTVDWQSDHKL ICGLKSFKDR*detpikn*ik*kkiklnnk Frame C: ---lk

  10. Transcriptomic landscape of Dendrobium huoshanense and its genes related to polysaccharide biosynthesis

    Directory of Open Access Journals (Sweden)

    Rongchun Han

    2018-03-01

    Full Text Available Dendrobium huoshanense has long been used to treat various diseases in oriental medicine. In order to study its gene expression profile, transcripts involved in the biosynthesis of precursors of polysaccharides, as well as mechanisms underlining morphological differences between wild and cultivated plants, three organs of both wild type and cultivated D. huoshanense were collected and sequenced by Illumina HiSeq4000 platform, yielding 919,409,540 raw reads in FASTQ format. After Trinity de novo assembly and quality control, 241,242 nonredundant contigs with the average length of 967.5 bp were generated. qRT-PCR experiment on the selected transcripts showed the transcriptomic data were reliable. BLASTx was conducted against NR, SwissProt, String, Pfam, and KEGG. Gene ontology annotation revealed more than 40,000 contigs assigned to catalytic activity and metabolic process, suggesting its dynamic physiological activities. By searching KEGG pathway, six genes potentially involved in mannose biosynthetic pathway were retrieved. Gene expression analysis for stems between wild and cultivated D. huoshanense resulted in 956 genes differentially expressed. Simple sequence repeats (SSRs analysis revealed 143 SSRs with the unit size of 4 and 3,437 SSRs the size of 3. The obtained SSRs are the potential molecular markers for discriminating distinct cultivars of D. huoshanense.

  11. Behavioural mapping of a pelagic seabird: combining multiple sensors and a hidden Markov model reveals the distribution of at-sea behaviour

    Science.gov (United States)

    Dean, Ben; Freeman, Robin; Kirk, Holly; Leonard, Kerry; Phillips, Richard A.; Perrins, Chris M.; Guilford, Tim

    2013-01-01

    The use of miniature data loggers is rapidly increasing our understanding of the movements and habitat preferences of pelagic seabirds. However, objectively interpreting behavioural information from the large volumes of highly detailed data collected by such devices can be challenging. We combined three biologging technologies—global positioning system (GPS), saltwater immersion and time–depth recorders—to build a detailed picture of the at-sea behaviour of the Manx shearwater (Puffinus puffinus) during the breeding season. We used a hidden Markov model to explore discrete states within the combined GPS and immersion data, and found that behaviour could be organized into three principal activities representing (i) sustained direct flight, (ii) sitting on the sea surface, and (iii) foraging, comprising tortuous flight interspersed with periods of immersion. The additional logger data verified that the foraging activity corresponded well to the occurrence of diving. Applying this approach to a large tracking dataset revealed that birds from two different colonies foraged in local waters that were exclusive, but overlapped in one key area: the Irish Sea Front (ISF). We show that the allocation of time to each activity differed between colonies, with birds breeding furthest from the ISF spending the greatest proportion of time engaged in direct flight and the smallest proportion of time engaged in foraging activity. This type of analysis has considerable potential for application in future biologging studies and in other taxa. PMID:23034356

  12. Dicty_cDB: FC-BR21 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available FC (Link to library) FC-BR21 (Link to dictyBase) - - - Contig-U15384-1 | Contig-U16443-1 FC-BR2...1P (Link to Original site) FC-BR21F 551 FC-BR21Z 122 FC-BR21P 673 - - Show FC-BR21 Library FC (L...ink to library) Clone ID FC-BR21 (Link to dictyBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Cont...ig-U15384-1 | Contig-U16443-1 Original site URL http://dictycdb.biol.tsukuba.ac.jp/CSM/FC/FC-BR/FC-BR2...1Q.Seq.d/ Representative seq. ID FC-BR21P (Link to Original site) Representative DNA sequence >FC-BR21 (FC-BR2

  13. Dicty_cDB: FC-AI07 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available FC (Link to library) FC-AI07 (Link to dictyBase) - - - Contig-U15296-1 | Contig-U15756-1 FC-AI...07P (Link to Original site) FC-AI07F 580 FC-AI07Z 723 FC-AI07P 1303 - - Show FC-AI07 Library FC (...Link to library) Clone ID FC-AI07 (Link to dictyBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Con...tig-U15296-1 | Contig-U15756-1 Original site URL http://dictycdb.biol.tsukuba.ac.jp/CSM/FC/FC-AI/FC-AI...07Q.Seq.d/ Representative seq. ID FC-AI07P (Link to Original site) Representative DNA sequence >FC-AI07 (FC-AI

  14. Dicty_cDB: FC-AI22 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available FC (Link to library) FC-AI22 (Link to dictyBase) - - - Contig-U15369-1 | Contig-U15732-1 FC-AI...22P (Link to Original site) FC-AI22F 583 FC-AI22Z 683 FC-AI22P 1266 - - Show FC-AI22 Library FC (...Link to library) Clone ID FC-AI22 (Link to dictyBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Con...tig-U15369-1 | Contig-U15732-1 Original site URL http://dictycdb.biol.tsukuba.ac.jp/CSM/FC/FC-AI/FC-AI...22Q.Seq.d/ Representative seq. ID FC-AI22P (Link to Original site) Representative DNA sequence >FC-AI22 (FC-AI

  15. Dicty_cDB: AFJ353 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available AF (Link to library) AFJ353 (Link to dictyBase) - - - Contig-U10972-1 AFJ353P (Link... to Original site) AFJ353F 592 AFJ353Z 580 AFJ353P 1172 - - Show AFJ353 Library AF (Link to library) Clone ID AFJ353 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U10972-1 Original site URL http://dict...SISTGLPNLSGGKINTNVYSSLGNIGSGMNGSSTKDDSVSPTKKSTDHQP RKPMTKSVSMSSIGLN--- ---NYLIQSFLKQFSPEVNQFVCDAIMGNIEEICT...TNVYSSLGNIGSGMNGSSTKDDSVSPTKKSTDHQP RKPMTKSVSMSSIGLN--- ---NYLIQSFLKQFSPEVNQFVCDAIMGNIEEICTHKVGCTVVNRCIDNANP

  16. Dicty_cDB: SFA408 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SF (Link to library) SFA408 (Link to dictyBase) - - - Contig-U10972-1 SFA408P (Link... to Original site) SFA408F 509 SFA408Z 686 SFA408P 1195 - - Show SFA408 Library SF (Link to library) Clone ID SFA408 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U10972-1 Original site URL http://dict...YLSXDQVESIIASIKGKVIQLSKDNKGNYLIQSFLKQFSPEVNQFV CDAIMGNIEEICTHKVGCTVVNRCIDNANPEQLEKLVDKITEHSLKLVQDQFGNYVVQHL ...KDNKGNYLIQSFLKQFSPEVNQFV CDAIMGNIEEICTHKVGCTVVNRCIDNANPEQLEKLVDKITEHSLKLVQDQFGNYV

  17. Dicty_cDB: SLG821 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SL (Link to library) SLG821 (Link to dictyBase) - - - Contig-U15540-1 SLG821P (Link... to Original site) SLG821F 638 SLG821Z 554 SLG821P 1192 - - Show SLG821 Library SL (Link to library) Clone ID SLG821 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15540-1 Original site URL http://dict...MSPSSPICLPESPMGAQHWESGL--- ---IDDKTGIFDSQRFLAFNNPQAMSKYESYRIYIHPSLGYSGNAKRFKQQPDVNEKALI LDGNVYDGHLNPIYNCKICT...*--- ---IDDKTGIFDSQRFLAFNNPQAMSKYESYRIYIHPSLGYSGNAKRFKQQPDVNEKALI LDGNVYDGHLNPIYNCKICT

  18. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB866 (Link to dictyBase) - - - Contig-U16349-1 VFB866Z (Link... to Original site) - - VFB866Z 631 - - - - Show VFB866 Library VF (Link to library) Clone ID VFB866 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16349-1 Original site URL http://dict...ce ---SSVDSISNLPTTMQLFAGIKSICTEMAMDGCEKCSGNSPTTTCDVLPVYSSLCMAMP DMSQCANWTKMCSSSGQLYNSQITSDYCVASVADAVPIMRMYFH...RGCLHAIELTCSYALMLVAMTFNVALFFAV Translated Amino Acid sequence (All Frames) Frame A: ---SSVDSISNLPTTMQLFAGIKSICT

  19. Dicty_cDB: VHL364 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHL364 (Link to dictyBase) - - - Contig-U16205-1 - (Link to Or...iginal site) - - VHL364Z 668 - - - - Show VHL364 Library VH (Link to library) Clone ID VHL364 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16205-1 Original site URL http://dictycdb.b...mino Acid sequence ---KKVTIAEAKEIFEKQYRDLYTVSQDVTKLAIQSAEQNGIVFLDEIDKICTSRESIKN GGDASTDGVQRDLLPIVEGCMVSTKYGQ...itwyh*tyrrgykf*lxys*r*nscfrysrh*ktfk Frame B: ---KKVTIAEAKEIFEKQYRDLYTVSQDVTKLAIQSAEQNGIVFLDEIDKICTSRESIKN G

  20. Dicty_cDB: VHK472 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHK472 (Link to dictyBase) - - - Contig-U16349-1 - (Link to Or...iginal site) - - VHK472Z 392 - - - - Show VHK472 Library VH (Link to library) Clone ID VHK472 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16349-1 Original site URL http://dictycdb.b....10 Translated Amino Acid sequence ---CSSVDSISNLPTXMQLFAGIKSICTEMAMDGCEKCSGNSPTTT...wstlqlsnhirllcrlcc*rrsnhenvlshwylglypl*ilgtk n*psicwfmv Frame B: ---CSSVDSISNLPTXMQLFAGIKSICTEMAMDGCEKCSGNSP

  1. Dicty_cDB: CFJ816 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CF (Link to library) CFJ816 (Link to dictyBase) - - - Contig-U16255-1 CFJ816P (Link... to Original site) CFJ816F 562 CFJ816Z 721 CFJ816P 1263 - - Show CFJ816 Library CF (Link to library) Clone ID CFJ816 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16255-1 Original site URL http://dict...ATGSATGQGTSGGTPGSCDKV NCPNGYICTIVNQLAVCVSPSSSSSSSSSTTGSHTTTGGSTTGSHTTTGGSTTGSHTTT...PATGSATGQGTSGGTPGSCDKV NCPNGYICTIVNQLAVCVSPSSSSSSSSSTTGSHTTTGGSTTGSHTTTGGSTTGSHTTTG GSTTGSHTTTGGSTTGSHTTTGGS

  2. Dicty_cDB: CFI686 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CF (Link to library) CFI686 (Link to dictyBase) - - - Contig-U15579-1 CFI686P (Link... to Original site) CFI686F 185 CFI686Z 445 CFI686P 610 - - Show CFI686 Library CF (Link to library) Clone ID CFI686 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15579-1 Original site URL http://dict...CYYDKFDXCXACN XVDTCXXNDLCXXRECNXRGXPPCLINPINCXSXDPCIFSYCENGVCIXTYICTPTPSVT PTVTPT...LVXXCXIGFGCXAIXKNCNXNDPXXXDXCDXXIGCYYDKFDXCXACN XVDTCXXNDLCXXRECNXRGXPPCLINPINCXSXDPCIFSYCENGVCIXTYICTPTPSVT

  3. Dicty_cDB: VHA365 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHA365 (Link to dictyBase) - - - Contig-U16349-1 - (Link to Or...iginal site) - - VHA365Z 352 - - - - Show VHA365 Library VH (Link to library) Clone ID VHA365 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16349-1 Original site URL http://dictycdb.b...TTGGGTACCAAGAACTGACCGTCAATTTGCTGGTTCATGGTTT sequence update 2002. 9.10 Translated Amino Acid sequence ---QLFAGIKSICT...wfmv Frame C: ---QLFAGIKSICTEMAMDGCEKCSGNSPTTTCDVLPVYSSLCMAMPDMSQCANWTKMCS SSGQLY

  4. Dicty_cDB: VFF110 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFF110 (Link to dictyBase) - - - Contig-U10843-1 VFF110P (Link... to Original site) VFF110F 496 VFF110Z 306 VFF110P 802 - - Show VFF110 Library VF (Link to library) Clone ID VFF110 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U10843-1 Original site URL http://dict...AVKVMLKQVDQKTLTDF RKEVAIMSKIFHPNIVLFLGACTSTPGKLMICTELMKGNLESLL--- ---XIFKVXXNGGIXXQXIXXPQGKFQXXKSKXXRXKXLITG...VYKGRCRLKDVAVKVMLKQVDQKTLTDF RKEVAIMSKIFHPNIVLFLGACTSTPGKLMICTELMKGNLESLL--- ---qflklxxmxvfxxkxfxxpkvnskxxkv

  5. Dicty_cDB: SLE110 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SL (Link to library) SLE110 (Link to dictyBase) - - - Contig-U16520-1 SLE110E (Link... to Original site) - - - - - - SLE110E 237 Show SLE110 Library SL (Link to library) Clone ID SLE110 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16520-1 Original site URL http://dict...NLSEDVNLEEFVMSKDDLSGADIKAICTESGLLALRERRMRVTYXDFKKAKEKVLYR KTAGAPEGLYM*kkknqnq Translated Amino Acid sequence... (All Frames) Frame A: AKMNLSEDVNLEEFVMSKDDLSGADIKAICTESGLLALRERRMRVTYXDFKKAKEKVL

  6. Dicty_cDB: SSB322 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SS (Link to library) SSB322 (Link to dictyBase) - - - Contig-U03539-1 SSB322Z (Link... to Original site) - - SSB322Z 393 - - - - Show SSB322 Library SS (Link to library) Clone ID SSB322 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U03539-1 Original site URL http://dict...KGKXYILQDKQYKERGVGTIRVNKDLEEKXRIIMNAD GSKKNILNVNIFPKMKVTSPNEKTLTFIAFEDDKICTFVLIAKPEEIKNFSTVINKQISS LEVA*nfil...NAD GSKKNILNVNIFPKMKVTSPNEKTLTFIAFEDDKICTFVLIAKPEEIKNFSTVINKQISS LEVA*nfilkkkk Homology vs CSM-cDNA Score E

  7. Dicty_cDB: AHB126 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available AH (Link to library) AHB126 (Link to dictyBase) - - - Contig-U16336-1 AHB126P (Link... to Original site) AHB126F 598 AHB126Z 500 AHB126P 1078 - - Show AHB126 Library AH (Link to library) Clone ID AHB126 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16336-1 Original site URL http://dict...RGIRNHKKNETKFINQCINEIKEELKGDMQKKTVAVQKLTYIQMLGFDI SWASFKIVEVMSCNKFSSKRIGYLAASQSFNEGTDVIVLATHQIRKDFLSSNQSEAYLAL NCLSNICT...KKNETKFINQCINEIKEELKGDMQKKTVAVQKLTYIQMLGFDI SWASFKIVEVMSCNKFSSKRIGYLAASQSFNEGTDVIVLATHQIRKDFLSSNQSEAYLAL NCLSNICT

  8. Dicty_cDB: VHC776 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHC776 (Link to dictyBase) - - - Contig-U15722-1 VHC776P (Link... to Original site) VHC776F 593 VHC776Z 895 VHC776P 1468 - - Show VHC776 Library VH (Link to library) Clone ID VHC776 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15722-1 Original site URL http://dict...CDNNTGNCTCHNEXFGNSCEFTRCP LDCSTPNGTCDNNXGNCTCHNEHFGNGCEFTQCPLYCSTPNGTCDINSGICTCDN...XFGNSCEFTRCP LDCSTPNGTCDNNXGNCTCHNEHFGNGCEFTQCPLYCSTPNGTCDINSGICTCDNEHIGN GCEIKFIECKHKCSTKHGICDNDSGNCKCDTQTK

  9. Dicty_cDB: VHK477 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHK477 (Link to dictyBase) - - - Contig-U11536-1 VHK477P (Link... to Original site) VHK477F 685 VHK477Z 794 VHK477P 1459 - - Show VHK477 Library VH (Link to library) Clone ID VHK477 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U11536-1 Original site URL http://dict...QIGTYSFEGKFIKKAIINGNRIVTINN KLLNENQLNSNSTITKEKDSSNQLSSFISIEIPHYDSNVLLDPSFSVLLDSNESYKNSEN SICTSKKSSKLTTAQIIAI...NQLSSFISIEIPHYDSNVLLDPSFSVLLDSNESYKNSEN SICTSKKSSKLTTAQIIAIVIVSVVFVIILALFIIVKFSKS

  10. Dicty_cDB: SFD793 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SF (Link to library) SFD793 (Link to dictyBase) - - - Contig-U16255-1 SFD793P (Link... to Original site) SFD793F 660 SFD793Z 689 SFD793P 1349 - - Show SFD793 Library SF (Link to library) Clone ID SFD793 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16255-1 Original site URL http://dict...ATCSSLNCNAQQMSCKYVQQACHETSC CPDIPQCQIPATGGGPATGSATGQGTSGGTPGSCDKVNCPNGYICTIVNQLAVCVSPSSS SSSSSSTTGSHTTTGGSTT...TGGGPATGSATGQGTSGGTPGSCDKVNCPNGYICTIVNQLAVCVSPSSS SSSSSSTTGSHTTTGGSTTGSHTTTGGSTTG

  11. Dicty_cDB: FC-AC21 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available FC (Link to library) FC-AC21 (Link to dictyBase) - - - Contig-U15104-1 FC-AC21E (Li...nk to Original site) - - - - - - FC-AC21E 527 Show FC-AC21 Library FC (Link to library) Clone ID FC-AC21 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15104-1 Original site URL http://dict...ce KDSLDVIIFPEMVKLVGLTPNTMEKVLTYFQDNDTIDLSTFPMEIQVEQLSGKYIFICTH KQKDQRCGYCGPILVDQLRDQIKERSLEKEIQVFGTSHVGGHKY... Frames) Frame A: KDSLDVIIFPEMVKLVGLTPNTMEKVLTYFQDNDTIDLSTFPMEIQVEQLSGKYIFICTH KQ

  12. Dicty_cDB: VHI141 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHI141 (Link to dictyBase) - - - Contig-U15722-1 VHI141P (Link... to Original site) VHI141F 635 VHI141Z 922 VHI141P 1537 - - Show VHI141 Library VH (Link to library) Clone ID VHI141 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15722-1 Original site URL http://dict...LDCSTPNGTCDNNTGNCTCHNEHFGNSC EFTRCPLDCSTPNGTCDNNTGNCTCHNEHFGNGCEFTQCPLYCSTPNGTCDINSGICTCD NEHIGNGCEIKFIECKHK...PNGTCDNNTGNCTCHNEHFGNGCEFTQCPLYCSTPNGTCDINSGICTCD NEHIGNGCEIKFIECKHKCSTKHGICDNDSG

  13. Dicty_cDB: VFL117 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFL117 (Link to dictyBase) - G22229 DDB0204982 Contig-U10461-1... - (Link to Original site) - - VFL117Z 591 - - - - Show VFL117 Library VF (Link to library) Clone ID VFL117 (Link to dict...yBase) Atlas ID - NBRP ID G22229 dictyBase ID DDB0204982 Link to Contig Contig-U10461-1 Original site URL http://dict...GMALLRSLQNHKKNTPFQHFQDMIFKTK VLITTVSLVFICTVIEEGIYQFAFEEFSTSTSVQSNFITYLIDTTQMIVVMYILANGKFS NYILFRKVKTTSFNSNEK...k**ytkfkynn**hw Frame B: ---TSMLMDIKFGQAVGFLILLAIYCAMVIGFGMALLRSLQNHKKNTPFQHFQDMIFKTK VLITTVSLVFICT

  14. Dicty_cDB: VHC263 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHC263 (Link to dictyBase) - - - Contig-U16349-1 - (Link to Or...iginal site) - - VHC263Z 429 - - - - Show VHC263 Library VH (Link to library) Clone ID VHC263 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16349-1 Original site URL http://dictycdb.b...ATGG ACTCCCAAC sequence update 2002. 9.10 Translated Amino Acid sequence ---QLFAGIKSICT...kyfrkkmdsq Frame B: ---QLFAGIKSICTXMAMDGCEKCSGNSPTTTCDVLPVYSSLCMAMPDMSQCANWTKMCS

  15. Dicty_cDB: VHO201 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHO201 (Link to dictyBase) - - - Contig-U12425-1 VHO201P (Link... to Original site) VHO201F 596 VHO201Z 398 VHO201P 974 - - Show VHO201 Library VH (Link to library) Clone ID VHO201 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U12425-1 Original site URL http://dict...SGSQMEIQNRRGIVNLGDYSTSR DDNPHVLTVLLKQFLRDLPEPICTNALYDLFLAASDQINFQQCKENGFEVLKKLINS...CKREXMELRIPTFVSNILNTLFLHSLGVEGLFRISGSQMEIQNRRGIVNLGDYSTSR DDNPHVLTVLLKQFLRDLPEPICT

  16. Dicty_cDB: AFI379 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available AF (Link to library) AFI379 (Link to dictyBase) - - - Contig-U08795-1 AFI379P (Link... to Original site) AFI379F 645 AFI379Z 836 AFI379P 1471 - - Show AFI379 Library AF (Link to library) Clone ID AFI379 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U08795-1 Original site URL http://dict...ITNNYFFYFLKIFCYYYFYYHKMALVLPIFATENTLLVTENDL GRGFSVDDNGANAKKSNVYICTKEDMTTVRSITDSNKLTLINDQESLTSALNVSGELSLS YGL...KIFCYYYFYYHKMALVLPIFATENTLLVTENDL GRGFSVDDNGANAKKSNVYICTKEDMTTVRSITDSNKLTLINDQESLTSALNVSGELSLS YGLMSGKIMGEYL

  17. Dicty_cDB: SFH459 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SF (Link to library) SFH459 (Link to dictyBase) - - - Contig-U16336-1 SFH459P (Link... to Original site) SFH459F 606 SFH459Z 728 SFH459P 1334 - - Show SFH459 Library SF (Link to library) Clone ID SFH459 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16336-1 Original site URL http://dict...KTVAVQKLTYIQ MLGFDISWASFKIVEVMSCNKFSSKRIGYLAASQSFNEGTDVIVLATHQIRKDFLSSNQS EAYLALNCLSNICTTDLARELANDILTLLSTQKT...TVAVQKLTYIQ MLGFDISWASFKIVEVMSCNKFSSKRIGYLAASQSFNEGTDVIVLATHQIRKDFLSSNQS EAYLALNCLSNICTTDLARELANDILTLLSTQKTH

  18. Dicty_cDB: CHR591 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CH (Link to library) CHR591 (Link to dictyBase) - - - Contig-U11865-1 CHR591P (Link... to Original site) CHR591F 557 CHR591Z 706 CHR591P 1243 - - Show CHR591 Library CH (Link to library) Clone ID CHR591 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U11865-1 Original site URL http://dict... IDPYEIQQNKQSNNSNSNSNRNLTPNSSSPTNQRKNKQEDDDDESKLDDESDLNERFQKV VLRFREFPSLDTNLYRLQEICT...DLNERFQKV VLRFREFPSLDTNLYRLQEICTDLLHISQDFIHTVKTYGRIIIEERYLKEKTIQSRSIGG H--- ---RK

  19. Dicty_cDB: SFF125 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SF (Link to library) SFF125 (Link to dictyBase) - - - Contig-U16368-1 SFF125P (Link... to Original site) SFF125F 540 SFF125Z 608 SFF125P 1148 - - Show SFF125 Library SF (Link to library) Clone ID SFF125 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16368-1 Original site URL http://dict...RNQHCPHGYSCR VIKGCATCVRDARPPHNLCRGFGCPEGSHCEVLEKHPVCVRNHVPPHPPPPPQICGSVNC GPGYICTIINGHPTCIRGDGYLCNQTRCPHDYQC...ACCVPHHDGCGNIQCPWGHYCVNEHGKCRCVPHRPPPRPPVDQCRNQHCPHGYSCR VIKGCATCVRDARPPHNLCRGFGCPEGSHCEVLEKHPVCVRNHVPPHPPPPPQICGSVNC GPGYICT

  20. Dicty_cDB: VHL434 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHL434 (Link to dictyBase) - - - Contig-U16336-1 VHL434P (Link... to Original site) VHL434F 546 VHL434Z 778 VHL434P 1304 - - Show VHL434 Library VH (Link to library) Clone ID VHL434 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16336-1 Original site URL http://dict...EVMSCNKFSSKRIGYLAASQSFNEGTDVIVLATHQIRKDFLS SNQSEAYLALNCLSNICTTDLARELANDILTLLSTQKTHILKRAITVLYKIFLRYPES-- - --...GFDISWASFKIVEVMSCNKFSSKRIGYLAASQSFNEGTDVIVLATHQIRKDFLS SNQSEAYLALNCLSNICTTDLARELANDILTLLSTQKTHILKRAITVLYKIFL

  1. Dicty_cDB: CHN802 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CH (Link to library) CHN802 (Link to dictyBase) - - - Contig-U16336-1 CHN802P (Link... to Original site) CHN802F 543 CHN802Z 788 CHN802P 1311 - - Show CHN802 Library CH (Link to library) Clone ID CHN802 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16336-1 Original site URL http://dict...ASFKIVEVMSCNKFSSKRIGYLAASQSFNEGTDVIVLATHQIRKDFLSSN QSEAYLALNCLSNICTTDLARELANDILTLLSTQKTHILKRAITVLYKIFLRYPESL...NETKFINQCINEIKEELKGDMQKKTVAVQKLTY IQMLGFDISWASFKIVEVMSCNKFSSKRIGYLAASQSFNEGTDVIVLATHQIRKDFLSSN QSEAYLALNCLSNICT

  2. Dicty_cDB: AFJ513 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available AF (Link to library) AFJ513 (Link to dictyBase) - G20675 DDB0218542 Contig-U10004-1...brary) Clone ID AFJ513 (Link to dictyBase) Atlas ID - NBRP ID G20675 dictyBase ID DDB0218542 Link to Contig ...Contig-U10004-1 Original site URL http://dictycdb.biol.tsukuba.ac.jp/CSM/AF/AFJ5-...LLVNLVV VNDSGDIEPDTIIPLVDG--- ---SCFECSLXAFPPQVSYAICTIANTPRVPEHCIQWALLFGLQDATLEKPFDPKQFDND NPDHMNWLFECAKKRAE...r*r*rll*ti*ncycrfrfnrskkmd*wfis*fsc ck**w*y*trynhsig*ww--- ---SCFECSLXAFPPQVSYAICT

  3. Dicty_cDB: SLA340 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SL (Link to library) SLA340 (Link to dictyBase) - - - Contig-U16510-1 - (Link to Or...iginal site) - - SLA340Z 466 - - - - Show SLA340 Library SL (Link to library) Clone ID SLA340 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16510-1 Original site URL http://dictycdb.b.... 2 Translated Amino Acid sequence ---CGKPSPIPTCTRKLSPISICIIKLCPIFICITKLCSIYICTISICIIFICIICSNYS CNYNCNYSCNYN...qpqlqlqlqlqpqpqlqpqlqpq lplkfnqktffikyhynnffnnnsikk*y*kspffl Frame C: ---CGKPSPIPTCTRKLSPISICIIKLCPIFICITKLCSIYICT

  4. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB676 (Link to dictyBase) - - - Contig-U12091-1 VFB676E (Link...) Clone ID VFB676 (Link to dictyBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U12091-1 Ori...NA, complete cds. 2155 0.0 1 U66525 |U66525.1 Dictyostelium discoideum ORFveg114 ...toplasmic 16.0 %: mitochondrial 4.0 %: cytoskeletal 4.0 %: vesicles of secretory system >> prediction for VF

  5. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB713 (Link to dictyBase) - - - Contig-U16602-1 VFB713F (Link... to Original site) VFB713F 471 - - - - - - Show VFB713 Library VF (Link to library) Clone ID VFB713 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16602-1 Original site URL http://dict...ar 12.0 %: cytoplasmic 8.0 %: vacuolar 8.0 %: mitochondrial 8.0 %: endoplasmic reticulum >> predict

  6. Dicty_cDB: AFN878 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available AF (Link to library) AFN878 (Link to dictyBase) - - - Contig-U15540-1 AFN878P (Link... to Original site) AFN878F 592 AFN878Z 531 AFN878P 1123 - - Show AFN878 Library AF (Link to library) Clone ID AFN878 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15540-1 Original site URL http://dict...D VNEKALILDGNVYDGHLNPIYNCKICTEYYQTKSYFSANPHAKGKVLLVKNNILTRVKDG GFTLSLKPMCCSGHNSHIPLYFHFTLTNPLTNEVVLQSLINVNVK...YESYRIYIHPSLGYSGNAKRFKQQPD VNEKALILDGNVYDGHLNPIYNCKICTEYYQTKSYFSANPHAKGKVLLVKNNILTRVKDG GFTLSLKPMCCSGHNSHIPL

  7. Dicty_cDB: AFL122 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available AF (Link to library) AFL122 (Link to dictyBase) - - - Contig-U11144-1 AFL122P (Link... to Original site) AFL122F 837 AFL122Z 711 AFL122P 1538 - - Show AFL122 Library AF (Link to library) Clone ID AFL122 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U11144-1 Original site URL http://dict...GPSVILLDEPTSGLDASTSFYVMSALKKLAKSGRTIICTIHQPRSNIYDM FDNLLLLGDGNTIYYGKANKALEYFNANGYHCSEKTNPADFFLDLINTQVEDQADSD...TLNFYAQLKMPRDVPLKEKLQRVQDIIDEMGLNRCADTLVGTADNKIRGISGGERR RVTISIELLTGPSVILLDEPTSGLDASTSFYVMSALKKLAKSGRTIICT

  8. Dicty_cDB: VHH831 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHH831 (Link to dictyBase) - - - Contig-U15722-1 VHH831P (Link... to Original site) VHH831F 586 VHH831Z 930 VHH831P 1496 - - Show VHH831 Library VH (Link to library) Clone ID VHH831 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15722-1 Original site URL http://dict...HFGNGCEFTQCPLYCSTPNGTCDINSGICTC DNEHIGNGCEIKFIECKHKCSTKHGICDNDSGNCKCDTQTKGLTCEESRLLIESLDSINS KGGTINIIGYFGNTT...STPNGTCDNNTGNCTCHNEHFGNG CEFTRCPLDCSTPNGTCDNNTGNCTCHNEHFGNGCEFTQCPLYCSTPNGTCDINSGICTC DNEHIGNGCEIKFIECKHKCST

  9. Dicty_cDB: CFE262 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CF (Link to library) CFE262 (Link to dictyBase) - - - Contig-U16368-1 CFE262P (Link... to Original site) CFE262F 652 CFE262Z 401 CFE262P 1053 - - Show CFE262 Library CF (Link to library) Clone ID CFE262 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16368-1 Original site URL http://dict...HRPPPRPPVDQCRNQHCPHGYSC RVIKGCATCVRDARPPHNLCRGFGCPEGSHCEVLEKHPVCVRNHVPPHPPPPPQICGSVN CGPGYICTIINGHPTCIRGDGYL...VNEHGKCRCVPHRPPPRPPVDQCRNQHCPHGYSC RVIKGCATCVRDARPPHNLCRGFGCPEGSHCEVLEKHPVCVRNHVPPHPPPPPQICGSVN CGPGYICTIING

  10. Mapping of the locus for autosomal dominant amelogenesis imperfecta (AIH2) to a 4-Mb YAC contig on chromosome 4q11-q21

    Energy Technology Data Exchange (ETDEWEB)

    Kaerrman, C.; Holmgren, G.; Forsman, K. [Univ. Hospital, Umea (Sweden)]|[Univ. of Umea (Sweden)] [and others

    1997-01-15

    Amelogenesis imperfecta (Al) is a clinically and genetically heterogeneous group of inherited enamel defects. We recently mapped a locus for autosomal dominant local hypoplastic amelogenesis imperfecta (AIH2) to the long arm of chromosome 4. The disease gene was localized to a 17.6-cM region between the markers D4S392 and D4S395. The albumin gene (ALB), located in the same interval, was a candidate gene for autosomal dominant AI (ADAI) since albumin has a potential role in enamel maturation. Here we describe refined mapping of the AIH2 locus and the construction of marker maps by radiation hybrid mapping and yeast artificial chromosome (YAC)-based sequence tagged site-content mapping. A radiation hybrid map consisting of 11 microsatellite markers in the 5-cM interval between D4S409 and D4S1558 was constructed. Recombinant haplotypes in six Swedish ADAI families suggest that the disease gene is located in the interval between D4S2421 and ALB. ALB is therefore not likely to be the disease-causing gene. Affected members in all six families share the same allele haplotypes, indicating a common ancestral mutation in all families. The AIH2 critical region is less than 4 cM and spans a physical distance of approximately 4 Mb as judged from radiation hybrid maps. A YAC contig over the AIH2 critical region including several potential candidate genes was constructed. 35 refs., 4 figs., 1 tab.

  11. Specific patterns of gene space organisation revealed in wheat by using the combination of barley and wheat genomic resources

    Directory of Open Access Journals (Sweden)

    Waugh Robbie

    2010-12-01

    Full Text Available Abstract Background Because of its size, allohexaploid nature and high repeat content, the wheat genome has always been perceived as too complex for efficient molecular studies. We recently constructed the first physical map of a wheat chromosome (3B. However gene mapping is still laborious in wheat because of high redundancy between the three homoeologous genomes. In contrast, in the closely related diploid species, barley, numerous gene-based markers have been developed. This study aims at combining the unique genomic resources developed in wheat and barley to decipher the organisation of gene space on wheat chromosome 3B. Results Three dimensional pools of the minimal tiling path of wheat chromosome 3B physical map were hybridised to a barley Agilent 15K expression microarray. This led to the fine mapping of 738 barley orthologous genes on wheat chromosome 3B. In addition, comparative analyses revealed that 68% of the genes identified were syntenic between the wheat chromosome 3B and barley chromosome 3 H and 59% between wheat chromosome 3B and rice chromosome 1, together with some wheat-specific rearrangements. Finally, it indicated an increasing gradient of gene density from the centromere to the telomeres positively correlated with the number of genes clustered in islands on wheat chromosome 3B. Conclusion Our study shows that novel structural genomics resources now available in wheat and barley can be combined efficiently to overcome specific problems of genetic anchoring of physical contigs in wheat and to perform high-resolution comparative analyses with rice for deciphering the organisation of the wheat gene space.

  12. Genome-wide SNP identification by high-throughput sequencing and selective mapping allows sequence assembly positioning using a framework genetic linkage map

    Directory of Open Access Journals (Sweden)

    Xu Xiangming

    2010-12-01

    Full Text Available Abstract Background Determining the position and order of contigs and scaffolds from a genome assembly within an organism's genome remains a technical challenge in a majority of sequencing projects. In order to exploit contemporary technologies for DNA sequencing, we developed a strategy for whole genome single nucleotide polymorphism sequencing allowing the positioning of sequence contigs onto a linkage map using the bin mapping method. Results The strategy was tested on a draft genome of the fungal pathogen Venturia inaequalis, the causal agent of apple scab, and further validated using sequence contigs derived from the diploid plant genome Fragaria vesca. Using our novel method we were able to anchor 70% and 92% of sequences assemblies for V. inaequalis and F. vesca, respectively, to genetic linkage maps. Conclusions We demonstrated the utility of this approach by accurately determining the bin map positions of the majority of the large sequence contigs from each genome sequence and validated our method by mapping single sequence repeat markers derived from sequence contigs on a full mapping population.

  13. [Reconstruction of long polynucleotide sequences from fragments using the Iskra-226 personal computer

    Science.gov (United States)

    Kostetskiĭ, P V; Dobrova, I E

    1988-04-01

    An algorithm for reconstructing long DNA sequences, i.e. arranging all overlapping gel readings in the contigs, and the corresponding BASIC programme for personal computer "Iskra-226" (USSR) are described. The contig construction begins with the search for all fragments overlapping the basic (longest) one follower by determination of coordinates of 5' ends of the overlapping fragments. Then the gel reading with minimal 5' end coordinate and the gel reading with maximal 3' end coordinate are selected and used as basic ones at the next assembly steps. The procedure is finished when no gel reading overlapping the basic one can be found. All gel readings entered the contig are ignored at the next steps of the assembly. Finally, one or several contigs consisted of DNA fragments are obtained. Effectiveness of the algorithm was tested on a model based on the multiple assembly of the nucleotide sequence, encoding the Na, K-ATPase alpha-subunit of pig kidney. The programme does not call for user's participation and can comprise contigs up to 10,000 nucleotides long.

  14. Yeast artificial chromosome cloning in the glycerol kinase and adrenal hypoplasia congenita region of Xp21

    Energy Technology Data Exchange (ETDEWEB)

    Worley, K.C.; Ellison, K.A.; Zhang, Y.H.; Wang, D.F.; Mason, J.; Roth, E.J.; Adams, V.; Fogt, D.D.; Zhu, X.M.; Towbin, J.A. [Baylor College of Medicine, Houston, TX (United States)] [and others

    1993-05-01

    The adrenal hypoplasia congenita (AHC) and glycerol kinase (GK) loci are telomeric to the Duchenne muscular dystrophy locus in Xp21. The authors developed a pair of yeast artificial chromosome (YAC) contigs spanning at least 1.2 Mb and encompassing the region from the telomeric end of the Duchenne muscular dystrophy (DMD) locus to beyond YHX39 (DXS727), including the genes for AHC and GK. The centromeric contig consists of 13 YACs reaching more than 600 kb from DMD through GK. The telomeric contig group consists of 8 YACs containing more than 600 kb including the markers YHX39 (DXS727) and QST-59 (DXS319). Patient deletion breakpoints in the region of the two YAC contigs define at least eight intervals, and seven deletion breakpoints are contained within these contigs. In addition to the probes developed from YAC ends, they have mapped eight Alu-PCR probes amplified from a radiation-reduced somatic cell hybrid, two anonymous DNA probes, and one Alu-PCR product amplified from a cosmid end, for a total of 26 new markers within this region of 2 Mb or less. One YAC in the centromeric contig contains an insert encompassing the minimum interval for GK deficiency defined by patient deletion breakpoints, and this clone includes all or part of the GK gene. 33 refs., 3 figs., 5 tabs.

  15. Dicty_cDB: CHA851 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CH (Link to library) CHA851 (Link to dictyBase) - - - Contig-U16368-1 - (Link to Or...iginal site) CHA851F 614 - - - - - - Show CHA851 Library CH (Link to library) Clone ID CHA851 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16368-1 Original site URL http://dictycdb.b...TCCXXXXXXXXXX sequence update 2002.10.25 Translated Amino Acid sequence VRDARPPHNLCRGFGCPEGSHCEVLEKHPVCVRNHVPPHPPPPPQICGSVNCGPGYICT...nly*skttgttttllnlcraiism*srwn dlysstkqlyqy*ipmlpis--- Frame C: VRDARPPHNLCRGFGCPEGSHCEVLEKHPVCVRNHVPPHPPPPPQICGSVNCGPGYICT

  16. Dicty_cDB: SFF103 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SF (Link to library) SFF103 (Link to dictyBase) - - - Contig-U11967-1 SFF103Z (Link... to Original site) - - SFF103Z 655 - - - - Show SFF103 Library SF (Link to library) Clone ID SFF103 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U11967-1 Original site URL http://dict...nslated Amino Acid sequence ---rgsf*FLKIIITLLAYKIICTPNHMHLTRGNHETTDMNRFYGFQGEVVAKYSEMVFD LFSELFNWFPLAFVLDESF...*rkrlsnr**wfshhcflc skll*siw*swliykynxdkikittxklxtsexppmhsqk Frame C: ---rgsf*FLKIIITLLAYKIICT

  17. Dicty_cDB: VHA622 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHA622 (Link to dictyBase) - - - Contig-U15767-1 VHA622P (Link... to Original site) VHA622F 458 VHA622Z 778 VHA622P 1216 - - Show VHA622 Library VH (Link to library) Clone ID VHA622 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15767-1 Original site URL http://dict...HVQVTCGGCETCSYATGKCEPDSSLCNDNNICTIDICVHEGILDGLP QGNCSNTPVDCGANDEDKCKTWSCDPTKGGCQSTPVVCEDKGKCLVGTCQPSTGQCEYSD...qisnqkmrenlf*lrelsnqilikkkefqf*ivwmpmil*ik rqelvgqmvlsiflxitklviqnlpnlvk--- ---ACNEDEKXITHVQVTCGGCETCSYATGKCEPDSSLCNDNNICT

  18. Dicty_cDB: CHF177 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CH (Link to library) CHF177 (Link to dictyBase) - - - Contig-U11892-1 - (Link to Or...iginal site) - - CHF177Z 395 - - - - Show CHF177 Library CH (Link to library) Clone ID CHF177 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U11892-1 Original site URL http://dictycdb.b...LLLWDVQGFPCXFAVEG GQCIDPSSLKVGGKYSFIAFSTCRXKFDNQKIHDCDWIIQGPTTPSXCANCGKICTSKCT TNYCDRDXQT Translated Amino A...XKFDNQKIHDCDWIIQGPTTPSXCANCGKICTSKCT TNYCDRDXQT Homology vs CSM-cDNA Score E Sequences producing significant

  19. Dicty_cDB: VFJ256 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFJ256 (Link to dictyBase) - - - Contig-U10140-1 VFJ256E (Link...) Clone ID VFJ256 (Link to dictyBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U10140-1 Ori...*nq*fylv*l*vx*KMNKLHLPIKENHHQXIKSIELIKNEFPEILICTDLCLC AYTDHGHCGVLTEEGFIENEKSIIRLG...iiikiih iiyivdmpiqlldhgkvimsf*nln*fiqfllqi*liqklklnpyqdnikfqvi**lnf* dhwlrkd*nq*fylv*l*vx*KMNKLHLPIKENHHQXIKSIELIKNEFPEILICT...pdate 2002.12.18 Homology vs DNA Score E Sequences producing significant alignments: (bits) Value N ( BJ432755 ) Dict

  20. Dicty_cDB: AFF341 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available AF (Link to library) AFF341 (Link to dictyBase) - - - Contig-U15579-1 AFF341P (Link... to Original site) AFF341F 158 AFF341Z 283 AFF341P 441 - - Show AFF341 Library AF (Link to library) Clone ID AFF341 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15579-1 Original site URL http://dict...GCYYDKFDNCDACNAVDXCITNDLCFPRECNPRGNPPCLINPINCTSTDP CIFSYCENGVCIPTYICTPTPSVTPTVTPXVTXTVT Translated Amino Aci...*fliikkk--- ---DHCDPAIGCYYDKFDNCDACNAVDXCITNDLCFPRECNPRGNPPCLINPINCTSTDP CIFSYCENGVCIPTYICT

  1. Dicty_cDB: CFF487 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CF (Link to library) CFF487 (Link to dictyBase) - - - Contig-U15579-1 CFF487P (Link... to Original site) CFF487F 143 CFF487Z 518 CFF487P 661 - - Show CFF487 Library CF (Link to library) Clone ID CFF487 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15579-1 Original site URL http://dict...CFPRECNPRGNPPCLINPINCTSTDPCIFSYCE NGVCIPTYICTPTPSVTPTVTPTVTPTVTPTVTPTVTPTVTPTPTTTPTPSPTTV Translated Amino A...ESCEIGFGCLAIPKNCNDNDPCTTDHCD PAIGCYYDKFDNCDACNAVDTCITNDLCFPRECNPRGNPPCLINPINCTSTDPCIFSYCE NGVCIPTYICTPTPSVTP

  2. Dicty_cDB: CHI134 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CH (Link to library) CHI134 (Link to dictyBase) - - - Contig-U11723-1 CHI134P (Link... to Original site) CHI134F 581 CHI134Z 736 CHI134P 1297 - - Show CHI134 Library CH (Link to library) Clone ID CHI134 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U11723-1 Original site URL http://dict...gl*kta*g*IIMESNKSSSHGDVSTSPSFLNNHHQFNNGGDIIPKKKKNRIMHVG SYEVGKTLGNGTFGKVKLGTNICTK...PVIEAPKTRRMSLDSR MLNGDQQSLVEKNQHMASPRTSKGIFKXSTTTTKSPEKTIIELKRSLEESGLFTKKKGPY LXLCFDEDNSVKFQIEIVKICNLDLTGIQLKRLSGDTWKYKDICT

  3. Dicty_cDB: VFM148 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFM148 (Link to dictyBase) - - - Contig-U08795-1 VFM148P (Link... to Original site) VFM148F 479 VFM148Z 710 VFM148P 1169 - - Show VFM148 Library VF (Link to library) Clone ID VFM148 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U08795-1 Original site URL http://dict...YYFYYHKMALVLPIFATENTLLVTENDLGRGFS VDDNGANAKKSNVYICTKEDMTTVRSITDSNKLTLINDQESLTSALNVSGELSLSYGLMS GKIMGEYLDTSSS...(All Frames) Frame A: iylflnqk*fv*ITNNYFFYFLKIFCYYYFYYHKMALVLPIFATENTLLVTENDLGRGFS VDDNGANAKKSNVYICTKEDMTTVR

  4. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB590 (Link to dictyBase) - - - Contig-U09552-1 VFB590P (Link... to Original site) VFB590F 225 VFB590Z 118 VFB590P 343 - - Show VFB590 Library VF (Link to library) Clone ID VFB590 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U09552-1 Original site URL http://dict...ed Amino Acid sequence IAVVEGFMAPSELCQKIKFCSSSSSTNDFDFIGSSTTDCEICTFISGYAENFLEEXKT...--- ---riyqinkvvmxhxlhn*lxvaxivxlgxvnvkihvexix Frame C: IAVVEGFMAPSELCQKIKFCSSSSSTNDFDFIGSSTTDCEICT

  5. Dicty_cDB: CHC113 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CH (Link to library) CHC113 (Link to dictyBase) - - - Contig-U15579-1 CHC113P (Link... to Original site) CHC113F 198 CHC113Z 396 CHC113P 574 - - Show CHC113 Library CH (Link to library) Clone ID CHC113 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15579-1 Original site URL http://dict...nif*KLENIIKKRNKLIFNYK KK--- ---GFGCLAIPKNCNDNDPCTTDHCDPAIGCYYDKFDNCDACNAVDTCITNDLCFPRECN PRGNPPCLINPINCTSTDPCIFSYCENGVCIPTYICT...KK--- ---GFGCLAIPKNCNDNDPCTTDHCDPAIGCYYDKFDNCDACNAVDTCITNDLCFPRECN PRGNPPCLINPINCTSTDPCIFSYCENGVCIPTYICTPTPS

  6. Dicty_cDB: VHP253 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHP253 (Link to dictyBase) - - - Contig-U16349-1 - (Link to Or...iginal site) - - VHP253Z 355 - - - - Show VHP253 Library VH (Link to library) Clone ID VHP253 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16349-1 Original site URL http://dictycdb.b...CTTGGGTACCAAGAACTGACCGTCAATTTGCTGGTTCATGGTTTGC sequence update 2002. 9.10 Translated Amino Acid sequence ---MQLFAGIKSICT...VPIMRMYFHTGILDYILFKSWVPRTDRQFAGSWF Translated Amino Acid sequence (All Frames) Frame A: ---MQLFAGIKSICTEMAMD

  7. Dicty_cDB: VHP243 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHP243 (Link to dictyBase) - - - Contig-U16236-1 - (Link to Or...iginal site) VHP243F 134 - - - - - - Show VHP243 Library VH (Link to library) Clone ID VHP243 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16236-1 Original site URL http://dictycdb.b...AXXXXXXXXXX sequence update 2002.10.25 Translated Amino Acid sequence CWPTGIXKTTICT...kilsif*ynfkyyqqpkkk--- Frame B: llaywyxqnnnlyqyyyyfyl*kyflsfniilniinnpkk--- Frame C: CWPTGIXKTTICTNTTIISICKN

  8. Dicty_cDB: AFA460 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available AF (Link to library) AFA460 (Link to dictyBase) - - - Contig-U15574-1 AFA460Z (Link... to Original site) - - AFA460Z 170 - - - - Show AFA460 Library AF (Link to library) Clone ID AFA460 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15574-1 Original site URL http://dict...date 2001. 6. 2 Translated Amino Acid sequence ---QIHQTIQVVKITLSSASSSSSSSSSSILNKTRICTYINSNSTHSLXXNIYKYKLPK T...it* Frame B: ---QIHQTIQVVKITLSSASSSSSSSSSSILNKTRICTYINSNSTHSLXXNIYKYKLPK Frame C:

  9. Dicty_cDB: VHM522 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHM522 (Link to dictyBase) - - - Contig-U15767-1 VHM522P (Link... to Original site) VHM522F 575 VHM522Z 747 VHM522P 1302 - - Show VHM522 Library VH (Link to library) Clone ID VHM522 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15767-1 Original site URL http://dict...PV--- ---ITHVQVTCGGCETCSYATGKCEPDSSLCNDNNICTIDICVHEGILDGLPQGNCSNTP VDCGANDEDKCKTWSCDPTKGGCQSTPVVCEDKGKCLVGTC...rtrf q*cklss--- ---ITHVQVTCGGCETCSYATGKCEPDSSLCNDNNICTIDICVHEGILDGLPQGNCSNTP VDCGANDEDKCKTWSCDPTKGGCQSTPVVCE

  10. Dicty_cDB: VHE138 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHE138 (Link to dictyBase) - - - Contig-U15767-1 VHE138P (Link... to Original site) VHE138F 569 VHE138Z 621 VHE138P 1170 - - Show VHE138 Library VH (Link to library) Clone ID VHE138 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15767-1 Original site URL http://dict...FVDNQAGDSXSAKSGKNLPIQRDIELNWNGEAYEYSNSNYFPINGQGF NDVSYP--- ---QVTCGGCETCSYATGKCEPDSSLCNDNNICT...rsi*i**fkllpn*rtrf q*ckls--- ---QVTCGGCETCSYATGKCEPDSSLCNDNNICTIDICVHEGILDGLPQGNCSNTPVDCG ANDEDKCKTWSCDPTKGG

  11. Dicty_cDB: VHD652 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHD652 (Link to dictyBase) - - - Contig-U15722-1 VHD652P (Link... to Original site) VHD652F 573 VHD652Z 707 VHD652P 1260 - - Show VHD652 Library VH (Link to library) Clone ID VHD652 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15722-1 Original site URL http://dict...NYTIPCGSNSNNVLCGSYHSTVPRKYLSLGFVNGSTHSYSKDGLMITYISQDTTNDNDC KRYTTNV--- ---LVIIILVIVHAIMNILVMVVNSLNAHSIVQPPNGTCDINSGICT...i*vlflql viilflvvvivimycvdhiiqlfhasifhwdllmavhiviaktd***hiyhriqqmimia kdiqqm*--- ---LVIIILVIVHAIMNILVMVVNSLNAHSIVQPPNGTCDINSGICT

  12. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB603 (Link to dictyBase) - - - Contig-U16505-1 VFB603Z (Link... to Original site) - - VFB603Z 496 - - - - Show VFB603 Library VF (Link to library) Clone ID VFB603 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16505-1 Original site URL http://dict...al 4.0 %: cytoskeletal 4.0 %: vesicles of secretory system 4.0 %: endoplasmic reticulum >> prediction for VF

  13. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFC221 (Link to dictyBase) - - - Contig-U09438-1 VFC221Z (Link... to Original site) - - VFC221Z 496 - - - - Show VFC221 Library VF (Link to library) Clone ID VFC221 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U09438-1 Original site URL http://dict... mitochondrial 8.0 %: peroxisomal 4.0 %: cytoskeletal 4.0 %: Golgi >> prediction for VFC221 is cyt 5' end se

  14. Dicty_cDB: VHL663 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHL663 (Link to dictyBase) - - - Contig-U15767-1 VHL663P (Link... to Original site) VHL663F 574 VHL663Z 702 VHL663P 1256 - - Show VHL663 Library VH (Link to library) Clone ID VHL663 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15767-1 Original site URL http://dict...WPDGFKYFFVDNQAGDSESAKSGKNLPIQRDIELNWNGEAYEYSNSNYFPINGQG FNDVSYPV--- ---SYATGKCEPDSSLCNDNNICTIDICVHEGILDGLPQG...ik rqelvgqmvlsifl*itklviqnlpnlvkifqfkeiss*igmekhmniviqitsqltdkv smm*aiq--- ---SYATGKCEPDSSLCNDNNICTIDICVHEGI

  15. SIS: a program to generate draft genome sequence scaffolds for prokaryotes

    Directory of Open Access Journals (Sweden)

    Dias Zanoni

    2012-05-01

    Full Text Available Abstract Background Decreasing costs of DNA sequencing have made prokaryotic draft genome sequences increasingly common. A contig scaffold is an ordering of contigs in the correct orientation. A scaffold can help genome comparisons and guide gap closure efforts. One popular technique for obtaining contig scaffolds is to map contigs onto a reference genome. However, rearrangements that may exist between the query and reference genomes may result in incorrect scaffolds, if these rearrangements are not taken into account. Large-scale inversions are common rearrangement events in prokaryotic genomes. Even in draft genomes it is possible to detect the presence of inversions given sufficient sequencing coverage and a sufficiently close reference genome. Results We present a linear-time algorithm that can generate a set of contig scaffolds for a draft genome sequence represented in contigs given a reference genome. The algorithm is aimed at prokaryotic genomes and relies on the presence of matching sequence patterns between the query and reference genomes that can be interpreted as the result of large-scale inversions; we call these patterns inversion signatures. Our algorithm is capable of correctly generating a scaffold if at least one member of every inversion signature pair is present in contigs and no inversion signatures have been overwritten in evolution. The algorithm is also capable of generating scaffolds in the presence of any kind of inversion, even though in this general case there is no guarantee that all scaffolds in the scaffold set will be correct. We compare the performance of sis, the program that implements the algorithm, to seven other scaffold-generating programs. The results of our tests show that sis has overall better performance. Conclusions sis is a new easy-to-use tool to generate contig scaffolds, available both as stand-alone and as a web server. The good performance of sis in our tests adds evidence that large

  16. Diversity and strain specificity of plant cell wall degrading enzymes revealed by the draft genome of Ruminococcus flavefaciens FD-1.

    Directory of Open Access Journals (Sweden)

    Margret E Berg Miller

    Full Text Available BACKGROUND: Ruminococcus flavefaciens is a predominant cellulolytic rumen bacterium, which forms a multi-enzyme cellulosome complex that could play an integral role in the ability of this bacterium to degrade plant cell wall polysaccharides. Identifying the major enzyme types involved in plant cell wall degradation is essential for gaining a better understanding of the cellulolytic capabilities of this organism as well as highlighting potential enzymes for application in improvement of livestock nutrition and for conversion of cellulosic biomass to liquid fuels. METHODOLOGY/PRINCIPAL FINDINGS: The R. flavefaciens FD-1 genome was sequenced to 29x-coverage, based on pulsed-field gel electrophoresis estimates (4.4 Mb, and assembled into 119 contigs providing 4,576,399 bp of unique sequence. As much as 87.1% of the genome encodes ORFs, tRNA, rRNAs, or repeats. The GC content was calculated at 45%. A total of 4,339 ORFs was detected with an average gene length of 918 bp. The cellulosome model for R. flavefaciens was further refined by sequence analysis, with at least 225 dockerin-containing ORFs, including previously characterized cohesin-containing scaffoldin molecules. These dockerin-containing ORFs encode a variety of catalytic modules including glycoside hydrolases (GHs, polysaccharide lyases, and carbohydrate esterases. Additionally, 56 ORFs encode proteins that contain carbohydrate-binding modules (CBMs. Functional microarray analysis of the genome revealed that 56 of the cellulosome-associated ORFs were up-regulated, 14 were down-regulated, 135 were unaffected, when R. flavefaciens FD-1 was grown on cellulose versus cellobiose. Three multi-modular xylanases (ORF01222, ORF03896, and ORF01315 exhibited the highest levels of up-regulation. CONCLUSIONS/SIGNIFICANCE: The genomic evidence indicates that R. flavefaciens FD-1 has the largest known number of fiber-degrading enzymes likely to be arranged in a cellulosome architecture. Functional

  17. Dicty_cDB: SHI251 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SH (Link to library) SHI251 (Link to dictyBase) - - - Contig-U11819-1 - (Link to Or...iginal site) SHI251F 125 - - - - - - Show SHI251 Library SH (Link to library) Clone ID SHI251 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U11819-1 Original site URL http://dictycdb.b...XX sequence update 2002.10.25 Translated Amino Acid sequence ilfqilkistnk**IKNYYVNRVYEIIIIINICT...YKKK--- Translated Amino Acid sequence (All Frames) Frame A: ilfqilkistnk**IKNYYVNRVYEIIIIINICT

  18. Dicty_cDB: SLJ344 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SL (Link to library) SLJ344 (Link to dictyBase) - - - Contig-U16255-1 SLJ344P (Link... to Original site) SLJ344F 253 SLJ344Z 273 SLJ344P 526 - - Show SLJ344 Library SL (Link to library) Clone ID SLJ344 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16255-1 Original site URL http://dict...Amino Acid sequence GTSGGTPGSCDKVNCPNGYICTIVNQLAVCVSPSSSSSSSSSTTGSHTTTGGSTTGSHTT TGGSTTGSHTTTGGSTTGSHTTTG---...li tilffniqrlykkkkkkkkkknkp*tklkin*kk Frame B: GTSGGTPGSCDKVNCPNGYICTIVNQLAVCVSPSSSSSSSSSTTGSHTTTGGSTTGSHTT

  19. Dicty_cDB: SFC534 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SF (Link to library) SFC534 (Link to dictyBase) - - - Contig-U15579-1 SFC534P (Link... to Original site) SFC534F 158 SFC534Z 334 SFC534P 492 - - Show SFC534 Library SF (Link to library) Clone ID SFC534 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15579-1 Original site URL http://dict...fiyl**hymni*klenif*KLENIIKKRNKLIFNYKKK--- ---DPXIGCYYDKFDNCDACNAVDTCITNDLCFPRECNPRGXPPCLINPINCTSTDPCIF SYCENGVCIPTYICT...lfiyl**hymni*klenif*KLENIIKKRNKLIFNYKKK--- ---DPXIGCYYDKFDNCDACNAVDTCITNDLCFPRECNPRGXPPCLINPINCTSTDPCIF SYCENGVCIPTYICT

  20. Dicty_cDB: VHH756 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHH756 (Link to dictyBase) - - - Contig-U15767-1 VHH756P (Link... to Original site) VHH756F 658 VHH756Z 686 VHH756P 1324 - - Show VHH756 Library VH (Link to library) Clone ID VHH756 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15767-1 Original site URL http://dict...YEYSNSNYFPINGQGF NDVSYPVPRGYVPISGESWTSMSTSTSLKGNNYNFC--- ---DSSLCNDNNICTIDICVHEGILDGLPQGNCSNTPVDCGANDEDKCKTW...*psw*fricqiw*kssnskryrvklewrsi*i**fkllpn*rtrfq *cklssskrlctnfwrildinvnfnfikg**l*fl--- ---DSSLCNDNNICTIDICVHE

  1. Dicty_cDB: SHD573 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SH (Link to library) SHD573 (Link to dictyBase) - - - Contig-U11503-1 SHD573E (Link...Clone ID SHD573 (Link to dictyBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U11503-1 Original site URL http://dict...LFAIFLKIVFVVSAPLCPNSTILLNYNILTVYNSSEGCGFNN EPICTSLKDAVSRAFLLISNNSRVCIGIIGNINVTSEQITLGNYCGALWITSENINNENN NYTI...ststtts ax***d*eyyhcysyfgldl Frame C: fsl*iy*YMIRKSNNFSILFAIFLKIVFVVSAPLCPNSTILLNYNILTVYNSSEGCGFNN EPICTSLKD...vegicus clone CH230-428C17, WORKING DRAFT SEQUENCE, 3 unordered pieces. 48 3e-12 3 AC116984 |AC116984.2 Dict

  2. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB760 (Link to dictyBase) - - - Contig-U12286-1 VFB760P (Link... to Original site) VFB760F 474 VFB760Z 691 VFB760P 1165 - - Show VFB760 Library VF (Link to library) Clone ID VFB760 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U12286-1 Original site URL http://dict...CCACTTAATTCCAGAAGG sequence update 2001. 6. 1 Translated Amino Acid sequence LLAYWNKCQVNSCDKTTGNCKPENLKCPDRSNECLKNTGCDDLTGCKYVSICT...cmfllqsfxnplnsrr Frame C: LLAYWNKCQVNSCDKTTGNCKPENLKCPDRSNECLKNTGCDDLTGCKYVSICTDS

  3. Dicty_cDB: VSK196 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VS (Link to library) VSK196 (Link to dictyBase) - - - Contig-U10274-1 VSK196P (Link... to Original site) VSK196F 423 VSK196Z 453 VSK196P 876 - - Show VSK196 Library VS (Link to library) Clone ID VSK196 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U10274-1 Original site URL http://dict...TTATNAACAATTAAAAAAAA sequence update 2001. 3.22 Translated Amino Acid sequence kdgklvslkdfikdqkpivlyfypkdetsict...*NKIX--- ---KLKVGDQAPDFTCPDKDGKLVSLKDFIKDQKPIVLYFYPKDETSICTKEACEFRDKY QKFIEAGADVI

  4. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB670 (Link to dictyBase) - - - Contig-U16447-1 VFB670Z (Link... to Original site) - - VFB670Z 541 - - - - Show VFB670 Library VF (Link to library) Clone ID VFB670 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16447-1 Original site URL http://dict... 0.00 m_ : 1.00 44.0 %: nuclear 28.0 %: cytoplasmic 16.0 %: cytoskeletal 8.0 %: peroxisomal 4.0 %: mitochondrial >> predict

  5. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB779 (Link to dictyBase) - - - Contig-U15331-1 VFB779P (Link... to Original site) VFB779F 328 VFB779Z 402 VFB779P 730 - - Show VFB779 Library VF (Link to library) Clone ID VFB779 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15331-1 Original site URL http://dict...cuolar 4.0 %: peroxisomal 4.0 %: endoplasmic reticulum >> prediction for VFB779 i

  6. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB893 (Link to dictyBase) - - - Contig-U15009-1 VFB893P (Link... to Original site) VFB893F 337 VFB893Z 727 VFB893P 1064 - - Show VFB893 Library VF (Link to library) Clone ID VFB893 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15009-1 Original site URL http://dict... 4.0 %: cytoskeletal 4.0 %: vesicles of secretory system >> prediction for VFB893

  7. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB615 (Link to dictyBase) - - - Contig-U16602-1 VFB615F (Link... to Original site) VFB615F 636 - - - - - - Show VFB615 Library VF (Link to library) Clone ID VFB615 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16602-1 Original site URL http://dict...0 %: vesicles of secretory system 4.0 %: peroxisomal >> prediction for VFB615 is cyt 5' end seq. ID VFB615F

  8. Genetic signatures of adaptation revealed from transcriptome sequencing of Arctic and red foxes.

    Science.gov (United States)

    Kumar, Vikas; Kutschera, Verena E; Nilsson, Maria A; Janke, Axel

    2015-08-07

    The genus Vulpes (true foxes) comprises numerous species that inhabit a wide range of habitats and climatic conditions, including one species, the Arctic fox (Vulpes lagopus) which is adapted to the arctic region. A close relative to the Arctic fox, the red fox (Vulpes vulpes), occurs in subarctic to subtropical habitats. To study the genetic basis of their adaptations to different environments, transcriptome sequences from two Arctic foxes and one red fox individual were generated and analyzed for signatures of positive selection. In addition, the data allowed for a phylogenetic analysis and divergence time estimate between the two fox species. The de novo assembly of reads resulted in more than 160,000 contigs/transcripts per individual. Approximately 17,000 homologous genes were identified using human and the non-redundant databases. Positive selection analyses revealed several genes involved in various metabolic and molecular processes such as energy metabolism, cardiac gene regulation, apoptosis and blood coagulation to be under positive selection in foxes. Branch site tests identified four genes to be under positive selection in the Arctic fox transcriptome, two of which are fat metabolism genes. In the red fox transcriptome eight genes are under positive selection, including molecular process genes, notably genes involved in ATP metabolism. Analysis of the three transcriptomes and five Sanger re-sequenced genes in additional individuals identified a lower genetic variability within Arctic foxes compared to red foxes, which is consistent with distribution range differences and demographic responses to past climatic fluctuations. A phylogenomic analysis estimated that the Arctic and red fox lineages diverged about three million years ago. Transcriptome data are an economic way to generate genomic resources for evolutionary studies. Despite not representing an entire genome, this transcriptome analysis identified numerous genes that are relevant to arctic

  9. Optimization of de novo transcriptome assembly from high-throughput short read sequencing data improves functional annotation for non-model organisms

    Directory of Open Access Journals (Sweden)

    Haznedaroglu Berat Z

    2012-07-01

    Full Text Available Abstract Background The k-mer hash length is a key factor affecting the output of de novo transcriptome assembly packages using de Bruijn graph algorithms. Assemblies constructed with varying single k-mer choices might result in the loss of unique contiguous sequences (contigs and relevant biological information. A common solution to this problem is the clustering of single k-mer assemblies. Even though annotation is one of the primary goals of a transcriptome assembly, the success of assembly strategies does not consider the impact of k-mer selection on the annotation output. This study provides an in-depth k-mer selection analysis that is focused on the degree of functional annotation achieved for a non-model organism where no reference genome information is available. Individual k-mers and clustered assemblies (CA were considered using three representative software packages. Pair-wise comparison analyses (between individual k-mers and CAs were produced to reveal missing Kyoto Encyclopedia of Genes and Genomes (KEGG ortholog identifiers (KOIs, and to determine a strategy that maximizes the recovery of biological information in a de novo transcriptome assembly. Results Analyses of single k-mer assemblies resulted in the generation of various quantities of contigs and functional annotations within the selection window of k-mers (k-19 to k-63. For each k-mer in this window, generated assemblies contained certain unique contigs and KOIs that were not present in the other k-mer assemblies. Producing a non-redundant CA of k-mers 19 to 63 resulted in a more complete functional annotation than any single k-mer assembly. However, a fraction of unique annotations remained (~0.19 to 0.27% of total KOIs in the assemblies of individual k-mers (k-19 to k-63 that were not present in the non-redundant CA. A workflow to recover these unique annotations is presented. Conclusions This study demonstrated that different k-mer choices result in various quantities

  10. Graph mining for next generation sequencing: leveraging the assembly graph for biological insights.

    Science.gov (United States)

    Warnke-Sommer, Julia; Ali, Hesham

    2016-05-06

    The assembly of Next Generation Sequencing (NGS) reads remains a challenging task. This is especially true for the assembly of metagenomics data that originate from environmental samples potentially containing hundreds to thousands of unique species. The principle objective of current assembly tools is to assemble NGS reads into contiguous stretches of sequence called contigs while maximizing for both accuracy and contig length. The end goal of this process is to produce longer contigs with the major focus being on assembly only. Sequence read assembly is an aggregative process, during which read overlap relationship information is lost as reads are merged into longer sequences or contigs. The assembly graph is information rich and capable of capturing the genomic architecture of an input read data set. We have developed a novel hybrid graph in which nodes represent sequence regions at different levels of granularity. This model, utilized in the assembly and analysis pipeline Focus, presents a concise yet feature rich view of a given input data set, allowing for the extraction of biologically relevant graph structures for graph mining purposes. Focus was used to create hybrid graphs to model metagenomics data sets obtained from the gut microbiomes of five individuals with Crohn's disease and eight healthy individuals. Repetitive and mobile genetic elements are found to be associated with hybrid graph structure. Using graph mining techniques, a comparative study of the Crohn's disease and healthy data sets was conducted with focus on antibiotics resistance genes associated with transposase genes. Results demonstrated significant differences in the phylogenetic distribution of categories of antibiotics resistance genes in the healthy and diseased patients. Focus was also evaluated as a pure assembly tool and produced excellent results when compared against the Meta-velvet, Omega, and UD-IDBA assemblers. Mining the hybrid graph can reveal biological phenomena captured

  11. PAVE: Program for assembling and viewing ESTs

    Directory of Open Access Journals (Sweden)

    Bomhoff Matthew

    2009-08-01

    Full Text Available Abstract Background New sequencing technologies are rapidly emerging. Many laboratories are simultaneously working with the traditional Sanger ESTs and experimenting with ESTs generated by the 454 Life Science sequencers. Though Sanger ESTs have been used to generate contigs for many years, no program takes full advantage of the 5' and 3' mate-pair information, hence, many tentative transcripts are assembled into two separate contigs. The new 454 technology has the benefit of high-throughput expression profiling, but introduces time and space problems for assembling large contigs. Results The PAVE (Program for Assembling and Viewing ESTs assembler takes advantage of the 5' and 3' mate-pair information by requiring that the mate-pairs be assembled into the same contig and joined by n's if the two sub-contigs do not overlap. It handles the depth of 454 data sets by "burying" similar ESTs during assembly, which retains the expression level information while circumventing time and space problems. PAVE uses MegaBLAST for the clustering step and CAP3 for assembly, however it assembles incrementally to enforce the mate-pair constraint, bury ESTs, and reduce incorrect joins and splits. The PAVE data management system uses a MySQL database to store multiple libraries of ESTs along with their metadata; the management system allows multiple assemblies with variations on libraries and parameters. Analysis routines provide standard annotation for the contigs including a measure of differentially expressed genes across the libraries. A Java viewer program is provided for display and analysis of the results. Our results clearly show the benefit of using the PAVE assembler to explicitly use mate-pair information and bury ESTs for large contigs. Conclusion The PAVE assembler provides a software package for assembling Sanger and/or 454 ESTs. The assembly software, data management software, Java viewer and user's guide are freely available.

  12. PAVE: program for assembling and viewing ESTs.

    Science.gov (United States)

    Soderlund, Carol; Johnson, Eric; Bomhoff, Matthew; Descour, Anne

    2009-08-26

    New sequencing technologies are rapidly emerging. Many laboratories are simultaneously working with the traditional Sanger ESTs and experimenting with ESTs generated by the 454 Life Science sequencers. Though Sanger ESTs have been used to generate contigs for many years, no program takes full advantage of the 5' and 3' mate-pair information, hence, many tentative transcripts are assembled into two separate contigs. The new 454 technology has the benefit of high-throughput expression profiling, but introduces time and space problems for assembling large contigs. The PAVE (Program for Assembling and Viewing ESTs) assembler takes advantage of the 5' and 3' mate-pair information by requiring that the mate-pairs be assembled into the same contig and joined by n's if the two sub-contigs do not overlap. It handles the depth of 454 data sets by "burying" similar ESTs during assembly, which retains the expression level information while circumventing time and space problems. PAVE uses MegaBLAST for the clustering step and CAP3 for assembly, however it assembles incrementally to enforce the mate-pair constraint, bury ESTs, and reduce incorrect joins and splits. The PAVE data management system uses a MySQL database to store multiple libraries of ESTs along with their metadata; the management system allows multiple assemblies with variations on libraries and parameters. Analysis routines provide standard annotation for the contigs including a measure of differentially expressed genes across the libraries. A Java viewer program is provided for display and analysis of the results. Our results clearly show the benefit of using the PAVE assembler to explicitly use mate-pair information and bury ESTs for large contigs. The PAVE assembler provides a software package for assembling Sanger and/or 454 ESTs. The assembly software, data management software, Java viewer and user's guide are freely available.

  13. 454 sequencing of pooled BAC clones on chromosome 3H of barley

    Directory of Open Access Journals (Sweden)

    Yamaji Nami

    2011-05-01

    Full Text Available Abstract Background Genome sequencing of barley has been delayed due to its large genome size (ca. 5,000Mbp. Among the fast sequencing systems, 454 liquid phase pyrosequencing provides the longest reads and is the most promising method for BAC clones. Here we report the results of pooled sequencing of BAC clones selected with ESTs genetically mapped to chromosome 3H. Results We sequenced pooled barley BAC clones using a 454 parallel genome sequencer. A PCR screening system based on primer sets derived from genetically mapped ESTs on chromosome 3H was used for clone selection in a BAC library developed from cultivar "Haruna Nijo". The DNA samples of 10 or 20 BAC clones were pooled and used for shotgun library development. The homology between contig sequences generated in each pooled library and mapped EST sequences was studied. The number of contigs assigned on chromosome 3H was 372. Their lengths ranged from 1,230 bp to 58,322 bp with an average 14,891 bp. Of these contigs, 240 showed homology and colinearity with the genome sequence of rice chromosome 1. A contig annotation browser supplemented with query search by unique sequence or genetic map position was developed. The identified contigs can be annotated with barley cDNAs and reference sequences on the browser. Homology analysis of these contigs with rice genes indicated that 1,239 rice genes can be assigned to barley contigs by the simple comparison of sequence lengths in both species. Of these genes, 492 are assigned to rice chromosome 1. Conclusions We demonstrate the efficiency of sequencing gene rich regions from barley chromosome 3H, with special reference to syntenic relationships with rice chromosome 1.

  14. Refined mapping and YAC contig construction of the X-linked cleft palate and ankyloglossia locus (CPX) including the proximal X-Y homology breakpoint within Xq21.3

    Energy Technology Data Exchange (ETDEWEB)

    Forbes, S.A.; Brennan, L.; Richardson, M. [Queen Charlotte`s Hospital, London (United Kingdom)] [and others

    1996-01-01

    The gene for X-linked cleft palate (CPX) has previously been mapped in an Icelandic kindred between the unordered proximal markers DXS1002/DXS349/DXS95 and the distal marker DXYS1X, which maps to the proximal end of the X-Y homology region in Xq21.3. Using six sequence-tagged sites (STSs) within the region, a total of 91 yeast artificial chromosome (YAC) clones were isolated and overlapped in a single contig that spans approximately 3.1 Mb between DXS1002 and DXYS1X. The order of microsatellite and STS markers in this was established as DXS1002-DXS1168-DXS349-DXS95-DXS364-DXS1196-DXS472-DXS1217-DXYS1X. A long-range restriction map of this region was created using eight nonchimeric, overlapping YAC clones. Analysis of newly positioned polymorphic markers in recombinant individuals from the Icelandic family has enabled us to identify DXS1196 and DXS1217 as the flanking markers for CPX. The maximum physical distance containing the CPX gene has been estimated to be 2.0 Mb, which is spanned by a minimum set of five nonchimeric YAC clones. In addition, YAC end clone and STS analyses have pinpointed the location of the proximal boundary of the X-Y homology region within the map. 40 refs., 2 figs., 2 tabs.

  15. Comparative sequence analysis of the potato cyst nematode resistance locus H1 reveals a major lack of co-linearity between three haplotypes in potato (Solanum tuberosum ssp.).

    Science.gov (United States)

    Finkers-Tomczak, Anna; Bakker, Erin; de Boer, Jan; van der Vossen, Edwin; Achenbach, Ute; Golas, Tomasz; Suryaningrat, Suwardi; Smant, Geert; Bakker, Jaap; Goverse, Aska

    2011-02-01

    The H1 locus confers resistance to the potato cyst nematode Globodera rostochiensis pathotypes 1 and 4. It is positioned at the distal end of chromosome V of the diploid Solanum tuberosum genotype SH83-92-488 (SH) on an introgression segment derived from S. tuberosum ssp. andigena. Markers from a high-resolution genetic map of the H1 locus (Bakker et al. in Theor Appl Genet 109:146-152, 2004) were used to screen a BAC library to construct a physical map covering a 341-kb region of the resistant haplotype coming from SH. For comparison, physical maps were also generated of the two haplotypes from the diploid susceptible genotype RH89-039-16 (S. tuberosum ssp. tuberosum/S. phureja), spanning syntenic regions of 700 and 319 kb. Gene predictions on the genomic segments resulted in the identification of a large cluster consisting of variable numbers of the CC-NB-LRR type of R genes for each haplotype. Furthermore, the regions were interspersed with numerous transposable elements and genes coding for an extensin-like protein and an amino acid transporter. Comparative analysis revealed a major lack of gene order conservation in the sequences of the three closely related haplotypes. Our data provide insight in the evolutionary mechanisms shaping the H1 locus and will facilitate the map-based cloning of the H1 resistance gene.

  16. Dicty_cDB: SHA393 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SH (Link to library) SHA393 (Link to dictyBase) - - - Contig-U11503-1 SHA393E (Link... Clone ID SHA393 (Link to dictyBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U11503-1 Original site URL http://dict...lated Amino Acid sequence ekqfsl*iy*YMIRKSNNFSILFAIFLKIVFVVSAPLCPNSTILLNYNILTVYNSSEGCG FNNXPICTSLKDAVXRAFLLI...yhcysyfg Translated Amino Acid sequence (All Frames) Frame A: ekqfsl*iy*YMIRKSNNFSILFAIFLKIVFVVSAPLCPNSTILLNYNILTVYNSSEGCG FNNXPICT...Homology vs Protein Score E Sequences producing significant alignments: (bits) Value AF020283_1( AF020283 |pid:none) Dict

  17. Dicty_cDB: SFE109 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SF (Link to library) SFE109 (Link to dictyBase) - - - Contig-U10771-1 SFE109P (Link... to Original site) SFE109F 197 SFE109Z 630 SFE109P 827 - - Show SFE109 Library SF (Link to library) Clone ID SFE109 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U10771-1 Original site URL http://dict...anslated Amino Acid sequence elnykfhftlnttqiei*inknfllflf*fffyfqi*iqlcfilkllllptiink*INKI FLKKK--- ---VTTSQCESLIQAGVDGLRVGMGVGSICT...fkfnsvsy*nyyyyqpslink*ik ff*kk--- ---VTTSQCESLIQAGVDGLRVGMGVGSICTTQEVMACGRPQATAVFKCALYSSQYNVPI IADGGIRTIGHII

  18. Dicty_cDB: SFL482 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SF (Link to library) SFL482 (Link to dictyBase) - - - Contig-U15494-1 SFL482P (Link... to Original site) SFL482F 434 SFL482Z 394 SFL482P 828 - - Show SFL482 Library SF (Link to library) Clone ID SFL482 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15494-1 Original site URL http://dict...cftclww*qmcqmcp*qnrscfln*rtknr*nclqeatkrfkt ker*EIIQINVVININHFLLIKKK--- ---ICTHIEKMVQRLTYRRRLSYRTTSNATKIVKTP...KQQK DLKQKKDKKSSK*m Translated Amino Acid sequence (All Frames) Frame A: icthiekmvqrltyrrrlsyrttsnatkivktpgg

  19. Dicty_cDB: SSL592 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SS (Link to library) SSL592 (Link to dictyBase) - - - Contig-U14332-1 SSL592E (Link... to Original site) - - - - - - SSL592E 244 Show SSL592 Library SS (Link to library) Clone ID SSL592 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U14332-1 Original site URL http://dict...ence niyi*IYMFLTLIHLWTSKNTVIIFICTLNGI*ik*nnvkniyi*iyn*kkkkkklkn*h lvdlnktv*lyk*kkliy*k Translated Amino Acid...kqncitl*ikkinllkk Frame C: niyi*IYMFLTLIHLWTSKNTVIIFICTLNGI*ik*nnvkniyi*iyn*kkkkkklkn*h lvdlnktv*lyk*kkliy*k

  20. Dicty_cDB: CHN208 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CH (Link to library) CHN208 (Link to dictyBase) - - - Contig-U11696-1 - (Link to Or...iginal site) CHN208F 157 - - - - - - Show CHN208 Library CH (Link to library) Clone ID CHN208 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U11696-1 Original site URL http://dictycdb.b...ces producing significant alignments: (bits) Value N AC149612 |AC149612.1 Ictalurus punctatus clone CH212-98...nuclear 20.0 %: mitochondrial 8.0 %: vacuolar 4.0 %: peroxisomal >> prediction fo

  1. Dicty_cDB: VSK446 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VS (Link to library) VSK446 (Link to dictyBase) - - - Contig-U15105-1 VSK446E (Link... Clone ID VSK446 (Link to dictyBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15105-1 Original site URL http://dict... r... 269 4e-71 AF402813_1( AF402813 |pid:none) Ictalurus punctatus 40S ribosomal ... 269 4e-71 FJ196315_1( ....00 84.0 %: cytoplasmic 12.0 %: mitochondrial 4.0 %: nuclear >> prediction for VSK446 is cyt 5' end seq. ID

  2. Dicty_cDB: SLH678 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SL (Link to library) SLH678 (Link to dictyBase) - - - Contig-U04159-1 SLH678E (Link... to Original site) - - - - - - SLH678E 373 Show SLH678 Library SL (Link to library) Clone ID SLH678 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U04159-1 Original site URL http://dict... producing significant alignments: (bits) Value N ( AU039970 ) Dictyostelium discoideum slug cDNA, clone SLG...865. 323 e-129 2 ( AU039381 ) Dictyostelium discoideum slug cDNA, clone SLH678. 3

  3. Dicty_cDB: VHA386 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHA386 (Link to dictyBase) - - - Contig-U11201-1 - (Link to Or...iginal site) - - VHA386Z 730 - - - - Show VHA386 Library VH (Link to library) Clone ID VHA386 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U11201-1 Original site URL http://dictycdb.b...Homology vs DNA Score E Sequences producing significant alignments: (bits) Value N L08646 |L08646.1 Dict...19 1 CX835130 |AF255664_1 major vault protein [Ictalurus punctatus], mRNA sequenc

  4. Dicty_cDB: SLD420 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SL (Link to library) SLD420 (Link to dictyBase) - - - Contig-U16325-1 SLD420E (Link... to Original site) - - - - - - SLD420E 434 Show SLD420 Library SL (Link to library) Clone ID SLD420 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16325-1 Original site URL http://dict... 7 Homology vs DNA Score E Sequences producing significant alignments: (bits) Value N ( AF066071 ) Dict...yostelium discoideum SP85 (pspB) gene, comple... 860 0.0 1 ( AC117075 ) Dictyostelium

  5. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB689 (Link to dictyBase) - - - Contig-U16603-1 VFB689P (Link... to Original site) VFB689F 476 VFB689Z 663 VFB689P 1139 - - Show VFB689 Library VF (Link to library) Clone ID VFB689 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16603-1 Original site URL http://dict...mbrane 4.0 %: vesicles of secretory system 4.0 %: peroxisomal >> prediction for VFB689 is cyt 5' end seq. ID

  6. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB593 (Link to dictyBase) - - - Contig-U02438-1 VFB593E (Link...) Clone ID VFB593 (Link to dictyBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U02438-1 Ori...0.009 6 AC116986 |AC116986.2 Dictyostelium discoideum chromosome 2 map 2234041-25...sequence. 46 0.031 2 AC115577 |AC115577.2 Dictyostelium discoideum chromosome 2 m...ap 4657875-4914984 strain AX4, complete sequence. 34 0.051 14 AC116960 |AC116960.2 Dictyostelium discoideum

  7. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB551 (Link to dictyBase) - - - Contig-U16449-1 VFB551P (Link... to Original site) VFB551F 194 VFB551Z 178 VFB551P 372 - - Show VFB551 Library VF (Link to library) Clone ID VFB551 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16449-1 Original site URL http://dict...0 %: cytoplasmic 16.0 %: cytoskeletal 12.0 %: mitochondrial 4.0 %: vacuolar 4.0 %: endoplasmic reticulum >> predict

  8. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB826 (Link to dictyBase) - - - Contig-U16584-1 VFB826F (Link... to Original site) VFB826F 430 - - - - - - Show VFB826 Library VF (Link to library) Clone ID VFB826 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16584-1 Original site URL http://dict... (bits) Value N AC115577 |AC115577.2 Dictyostelium discoideum chromosome 2 map 4657875-4914984 strain AX4, c...vesicles of secretory system 4.0 %: endoplasmic reticulum >> prediction for VFB82

  9. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB490 (Link to dictyBase) - - - Contig-U16603-1 VFB490P (Link... to Original site) VFB490F 588 VFB490Z 683 VFB490P 1271 - - Show VFB490 Library VF (Link to library) Clone ID VFB490 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16603-1 Original site URL http://dict...: extracellular, including cell wall 4.0 %: plasma membrane 4.0 %: vesicles of secretory system 4.0 %: peroxisomal >> predict

  10. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB838 (Link to dictyBase) - - - Contig-U14973-1 VFB838P (Link... to Original site) VFB838F 562 VFB838Z 443 VFB838P 1005 - - Show VFB838 Library VF (Link to library) Clone ID VFB838 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U14973-1 Original site URL http://dict...plasmic 12.0 %: Golgi 12.0 %: nuclear 4.0 %: plasma membrane 4.0 %: vesicles of secretory system 4.0 %: peroxisomal >> predict

  11. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB585 (Link to dictyBase) - - - Contig-U09875-1 VFB585Z (Link... to Original site) - - VFB585Z 664 - - - - Show VFB585 Library VF (Link to library) Clone ID VFB585 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U09875-1 Original site URL http://dict...Score E Sequences producing significant alignments: (bits) Value N AC116551 |AC116551.2 Dictyostelium discoi...ces producing significant alignments: (bits) Value AC116551_43( AC116551 |pid:none) Dictyostelium discoideum

  12. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB668 (Link to dictyBase) - G00394 DDB0168247 Contig-U09555-1...brary) Clone ID VFB668 (Link to dictyBase) Atlas ID - NBRP ID G00394 dictyBase ID DDB0168247 Link to Contig ...Contig-U09555-1 Original site URL http://dictycdb.biol.tsukuba.ac.jp/CSM/VF/VFB6-...uences producing significant alignments: (bits) Value N AC117072 |AC117072.2 Dictyostelium discoideum chromo...tein Score E Sequences producing significant alignments: (bits) Value AC117076_25

  13. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB625 (Link to dictyBase) - - - Contig-U13825-1 VFB625P (Link... to Original site) VFB625F 492 VFB625Z 685 VFB625P 1177 - - Show VFB625 Library VF (Link to library) Clone ID VFB625 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U13825-1 Original site URL http://dict... %: cytoplasmic 16.0 %: nuclear 4.0 %: cytoskeletal 4.0 %: plasma membrane >> prediction for VFB625 is mit 5

  14. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB677 (Link to dictyBase) - G02518 DDB0188526 Contig-U02066-1...brary) Clone ID VFB677 (Link to dictyBase) Atlas ID - NBRP ID G02518 dictyBase ID DDB0188526 Link to Contig ...Contig-U02066-1 Original site URL http://dictycdb.biol.tsukuba.ac.jp/CSM/VF/VFB6-...nfnytrgccsygdl*wy*gtr*tih*kdr*lwcassfenfndtictit mfnr**sstwfeilskrsih*rgikc*hd*nh..._25( AC116960 |pid:none) Dictyostelium discoideum chromoso... 90 2e-16 CP001276_520( CP001276 |pid:none) The

  15. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB742 (Link to dictyBase) - - - Contig-U14919-1 VFB742P (Link... to Original site) VFB742F 590 VFB742Z 468 VFB742P 1058 - - Show VFB742 Library VF (Link to library) Clone ID VFB742 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U14919-1 Original site URL http://dict...mbrane 4.0 %: peroxisomal 4.0 %: endoplasmic reticulum >> prediction for VFB742 is cyt 5' end seq. ID VFB742

  16. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB817 (Link to dictyBase) - - - Contig-U16229-1 VFB817P (Link... to Original site) VFB817F 504 VFB817Z 193 VFB817P 697 - - Show VFB817 Library VF (Link to library) Clone ID VFB817 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16229-1 Original site URL http://dict...%: peroxisomal 4.0 %: endoplasmic reticulum >> prediction for VFB817 is cyt 5' end seq. ID VFB817F 5' end se

  17. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB720 (Link to dictyBase) - - - Contig-U16603-1 VFB720P (Link... to Original site) VFB720F 521 VFB720Z 353 VFB720P 874 - - Show VFB720 Library VF (Link to library) Clone ID VFB720 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16603-1 Original site URL http://dict...: cytoskeletal 4.0 %: vacuolar 4.0 %: vesicles of secretory system >> prediction for VFB720 is nuc 5' end se

  18. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB644 (Link to dictyBase) - - - Contig-U10683-1 VFB644F (Link... to Original site) VFB644F 519 - - - - - - Show VFB644 Library VF (Link to library) Clone ID VFB644 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U10683-1 Original site URL http://dict...7 9 AC116920 |AC116920.2 Dictyostelium discoideum chromosome 2 map 3879572-407176... %: nuclear 8.0 %: mitochondrial 4.0 %: cytoskeletal 4.0 %: vesicles of secretory system >> predict

  19. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB868 (Link to dictyBase) - - - Contig-U15649-1 VFB868F (Link... to Original site) VFB868F 110 - - - - - - Show VFB868 Library VF (Link to library) Clone ID VFB868 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15649-1 Original site URL http://dict...c 4.0 %: cytoskeletal >> prediction for VFB868 is nuc 5' end seq. ID VFB868F 5' end seq. >VFB868F.Seq TATTAA

  20. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB804 (Link to dictyBase) - - - Contig-U13849-1 VFB804P (Link... to Original site) VFB804F 150 VFB804Z 637 VFB804P 787 - - Show VFB804 Library VF (Link to library) Clone ID VFB804 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U13849-1 Original site URL http://dict... nuclear 4.0 %: cytoskeletal 4.0 %: vacuolar 4.0 %: plasma membrane 4.0 %: vesicles of secretory system >> predict

  1. Short interspersed elements (SINEs) are a major source of canine genomic diversity.

    Science.gov (United States)

    Wang, Wei; Kirkness, Ewen F

    2005-12-01

    SINEs are retrotransposons that have enjoyed remarkable reproductive success during the course of mammalian evolution, and have played a major role in shaping mammalian genomes. Previously, an analysis of survey-sequence data from an individual dog (a poodle) indicated that canine genomes harbor a high frequency of alleles that differ only by the absence or presence of a SINEC_Cf repeat. Comparison of this survey-sequence data with a draft genome sequence of a distinct dog (a boxer) has confirmed this prediction, and revealed the chromosomal coordinates for >10,000 loci that are bimorphic for SINEC_Cf insertions. Analysis of SINE insertion sites from the genomes of nine additional dogs indicates that 3%-5% are absent from either the poodle or boxer genome sequences--suggesting that an additional 10,000 bimorphic loci could be readily identified in the general dog population. We describe a methodology that can be used to identify these loci, and could be adapted to exploit these bimorphic loci for genotyping purposes. Approximately half of all annotated canine genes contain SINEC_Cf repeats, and these elements are occasionally transcribed. When transcribed in the antisense orientation, they provide splice acceptor sites that can result in incorporation of novel exons. The high frequency of bimorphic SINE insertions in the dog population is predicted to provide numerous examples of allele-specific transcription patterns that will be valuable for the study of differential gene expression among multiple dog breeds.

  2. Mapping analysis of scaffold/matrix attachment regions (s/MARs) from two different mammalian cell lines

    Energy Technology Data Exchange (ETDEWEB)

    Pilus, Nur Shazwani Mohd; Ahmad, Azrin; Yusof, Nurul Yuziana Mohd [School of Bioscience and Biotechnology, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, 43600 Bangi, Selangor (Malaysia); Johari, Norazfa [Department of Chemical and Process Engineering, Faculty of Engineering and Built Environment, Universiti Kebangsaan Malaysia, 43600 Bangi, Selangor (Malaysia)

    2014-09-03

    Scaffold/matrix attachment regions (S/MARs) are potential element that can be integrated into expression vector to increase expression of recombinant protein. Many studies on S/MAR have been done but none has revealed the distribution of S/MAR in a genome. In this study, we have isolated S/MAR sequences from HEK293 and Chinese hamster ovary cell lines (CHO DG44) using two different methods utilizing 2 M NaCl and lithium-3,5-diiodosalicylate (LIS). The isolated S/MARs were sequenced using Next Generation Sequencing (NGS) platform. Based on reference mapping analysis against human genome database, a total of 8,994,856 and 8,412,672 contigs of S/MAR sequences were retrieved from 2M NaCl and LIS extraction of HEK293 respectively. On the other hand, reference mapping analysis of S/MAR derived from CHO DG44 against our own CHO DG44 database have generated a total of 7,204,348 and 4,672,913 contigs from 2 M NaCl and LIS extraction method respectively.

  3. Mapping analysis of scaffold/matrix attachment regions (s/MARs) from two different mammalian cell lines

    International Nuclear Information System (INIS)

    Pilus, Nur Shazwani Mohd; Ahmad, Azrin; Yusof, Nurul Yuziana Mohd; Johari, Norazfa

    2014-01-01

    Scaffold/matrix attachment regions (S/MARs) are potential element that can be integrated into expression vector to increase expression of recombinant protein. Many studies on S/MAR have been done but none has revealed the distribution of S/MAR in a genome. In this study, we have isolated S/MAR sequences from HEK293 and Chinese hamster ovary cell lines (CHO DG44) using two different methods utilizing 2 M NaCl and lithium-3,5-diiodosalicylate (LIS). The isolated S/MARs were sequenced using Next Generation Sequencing (NGS) platform. Based on reference mapping analysis against human genome database, a total of 8,994,856 and 8,412,672 contigs of S/MAR sequences were retrieved from 2M NaCl and LIS extraction of HEK293 respectively. On the other hand, reference mapping analysis of S/MAR derived from CHO DG44 against our own CHO DG44 database have generated a total of 7,204,348 and 4,672,913 contigs from 2 M NaCl and LIS extraction method respectively

  4. Deciphering the hybridisation history leading to the Lager lineage based on the mosaic genomes of Saccharomyces bayanus strains NBRC1948 and CBS380.

    Directory of Open Access Journals (Sweden)

    Huu-Vang Nguyen

    Full Text Available Saccharomyces bayanus is a yeast species described as one of the two parents of the hybrid brewing yeast S. pastorianus. Strains CBS380(T and NBRC1948 have been retained successively as pure-line representatives of S. bayanus. In the present study, sequence analyses confirmed and upgraded our previous finding: S. bayanus type strain CBS380(T harbours a mosaic genome. The genome of strain NBRC1948 was also revealed to be mosaic. Both genomes were characterized by amplification and sequencing of different markers, including genes involved in maltotriose utilization or genes detected by array-CGH mapping. Sequence comparisons with public Saccharomyces spp. nucleotide sequences revealed that the CBS380(T and NBRC1948 genomes are composed of: a predominant non-cerevisiae genetic background belonging to S. uvarum, a second unidentified species provisionally named S. lagerae, and several introgressed S. cerevisiae fragments. The largest cerevisiae-introgressed DNA common to both genomes totals 70kb in length and is distributed in three contigs, cA, cB and cC. These vary in terms of length and presence of MAL31 or MTY1 (maltotriose-transporter gene. In NBRC1948, two additional cerevisiae-contigs, cD and cE, totaling 12kb in length, as well as several smaller cerevisiae fragments were identified. All of these contigs were partially detected in the genomes of S. pastorianus lager strains CBS1503 (S. monacensis and CBS1513 (S. carlsbergensis explaining the noticeable common ability of S. bayanus and S. pastorianus to metabolize maltotriose. NBRC1948 was shown to be inter-fertile with S. uvarum CBS7001. The cross involving these two strains produced F1 segregants resembling the strains CBS380(T or NRRLY-1551. This demonstrates that these S. bayanus strains were the offspring of a cross between S. uvarum and a strain similar to NBRC1948. Phylogenies established with selected cerevisiae and non-cerevisiae genes allowed us to decipher the complex hybridisation

  5. Comparison of bacterial genome assembly software for MinION data and their applicability to medical microbiology.

    Science.gov (United States)

    Judge, Kim; Hunt, Martin; Reuter, Sandra; Tracey, Alan; Quail, Michael A; Parkhill, Julian; Peacock, Sharon J

    2016-09-01

    Translating the Oxford Nanopore MinION sequencing technology into medical microbiology requires on-going analysis that keeps pace with technological improvements to the instrument and release of associated analysis software. Here, we use a multidrug-resistant Enterobacter kobei isolate as a model organism to compare open source software for the assembly of genome data, and relate this to the time taken to generate actionable information. Three software tools (PBcR, Canu and miniasm) were used to assemble MinION data and a fourth (SPAdes) was used to combine MinION and Illumina data to produce a hybrid assembly. All four had a similar number of contigs and were more contiguous than the assembly using Illumina data alone, with SPAdes producing a single chromosomal contig. Evaluation of the four assemblies to represent the genome structure revealed a single large inversion in the SPAdes assembly, which also incorrectly integrated a plasmid into the chromosomal contig. Almost 50 %, 80 % and 90 % of MinION pass reads were generated in the first 6, 9 and 12 h, respectively. Using data from the first 6 h alone led to a less accurate, fragmented assembly, but data from the first 9 or 12 h generated similar assemblies to that from 48 h sequencing. Assemblies were generated in 2 h using Canu, indicating that going from isolate to assembled data is possible in less than 48 h. MinION data identified that genes responsible for resistance were carried by two plasmids encoding resistance to carbapenem and to sulphonamides, rifampicin and aminoglycosides, respectively.

  6. Deep RNA sequencing of the skeletal muscle transcriptome in swimming fish.

    Directory of Open Access Journals (Sweden)

    Arjan P Palstra

    Full Text Available Deep RNA sequencing (RNA-seq was performed to provide an in-depth view of the transcriptome of red and white skeletal muscle of exercised and non-exercised rainbow trout (Oncorhynchus mykiss with the specific objective to identify expressed genes and quantify the transcriptomic effects of swimming-induced exercise. Pubertal autumn-spawning seawater-raised female rainbow trout were rested (n = 10 or swum (n = 10 for 1176 km at 0.75 body-lengths per second in a 6,000-L swim-flume under reproductive conditions for 40 days. Red and white muscle RNA of exercised and non-exercised fish (4 lanes was sequenced and resulted in 15-17 million reads per lane that, after de novo assembly, yielded 149,159 red and 118,572 white muscle contigs. Most contigs were annotated using an iterative homology search strategy against salmonid ESTs, the zebrafish Danio rerio genome and general Metazoan genes. When selecting for large contigs (>500 nucleotides, a number of novel rainbow trout gene sequences were identified in this study: 1,085 and 1,228 novel gene sequences for red and white muscle, respectively, which included a number of important molecules for skeletal muscle function. Transcriptomic analysis revealed that sustained swimming increased transcriptional activity in skeletal muscle and specifically an up-regulation of genes involved in muscle growth and developmental processes in white muscle. The unique collection of transcripts will contribute to our understanding of red and white muscle physiology, specifically during the long-term reproductive migration of salmonids.

  7. Exploring viral diversity in a unique South African soil habitat

    CSIR Research Space (South Africa)

    Segobola, J

    2018-01-01

    Full Text Available Refseq protein data- base (release of 2016-01-19). The results revealed that 37.6% of the contigs represented a significant hit (threshold of 50 on the BLAST bit score). MetaVir identified 18 virus families, in which prokaryotic viruses were the most... metaviromics analyses in soil environments18,20,23,30. This finding was not surprising, given the observations from previous studies35,53 which showed high prokaryotic abundances in the Kogelberg soil environment. Nevertheless, signature sequences from large...

  8. A Near-Complete Haplotype-Phased Genome of the Dikaryotic Wheat Stripe Rust Fungus Puccinia striiformis f. sp. tritici Reveals High Interhaplotype Diversity

    Directory of Open Access Journals (Sweden)

    Benjamin Schwessinger

    2018-02-01

    Full Text Available A long-standing biological question is how evolution has shaped the genomic architecture of dikaryotic fungi. To answer this, high-quality genomic resources that enable haplotype comparisons are essential. Short-read genome assemblies for dikaryotic fungi are highly fragmented and lack haplotype-specific information due to the high heterozygosity and repeat content of these genomes. Here, we present a diploid-aware assembly of the wheat stripe rust fungus Puccinia striiformis f. sp. tritici based on long reads using the FALCON-Unzip assembler. Transcriptome sequencing data sets were used to infer high-quality gene models and identify virulence genes involved in plant infection referred to as effectors. This represents the most complete Puccinia striiformis f. sp. tritici genome assembly to date (83 Mb, 156 contigs, N50 of 1.5 Mb and provides phased haplotype information for over 92% of the genome. Comparisons of the phase blocks revealed high interhaplotype diversity of over 6%. More than 25% of all genes lack a clear allelic counterpart. When we investigated genome features that potentially promote the rapid evolution of virulence, we found that candidate effector genes are spatially associated with conserved genes commonly found in basidiomycetes. Yet, candidate effectors that lack an allelic counterpart are more distant from conserved genes than allelic candidate effectors and are less likely to be evolutionarily conserved within the P. striiformis species complex and Pucciniales. In summary, this haplotype-phased assembly enabled us to discover novel genome features of a dikaryotic plant-pathogenic fungus previously hidden in collapsed and fragmented genome assemblies.

  9. Dicty_cDB: SSB721 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SS (Link to library) SSB721 (Link to dictyBase) ssb721 G20804 DDB0230090 Contig-U04...594-1 SSB721E (Link to Original site) - - - - - - SSB721E 549 Show SSB721 Library SS (Link to library) Clone ID SSB721 (Link to dict...yBase) Atlas ID ssb721 NBRP ID G20804 dictyBase ID DDB0230090 Link to Contig Contig-...U04594-1 Original site URL http://dictycdb.biol.tsukuba.ac.jp/CSM/SS/SSB7-A/SSB72...quence update 1997.11.25 Translated Amino Acid sequence NDINSIVCKANSQCPTSHICTSSNKCIIKYSSKVGEKCTDSPLQCRVFNGEI

  10. Dicty_cDB: SLG865 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SL (Link to library) SLG865 (Link to dictyBase) - G22488 DDB0184443 Contig-U04159-1... SLG865E (Link to Original site) - - - - - - SLG865E 373 Show SLG865 Library SL (Link to library) Clone ID SLG865 (Link to dict...yBase) Atlas ID - NBRP ID G22488 dictyBase ID DDB0184443 Link to Contig Contig-U04159-1 O...riginal site URL http://dictycdb.biol.tsukuba.ac.jp/CSM/SL/SLG8-C/SLG865Q.Seq.d/ ...quence update 1998. 6.28 Translated Amino Acid sequence KAQLETDLKNICTLVPSNITMECKF

  11. Dicty_cDB: VFB113 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB113 (Link to dictyBase) - - - Contig-U16478-1 VFB113P (Link... to Original site) VFB113F 584 VFB113Z 643 VFB113P 1227 - - Show VFB113 Library VF (Link to library) Clone ID VFB113 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16478-1 Original site URL http://dict...rlstttrlptttrlptttrlstttr lptttrlptttrlptttrlptttrlsttrlstttrlstswctswctswictrygswissr llcwynhs*l*t*c*sfkksn...sequence. 42 5e-29 8 U03413 |U03413.1 Dictyostelium discoideum AX2 calcium binding protein mRNA, complete cd

  12. Dicty_cDB: CFE213 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CF (Link to library) CFE213 (Link to dictyBase) - - - Contig-U16381-1 CFE213F (Link... to Original site) CFE213F 111 - - - - - - Show CFE213 Library CF (Link to library) Clone ID CFE213 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16381-1 Original site URL http://dict... E Sequences producing significant alignments: (bits) Value N AC115685 |AC115685.1 Dict...yostelium discoideum chromosome 2 map 4718821-4752388 strain AX4, complete sequence. 80 9e-24 3 X51892 |X51892.1 Dict

  13. Dicty_cDB: SSF210 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SS (Link to library) SSF210 (Link to dictyBase) - - - Contig-U16581-1 SSF210P (Link... to Original site) SSF210F 183 SSF210Z 197 SSF210P 380 - - Show SSF210 Library SS (Link to library) Clone ID SSF210 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16581-1 Original site URL http://dict...nt alignments: (bits) Value N D16417 |D16417.1 Dictyostelium discoideum mRNA. 339...01670 Skin cDNA library Ictalurus punctatus cDNA 5' similar to Ictacalcin, mRNA s

  14. Dicty_cDB: VFI196 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFI196 (Link to dictyBase) - - - Contig-U16512-1 - (Link to Or...iginal site) - - VFI196Z 168 - - - - Show VFI196 Library VF (Link to library) Clone ID VFI196 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16512-1 Original site URL http://dictycdb.b...ces. 40 8.8 1 CK407044 |CK407044.1 AUF_IfLvr_212_p04 Ictalurus furcatus liver cDNA library Ictalurus furcatu...%: nuclear 36.0 %: mitochondrial 16.0 %: cytoplasmic 4.0 %: cytoskeletal >> prediction for VFI196 is nuc 5'

  15. Dicty_cDB: SSB373 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SS (Link to library) SSB373 (Link to dictyBase) - G00609 DDB0216216 Contig-U04543-1...nk to library) Clone ID SSB373 (Link to dictyBase) Atlas ID - NBRP ID G00609 dictyBase ID DDB0216216 Link to... Contig Contig-U04543-1 Original site URL http://dictycdb.biol.tsukuba.ac.jp/CSM/...Value N AB016728 |AB016728.1 Dictyostelium discoideum sapA mRNA for saposin A, co... 44 7e-04 12 AC116330 |AC116330.2 Dictyostelium discoideum chromosome 2 map 3191214-3323468 strain AX4, comp

  16. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFC171 (Link to dictyBase) - - - Contig-U16478-1 VFC171P (Link... to Original site) VFC171F 482 VFC171Z 633 VFC171P 1115 - - Show VFC171 Library VF (Link to library) Clone ID VFC171 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16478-1 Original site URL http://dict...ptttrlptt trlstttrlptttrlptttrlptttrlptttrlsttrlxtttrlstswctswctswictr ygswissrllcwynhs*l*t*c*sfkksnerywyk*i...ologous to C-terminal repeat sequence of rhodopsin and synaptophysin. 88 1e-15 3 X54062 |X54062.1 Dictyostel

  17. Dicty_cDB: VSE812 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VS (Link to library) VSE812 (Link to dictyBase) - - - Contig-U14773-1 VSE812E (Link... to Original site) - - - - - - VSE812E 721 Show VSE812 Library VS (Link to library) Clone ID VSE812 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U14773-1 Original site URL http://dict...re E Sequences producing significant alignments: (bits) Value N L08391 |L08391.1 Dict...9 AF401554_1( AF401554 |pid:none) Ictalurus punctatus ribosomal prot... 132 1e-29 protein update 2009. 7.31

  18. Dicty_cDB: SSD329 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SS (Link to library) SSD329 (Link to dictyBase) - - - Contig-U16581-1 SSD329F (Link... to Original site) SSD329F 444 - - - - - - Show SSD329 Library SS (Link to library) Clone ID SSD329 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16581-1 Original site URL http://dict...A Score E Sequences producing significant alignments: (bits) Value N D16417 |D16417.1 Dictyostelium discoide... DNA sequence. 50 0.027 1 BM028890 |BM028890.1 IpSkn01670 Skin cDNA library Ictal

  19. Dicty_cDB: CFE853 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CF (Link to library) CFE853 (Link to dictyBase) - - - Contig-U16381-1 CFE853F (Link... to Original site) CFE853F 109 - - - - - - Show CFE853 Library CF (Link to library) Clone ID CFE853 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16381-1 Original site URL http://dict...uences producing significant alignments: (bits) Value N X51892 |X51892.1 Dictyost...elium discoideum SP60 gene for spore coat protein. 80 6e-21 2 X52105 |X52105.1 Dictyostelium discoideum SP60

  20. Dicty_cDB: CFH668 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CF (Link to library) CFH668 (Link to dictyBase) - - - Contig-U13860-1 - (Link to Or...iginal site) - - CFH668Z 651 - - - - Show CFH668 Library CF (Link to library) Clone ID CFH668 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U13860-1 Original site URL http://dictycdb.b...s) Value N AC116987 |AC116987.2 Dictyostelium discoideum chromosome 2 map 3527441-3568052 strain AX4, comple...te sequence. 58 1e-11 6 AC116305 |AC116305.2 Dictyostelium discoideum chromosome

  1. Dicty_cDB: SFD117 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SF (Link to library) SFD117 (Link to dictyBase) - - - Contig-U16381-1 SFD117F (Link... to Original site) SFD117F 107 - - - - - - Show SFD117 Library SF (Link to library) Clone ID SFD117 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16381-1 Original site URL http://dict...s producing significant alignments: (bits) Value N AC115685 |AC115685.1 Dictyoste...lium discoideum chromosome 2 map 4718821-4752388 strain AX4, complete sequence. 80 2e-21 3 X51892 |X51892.1 Dict

  2. Dicty_cDB: AFK740 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available AF (Link to library) AFK740 (Link to dictyBase) - - - Contig-U16381-1 AFK740F (Link... to Original site) AFK740F 107 - - - - - - Show AFK740 Library AF (Link to library) Clone ID AFK740 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16381-1 Original site URL http://dict...es producing significant alignments: (bits) Value N X51892 |X51892.1 Dictyosteliu...m discoideum SP60 gene for spore coat protein. 80 6e-21 2 X52105 |X52105.1 Dictyostelium discoideum SP60 gen

  3. Dicty_cDB: SLF689 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SL (Link to library) SLF689 (Link to dictyBase) - - - Contig-U16284-1 SLF689Z (Link... to Original site) - - SLF689Z 320 - - - - Show SLF689 Library SL (Link to library) Clone ID SLF689 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16284-1 Original site URL http://dict...DNA Score E Sequences producing significant alignments: (bits) Value N ( AC116305 ) Dictyostelium discoideum... chromosome 2 map 1005175... 478 e-131 1 ( AU053477 ) Dictyostelium discoideum slug cDNA, clone SLI726. 478 e-131 1 ( AU053102 ) Dict

  4. Dicty_cDB: CFH744 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CF (Link to library) CFH744 (Link to dictyBase) - - - Contig-U16381-1 - (Link to Or...iginal site) CFH744F 118 - - - - - - Show CFH744 Library CF (Link to library) Clone ID CFH744 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16381-1 Original site URL http://dictycdb.b...ng significant alignments: (bits) Value N X51892 |X51892.1 Dictyostelium discoide...um SP60 gene for spore coat protein. 80 1e-23 3 AC115685 |AC115685.1 Dictyostelium discoideum chromosome 2 m

  5. Dicty_cDB: SSJ546 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SS (Link to library) SSJ546 (Link to dictyBase) - - - Contig-U16581-1 SSJ546F (Link... to Original site) SSJ546F 445 - - - - - - Show SSJ546 Library SS (Link to library) Clone ID SSJ546 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16581-1 Original site URL http://dict...DNA Score E Sequences producing significant alignments: (bits) Value N D16417 |D16417.1 Dictyostelium discoi...2, DNA sequence. 50 0.027 1 BM028890 |BM028890.1 IpSkn01670 Skin cDNA library Ict

  6. Dicty_cDB: SSF689 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SS (Link to library) SSF689 (Link to dictyBase) - - - Contig-U16581-1 SSF689F (Link... to Original site) SSF689F 443 - - - - - - Show SSF689 Library SS (Link to library) Clone ID SSF689 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16581-1 Original site URL http://dict...core E Sequences producing significant alignments: (bits) Value N D16417 |D16417.1 Dictyostelium discoideum ...A sequence. 50 0.027 1 BM028890 |BM028890.1 IpSkn01670 Skin cDNA library Ictaluru

  7. Dicty_cDB: CFH713 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CF (Link to library) CFH713 (Link to dictyBase) - - - Contig-U16381-1 - (Link to Or...iginal site) CFH713F 133 - - - - - - Show CFH713 Library CF (Link to library) Clone ID CFH713 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16381-1 Original site URL http://dictycdb.b...ogy vs DNA Score E Sequences producing significant alignments: (bits) Value N X51892 |X51892.1 Dict...yostelium discoideum SP60 gene for spore coat protein. 80 2e-23 3 AC115685 |AC115685.1 Dict

  8. Dicty_cDB: CFH518 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CF (Link to library) CFH518 (Link to dictyBase) - - - Contig-U16381-1 - (Link to Or...iginal site) CFH518F 131 - - - - - - Show CFH518 Library CF (Link to library) Clone ID CFH518 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16381-1 Original site URL http://dictycdb.b...vs DNA Score E Sequences producing significant alignments: (bits) Value N X51892 |X51892.1 Dict...yostelium discoideum SP60 gene for spore coat protein. 80 2e-22 3 AC115685 |AC115685.1 Dictyos

  9. Dicty_cDB: VHI285 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHI285 (Link to dictyBase) - - - Contig-U16073-1 - (Link to Or...iginal site) VHI285F 136 - - - - - - Show VHI285 Library VH (Link to library) Clone ID VHI285 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16073-1 Original site URL http://dictycdb.b... DNA Score E Sequences producing significant alignments: (bits) Value N ( BJ423035 ) Dict...yostelium discoideum cDNA clone:ddv47i22, 5' ... 72 2e-27 3 ( BJ419916 ) Dictyostelium discoideum cD

  10. Dicty_cDB: CHQ307 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CH (Link to library) CHQ307 (Link to dictyBase) - - - Contig-U16403-1 CHQ307P (Link... to Original site) CHQ307F 441 CHQ307Z 669 CHQ307P 1090 - - Show CHQ307 Library CH (Link to library) Clone ID CHQ307 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16403-1 Original site URL http://dict...EDPATQLSSLKFESDKEKEQALLWAFLASYLPEDKGS LQKEFVKHVEYTLAQTKSECTDF--- ---fvmplvmkictslvlvlkrstk*rklfmmenshqtldglv...0.0 own update 2004.12.25 Homology vs DNA Score E Sequences producing significant alignments: (bits) Value N M77492 |M77492.1 Dict

  11. Dicty_cDB: VHH893 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHH893 (Link to dictyBase) - - - Contig-U15693-1 - (Link to Or...iginal site) - - VHH893Z 385 - - - - Show VHH893 Library VH (Link to library) Clone ID VHH893 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15693-1 Original site URL http://dictycdb.b... Homology vs DNA Score E Sequences producing significant alignments: (bits) Value N AF188717 |AF188717.1 Dict...anio rerio alanyl-tRNA synthetase... 48 9e-05 DQ353802_1( DQ353802 |pid:none) Ictalurus punctatus isolate C1

  12. Dicty_cDB: CHD534 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CH (Link to library) CHD534 (Link to dictyBase) - - - Contig-U15540-1 CHD534E (Link...) Clone ID CHD534 (Link to dictyBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15540-1 Ori...nts: (bits) Value N AC114263 |AC114263.2 Dictyostelium discoideum chromosome 2 ma...p 215673-367476 strain AX4, complete sequence. 40 1e-05 6 AC117081 |AC117081.2 Dictyostelium discoideum chro...mosome 2 map 5862124-6045772 strain AX4, complete sequence. 40 2e-05 5 AJ277590 |AJ277590.1 Dictyostelium di

  13. Dicty_cDB: CFI225 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CF (Link to library) CFI225 (Link to dictyBase) - - - Contig-U16381-1 - (Link to Or...iginal site) CFI225F 133 - - - - - - Show CFI225 Library CF (Link to library) Clone ID CFI225 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16381-1 Original site URL http://dictycdb.b...ogy vs DNA Score E Sequences producing significant alignments: (bits) Value N X51892 |X51892.1 Dict...yostelium discoideum SP60 gene for spore coat protein. 80 2e-23 3 AC115685 |AC115685.1 Dict

  14. Dicty_cDB: SSG552 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SS (Link to library) SSG552 (Link to dictyBase) - - - Contig-U16581-1 - (Link to Or...iginal site) SSG552F 449 - - - - - - Show SSG552 Library SS (Link to library) Clone ID SSG552 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16581-1 Original site URL http://dictycdb.b...A Score E Sequences producing significant alignments: (bits) Value N D16417 |D16417.1 Dictyostelium discoide.... 44 1.7 1 BM028890 |BM028890.1 IpSkn01670 Skin cDNA library Ictalurus punctatus cDNA 5' similar to Ictacalc

  15. Dicty_cDB: SHE682 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SH (Link to library) SHE682 (Link to dictyBase) - - - Contig-U11295-1 - (Link to Or...iginal site) SHE682F 142 - - - - - - Show SHE682 Library SH (Link to library) Clone ID SHE682 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U11295-1 Original site URL http://dictycdb.b... 2 CB936849 |CB936849.1 IpCGJx13_9_A11_23 IpCGJx13 Ictalurus punctatus cDNA clone...8.0 %: cytoplasmic 28.0 %: nuclear 4.0 %: cytoskeletal 4.0 %: plasma membrane 4.0

  16. Dicty_cDB: SHH247 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SH (Link to library) SHH247 (Link to dictyBase) - - - Contig-U15693-1 - (Link to Or...iginal site) - - SHH247Z 601 - - - - Show SHH247 Library SH (Link to library) Clone ID SHH247 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15693-1 Original site URL http://dictycdb.b...pdate 2002.12. 6 Homology vs DNA Score E Sequences producing significant alignments: (bits) Value N AF188717 |AF188717.1 Dict...ogaster SD01519 fu... 56 6e-07 DQ353802_1( DQ353802 |pid:none) Ictalurus punctatu

  17. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFC154 (Link to dictyBase) - - - Contig-U16363-1 VFC154Z (Link... to Original site) - - VFC154Z 551 - - - - Show VFC154 Library VF (Link to library) Clone ID VFC154 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16363-1 Original site URL http://dict...s: (bits) Value N U82513 |U82513.1 Dictyostelium discoideum random slug cDNA25 protein (rsc25) mRNA, partial...producing significant alignments: (bits) Value U82513_1( U82513 |pid:none) Dictyostelium discoideum random s

  18. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFC175 (Link to dictyBase) - - - Contig-U16358-1 VFC175F (Link... to Original site) VFC175F 507 - - - - - - Show VFC175 Library VF (Link to library) Clone ID VFC175 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16358-1 Original site URL http://dict...12.25 Homology vs DNA Score E Sequences producing significant alignments: (bits) Value N AC117072 |AC117072.2 Dict...eletal 4.0 %: mitochondrial 4.0 %: vacuolar 4.0 %: vesicles of secretory system >> prediction for VFC175 is

  19. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB752 (Link to dictyBase) - - - Contig-U14717-1 VFB752E (Link...) Clone ID VFB752 (Link to dictyBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U14717-1 Ori...s: (bits) Value N AC116984 |AC116984.2 Dictyostelium discoideum chromosome 2 map 2567470-3108875 strain AX4,... complete sequence. 1215 0.0 11 AC115594 |AC115594.2 Dictyostelium discoideum chr...omosome 2 map 4071862-4101005 strain AX4, complete sequence. 113 3e-47 8 AC116920 |AC116920.2 Dictyostelium

  20. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB614 (Link to dictyBase) - - - Contig-U16382-1 VFB614Z (Link... to Original site) - - VFB614Z 219 - - - - Show VFB614 Library VF (Link to library) Clone ID VFB614 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16382-1 Original site URL http://dict...ments: (bits) Value N AC116957 |AC116957.2 Dictyostelium discoideum chromosome 2 ...tin mRNA ITL-1, 3' end. 339 9e-90 1 AC116986 |AC116986.2 Dictyostelium discoideum chromosome 2 map 2234041-2

  1. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB606 (Link to dictyBase) - - - Contig-U16382-1 VFB606E (Link...) Clone ID VFB606 (Link to dictyBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16382-1 Ori...ucing significant alignments: (bits) Value N X03281 |X03281.1 Dictyostelium discoideum gene for actin A8. 22...28 0.0 1 AC116957 |AC116957.2 Dictyostelium discoideum chromosome 2 map 1685067-2...2028 0.0 1 AC115579 |AC115579.2 Dictyostelium discoideum chromosome 2 map 4915084-5005461 strain AX4, comple

  2. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB714 (Link to dictyBase) - - - Contig-U12859-1 VFB714F (Link... to Original site) VFB714F 545 - - - - - - Show VFB714 Library VF (Link to library) Clone ID VFB714 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U12859-1 Original site URL http://dict... Homology vs DNA Score E Sequences producing significant alignments: (bits) Value N U36936 |U36936.1 Dict...Score E Sequences producing significant alignments: (bits) Value U36936_1( U36936 |pid:none) Dictyostelium d

  3. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB830 (Link to dictyBase) - - - Contig-U13825-1 VFB830P (Link... to Original site) VFB830F 566 VFB830Z 634 VFB830P 1200 - - Show VFB830 Library VF (Link to library) Clone ID VFB830 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U13825-1 Original site URL http://dict...m3a: 0.00 m3b: 0.00 m_ : 1.00 56.0 %: mitochondrial 28.0 %: cytoplasmic 16.0 %: nuclear >> prediction for VF

  4. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB667 (Link to dictyBase) - - - Contig-U13894-1 VFB667Z (Link... to Original site) - - VFB667Z 633 - - - - Show VFB667 Library VF (Link to library) Clone ID VFB667 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U13894-1 Original site URL http://dict... update 2009. 4. 4 Homology vs DNA Score E Sequences producing significant alignments: (bits) Value N AC117075 |AC117075.2 Dict...chondrial 4.0 %: vacuolar 4.0 %: peroxisomal >> prediction for VFB667 is cyt 5' e

  5. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFC129 (Link to dictyBase) - - - Contig-U16543-1 VFC129F (Link... to Original site) VFC129F 276 - - - - - - Show VFC129 Library VF (Link to library) Clone ID VFC129 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16543-1 Original site URL http://dict... vs DNA Score E Sequences producing significant alignments: (bits) Value N M91382 |M91382.1 Dictyostelium di...scoideum thioredoxin (TRX2) mRNA, 5' end. 281 2e-96 3 M91384 |M91384.1 Dictyostelium discoideum thioredoxin

  6. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFC191 (Link to dictyBase) - - - Contig-U16281-1 VFC191F (Link... to Original site) VFC191F 350 - - - - - - Show VFC191 Library VF (Link to library) Clone ID VFC191 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16281-1 Original site URL http://dict...ts) Value N AC116305 |AC116305.2 Dictyostelium discoideum chromosome 2 map 1005175-1418323 strain AX4, compl... 186 3e-46 AC116305_8( AC116305 |pid:none) Dictyostelium discoideum chromosom...

  7. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFC222 (Link to dictyBase) - - - Contig-U16046-1 VFC222Z (Link... to Original site) - - VFC222Z 339 - - - - Show VFC222 Library VF (Link to library) Clone ID VFC222 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16046-1 Original site URL http://dict...pdate 2002.12.15 Homology vs DNA Score E Sequences producing significant alignments: (bits) Value N AC116989 |AC116989.2 Dict...etical LO... 88 8e-17 AC115592_28( AC115592 |pid:none) Dictyostelium discoideum c

  8. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB776 (Link to dictyBase) - - - Contig-U13978-1 VFB776P (Link... to Original site) VFB776F 513 VFB776Z 602 VFB776P 1115 - - Show VFB776 Library VF (Link to library) Clone ID VFB776 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U13978-1 Original site URL http://dict...ences producing significant alignments: (bits) Value N L41839 |L41839.1 Dictyoste....00 48.0 %: cytoplasmic 32.0 %: nuclear 16.0 %: cytoskeletal 4.0 %: peroxisomal >> predict

  9. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFC231 (Link to dictyBase) - - - Contig-U16593-1 VFC231P (Link... to Original site) VFC231F 428 VFC231Z 202 VFC231P 630 - - Show VFC231 Library VF (Link to library) Clone ID VFC231 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16593-1 Original site URL http://dict...amoeba histolytica genomic, DNA sequence. 48 0.13 1 AC116984 |AC116984.2 Dictyostelium discoideum chromosome...mbrane 4.0 %: vesicles of secretory system 4.0 %: extracellular, including cell wall >> predict

  10. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB552 (Link to dictyBase) - - - Contig-U16480-1 VFB552P (Link... to Original site) VFB552F 529 VFB552Z 562 VFB552P 1091 - - Show VFB552 Library VF (Link to library) Clone ID VFB552 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16480-1 Original site URL http://dict...nce (All Frames) Frame A: kvtv*liiylkiknetficisifdfsyfhcksrlpicfl*iq*iinwwc*rycssfky*w nnikyyi*fyniictkyticc...cuolar 8.0 %: endoplasmic reticulum 4.0 %: cytoplasmic 4.0 %: mitochondrial 4.0 %: Golgi >> prediction for V

  11. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB751 (Link to dictyBase) - - - Contig-U16480-1 VFB751P (Link... to Original site) VFB751F 134 VFB751Z 560 VFB751P 694 - - Show VFB751 Library VF (Link to library) Clone ID VFB751 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16480-1 Original site URL http://dict...4.0 %: endoplasmic reticulum >> prediction for VFB751 is cyt 5' end seq. ID VFB751F 5' end seq. >VFB751F.Seq

  12. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB594 (Link to dictyBase) - G00395 DDB0218201 Contig-U09553-1...brary) Clone ID VFB594 (Link to dictyBase) Atlas ID - NBRP ID G00395 dictyBase ID DDB0218201 Link to Contig ...Contig-U09553-1 Original site URL http://dictycdb.biol.tsukuba.ac.jp/CSM/VF/VFB5-...s thaliana BAC F21J9 from chromosome I, complete sequence. 44 0.14 3 AC115612 |AC115612.2 Dictyostelium disc...: cytoskeletal 4.0 %: mitochondrial 4.0 %: vesicles of secretory system >> prediction for VFB594 is nuc 5' e

  13. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB703 (Link to dictyBase) - - - Contig-U14188-1 VFB703Z (Link... to Original site) - - VFB703Z 680 - - - - Show VFB703 Library VF (Link to library) Clone ID VFB703 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U14188-1 Original site URL http://dict...ENCE, 18 unordered pieces. 36 0.038 5 AC115680 |AC115680.2 Dictyostelium discoideum chromosome 2 map 4415041...4-226K1, *** SEQUENCING IN PROGRESS ***, 52 unordered pieces. 46 0.31 5 AC116977 |AC116977.2 Dict

  14. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB662 (Link to dictyBase) - - - Contig-U15118-1 VFB662E (Link...) Clone ID VFB662 (Link to dictyBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15118-1 Ori... significant alignments: (bits) Value N AF039211 |AF039211.1 Dictyostelium discoideum ADP/ATP translocase mR...NA, complete cds. 1011 0.0 2 AF100676 |AF100676.1 Dictyostelium discoideum ADP/ATP translocase gene, complet...drial 4.0 %: extracellular, including cell wall 4.0 %: vacuolar 4.0 %: vesicles of secretory system >> predict

  15. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB843 (Link to dictyBase) - - - Contig-U15729-1 VFB843F (Link... to Original site) VFB843F 566 - - - - - - Show VFB843 Library VF (Link to library) Clone ID VFB843 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15729-1 Original site URL http://dict...roducing significant alignments: (bits) Value N U64830 |U64830.1 Dictyostelium discoideum AX2 protein tyrosi...ne kinase gene, complete cds. 1114 0.0 1 U01064 |U01064.1 Dictyostelium discoideum AX2 protein tyrosine kina

  16. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFB632 (Link to dictyBase) - - - Contig-U16585-1 VFB632Z (Link... to Original site) - - VFB632Z 181 - - - - Show VFB632 Library VF (Link to library) Clone ID VFB632 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16585-1 Original site URL http://dict...g significant alignments: (bits) Value N AC115577 |AC115577.2 Dictyostelium discoideum chromosome 2 map 4657...%: cytoplasmic 12.0 %: mitochondrial 8.0 %: cytoskeletal 4.0 %: peroxisomal >> prediction for VFB632 is nuc

  17. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFC124 (Link to dictyBase) - - - Contig-U12017-1 VFC124Z (Link... to Original site) - - VFC124Z 496 - - - - Show VFC124 Library VF (Link to library) Clone ID VFC124 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U12017-1 Original site URL http://dict...e E Sequences producing significant alignments: (bits) Value N AC116957 |AC116957.2 Dictyostelium discoideum... chromosome 2 map 1685067-2090751 strain AX4, complete sequence. 835 0.0 2 U67089 |U67089.1 Dict

  18. Dicty_cDB: [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VF (Link to library) VFC242 (Link to dictyBase) - - - Contig-U16280-1 VFC242F (Link... to Original site) VFC242F 431 - - - - - - Show VFC242 Library VF (Link to library) Clone ID VFC242 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16280-1 Original site URL http://dict...3( S18663 ) coronin - slime mold (Dictyostelium discoideum) ... 270 1e-71 AY52578...asmic 4.0 %: cytoskeletal 4.0 %: vesicles of secretory system >> prediction for VFC242 is nuc 5' end seq. ID

  19. Dicty_cDB: VHF145 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHF145 (Link to dictyBase) - - - Contig-U15430-1 VHF145E (Link...) Clone ID VHF145 (Link to dictyBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15430-1 Ori...ology vs DNA Score E Sequences producing significant alignments: (bits) Value N AC116984 |AC116984.2 Dictyos... theta DNA for complete sequence of nucleomorph chromosome 2. 48 2e-07 2 ES451909 | PREDICTED: similar to PI...al 16.0 %: nuclear 8.0 %: vacuolar 8.0 %: endoplasmic reticulum 4.0 %: cytoskeletal >> prediction for VHF145

  20. De novo characterisation of the greenlip abalone transcriptome (Haliotis laevigata) with a focus on the heat shock protein 70 (HSP70) family.

    Science.gov (United States)

    Shiel, Brett P; Hall, Nathan E; Cooke, Ira R; Robinson, Nicholas A; Strugnell, Jan M

    2015-02-01

    Abalone (Haliotis) are economically important molluscs for fisheries and aquaculture industries worldwide. Despite this, genomic resources for abalone and molluscs are still limited. Here we present a description and functional annotation of the greenlip abalone (Haliotis laevigata) transcriptome. We present a focused analysis on the heat shock protein 70 (HSP70) family of genes with putative functions affecting temperature stress and immunity. A total of ~38 million paired end Illumina reads were obtained, resulting in a Trinity assembly of 222,172 contigs with minimum length of 200 base pairs and maximum length of 33 kilobases. The 20,702 contigs were annotated with gene descriptions by BLAST. We created a program to maximise the number of functionally annotated genes, and over 10,000 contigs were assigned Gene ontologies (GO terms). By using CateGOrizer, immunity related GO terms for stressors such as heat, hypoxia, oxidative stress and wounding received the highest counts. Twenty-six contigs with homology to the HSP70 family of genes were identified. Ninety-one putative single-nucleotide polymorphisms were observed in the abalone HSP70 contigs. Eleven of these were considered non-synonymous. The annotated transcriptome described in this study will be a useful basis for future work investigating the genetic response of abalone to stress.

  1. On the placement of practice questions during study.

    Science.gov (United States)

    Weinstein, Yana; Nunes, Ludmila D; Karpicke, Jeffrey D

    2016-03-01

    Retrieval practice improves retention of information on later tests. A question remains: When should retrieval occur during learning-interspersed throughout study or at the end of each study period? In a lab experiment, an online experiment, and a classroom study, we aimed to determine the ideal placement (interspersed vs. at-the-end) of retrieval practice questions. In the lab experiment, 64 subjects viewed slides about APA style and answered short-answer practice questions about the content or restudied the slides (restudy condition). The practice questions either appeared 1 every 1-2 slides (interspersed condition), or all at the end of the presentation (at-the-end condition). One week later, subjects returned and answered the same questions on a final test. In the online experiment, 175 subjects completed the same procedure. In the classroom study, 62 undergraduate students took quizzes as part of class lectures. Short-answer practice questions appeared either throughout the lectures (interspersed condition) or at the end of the lectures (at-the-end condition). Nineteen days after the last quiz, students were given a surprise final test. Results from the 3 experiments converge in demonstrating an advantage for interspersing practice questions on the initial tests, but an absence of this advantage on the final test. (c) 2016 APA, all rights reserved).

  2. Atomic force microscopy reveals multiple patterns of antenna organization in purple bacteria: implications for energy transduction mechanisms and membrane modeling.

    Science.gov (United States)

    Sturgis, James N; Niederman, Robert A

    2008-01-01

    Recent topographs of the intracytoplasmic membrane (ICM) of purple bacteria obtained by atomic force microscopy (AFM) have provided the first surface views of the native architecture of a multicomponent biological membrane at submolecular resolution, representing an important landmark in structural biology. A variety of species-dependent, closely packed arrangements of light-harvesting (LH) complexes was revealed: the most highly organized was found in Rhodobacter sphaeroides in which the peripheral LH2 antenna was seen either in large clusters or in fixed rows interspersed among ordered arrays of dimeric LH1-reaction center (RC) core complexes. A more random organization was observed in other species containing both the LH1 and LH2 complexes, as typified by Rhododspirillum photometricum with randomly packed monomeric LH1-RC core complexes intermingled with large, paracrystalline domains of LH2 antenna. Surprisingly, no structures that could be identified as the ATP synthase or cytochrome bc (1) complexes were observed, which may reflect their localization at ICM vesicle poles or in curved membrane areas, out of view from the flat regions imaged by AFM. This possible arrangement of energy transducing complexes has required a reassessment of energy tranduction mechanisms which place the cytochrome bc (1) complex in close association with the RC. Instead, more plausible proposals must account for the movement of quinone redox species over considerable membrane distances on appropriate time scales. AFM, together with atomic resolution structures are also providing the basis for molecular modeling of the ICM that is leading to an improved picture of the supramolecular organization of photosynthetic complexes, as well as the forces that drive their segregation into distinct domains.

  3. A first generation integrated physical and genetic map of the rainbow trout genome

    Science.gov (United States)

    The rainbow trout physical map was previously constructed from DNA fingerprinting of 192,096 BAC clones using the 4-color high-information content fingerprinting (HICF) method. The clones were assembled into physical map contigs using the finger-printing contig (FPC) program. The map is composed of ...

  4. Integration of transcriptomic and proteomic data from a single wheat cultivar provides new tools for understanding the roles of individual alpha gliadin proteins in flour quality and celiac disease

    Science.gov (United States)

    One-hundred-thirty-six expressed sequence tags (ESTs) encoding alpha gliadins from Triticum aestivum cv Butte 86 were identified in public databases and assembled into 19 contigs. Consensus sequences for 12 of the contigs encoded complete alpha gliadin proteins, but only two were identical to protei...

  5. Short interspersed element (SINE) depletion and long interspersed element (LINE) abundance are not features universally required for imprinting.

    Science.gov (United States)

    Cowley, Michael; de Burca, Anna; McCole, Ruth B; Chahal, Mandeep; Saadat, Ghazal; Oakey, Rebecca J; Schulz, Reiner

    2011-04-20

    Genomic imprinting is a form of gene dosage regulation in which a gene is expressed from only one of the alleles, in a manner dependent on the parent of origin. The mechanisms governing imprinted gene expression have been investigated in detail and have greatly contributed to our understanding of genome regulation in general. Both DNA sequence features, such as CpG islands, and epigenetic features, such as DNA methylation and non-coding RNAs, play important roles in achieving imprinted expression. However, the relative importance of these factors varies depending on the locus in question. Defining the minimal features that are absolutely required for imprinting would help us to understand how imprinting has evolved mechanistically. Imprinted retrogenes are a subset of imprinted loci that are relatively simple in their genomic organisation, being distinct from large imprinting clusters, and have the potential to be used as tools to address this question. Here, we compare the repeat element content of imprinted retrogene loci with non-imprinted controls that have a similar locus organisation. We observe no significant differences that are conserved between mouse and human, suggesting that the paucity of SINEs and relative abundance of LINEs at imprinted loci reported by others is not a sequence feature universally required for imprinting.

  6. Short interspersed element (SINE depletion and long interspersed element (LINE abundance are not features universally required for imprinting.

    Directory of Open Access Journals (Sweden)

    Michael Cowley

    2011-04-01

    Full Text Available Genomic imprinting is a form of gene dosage regulation in which a gene is expressed from only one of the alleles, in a manner dependent on the parent of origin. The mechanisms governing imprinted gene expression have been investigated in detail and have greatly contributed to our understanding of genome regulation in general. Both DNA sequence features, such as CpG islands, and epigenetic features, such as DNA methylation and non-coding RNAs, play important roles in achieving imprinted expression. However, the relative importance of these factors varies depending on the locus in question. Defining the minimal features that are absolutely required for imprinting would help us to understand how imprinting has evolved mechanistically. Imprinted retrogenes are a subset of imprinted loci that are relatively simple in their genomic organisation, being distinct from large imprinting clusters, and have the potential to be used as tools to address this question. Here, we compare the repeat element content of imprinted retrogene loci with non-imprinted controls that have a similar locus organisation. We observe no significant differences that are conserved between mouse and human, suggesting that the paucity of SINEs and relative abundance of LINEs at imprinted loci reported by others is not a sequence feature universally required for imprinting.

  7. Identifying wrong assemblies in de novo short read primary

    Indian Academy of Sciences (India)

    Finally, some mis-assembly detecting tools have been evaluated for their ability to detect the wrongly assembledprimary contigs, suggesting a lot of scope for improvement in this area. The present work also proposes a simpleunsupervised learning-based novel approach to identify mis-assemblies in the contigs which was ...

  8. Data set for diet specific differential gene expression analysis in three Spodoptera moths

    Directory of Open Access Journals (Sweden)

    A. Roy

    2016-09-01

    Full Text Available Examination of closely related species pairs is suggested for evolutionary comparisons of different degrees of polyphagy, which we did here with three taxa of lepidopteran herbivores, Spodoptera spp (S. littoralis, S. frugiperda maize (C and rice (R strains for a RNAseq analysis of the midguts from the 3rd instar insect larvae for differential metabolic responses after feeding on pinto bean based artificial diet vs maize leaves. Paired-end (2×100 bp Illumina HiSeq2500 sequencing resulted in a total of 24, 23, 24, and 21 million reads for the SF-C-Maize, SF-C-Pinto, SF-R-Maize, SF-R Pinto, and a total of 35 and 36 million reads for the SL-Maize and SL-Pinto samples, respectively. After quality control measures, a total of 62.2 million reads from SL and 71.7 million reads from SF were used for transcriptome assembly (TA. The resulting final de novo reference TA (backbone for the SF taxa contained 37,985 contigs with a N50 contig size of 1030 bp and a maximum contig length of 17,093 bp, while for SL, 28,329 contigs were generated with a N50 contig size of 1980 bp and a maximum contig length of 18,267 bp. The data presented herein contains supporting information related to our research article Roy et al. (2016 http://dx.doi.org/10.1016/j.ibmb.2016.02.006 [1]. Keywords: Differential expression analysis (DGE, Transcriptomics, Spodoptera, Adaptation, Generalist, Specialist, RPKM (reads per kilo base of transcript per million mapped reads, RNA seq

  9. Dicty_cDB: SSL472 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SS (Link to library) SSL472 (Link to dictyBase) - - - Contig-U14592-1 SSL472F (Link... to Original site) SSL472F 185 - - - - - - Show SSL472 Library SS (Link to library) Clone ID SSL472 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U14592-1 Original site URL http://dict...group) genomic DNA, chromosome 6, PAC clone:P0036F10, WORKING DRAFT SEQUENCE, 1 ordered pieces. 44 0.59 1 AC114263 |AC114263.2 Dict...library Plasmodium falciparum 3D7 cDNA 5' similar to TR:O96129 O96129 PREDICTED MEMBRANE ASSOCIATED PROTEIN.

  10. Dicty_cDB: SSH414 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SS (Link to library) SSH414 (Link to dictyBase) - - - Contig-U16581-1 SSH414Z (Link... to Original site) - - SSH414Z 448 - - - - Show SSH414 Library SS (Link to library) Clone ID SSH414 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16581-1 Original site URL http://dict...ogy vs DNA Score E Sequences producing significant alignments: (bits) Value N D16417 |D16417.1 Dictyostelium... SAMPLING. 44 1.7 1 BM028890 |BM028890.1 IpSkn01670 Skin cDNA library Ictalurus punctatus cDNA 5' similar to Ict

  11. Dicty_cDB: VHE867 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHE867 (Link to dictyBase) - G02701 DDB0204977 Contig-U14789-1... - (Link to Original site) VHE867F 383 - - - - - - Show VHE867 Library VH (Link to library) Clone ID VHE867 (Link to dict...yBase) Atlas ID - NBRP ID G02701 dictyBase ID DDB0204977 Link to Contig Contig-U14789-1 Original site URL http://dict...one) Salmo salar clone ssal-rgb2-599-16... 116 2e-25 DQ363469_1( DQ363469 |pid:none) Ict...m_ : 1.00 40.0 %: cytoplasmic 36.0 %: nuclear 12.0 %: cytoskeletal 8.0 %: mitochondrial 4.0 %: peroxisomal >> predict

  12. Dicty_cDB: SSD571 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SS (Link to library) SSD571 (Link to dictyBase) - - - Contig-U16581-1 SSD571Z (Link... to Original site) - - SSD571Z 415 - - - - Show SSD571 Library SS (Link to library) Clone ID SSD571 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16581-1 Original site URL http://dict...omology vs DNA Score E Sequences producing significant alignments: (bits) Value N D16417 |D16417.1 Dict...Brassica oleracea genomic clone BONRK12, DNA sequence. 50 0.025 1 BM028890 |BM028890.1 IpSkn01670 Skin cDNA library Ict

  13. Dicty_cDB: VHO349 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available VH (Link to library) VHO349 (Link to dictyBase) - - - Contig-U11939-1 - (Link to Or...iginal site) - - VHO349Z 412 - - - - Show VHO349 Library VH (Link to library) Clone ID VHO349 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U11939-1 Original site URL http://dictycdb.b...4 |CK424794.1 AUF_IpSto_12_h11 Stomach cDNA library Ictalurus punctatus cDNA 5', mRNA sequence. 42 6.1 1 dna...ial 8.0 %: Golgi 4.0 %: vesicles of secretory system >> prediction for VHO349 is

  14. Dicty_cDB: CFH521 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CF (Link to library) CFH521 (Link to dictyBase) - - - Contig-U16381-1 - (Link to Or...iginal site) CFH521F 134 - - - - - - Show CFH521 Library CF (Link to library) Clone ID CFH521 (Link to dicty...Base) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16381-1 Original site URL http://dictycdb.b...mology vs DNA Score E Sequences producing significant alignments: (bits) Value N AC115685 |AC115685.1 Dict.... 82 2e-29 3 X51892 |X51892.1 Dictyostelium discoideum SP60 gene for spore coat protein. 82 4e-29 2 X52105 |X52105.1 Dict

  15. Dicty_cDB: FC-IC0102 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available FC-IC (Link to library) FC-IC0102 (Link to dictyBase) - - - Contig-U16527-1 FC-IC01...02F (Link to Original site) FC-IC0102F 434 - - - - - - Show FC-IC0102 Library FC-IC (Link to library) Clone ...ID FC-IC0102 (Link to dictyBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16527-1 Original site URL http://dict... (bits) Value N AB088483 |AB088483.1 Dictyostelium discoideum gene for gamete and mating-type specific prote...oducing significant alignments: (bits) Value AB088483_1( AB088483 |pid:none) Dictyostelium discoideum gmsA g

  16. Dicty_cDB: SSK552 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SS (Link to library) SSK552 (Link to dictyBase) - - - Contig-U04708-1 SSK552Z (Link... to Original site) - - SSK552Z 713 - - - - Show SSK552 Library SS (Link to library) Clone ID SSK552 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U04708-1 Original site URL http://dict...equence. 40 0.26 5 BM029272 |BM029272.1 IpSkn00291 Skin cDNA library Ictalurus pu...0 m3a: 0.00 m3b: 0.00 m_ : 1.00 76.0 %: nuclear 20.0 %: cytoplasmic 4.0 %: plasma membrane >> prediction for

  17. Dicty_cDB: SSG335 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SS (Link to library) SSG335 (Link to dictyBase) - - - Contig-U16509-1 SSG335F (Link... to Original site) SSG335F 200 - - - - - - Show SSG335 Library SS (Link to library) Clone ID SSG335 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16509-1 Original site URL http://dict...y vs DNA Score E Sequences producing significant alignments: (bits) Value N D16417 |D16417.1 Dictyostelium d...iscoideum mRNA. 64 1e-17 2 BM028890 |BM028890.1 IpSkn01670 Skin cDNA library Icta

  18. Dicty_cDB: CFE801 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available CF (Link to library) CFE801 (Link to dictyBase) - - - Contig-U12894-1 CFE801Z (Link... to Original site) - - CFE801Z 690 - - - - Show CFE801 Library CF (Link to library) Clone ID CFE801 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U12894-1 Original site URL http://dict...6e1 5', mRNA sequence. 36 5.4 2 CK426247 |CK426247.1 AUF_IpTes_24_l09 Testis cDNA library Ict...A26838 )prestalk protein precursor - slime mold (Dictyoste... 66 9e-10 AC117072_6

  19. Dicty_cDB: AFJ817 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available AF (Link to library) AFJ817 (Link to dictyBase) - - - Contig-U15574-1 AFJ817F (Link... to Original site) AFJ817F 172 - - - - - - Show AFJ817 Library AF (Link to library) Clone ID AFJ817 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U15574-1 Original site URL http://dict...alue N AC116920 |AC116920.2 Dictyostelium discoideum chromosome 2 map 3879572-4071762 strain AX4, complete s...es. 42 1.1 2 BE213059 |BE213059.1 IpBrn01690 Brain cDNA library Ictalurus punctatus cDNA 5', mRNA sequence.

  20. Dicty_cDB: SSG316 [Dicty_cDB

    Lifescience Database Archive (English)

    Full Text Available SS (Link to library) SSG316 (Link to dictyBase) - - - Contig-U16509-1 SSG316F (Link... to Original site) SSG316F 231 - - - - - - Show SSG316 Library SS (Link to library) Clone ID SSG316 (Link to dict...yBase) Atlas ID - NBRP ID - dictyBase ID - Link to Contig Contig-U16509-1 Original site URL http://dict...g significant alignments: (bits) Value N D16417 |D16417.1 Dictyostelium discoideu...m mRNA. 64 3e-12 2 BM028890 |BM028890.1 IpSkn01670 Skin cDNA library Ictalurus punctatus cDNA 5' similar to Ict