Sample records for gene partial sequence

  1. Sequence Comparison of Partial Cytochrome b Genes of Two Coilia species

    Institute of Scientific and Technical Information of China (English)

    LIU Jinxian; GAO Tianxiang; WANG Yujiang; ZHANG Yaping


    Sequence variation of partial cytochrome b genes between two Coilia species, C. ectenes and C. mystus, was investigated. Of the 402 nucleotides, twenty-seven (6.72%) are polymorphic and all are synonymous substitutions. At the third positions of genetic condon of cytochrome b gene, the two species show an extreme anti-G bias (< 4 % ) and a pronounced bias towards A and C (>68%). There is no amino acid sequence divergence between the partial cytochrome b genes of the two species, indicating a close genetic relationship between them. The k-2p genetic distance of partial cytochrome b segment of the two species is 0.072, suggesting that the species were separated 3.6 Ma ago, in the middle Pliocene. Our result reveals that the cytochrome b gene is an appropriate marker for studies of population genetic structures and phylogeographic patterns of the two species.

  2. Sequencing and phylogenetic analysis of partial CXCR2 gene of Murrah buffalo

    Directory of Open Access Journals (Sweden)

    S. A. Wani


    Full Text Available Aim: Present study was carried out to sequence and phylogenetic analysis of CXCR2 gene of Murrah buffalo. Materials and Methods: For the present investigation, from a group of forty eight Murrah buffaloes (Bubalus bubalis, blood samples were collected randomly from eight animals, out of which four were healthy and four were mastitic. Results: The amplification of Interleukin-8B (IL-8B receptor gene target sequence was carried out using the primer pair in an optimized polymerase chain reaction. Partial sequencing of IL-8B receptor gene of Bubalus bubalis (Murrah has been done successfully. The sequences of IL-8B receptor gene showed 99% homology to that of Bos indicus × Bos taurus, 98% to that of Bos taurus, 97% to that of Ovis aries, 93% to that of Sus scrofa, 92% to that of Equus caballus and 90% to that of Felis catus. Conclusion: From the present study it can be concluded that the PCR amplification procedure for target region of IL-8B receptor gene yielding 459 bp products has been standardized, which yielded consistent and specific amplification. Amplification of partial IL-8B receptor gene (exon 2- 459 bp using self designed primers specific for cattle ortholog sequence signifies that the locus is conserved in cattle and buffaloes. In phylogenetic tree, the target sequence of IL-8B receptor gene of Bubalus bubalis were found to be more closely related to Bos indicus × Bos Taurus and Bos taurus than to Ovis aries and Sus scrofa.

  3. Structural features of conopeptide genes inferred from partial sequences of the Conus tribblei genome. (United States)

    Barghi, Neda; Concepcion, Gisela P; Olivera, Baldomero M; Lluisma, Arturo O


    The evolvability of venom components (in particular, the gene-encoded peptide toxins) in venomous species serves as an adaptive strategy allowing them to target new prey types or respond to changes in the prey field. The structure, organization, and expression of the venom peptide genes may provide insights into the molecular mechanisms that drive the evolution of such genes. Conus is a particularly interesting group given the high chemical diversity of their venom peptides, and the rapid evolution of the conopeptide-encoding genes. Conus genomes, however, are large and characterized by a high proportion of repetitive sequences. As a result, the structure and organization of conopeptide genes have remained poorly known. In this study, a survey of the genome of Conus tribblei was undertaken to address this gap. A partial assembly of C. tribblei genome was generated; the assembly, though consisting of a large number of fragments, accounted for 2160.5 Mb of sequence. A large number of repetitive genomic elements consisting of 642.6 Mb of retrotransposable elements, simple repeats, and novel interspersed repeats were observed. We characterized the structural organization and distribution of conotoxin genes in the genome. A significant number of conopeptide genes (estimated to be between 148 and 193) belonging to different superfamilies with complete or nearly complete exon regions were observed, ~60 % of which were expressed. The unexpressed conopeptide genes represent hidden but significant conotoxin diversity. The conotoxin genes also differed in the frequency and length of the introns. The interruption of exons by long introns in the conopeptide genes and the presence of repeats in the introns may indicate the importance of introns in facilitating recombination, evolution and diversification of conotoxins. These findings advance our understanding of the structural framework that promotes the gene-level molecular evolution of venom peptides.

  4. [Clonning and sequencing of partial gene of hepatitis G virus (HGV) from Nanjing of China]. (United States)

    Xu, J; Zhou, Y; Xu, J; Ju, J; Wang, X; Chen, W; Wang, H


    Partial gene of hepatitis G virus (HGV) was cloned and sequenced from the serum of a patient with post-transfusion hepatitis C in Nanjing of China by reverse transcription-polymerase chain reaction (RT-PCR). The sequence showed 89.09%, 92.12%, 87.27%, 93.94% nucleotide identity over the corresponding region of HGU44402, HGU45966, HGU36380 in America and HGV isolate in Hebei Province of China respectively. Forty patients with post-transfusion hepatitis C and thirty patients with hepatitis non A-E were detected for HGV RNA by RT-PCR, the HGV RNA positive rate were 10.00% and 6.67% respectively.

  5. discussion on validity of rana maoershanensis based on partial sequence of 16s rrna gene

    Institute of Scientific and Technical Information of China (English)


    rana maoershanensis found in mt.maoershan in guangxi,china was reported as a new species in 2007,but there was no molecular data for this frog.the partial sequences (543 bp) of 16s rrna gene from 12 specimens of 3 brown frog species (rana hanluica,r.maoershanensis and r.chensinensis) were analyzed with 17 specimens of 9 species from genbank.the nucleotide sequence divergence between r.maoershanensis and the other brown frog species were 4.5%-6.5%,with 22-30 nucleotide substitutions at this locus.the phylogenetic relationships based on mp,ml,and bayesian inference indicate that the brown frogs from southern china were diverged into three groups (clades a,b and c).r.maoershanensis was clustered together a well-supported subclade (b-l).it is suggested that r.maoershanensis is a valid species.

  6. Molecular epidemiology of rabies in Colombia 1994-2005 based on partial nucleoprotein gene sequences. (United States)

    Páez, Andrés; Velasco-Villa, Andrés; Rey, Gloria; Rupprecht, Charles E


    One hundred and twenty-four rabies viruses (RABV) were isolated from humans and eight species of mammals in Colombia during 1994-2005. To determine the genetic and reservoir-associated diversity cDNA fragments encoding 88 amino acids at the carboxyl terminus of the nucleoprotein were sequenced and used in phylogenetic analyses. Eight genetic lineages (GL) were characterized. GL1, GL2 and GL3 consisted of dog-associated antigenic variant (AV) 1 RABV, isolated in the centre-east, north and southwest of Colombia, respectively. GL1 is apparently extinct in Colombia. The GL4 were AV3, AV8 and non-determined (ND) AV viruses associated with hematophagous bats. The GL5 and GL6 consisted of AV4 viruses. GL6 isolate was found associated with Tadarida brasiliensis bats. GL5 segregated independently. The GL7 and GL8 segregated independently within clades associated with colonial insectivorous and solitary bats, respectively. Both of these were represented by NDAV viruses. Viruses isolated from humans grouped within GL2, GL3 and GL4, which in turn corresponded to AV1, 3, 8 and ND. Dogs and D. rotundus are the two major rabies reservoirs and vectors in Colombia. Insectivorous bats may also be important rabies reservoirs but spillovers to other species are rare. Our data were consistent with previous studies in which partial Psi, G and L gene sequences were analyzed. Our results confirmed the existence of RABV of unclassified AV in Colombia.

  7. Chromosomal localization and partial sequencing of the 18S and 28S ribosomal genes from Bradysia hygida (Diptera: Sciaridae). (United States)

    Gaspar, V P; Shimauti, E L T; Fernandez, M A


    In insects, ribosomal genes are usually detected in sex chromosomes, but have also or only been detected in autosomal chromosomes in some cases. Previous results from our research group indicated that in Bradysia hygida, nucleolus organizer regions were associated with heterochromatic regions of the autosomal C chromosome, using the silver impregnation technique. The present study confirmed this location of the ribosomal genes using fluorescence in situ hybridization analysis. This analysis also revealed the partial sequences of the 18S and 28S genes for this sciarid. The sequence alignment showed that the 18S gene has 98% identity to Corydalus armatus and 91% identity to Drosophila persimilis and Drosophila melanogaster. The partial sequence analysis of the 28S gene showed 95% identity with Bradysia amoena and 93% identity with Schwenckfeldina sp. These results confirmed the location of ribosomal genes of B. hygida in an autosomal chromosome, and the partial sequence analysis of the 18S and 28S genes demonstrated a high percentage of identity among several insect ribosomal genes.

  8. Partial Sequence Analysis of Merozoite Surface Proteine-3α Gene in Plasmodium vivax Isolates from Malarious Areas of Iran

    Directory of Open Access Journals (Sweden)

    H Mirhendi


    Full Text Available Background: Approximately 85-90% of malaria infections in Iran are attributed to Plasmodium vivax, while little is known about the genetic of the parasite and its strain types in this region. This study was designed and performed for describing genetic characteristics of Plasmodium vivax population of Iran based on the merozoite surface protein-3α gene sequence. Methods: Through a descriptive study we analyzed partial P. vivax merozoite surface protein-3α gene sequences from 17 clinical P. vivax isolates collected from malarious areas of Iran. Genomic DNA was extracted by Q1Aamp® DNA blood mini kit, amplified through nested PCR for a partial nucleotide sequence of PvMSP-3 gene in P. vivax. PCR-amplified products were sequenced with an ABI Prism Perkin-Elmer 310 sequencer machine and the data were analyzed with clustal W software. Results: Analysis of PvMSP-3 gene sequences demonstrated extensive polymorphisms, but the sequence identity between isolates of same types was relatively high. We identified specific insertions and deletions for the types A, B and C variants of P. vivax in our isolates. In phylogenetic comparison of geographically separated isolates, there was not a significant geo­graphical branching of the parasite populations. Conclusion: The highly polymorphic nature of isolates suggests that more investigations of the PvMSP-3 gene are needed to explore its vaccine potential.

  9. Globicatella sanguinis bacteraemia identified by partial 16S rRNA gene sequencing

    DEFF Research Database (Denmark)

    Abdul-Redha, Rawaa Jalil; Balslew, Ulla; Christensen, Jens Jørgen;


    Globicatella sanguinis is a gram-positive coccus, resembling non-haemolytic streptococci. The organism has been isolated infrequently from normally sterile sites of humans. Three isolates obtained by blood culture could not be identified by Rapid 32 ID Strep, but partial sequencing of the 16S r...

  10. Molecular cloning and analysis of the partial sequence of Rhinopithecus roxellanae growth hormone gene

    Institute of Scientific and Technical Information of China (English)

    徐来祥; 孔繁华; 华育平


    Growth hormone gene (GH) of Rhinopithecus roxellanae was amplified by PCR based on the sequences of the reported mammalian growth hormone gene for the first time. The amplified fragment was about 1.8 kb. It was cloned and its upper stream was sequenced. This sequencing region consists of a 5¢ flanking regulatory region, exon I and part of exon II, intron I of growth hormone gene. Comparing the corresponding sequences of growth hormone gene between Rhinopithecus roxellanae and the porcine, we concluded that the homology reached 81% in the region, and there was high conservation in the 5¢ flanking sequence. The kinds of amino acids of exon I and exon II for about 90% were the same to those in pig. Many mutations occurred in the degenerate site of the triplet code. In the nucleotides of intron I, there were only 72% homologies with those in pig. It means that introns and 3¢ flanking sequence maybe play an important part in growth hormone gene regulation of the different animals.

  11. Genetic Analysis Using Partial Sequencing of Melanocortin 4 Receptor (MC4R Gene in Bligon Goat

    Directory of Open Access Journals (Sweden)

    Latifah Latifah


    Full Text Available Melanocortin 4 Receptor gene is involved in sympathetic nerve activity, adrenal and thyroid functions, and media for leptin in regulating energy balance and homeostasis. The aim of this research was to perform genetic analysis of MC4R gene sequences from Bligon goats. Fourty blood samples of Bligon does were used for DNA extraction. The primers were designed after alignment of 12 DNA sequences of MC4R gene from goat, sheep, and cattle. The primers were constructed on the Capra hircus MC4R gene sequence from GenBank (accession No. NM_001285591. Two DNA polymorphisms of MC4R were revealed in exon region (g.998 A/G and g.1079 C/T. The SNP g.998 A/G was a non-synonymous polymorphism i.e., changing of amino acid from methionine (Met to isoleucine (Ile. The SNP g.1079 C/T was a synonymous polymorphism. Restriction enzyme mapping on Bligon goat MC4R gene revealed three restriction enzymes (RsaI (GT’AC, Acc651 (G’GTAC_C, and KpnI (G_GTAC’C, which can recognize the SNP at g.1079 C/T. The restriction enzymes may be used for genotyping of the gene target using PCR-RFLP method in the future research.

  12. 16S rRNA partial gene sequencing for the differentiation and molecular subtyping of Listeria species. (United States)

    Hellberg, Rosalee S; Martin, Keely G; Keys, Ashley L; Haney, Christopher J; Shen, Yuelian; Smiley, R Derike


    Use of 16S rRNA partial gene sequencing within the regulatory workflow could greatly reduce the time and labor needed for confirmation and subtyping of Listeria monocytogenes. The goal of this study was to build a 16S rRNA partial gene reference library for Listeria spp. and investigate the potential for 16S rRNA molecular subtyping. A total of 86 isolates of Listeria representing L. innocua, L. seeligeri, L. welshimeri, and L. monocytogenes were obtained for use in building the custom library. Seven non-Listeria species and three additional strains of Listeria were obtained for use in exclusivity and food spiking tests. Isolates were sequenced for the partial 16S rRNA gene using the MicroSeq ID 500 Bacterial Identification Kit (Applied Biosystems). High-quality sequences were obtained for 84 of the custom library isolates and 23 unique 16S sequence types were discovered for use in molecular subtyping. All of the exclusivity strains were negative for Listeria and the three Listeria strains used in food spiking were consistently recovered and correctly identified at the species level. The spiking results also allowed for differentiation beyond the species level, as 87% of replicates for one strain and 100% of replicates for the other two strains consistently matched the same 16S type.

  13. Sequence analysis of a few species of termites (Order: Isoptera) on the basis of partial characterization of COII gene. (United States)

    Sobti, Ranbir Chander; Kumari, Mamtesh; Sharma, Vijay Lakshmi; Sodhi, Monika; Mukesh, Manishi; Shouche, Yogesh


    The present study was aimed to get the nucleotide sequences of a part of COII mitochondrial gene amplified from individuals of five species of Termites (Isoptera: Termitidae: Macrotermitinae). Four of them belonged to the genus Odontotermes (O. obesus, O. horni, O. bhagwatii and Odontotermes sp.) and one to Microtermes (M. obesi). Partial COII gene fragments were amplified by using specific primers. The sequences so obtained were characterized to calculate the frequencies of each nucleotide bases and a high A + T content was observed. The interspecific pairwise sequence divergence in Odontotermes species ranged from 6.5% to 17.1% across COII fragment. M. obesi sequence diversity ranged from 2.5 with Odontotermes sp. to 19.0% with O. bhagwatii. Phylogenetic trees drawn on the basis of distance neighbour-joining method revealed three main clades clustering all the individuals according to their genera and families.

  14. Dissection of two soybean QTL conferring partial resistance to Phytophthora sojae through sequence and gene expression analysis

    Directory of Open Access Journals (Sweden)

    Wang Hehe


    Full Text Available Abstract Background Phytophthora sojae is the primary pathogen of soybeans that are grown on poorly drained soils. Race-specific resistance to P. sojae in soybean is gene-for-gene, although in many areas of the US and worldwide there are populations that have adapted to the most commonly deployed resistance to P. sojae ( Rps genes. Hence, this system has received increased attention towards identifying mechanisms and molecular markers associated with partial resistance to this pathogen. Several quantitative trait loci (QTL have been identified in the soybean cultivar ‘Conrad’ that contributes to the expression of partial resistance to multiple P. sojae isolates. Results In this study, two of the Conrad QTL on chromosome 19 were dissected through sequence and expression analysis of genes in both resistant (Conrad and susceptible (‘Sloan’ genotypes. There were 1025 single nucleotide polymorphisms (SNPs in 87 of 153 genes sequenced from Conrad and Sloan. There were 304 SNPs in 54 genes sequenced from Conrad compared to those from both Sloan and Williams 82, of which 11 genes had SNPs unique to Conrad. Eleven of 19 genes in these regions analyzed with qRT-PCR had significant differences in fold change of transcript abundance in response to infection with P. sojae in lines with QTL haplotype from the resistant parent compared to those with the susceptible parent haplotype. From these, 8 of the 11 genes had SNPs in the upstream, untranslated region, exon, intron, and/or downstream region. These 11 candidate genes encode proteins potentially involved in signal transduction, hormone-mediated pathways, plant cell structural modification, ubiquitination, and basal resistance. Conclusions These findings may indicate a complex defense network with multiple mechanisms underlying these two soybean QTL conferring resistance to P. sojae. SNP markers derived from these candidate genes can contribute to fine mapping of QTL and marker assisted breeding for

  15. Molecular phylogenetics of cupped oysters based on partial 28S rRNA gene sequences. (United States)

    Littlewood, D T


    Partial sequences of 28S-like rDNA were amplified using PCR and sequenced for eight species of oyster and one species of mussel. Phylogenetic relationships among seven species of Crassostreinid oyster were inferred from aligned sequences by parsimony and maximum-likelihood methods. Of the 315 sites that varied, 90 were phylogenetically informative in parsimony analysis. Inference by maximum parsimony (MP) is consistent with maximum-likelihood (ML) analysis for the major lineages, yielding a tree with the topology (Mytilus edulis (Ostrea edulis ((Crassostrea rivularis (C. belcheri, C. gigas))(C. virginica, C. rhizophorae, Saccostrea cuccullata, S. commercialis)))). MP and ML analyses resolved the systematic relationships of the Saccostrea and Atlantic Crassostrea differently such that a polytomy linking these four taxa is preferred with the data available. Molecular data support a later divergence of the tropical Pacific Saccostrea from a common ancestor of the Atlantic Crassostrea species. Molecular data from domains D1, D2, and partial D3 of the 28S rDNA supply sufficient phylogenetic information to determine systematic relationships among the extant oyster taxa, from the major species groups to the family level, thus providing valuable characters that are able to supplement the paucity of morphological characters so far recognized.

  16. Cloning, sequencing and identification of single nucleotide polymorphisms of partial sequence on the porcine CACNA1S gene

    Institute of Scientific and Technical Information of China (English)

    FANG XiaoMin; XU NingYing; REN ShouWen


    CACNA1S gene encodes the α1 subunit of the calcium channel. The mutation of CACNA1S gene can cause hypokalemic periodic paralysis (HypoKPP) and maliglant hyperthermla synarome (MHS) in human beings. Current research on CACNA1S was mainly in human being and model animal, but rarely in livestock and poultry. In this study, Yorkshire pigs (23), Pietrain pigs (30), Jinhua pigs (115) and the second generation (126) of crossbred of Jinhua and Pietrein were used. Primers were designed according to the sequence of human CACNA1S gene and PCR was carried out using pig genome DNA.PCR products were sequenced and compared with that of human, and then single nucleotide polymorphisms (SNPs) were investigated by PCR-SSCP, while PCR-RFLP tests were performed to validate the mutations. Results indicated: (1) the 5211 bp DNA fragments of porcine CACNA1S gene were acquired (GenBank accession number: DQ767693 ) and the identity of the exon region was 82.6% between human and pig; (2) fifty-seven mutations were found within the cloned sequences, among which 24 were in exon region; (3) the results of PCR-RFLP were in accordance with that of PCR-SSCP. According to the EST of porcine CACNA1S gene published in GenBank (Bx914582, Bx666997), 8 of the 11 SNPs identified in the present study were consistent with the base difference between two EST fragments.

  17. Partial Sequencing of 16S rRNA Gene of Selected Staphylococcus aureus Isolates and its Antibiotic Resistance

    Directory of Open Access Journals (Sweden)

    Harsi Dewantari Kusumaningrum


    Full Text Available The choice of primer used in 16S rRNA sequencing for identification of Staphylococcus species found in food is important. This study aimed to characterize Staphylococcus aureus isolates by partial sequencing based on 16S rRNA gene employing primers 16sF, 63F or 1387R. The isolates were isolated from milk, egg dishes and chicken dishes and selected based on the presence of sea gene that responsible for formation of enterotoxin-A. Antibiotic susceptibility of the isolates towards six antibiotics was also tested. The use of 16sF resulted generally in higher identity percentage and query coverage compared to the sequencing by 63F or 1387R. BLAST results of all isolates, sequenced by 16sF, showed 99% homology to complete genome of four S. aureus strains, with different characteristics on enterotoxin production and antibiotic resistance. Considering that all isolates were carrying sea gene, indicated by the occurence of 120 bp amplicon after PCR amplification using primer SEA1/SEA2,  the isolates were most in agreeing to S. aureus subsp. aureus ST288. This study indicated that 4 out of 8 selected isolates were resistant towards streptomycin. The 16S rRNA gene sequencing using 16sF is useful for identification of S. aureus. However, additional analysis such as PCR employing specific gene target, should give a valuable supplementary information, when specific characteristic is expected.

  18. Cloning, sequencing and identification of single nu-cleotide polymorphisms of partial sequence on the porcine CACNA1S gene

    Institute of Scientific and Technical Information of China (English)


    CACNA1S gene encodes the α1 subunit of the calcium channel. The mutation of CACNA1S gene can cause hypokalemic periodic paralysis (HypoKPP) and maliglant hyperthermia synarome (MHS) in hu-man beings. Current research on CACNA1S was mainly in human being and model animal, but rarely in livestock and poultry. In this study, Yorkshire pigs (23), Pietrain pigs (30), Jinhua pigs (115) and the second generation (126) of crossbred of Jinhua and Pietrain were used. Primers were designed ac-cording to the sequence of human CACNA1S gene and PCR was carried out using pig genome DNA. PCR products were sequenced and compared with that of human, and then single nucleotide poly-morphisms (SNPs) were investigated by PCR-SSCP, while PCR-RFLP tests were performed to validate the mutations. Results indicated: (1) the 5211 bp DNA fragments of porcine CACNA1S gene were ac-quired (GenBank accession number: DQ767693 ) and the identity of the exon region was 82.6% be-tween human and pig; (2) fifty-seven mutations were found within the cloned sequences, among which 24 were in exon region; (3) the results of PCR-RFLP were in accordance with that of PCR-SSCP. Ac-cording to the EST of porcine CACNA1S gene published in GenBank (Bx914582, Bx666997), 8 of the 11 SNPs identified in the present study were consistent with the base difference between two EST frag-ments.

  19. Molecular and phylogenetic analysis of the partial tams1 gene sequence of a vaccine strain of Theileria annulata

    Directory of Open Access Journals (Sweden)

    Majid Esmaelizad


    Full Text Available The polypeptide Tams1 is an immunodominant major merozoite piroplasm surface antigen of the protozoan parasite Theileria annulata and is highly variable. In this study, the partial nucleotide (nt sequence of the Tams1 (522 nt gene of Iranian vaccine strain (Vaccine-ir68 recovered from an outbreak of disease in Iran was determined and compared with the corresponding sequences of eighteen previously published Tams1 genes. According to sequencing result, a novel amino acid substitution at the Tams1 region (K→Q was found exclusively in isolate Vaccine-ir68. Genetic distance values, estimated from the sequence data, revealed striking sequence homology (approximately 99% between Vaccine-ir68 isolate and Tunisian isolates, showing that they were same isolates of T. annulata which were spread in these areas. The phylogenetic tree constructed based on the sequence alignment of 19 Tams1 coding regions was distinctly divided into five lineages. There might be some unknown tick carrier birds immigrating to the different geographical regions. These birds have an effective role to distribute the T. annulata species in North Africa, Palestine and Iran.

  20. Identification and Partial Sequence of a PLD -like Gene Encoding for Phospholipase D in Peanut

    Institute of Scientific and Technical Information of China (English)


    Preharvest aflatoxin contamination has been identified by thepeanut industry as the most serious challenge facing the industry worldwide. Drought stress is the most important environmental factor exacerbat ing Aspergillus inection and aflatoxin contamination in peanut. Development of resistant peanut cultivars would represent a major advance for the U.S. Peanut industry. In this study, we identify a novel PLD - like gene, encoding a putative phospholipase D, a main enzyme responsible for the drought - induced degradation of membrane phospholipids in plants. This cloned PLDI fragment has 1069 bp nucleic acids and the deduced amino acid sequence shows high identity with known PLD genes, having similar conserved features, such as two HXKXXXXD motifs. Further study is needed to genetically and physiologically characterize the PLD in peanut and to gain a better understanding of its function and relationship with drought - tolerance.%花生工业界认为收获前花生黄曲霉毒素的污染是全世界花生工业界面临的最严峻的挑战.干旱胁迫是加重花生黄曲霉真菌侵染和毒素污染最重要的环境因素.选育花生抗性品种将使美国花生工业处于优势地位.在这一研究报告中,我们鉴定出了一个新的类PLD基因,它编码磷脂酶D.在植物体中,这个酶是负责干旱诱导降解细胞膜磷脂的主要酶.克隆的PLDI片段有1069个核甘酸对长.推导的氨基酸序列与已知的PLD基因有很高的同一性,包括相似的保守序列特征,比如两个HXKXXXXD基元.对花生PLD基因特性需要从遗传和生理上作进一步研究,以便更好地理解这个基因的功能及其与花生耐干旱性的关系.

  1. Identification of clinically relevant nonhemolytic Streptococci on the basis of sequence analysis of 16S-23S intergenic spacer region and partial gdh gene

    DEFF Research Database (Denmark)

    Nielsen, Xiaohui Chen; Justesen, Ulrik Stenz; Dargis, Rimtas;


    Nonhemolytic streptococci (NHS) cause serious infections, such as endocarditis and septicemia. Many conventional phenotypic methods are insufficient for the identification of bacteria in this group to the species level. Genetic analysis has revealed that single-gene analysis is insufficient...... for the identification of all species in this group of bacteria. The aim of the present study was to establish a method based on sequence analysis of the 16S-23S intergenic spacer (ITS) region and the partial gdh gene to identify clinical relevant NHS to the species level. Sequence analysis of the ITS region....... A phylogenetic algorithm based on the analysis of partial gdh gene sequences revealed three distinct clusters. We suggest that sequence analysis of the combination of the ITS region and the partial gdh gene can be used in the reference laboratory for the species-level identification of NHS....

  2. Limited gene flow and partial isolation phylogeography of Himalayan snowcock Tetraogallus himalayensis based on part mitochondrial D-loop sequences

    Institute of Scientific and Technical Information of China (English)

    Xiaoli WANG; Jiangyong QU; Naifa LIU; Xinkang BAO; Sen SONG


    Himalayan snowcock Tetraogallus himalayensis are distributed in alpine and subalpine areas in China.We used mitochondrial DNA control-region data to investigate the origin and past demographic change in sixty-seven Himalayan snowcock T.himalayensis.The fragments of 1155 nucleotides from the control region of mitochondrial DNA were sequenced,and 57 polymorphic positions defined 37 haplotypes.A high level of genetic diversity was detected in all populations sampled and may be associated isolation of the mountains and habitat fragmentation and deterioration from Quaternary glaciations.In the phylogenetic tree,all haplotypes grouped into four groups:clade A (Kunlun Mountains clade),clade B (Northern Qinghai-Tibetan Plateau clade),clade C (Tianshan Mountains clade) and clade D (Kalakunlun Mountains clade).We found a low level of gene flow and significant genetic differentiation among all populations.Based on divergence time we suggest that the divergence of Himalayan snowcock occurred in the middle Pleistocene inter-glaciation,and expansion occurred in the glaciation.Analysis of mtDNA D-loop sequences confirmed demographic population expansion,as did our non-significant mismatch distribution analysis.In conclusion,limited gene flow and a pattern of partial isolation phylogeographic was found in geographic populations of T.himalayansis based on the analysis on mtDNA D-loop sequences [Current Zoology 57 (6):758-767,2011 ].

  3. Sequence diversity patterns suggesting balancing selection in partially sex-linked genes of the plant Silene latifolia are not generated by demographic history or gene flow. (United States)

    Guirao-Rico, Sara; Sánchez-Gracia, Alejandro; Charlesworth, Deborah


    DNA sequence diversity in genes in the partially sex-linked pseudoautosomal region (PAR) of the sex chromosomes of the plant Silene latifolia is higher than expected from within-species diversity of other genes. This could be the footprint of sexually antagonistic (SA) alleles that are maintained by balancing selection in a PAR gene (or genes) and affect polymorphism in linked genome regions. SA selection is predicted to occur during sex chromosome evolution, but it is important to test whether the unexpectedly high sequence polymorphism could be explained without it, purely by the combined effects of partial linkage with the sex-determining region and the population's demographic history, including possible introgression from Silene dioica. To test this, we applied approximate Bayesian computation-based model choice to autosomal sequence diversity data, to find the most plausible scenario for the recent history of S. latifolia and then to estimate the posterior density of the most relevant parameters. We then used these densities to simulate variation to be expected at PAR genes. We conclude that an excess of variants at high frequencies at PAR genes should arise in S. latifolia populations only for genes with strong associations with fully sex-linked genes, which requires closer linkage with the fully sex-linked region than that estimated for the PAR genes where apparent deviations from neutrality were observed. These results support the need to invoke selection to explain the S. latifolia PAR gene diversity, and encourage further work to test the possibility of balancing selection due to sexual antagonism. © 2016 John Wiley & Sons Ltd.

  4. Partial Cloning and Nucleotide Sequencing of Glutamate Decarboxylase Gene Isoform 65 from Human Brain

    Directory of Open Access Journals (Sweden)

    Abolghasem Esmaeili


    Conclusion: Because obtaining fresh human brain is difficult and amount of mRNA is low, it may not be easy to clone full length of human gad gene. The approach described in this paper may be useful in cloning of other genes for which the corresponding mRNA is present at low levels.

  5. The phylogenetic position of Allocreadiidae (Trematoda: Digenea) from partial sequences of the 18S and 28S ribosomal RNA genes. (United States)

    Choudhury, Anindo; Rosas Valdez, Rogelio; Johnson, Ryan C; Hoffmann, Brian; Pérez-Ponce de León, Gerardo


    Species of Allocreadiidae are an important component of the parasite fauna of freshwater vertebrates, particularly fishes, and yet their systematic relationships with other trematodes have not been clarified. Partial sequences of the 18S and 28S ribosomal RNA genes from 3 representative species of Allocreadiidae, i.e., Crepidostomum cooperi, Bunodera mediovitellata, and Polylekithum ictaluri, and from 79 other taxa representing 78 families of trematodes obtained from GenBank, were used in a phylogenetic analysis to address the relationships of Allocreadiidae with other plagiorchiiforms/plagiorchiidans. Maximum parsimony and Bayesian analyses of combined 18S and 28S rRNA gene sequence data place 2 of the allocreadiids, Crepidostomum cooperi and Bunodera mediovitellata, in a clade with species of Callodistomidae and Gorgoderidae, which, in turn is sister to a clade containing Polylekithum ictaluri and representatives of Encyclometridae, Dicrocoelidae, and Orchipedidae, a grouping supported by high bootstrap values. These results suggest that Polylekithum ictaluri is not an allocreadiid, a conclusion that is supported by reported differences between its cercaria and that of other allocreadiids. Although details of the life cycle of callodistomids, the sister taxon to Allocreadiidae, remain unknown, the relationship of Allocreadiidae and Gorgoderidae is consistent with their larval development in bivalve, rather than gastropod, molluscs, and with their host relationships (predominantly freshwater vertebrates). The results also indicate that, whereas Allocreadiidae is not a basal taxon, it is not included within the suborder Plagiorchiata. No support was found for a direct relationship between allocreadiids and opecoelids either.

  6. Polymorphism Analysis on Partial Sequence of Pig Obese Gene of Different Breeds by PCR-SSCP

    Institute of Scientific and Technical Information of China (English)

    SONG Yuefen; YU Hao; YANG Xiuqin; LIU Di


    Polymorphisms of porcine ob exon 1 and exon 2 among different breeds including Landrace, Duroc, Min pig, Yorkshire pig, double-muscled Yorkshire, Sanjiang pig, wild boar and cross bred pig were analyzed by PCR-SSCP in the current study. Three pairs of primers according to the ob cDNA sequence obtained from GenBank database were designed to amplify the first two exons, which were then genotyped by SSCP. The T to C transversion was found in exon 2, which resulted in 3 genotypes named AA, AB and BB, respectively in these different porcine breeds. There was only genotype of BB in the Min pig, while no allele B was detected in double-muscled Yorkshire, and the 3genotypes all existed in other breeds. There was significant difference on the genotype frequencies in various breeds.There was a trend that the frequency of allele A was positively associated with muscle ratio distribution on the one hand, and on the other hand, it was linked to the selected direction. So the allele A could be used as a selective marker of high muscle ratio in pig breeding.

  7. Use of Partial 16S rRNA Gene Sequencing for Identification of Legionella pneumophila and Non-pneumophila Legionella spp.▿


    Wilson, D. A.; Reischl, U.; Hall, G. S.; Procop, G. W.


    We examined 49 Legionella species, 26 L. pneumophila and 23 non-pneumophila Legionella spp., using partial 16S rRNA gene sequencing. This approach accurately identified all the L. pneumophila isolates, characterized all non-pneumophila Legionella isolates as such within this genus, and classified most (20/23; 87%) of the non-pneumophila Legionella isolates to the species level.

  8. Fastidious Gram-Negatives: Identification by the Vitek 2 Neisseria-Haemophilus Card and by Partial 16S rRNA Gene Sequencing Analysis

    DEFF Research Database (Denmark)

    Wolff Sönksen, Ute; Christensen, Jens Jørgen; Nielsen, Lisbeth


    Taxonomy and identification of fastidious Gram negatives are evolving and challenging. We compared identifications achieved with the Vitek 2 Neisseria-Haemophilus (NH) card and partial 16S rRNA gene sequence (526 bp stretch) analysis with identifications obtained with extensive phenotypic charact...

  9. Phylogenetic analysis of Thai oyster (Ostreidae) based on partial sequences of the mitochondrial 16S rDNA gene

    DEFF Research Database (Denmark)

    Bussarawit, Somchai; Gravlund, Peter; Glenner, Henrik;


    Ten oyster species of the family Ostreidae (Subfamilies Crassostreinae and Lophinae) from Thailand were studied using morphological data and mitochondrial 16S rDNA gene sequences. Additional sequence data from five specimens of Ostreidae and one specimen of Tridacna gigas were downloaded from Gen...

  10. Screening, diversity and partial sequence comparison of vegetative insecticidal protein (vip3A) genes in the local isolates of Bacillus thuringiensis Berliner. (United States)

    Asokan, R; Swamy, H M Mahadeva; Arora, D K


    Characterization, direct sequencing of the PCR amplicon and phylogenetic relationship was done to discover a novel Vip protein genes of the Bt isolates, to improve the prospects for insect control, more Vip proteins should be sought out and researched to predict their insecticidal activity. Characterization was based on direct sequencing of PCR amplicon using primers specific to vip3A gene was presented here. 12 out of 18 isolates screened were positive for vip gene-specific primers. Homology search for the partial sequences using BLAST showed that 11 isolates had high similarity to vip3Aa gene and only one fragment with vip3Ae gene (25-100% at nucleotide and amino acid level). Phylogenetic analysis showed that the gene sequences were responsible for geographic separation for divergence within vip genes, consistent with the evaluation of distinct bacterial population. Despite the geographical distances, strains harbouring vip genes have originated from common ancestors may significantly contribute to control resistant insect pests. Some strains have evolved to be quite distinct and others remain as members of closely related groups. The reported method is a powerful tool to find novel Vip3A proteins from large-scale Bt strains which is effective in terms of time and cost. Further the Vip proteins produced by different strains of B. thuringiensis are unique in terms of the sequence divergence and hence may also differ in their insecticidal activities.

  11. Partial gene sequences for the A subunit of methyl-coenzyme M reductase (mcrI) as a phylogenetic tool for the family Methanosarcinaceae (United States)

    Springer, E.; Sachs, M. S.; Woese, C. R.; Boone, D. R.


    Representatives of the family Methanosarcinaceae were analyzed phylogenetically by comparing partial sequences of their methyl-coenzyme M reductase (mcrI) genes. A 490-bp fragment from the A subunit of the gene was selected, amplified by the PCR, cloned, and sequenced for each of 25 strains belonging to the Methanosarcinaceae. The sequences obtained were aligned with the corresponding portions of five previously published sequences, and all of the sequences were compared to determine phylogenetic distances by Fitch distance matrix methods. We prepared analogous trees based on 16S rRNA sequences; these trees corresponded closely to the mcrI trees, although the mcrI sequences of pairs of organisms had 3.01 +/- 0.541 times more changes than the respective pairs of 16S rRNA sequences, suggesting that the mcrI fragment evolved about three times more rapidly than the 16S rRNA gene. The qualitative similarity of the mcrI and 16S rRNA trees suggests that transfer of genetic information between dissimilar organisms has not significantly affected these sequences, although we found inconsistencies between some mcrI distances that we measured and and previously published DNA reassociation data. It is unlikely that multiple mcrI isogenes were present in the organisms that we examined, because we found no major discrepancies in multiple determinations of mcrI sequences from the same organism. Our primers for the PCR also match analogous sites in the previously published mcrII sequences, but all of the sequences that we obtained from members of the Methanosarcinaceae were more closely related to mcrI sequences than to mcrII sequences, suggesting that members of the Methanosarcinaceae do not have distinct mcrII genes.

  12. PCR-SSCP and sequence analysis of three Odontotermes spp. (order: isoptera; family: termitidae) on the basis of partial 16SrRNA gene. (United States)

    Kumari, Mamtesh; Sharma, Vijay Lakshmi; Sodhi, Monika; Mukesh, Manishi; Shouche, Yogesh; Sobti, Ranbir Chander


    Partial 16S gene fragments were amplified by using specific primers in few species/populations of termites of the genus Odontotermes (Isoptera:Termitidae:Macrotermitinae), and the PCR products were subjected to SSCP analysis. Three haplotypes obtained were subjected to sequencing. The sequences obtained were characterized to see the frequencies of each nucleotide bases. High A + T content was observed. The inter-specific pairwise sequence divergence in Odontotermes spp. ranged from 0% to 4.8% across the entire 16S gene fragment. Identical sequences were found between two populations of O. horni. Individuals of different species having Type-I conformational pattern, i.e. O. obesus (-AI) and O. horni (-MI), as well as Type-II of O. obesus (-UII) and O. bhagwatii (-CHII) had no percent diversity. Phylogenetic trees drawn on the basis of distance Neighbour-joining method revealed clustering of individuals according to their genera and families.

  13. Evolutionary relationships among the Braconidae (Hymenoptera: Ichneumonoidea) inferred from partial 16S rDNA gene sequences. (United States)

    Dowton, M; Austin, A D; Antolin, M F


    Phylogenetic relationships among the Braconidae were examined using homologous 16S rDNA gene sequence data. Analyses recovered the few well-supported relationships evident in this family from morphological analyses, viz the monophyly of the microgastroid complex of subfamilies, the monophyly of the cyclostome complex of subfamilies (= braconoids), a sister-group relationship between the Alysiinae and Opiinae, and a close relationship between the Helconinae and Blacinae. With respect to the braconoid complex of subfamilies, a sister-group relationship was recovered between Aphidiinae and Mesostoinae, and a clade composed of Gnamptodontinae + Histeromerinae + Rhyssalinae + Aphidiinae + Mesostoinae was also recovered. The Doryctinae and Rogadinae sensu lato (s.l.) were generally not resolved as monophyletic. With respect to the helconoid complex of subfamilies, a sister-group relationship was recovered between Sigalphinae and Agathidinae, whereas Neoneurinae fell out among other helconoid subfamilies. Other relationships among the helconoid subfamilies were unclear from these analyses. With respect to the microgastroid complex of subfamilies, our data conform to morphological estimates, recovering ((Microgastrinae + Miracinae) + Cardiochilinae) + Cheloninae. The topology of our trees suggests that the cyclostome subfamilies are a natural derived group, inferring that endoparasitism (not ectoparasitism) is the ancestral state for the Braconidae, unless all of the ectoparasitic ancestors of the helconoid + microgastroid subfamilies are now extinct.

  14. Four-locus phylogeny of Fusarium avenaceum and related species and their species-specific identification based on partial phosphate permease gene sequences. (United States)

    Stakheev, Alexander A; Khairulina, Dina R; Zavriev, Sergey K


    The fungus Fusarium avenaceum and its closest relatives are responsible for contamination of agricultural plants and their products by mycotoxins such as enniatins and moniliformin. Precise identification of mycotoxin producers is necessary for estimation of the accumulation risk of those compounds and for preventing the consumption of highly contaminated products. Nucleic acids amplification-based techniques proved to be the most rapid and reliable approach for pathogen diagnostics and identification. In this study partial phosphate permease gene (PHO) sequences were determined for Fusarium avenaceum (including one isolate identified as F. arthrosporioides), F. tricinctum, F. acuminatum and F. torulosum. Phylogenetic analysis of 40 isolates of those species from different climates and geographical regions of Russia and some neighboring countries based on sequences of PHO, translation elongation factor 1 alpha (TEF1α), beta-tubulin (β-TUB), enniatin synthetase (Esyn1) genes and combined data set demonstrated that the PHO gene possesses the highest rate of variability among them and can be considered as an informative marker for phylogenetic studies of these species. According to the combined data set phylogeny, the isolates of each species formed clusters with a high bootstrap support. Analysis of PHO sequences revealed a high intraspecific variability of F. avenaceum: there were 5 independent clusters on the dendrogram, including one cluster which was closer to F. torulosum than to other F. avenaceum isolates. Variable sites in PHO sequences have been used for the design of species-specific primers and a fluorescent hydrolysis probe. The specificity of the assay was shown for DNA samples extracted from 68 isolates of 23 Fusarium species. Quantitative PCR approach was applied to estimate the contamination rate of 17 naturally infected oat and barley samples, previously characterized by microbiological procedures.

  15. Identification in the human central nervous system, pituitary, and thyroid of a novel calcitonin gene-related peptide, and partial amino acid sequence in the spinal cord. (United States)

    Petermann, J B; Born, W; Chang, J Y; Fischer, J A


    Two human genes encoding precursors for two calcitonin gene-related peptides (CGRP) I (or alpha) and II (or beta) have been identified (Steenbergh, P. H., Höppener, J. W. M., Zandberg, J., Lips, C. J. M., and Jansz, H. S. (1985) FEBS Lett. 183, 403-407). The amino acid sequence of CGRP-I was obtained in medullary thyroid carcinoma extracts (Morris, H. R., Panico, M., Etienne, T., Tippins, J., Girgis, S. I., and MacIntyre, I. (1984) Nature 308, 746-748), but not in normal human tissues. The human CGRP-II peptide remained to be discovered. Here we have determined in the human spinal cord the amino acid composition and the partial amino acid sequence of the DNA-predicted CGRP-I and -II. The data indicate for the first time the existence of a second CGRP different from the known CGRP-I. CGRP-II has been identified in the central nervous system, pituitary, thyroid, and in medullary thyroid carcinoma as a major CGRP form together with CGRP-I.

  16. Phylogenetic relationships of Cyprinidae (Teleostei: Cypriniformes) inferred from the partial S6K1 gene sequences and implication of indel sites in intron 1

    Institute of Scientific and Technical Information of China (English)


    The family Cyprinidae is widely distributed in East Asia, and has the important phylogenetic signifi- cance in the fish evolution. In this study, the 5′ end partial sequences (containing exon 1, exon 2 and indel 1) of S6K1 gene were obtained from 30 representative species in Cyprinidae and outgroup using PCR amplification and sequencing. The phylogenetic relationships of Cyprinidae were reconstructed with neighbor joining (NJ), maximum parsimony (MP), maximum likelihood (ML), and Bayesian meth- ods. Myxocyprinus asiaticus (Catostomidae) was assigned to the outgroup taxon. Similar phylogenetic relationships within the family Cyprinidae were achieved with the four analyses. Leuciscini and Barbini were monophyletic lineages respectively with the high nodal supports. Leuciscini comprises Hypophthalmichthyinae, Xenocyprinae, Cultrinae, Gobioninae, Acheilognathinae and East Asian species of Leuciscinae and Danioninae. Monophyly of East Asian clade was supported with high nodal support. Barbini comprises Schizothoracinae, Barbinae, Cyprininae and Labeoninae. The monophyletic lineage consisting of Danio rerio, D. myersi, and Rasbora trilineata was basal in the tree. In addition, the large fragment indels in intron 1 were analyzed to improve the understanding of Cyprinidae relationships. The results showed that the large fragment indels were correlated with the relations among species. Some conserved regions in intron 1 were thought to be involved in the functional regulation. However, no correlation was found between sequence variations and species characteristic size.

  17. Molecular and phylogenetic characterizations of an Eimeria krijgsmanni Yakimoff & Gouseff, 1938 (Apicomplexa: Eimeriidae) mouse intestinal protozoan parasite by partial 18S ribosomal RNA gene sequence analysis. (United States)

    Takeo, Toshinori; Tanaka, Tetsuya; Matsubayashi, Makoto; Maeda, Hiroki; Kusakisako, Kodai; Matsui, Toshihiro; Mochizuki, Masami; Matsuo, Tomohide


    Previously, we characterized an undocumented strain of Eimeria krijgsmanni by morphological and biological features. Here, we present a detailed molecular phylogenetic analysis of this organism. Namely, 18S ribosomal RNA gene (rDNA) sequences of E. krijgsmanni were analyzed to incorporate this species into a comprehensive Eimeria phylogeny. As a result, partial 18S rDNA sequence from E. krijgsmanni was successfully determined, and two different types, Type A and Type B, that differed by 1 base pair were identified. E. krijgsmanni was originally isolated from a single oocyst, and thus the result show that the two types might have allelic sequence heterogeneity in the 18S rDNA. Based on phylogenetic analyses, the two types of E. krijgsmanni 18S rDNA formed one of two clades among murine Eimeria spp.; these Eimeria clades reflected morphological similarity among the Eimeria spp. This is the third molecular phylogenetic characterization of a murine Eimeria spp. in addition to E. falciformis and E. papillata.

  18. Genetic classification and distinguishing of Staphylococcus species based on different partial gap, 16S rRNA, hsp60, rpoB, sodA, and tuf gene sequences. (United States)

    Ghebremedhin, B; Layer, F; König, W; König, B


    The analysis of 16S rRNA gene sequences has been the technique generally used to study the evolution and taxonomy of staphylococci. However, the results of this method do not correspond to the results of polyphasic taxonomy, and the related species cannot always be distinguished from each other. Thus, new phylogenetic markers for Staphylococcus spp. are needed. We partially sequenced the gap gene (approximately 931 bp), which encodes the glyceraldehyde-3-phosphate dehydrogenase, for 27 Staphylococcus species. The partial sequences had 24.3 to 96% interspecies homology and were useful in the identification of staphylococcal species (F. Layer, B. Ghebremedhin, W. König, and B. König, J. Microbiol. Methods 70:542-549, 2007). The DNA sequence similarities of the partial staphylococcal gap sequences were found to be lower than those of 16S rRNA (approximately 97%), rpoB (approximately 86%), hsp60 (approximately 82%), and sodA (approximately 78%). Phylogenetically derived trees revealed four statistically supported groups: S. hyicus/S. intermedius, S. sciuri, S. haemolyticus/S. simulans, and S. aureus/epidermidis. The branching of S. auricularis, S. cohnii subsp. cohnii, and the heterogeneous S. saprophyticus group, comprising S. saprophyticus subsp. saprophyticus and S. equorum subsp. equorum, was not reliable. Thus, the phylogenetic analysis based on the gap gene sequences revealed similarities between the dendrograms based on other gene sequences (e.g., the S. hyicus/S. intermedius and S. sciuri groups) as well as differences, e.g., the grouping of S. arlettae and S. kloosii in the gap-based tree. From our results, we propose the partial sequencing of the gap gene as an alternative molecular tool for the taxonomical analysis of Staphylococcus species and for decreasing the possibility of misidentification.

  19. Variance of partial sums of stationary sequences

    CERN Document Server

    Deligiannidis, George


    Let $X_1, X_2,...$ be a centred sequence of weakly stationary random variables with spectral measure $F$ and partial sums $S_n = X_1 +...+ X_n$, and let $G(x) = \\int_{-x}^x F(\\rd x)$. We show that $\\var(S_n)$ is regularly varying of index $\\gamma$ at infinity, if and only if $G(x)$ is regularly varying of index $2-\\gamma$ at the origin ($0<\\gamma<2$).

  20. A computer program for fast and easy typing of a partial endoglucanase gene sequence into genospecies and sequevars 1&2 of the Ralstonia solanacearum species complex. (United States)

    Stulberg, Michael J; Huang, Qi


    The phytopathogen Ralstonia solanacearum is a species complex that contains race 3 biovar 2 strains belonging to phylotype IIB sequevars 1 and 2 that are quarantined or select agent pathogens. Recently, the R. solanacearum species complex strains have been reclassified into three genospecies: R. solanacearum, Ralstonia pseudosolanacearum and Ralstonia syzygii. An unidentified R. solanacearum strain is considered a select agent in the US until proven to be a non-race 3 biovar 2 (non-phylotype IIB sequevars 1&2). Currently, sequevars of R. solanacearum species complex strains can only be determined by phylogenetic analysis of a partial endoglucanase (egl) sequence of approximately 700-bp in length. Such analysis, however, requires expert knowledge to properly trim the sequence, to include the correct reference strains, and to interpret the results. By comparing GenBank egl sequences of representative R. solanacearum species-complex strains, we identified genospecies- and sequevar 1 and 2-specific single nucleotide polymorphisms (SNPs). We also designed primers to amplify a shorter, 526-bp, egl fragment from R. solanacearum species complex strains for easy sequencing of the amplicon, and to facilitate direct and specific amplification of egl from R. solanacearum-infected plant samples without the need of bacterial isolation. We wrote a computer program (Ralstonia solanacearum typing program) that analyzes a minimum 400-bp user-input egl sequence from a R. solanacearum strain for egl homology and SNP content to determine 1) whether it belongs to the R. solanacearum species complex, 2) if so, to which genospecies, and 3) whether it is of the sequevar type (sequevars 1 and 2) associated with the select agent/quarantined R. solanacearum strain. The program correctly typed all 371 tested egl sequences with known sequevars, obtained either from GenBank or through personal communication. Additionally, the program successfully typed 25 R. solanacearum strains in our

  1. Concurrent speciation in the eastern woodland salamanders (Genus Plethodon): DNA sequences of the complete albumin nuclear and partial mitochondrial 12s genes. (United States)

    Highton, Richard; Hastings, Amy Picard; Palmer, Catherine; Watts, Richard; Hass, Carla A; Culver, Melanie; Arnold, Stevan J


    Salamanders of the North American plethodontid genus Plethodon are important model organisms in a variety of studies that depend on a phylogenetic framework (e.g., chemical communication, ecological competition, life histories, hybridization, and speciation), and consequently their systematics has been intensively investigated over several decades. Nevertheless, we lack a synthesis of relationships among the species. In the analyses reported here we use new DNA sequence data from the complete nuclear albumin gene (1818 bp) and the 12s mitochondrial gene (355 bp), as well as published data for four other genes (Wiens et al., 2006), up to a total of 6989 bp, to infer relationships. We relate these results to past systematic work based on morphology, allozymes, and DNA sequences. Although basal relationships show a strong consensus across studies, many terminal relationships remain in flux despite substantial sequencing and other molecular and morphological studies. This systematic instability appears to be a consequence of contemporaneous bursts of speciation in the late Miocene and Pliocene, yielding many closely related extant species in each of the four eastern species groups. Therefore we conclude that many relationships are likely to remain poorly resolved in the face of additional sequencing efforts. On the other hand, the current classification of the 45 eastern species into four species groups is supported. The Plethodon cinereus group (10 species) is the sister group to the clade comprising the other three groups, but these latter groups (Plethodon glutinosus [28 species], Plethodon welleri [5 species], and Plethodon wehrlei [2 species]) probably diverged from each other at approximately the same time. Copyright © 2012. Published by Elsevier Inc.

  2. Species-Level Identification of Actinomyces Isolates Causing Invasive Infections: Multiyear Comparison of Vitek MS (Matrix-Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry) to Partial Sequencing of the 16S rRNA Gene. (United States)

    Lynch, T; Gregson, D; Church, D L


    Actinomyces species are uncommon but important causes of invasive infections. The ability of our regional clinical microbiology laboratory to report species-level identification of Actinomyces relied on molecular identification by partial sequencing of the 16S ribosomal gene prior to the implementation of the Vitek MS (matrix-assisted laser desorption ionization-time of flight mass spectrometry [MALDI-TOF MS]) system. We compared the use of the Vitek MS to that of 16S rRNA gene sequencing for reliable species-level identification of invasive infections caused by Actinomyces spp. because limited data had been published for this important genera. A total of 115 cases of Actinomyces spp., either alone or as part of a polymicrobial infection, were diagnosed between 2011 and 2014. Actinomyces spp. were considered the principal pathogen in bloodstream infections (n = 17, 15%), in skin and soft tissue abscesses (n = 25, 22%), and in pulmonary (n = 26, 23%), bone (n = 27, 23%), intraabdominal (n = 16, 14%), and central nervous system (n = 4, 3%) infections. Compared to sequencing and identification from the SmartGene Integrated Database Network System (IDNS), Vitek MS identified 47/115 (41%) isolates to the correct species and 10 (9%) isolates to the correct genus. However, the Vitek MS was unable to provide identification for 43 (37%) isolates while 15 (13%) had discordant results. Phylogenetic analyses of the 16S rRNA sequences demonstrate high diversity in recovered Actinomyces spp. and provide additional information to compare/confirm discordant identifications between MALDI-TOF and 16S rRNA gene sequences. This study highlights the diversity of clinically relevant Actinomyces spp. and provides an important typing comparison. Based on our analysis, 16S rRNA gene sequencing should be used to rapidly identify Actinomyces spp. until MALDI-TOF databases are optimized.

  3. Comparison of eukaryotic phytobenthic community composition in a polluted river by partial 18S rRNA gene cloning and sequencing. (United States)

    Dorigo, U; Bérard, A; Humbert, J F


    We compared the species composition in phytobenthic communities at different sampling sites in a small French river presenting polluted and unpolluted areas. For each sampling point, the total DNA was extracted and used to construct an 18S rRNA gene clone library after PCR amplification of a ca 400 bp fragment. Phytobenthic community composition was estimated by random sequencing of several clones per library. Most of the sequences corresponded to the Bacillariophyceae and Chlorophyceae groups. By combining phylogenetic and correspondence analyses, we showed that our molecular approach is able to estimate and compare the species composition at different sampling sites in order to assess the environmental impact of xenobiotics on phytobenthic communities. Changes in species composition of these communities were found, but no evident decrease in the diversity. We discuss the significance of these changes with regard to the existing level of pollution and their impact on the functionality of the ecosystem. Our findings suggest that it is now possible to use faster molecular methods (DGGE, ARISA.) to test large numbers of samples in the context of ecotoxicological studies, and thus to assess the impact of pollution in an aquatic ecosystem.

  4. The first determination of DNA sequence of a specific gene. (United States)

    Inouye, Masayori


    How and when the first DNA sequence of a gene was determined? In 1977, F. Sanger came up with an innovative technology to sequence DNA by using chain terminators, and determined the entire DNA sequence of the 5375-base genome of bacteriophage φX 174 (Sanger et al., 1977). While this Sanger's achievement has been recognized as the first DNA sequencing of genes, we had determined DNA sequence of a gene, albeit a partial sequence, 11 years before the Sanger's DNA sequence (Okada et al., 1966).

  5. Cloning of partial putative gonadotropin hormone receptor sequence from fish

    Indian Academy of Sciences (India)

    G Kumaresan; T Venugopal; A Vikas; T J Pandian; S M Athavan


    A search for the presence of mariner-like elements in the Labeo rohita genome by polymerase chain reaction led to the amplification of a partial DNA sequence coding for a putative transmembrane domain of gonadotropin hormone receptor. The amplified DNA sequence shows a high degree of homology to the available turkey and human luteinizing and follicle stimulating hormone receptor coding sequences. This is the first report on cloning such sequences of piscine origin.

  6. DNA Cloning and Sequence Analysis of Partial env Gene of Avian Leukosis Virus Subgroup J in Wuhua Chicken Flocks%五华鸡J亚群白血病病毒env部分基因的和序列分析

    Institute of Scientific and Technical Information of China (English)

    戴银; 张丹俊; 胡晓苗; 赵瑞宏


    The partial sequence of avian leukosis virus subgroup J (ALV-J) env gene was cloned by PCR in Anhui province, which was named as ALV-J-envl. And we compared ALV-J-envl with the partial sequence of eight ALV-J env genes in the GenBank database. The results showed that ALV-J infection in Wuhua chicken was found in Anhui province. Alignment of nine gene sequences of ALV-J env, the homology between ALV-J-envl and the other sequences ranged from 94. 2% to 96. 6%. The result revealed that the env gene from different strains had the mutation at nucleotide sequence. However, the highest homology with ALV-J-envl were HQ425636 and HM235665 gene sequences isolated from strains in China, and they gathered together in the evolutionary tree, indicating a close genetic relationship, and they evolved from common strains. The lowest homology with ALV-J-envl was the gene sequence of AY312965 isolated from Malaysia, it could be further confirmed by phylogenetic analysis.%本试验通过PCR技术获得了安徽省地方品种五华鸡禽白血病病毒J亚群(avian leukosis virus subgroup J,ALV-J)env部分基因序列AL V-J-env1,并将该序列与GenBank数据库中登录的8条env基因相应序列进行了比对分析.结果表明,五华鸡已经感染了J亚群禽白血病,且部分鸡个体已经发病.通过分析可见,AL V-J-env1与所比较的基因序列同源性介于94.2%~96.6%之间,说明不同毒株之间的env基因有一定变异.但与ALV J-env1同源性最高的是来自于中国ALV-J毒株的HQ425636和HM235665基因序列,且在进化树中聚集为一组,暗示它们之间亲缘关系较近,由共同的毒株进化而来.与ALV-J-env1同源性最低的是来源于马来西亚毒株的AY312965基因序列,遗传进化分析也进一步证实,两者之间亲缘关系较远.

  7. On the extendibility of partially and Markov exchangeable binary sequences

    CERN Document Server

    Di Cecco, Davide


    In [Fortini et al., Stoch. Proc. Appl. 100 (2002), 147--165] it is demonstrated that a recurrent Markov exchangeable process in the sense of Diaconis and Freedman is essentially a partially exchangeable process in the sense of de Finetti. In case of finite sequences there is not such an equivalence. We analyze both finite partially exchangeable and finite Markov exchangeable binary sequences and formulate necessary and sufficient conditions for extendibility in both cases.

  8. Phylogenetic position of the genus Gonocerca Manter, 1925 (Trematoda, Hemiuroidea), based on partial sequences of 28S rRNA gene and a reconsideration of taxonomic status of Gonocercinae Skrjabin et Guschanskaja, 1955. (United States)

    Sokolov, Sergey G; Atopkin, Dmitry M; Gordeev, Ilya I; Shedko, Marina B


    Adult trematodes of the genus Gonocerca Manter, 1925, are parasites of marine fishes. Identification of the phylogenetic positions and a revision of the taxonomic status of the subfamily Gonocercinae Skrjabin et Guschanskaja, 1955 (Derogenidae) are the main purposes of this research article. Four Gonocerca species were used in the study, including the type-species G. phycidis Manter, 1925. Molecular phylogenetic analysis, based on partial sequences of 28S rRNA gene, revealed that Gonocerca spp. are phylogenetically distant from other hemiuroid trematodes, including Derogenes varicus (Müller, 1784), representative of the type-genus of the family Derogenidae. The taxonomic rank of Gonocercinae should be raised to the family level. The generic composition of the family Gonocercidae Skrjabin et Guschanskaja, 1955 stat. nov., requires further clarification as the molecular data do not support the inclusion of the genus Hemipera Nicoll, 1913, in this family. Copyright © 2017 Elsevier B.V. All rights reserved.

  9. Cloning and Analysis of Partial Sequence of the Lectin Gene in Pinellia%半夏属植物凝集素基因片段克隆与序列分析

    Institute of Scientific and Technical Information of China (English)

    赵静; 彭正松; 杨在君; 魏淑红; 彭丽娟


    半夏属植物凝集素同其他天南星科植物凝集素一样,有3个甘露糖专一结合位点,且具显著的抗虫性.本研究根据已知掌叶半夏凝集素基因表达序列的相关信息设计引物,采用RT-PCR方法,分别从滴水珠、石蜘蛛、盾叶半夏、掌叶半夏、三叶半夏中克隆出长约530bp的凝集素基因片段.使用Genesean软件、ORF finder软件、Ex2PASy Proteomics Server软件对5个凝集素基因片段进行分析,结果表明:克隆出的滴水珠、石蜘蛛、盾叶半夏、掌叶半夏、三叶半夏凝集素基因片段分别编码107、171、170、170、170个氨基酸,在它们所编码的肽链中,都具有酪蛋白激酶Ⅱ磷酸化位点、蛋白激酶C磷酸化位点和N-糖基化位点三类不同的功能位点.但由于编码序列碱基的突变,引起肽链中的氨基酸发生变化,从而导致这五种植株的凝集素基因存在较高的多态性以及存在功能位点的差异.其凝集素基因多态性及功能位点的差异对凝集素功能的影响需进一步分析.本实验为克隆五种植物凝集素基因的表达序列与结构基因以及深入研究半夏属凝集素的功能提供了有意义的参考资料.%Pinellia leetin, a mannose binding lectin with three mannose binding boxes like lectins from other Araceae species, has obvious insect resistance. A pair of primers was designed according to correlative information of Lection gene in P. Pedatisecta. About 530bp eDNA fragments of Lection gene in P. ternata, P. Pedatisecta, P.cordata, P. inlegrifolia, P. peltata were cloned by RT-PCR. These partial sequences data were analyzed by GenScan software, ORF finder software, ExPASy Proteomics Server software. The results showed that the partial sequences of lectin gene encode 170 amino acids in P. ternata, 171 amino acids in P. Pedatisecta, 170 amino acids in P. cordata, 171 amino acids in P. inlegrifolia and 170 amino acids in P. peltata. Three functional sites were found in

  10. [Molecular phylogeny and the time of divergence of minges (Chironomidae, Nematocera, Diptera) inferred from a partial nucleotide sequence of the cytochrome oxidase I gene (COI)]. (United States)

    Demin, A G; Polukonova, N V; Miuge, N S


    This is the first study to infer the phylogenetic structure of minges of the subfamily Chironominae from the amino acid sequence of cytochrome oxidase I (COI). The subdivision of Chironominae into two tribes, Chironomini and Tanytarsini, has been confirmed. The segregation of the genera Pseudochironomus and Riethia into a separate tribe has not been confirmed. Stenochironomus gibbus forms a branch considerably deviating from the subfamily Chironominae. The genus Micropsectra is formed by a large polyphyletic cluster that also includes the genera Virgotanytharsus, Reotanytharsus, Kenopsectra, and Parapsectra. Tanytarsus is the basal genus of the tribe Tanytarsini. The times of divergence of the main taxa of Chironominae have been estimated. The calculated time of divergence of the genus Chironomus disproves the assumption that it is phylogenetically old.

  11. Detection of Multiple Budding Yeast Cells and a Partial Sequence of 43-kDa Glycoprotein Coding Gene of Paracoccidioides brasiliensis from a Case of Lacaziosis in a Female Pacific White-Sided Dolphin (Lagenorhynchus obliquidens). (United States)

    Minakawa, Tomoko; Ueda, Keiichi; Tanaka, Miyuu; Tanaka, Natsuki; Kuwamura, Mitsuru; Izawa, Takeshi; Konno, Toshihiro; Yamate, Jyoji; Itano, Eiko Nakagawa; Sano, Ayako; Wada, Shinpei


    Lacaziosis, formerly called as lobomycosis, is a zoonotic mycosis, caused by Lacazia loboi, found in humans and dolphins, and is endemic in the countries on the Atlantic Ocean, Indian Ocean and Pacific Ocean of Japanese coast. Susceptible Cetacean species include the bottlenose dolphin (Tursiops truncatus), the Indian Ocean bottlenose dolphin (T. aduncus), and the estuarine dolphin (Sotalia guianensis); however, no cases have been recorded in other Cetacean species. We diagnosed a case of Lacaziosis in a Pacific white-sided dolphin (Lagenorhynchus obliquidens) nursing in an aquarium in Japan. The dolphin was a female estimated to be more than 14 years old at the end of June 2015 and was captured in a coast of Japan Sea in 2001. Multiple, lobose, and solid granulomatous lesions with or without ulcers appeared on her jaw, back, flipper and fluke skin, in July 2014. The granulomatous skin lesions from the present case were similar to those of our previous cases. Multiple budding and chains of round yeast cells were detected in the biopsied samples. The partial sequence of 43-kDa glycoprotein coding gene confirmed by a nested PCR and sequencing, which revealed a different genotype from both Amazonian and Japanese lacaziosis in bottlenose dolphins, and was 99 % identical to those derived from Paracoccidioides brasiliensis; a sister fungal species to L. loboi. This is the first case of lacaziosis in Pacific white-sided dolphin.

  12. Expressed genes in regenerating rat liver after partial hepatectomy

    Institute of Scientific and Technical Information of China (English)

    Cun-Shuan Xu; Salman Rahrnan; Jing-Bo Zhang; Cui-Fang Chang; Jin-Yun Yuan; Wen-Qiang Li; Hong-Peng Han; Ke-Jin Yang; Li-Feng Zhao; Yu-Chang Li; Hui-Yong Zhang


    AIM: To reveal the liver regeneration (LR) and its controlas well as the occurrence of liver disease and to study the gene expression profiles of 551 genes after partial hepatectomy (PH) in regenerating rat livers.METHODS: Five hundred and fifty-one expressed sequence tags screened by suppression subtractive hybridization were made into an in-house cDNA microarray, and the expressive genes and their expressive profiles in regenerating rat livers were analyzed by microarray and bioinformatics. RESULTS: Three hundred of the analyzed 551 genes were up- or downregulated more than twofolds at one or more time points during LR. Most of the genes were up- or downregulated 2-5 folds, but the highest reached 90 folds of the control. One hundred and thirty-nine of themshowed upregulation, 135 displayed downregulation, and up or down expression of 26 genes revealed a dependence on regenerating livers. The genes expressedin 24-h regenerating livers were much more than those in the others. Cluster analysis and generalization analysis showed that there were at least six distinct temporal patterns of gene expression in the regenerating livers, that is, genes were expressed in the immediate early phase, early phase, intermediate phase, early-late phase, late phase, terminal phase. CONCLUSION: In LR, the number of down-regulated genes was almost similar to that of the upregulated genes; the successively altered genes were more than the rapidly transient genes. The temporal patterns of gene expression were similar 2 and 4 h, 12 and 16 h, 48 and 96 h, 72 and 144 h after PH. Microarray combined with suppressive subtractive hybridization can effectively identify the genes related to LR.

  13. Phylogenetic characterization of Canine Parvovirus VP2 partial sequences from symptomatic dogs samples. (United States)

    Zienius, D; Lelešius, R; Kavaliauskis, H; Stankevičius, A; Šalomskas, A


    The aim of the present study was to detect canine parvovirus (CPV) from faecal samples of clinically ill domestic dogs by polymerase chain reaction (PCR) followed by VP2 gene partial sequencing and molecular characterization of circulating strains in Lithuania. Eleven clinically and antigen-tested positive dog faecal samples, collected during the period of 2014-2015, were investigated by using PCR. The phylogenetic investigations indicated that the Lithuanian CPV VP2 partial sequences (3025-3706 cds) were closely related and showed 99.0-99.9% identity. All Lithuanian sequences were associated with one phylogroup, but grouped in different clusters. Ten of investigated Lithuanian CPV VP2 sequences were closely associated with CPV 2a antigenic variant (99.4% nt identity). Five CPV VP2 sequences from Lithuania were related to CPV-2a, but were rather divergent (6.8 nt differences). Only one CPV VP2 sequence from Lithuania was associated (99.3% nt identity) with CPV-2b VP2 sequences from France, Italy, USA and Korea. The four of eleven investigated Lithuanian dogs with CPV infection symptoms were vaccinated with CPV-2 vaccine, but their VP2 sequences were phylogenetically distantly associated with CPV vaccine strains VP2 sequences (11.5-15.8 nt differences). Ten Lithuanian CPV VP2 sequences had monophyletic relations among the close geographically associated samples, but five of them were rather divergent (1.0% less sequence similarity). The one Lithuanian CPV VP2 sequence was closely related with CPV-2b antigenic variant. All the Lithuanian CPV VP2 partial sequences were conservative and phylogenetically low associated with most commonly used CPV vaccine strains.

  14. 泡状带绦虫线粒体部分基因克隆及序列比对分析%Cloning and sequence analysis of partial mitochondrial genes of Taenia hydatigena

    Institute of Scientific and Technical Information of China (English)

    项海涛; 尚清炎; 郝宝成; 邢小勇; 梁剑平; 胡永浩


    采集甘肃兰州市区宠物犬体内两株泡状带绦虫,运用分子生物学方法对其线粒体部分基因进行克隆,并使用基因分析软件Larsergene V7.1进行序列分析,结果发现这两株虫体同源性达到99.6%,与国内的泡状带绦虫广东虫株和甘肃虫株同源性分别达到98.7%和98.4%,鉴定为泡状带绦虫;与其他带科绦虫同源性均高于85%,低于88%。其中,与国内牛带绦虫同源性最高,为88%;与国内巨颈带绦虫同源性最低,只有85.1%。推断不同带科绦虫的cox1基因种间差异明显,在带科绦虫的分类上意义较大。%Partial mitochondrial genome of two strains of Taenia hydatigena in dogs selected in anzhou were amplified by PCR. Their sequence was analyzed with the software Larsergene V7.1 and compared with tile corresponding sequence of Guangdong strain and Gansu strain published in GenBank.It revealed that the two stratus shared 99.6% of sequence similarity, and had a homology of 98.7% and 98.4% with Guangdong strain and Gansu strain, respecfively. Taenia hydatigena in dogs in Lanzhou had a homology of 85%-88% with the other Taeniidae cestodes. It is conch,deal that coxl gene with species-specific may have great significance in the classification of different Taeniidae cestodes.

  15. A molecular phylogeny of eulophid wasps inferred from partial 18S gene sequences%基于18S基因序列的姬小蜂分子系统发育

    Institute of Scientific and Technical Information of China (English)

    沙忠利; 朱朝东; Robert W.MURPHY; 黄大卫; Stephen G.COMPTON


    We investigated the phylogeny of the parasitoid wasp family Eulophidae (Hymenoptera) using nuclear 18S rDNA partial sequences with maximum parsimony and Bayesian inference methods of sequence analysis. The status of the Eulophidae and its component subfamilies in relation to other taxa in the superfamily Chalcidoidea were examined. The analyses confirmed the monophyly of some subfamilies in Eulophidae, including the current Entedoninae, Eulophinae and Tetrastichinae. All three subfamilies formed independent branches and the phylogeny did not confidently resolve the monophyly of the Eulophidae. The tribes Cirrospilini, Elasmini and Elachertini were well defined. However, the status of the tribe Eulophini was problematic. Further extensive morphological studies and more gene sequences will be required before the wider relationships of some groups of eulophids within Chalcidoidea can be determined [ Acta Zoologica Sinica 52 (2): 288-301, 2006].%本文基于18S rDNA部分序列,用MP和Baysian方法研究了姬小蜂科的系统发育,对姬小蜂科的单系性及其与其它小蜂科间的关系进行了讨论.姬小蜂亚科、灿姬小蜂亚科和啮姬小蜂亚科形成三个独立的支系,研究结果支持它们各自的单系性,但本结果没有明确姬小蜂科的单系性.研究结果同时还支持瑟姬小蜂族、扁股姬小蜂族和狭面姬小蜂族三个族的地位,但不支持姬小蜂族的地位.姬小蜂科的单系性及其与其它小蜂间的关系还需更多的形态学数据和更多的基因序列来进一步研究[动物学报52(2):288-301,2006].

  16. Cloning and sequence analysis of the partial sequence of the rbcL from Bryopsis hypnoides

    Institute of Scientific and Technical Information of China (English)


    The partial sequence of the rbcL from Bryopsis hypnoides, including the sequences of the upstream, extron and partial intron, was amplified by PCR and their sequences were determined. With Spinacia oleracea as the outgroup, neighbor-joining method and maximum parsimony method were used respectively to build phylogenetic trees according to the rbcL exon sequence among 13species that were the typical species of six phyla. Two kinds of trees showed clearly that there were two groups among those species,the green lineage and the non-green lineage. And the relationships of algae in the green lineage were similar in the two trees but those in the non-green lineage were not consistent. Analysis of codon preference indicated that the codon preference of the rbcL exon of Bryopsis hypnoides distinctly differed from that of the relevant sequence of photosynthetic bacteria.

  17. Partial AUC maximization for essential gene prediction using genetic algorithms. (United States)

    Hwang, Kyu-Baek; Ha, Beom-Yong; Ju, Sanghun; Kim, Sangsoo


    Identifying genes indispensable for an organism's life and their characteristics is one of the central questions in current biological research, and hence it would be helpful to develop computational approaches towards the prediction of essential genes. The performance of a predictor is usually measured by the area under the receiver operating characteristic curve (AUC). We propose a novel method by implementing genetic algorithms to maximize the partial AUC that is restricted to a specific interval of lower false positive rate (FPR), the region relevant to follow-up experimental validation. Our predictor uses various features based on sequence information, protein-protein interaction network topology, and gene expression profiles. A feature selection wrapper was developed to alleviate the over-fitting problem and to weigh each feature's relevance to prediction. We evaluated our method using the proteome of budding yeast. Our implementation of genetic algorithms maximizing the partial AUC below 0.05 or 0.10 of FPR outperformed other popular classification methods.

  18. cis sequence effects on gene expression

    Directory of Open Access Journals (Sweden)

    Jacobs Kevin


    Full Text Available Abstract Background Sequence and transcriptional variability within and between individuals are typically studied independently. The joint analysis of sequence and gene expression variation (genetical genomics provides insight into the role of linked sequence variation in the regulation of gene expression. We investigated the role of sequence variation in cis on gene expression (cis sequence effects in a group of genes commonly studied in cancer research in lymphoblastoid cell lines. We estimated the proportion of genes exhibiting cis sequence effects and the proportion of gene expression variation explained by cis sequence effects using three different analytical approaches, and compared our results to the literature. Results We generated gene expression profiling data at N = 697 candidate genes from N = 30 lymphoblastoid cell lines for this study and used available candidate gene resequencing data at N = 552 candidate genes to identify N = 30 candidate genes with sufficient variance in both datasets for the investigation of cis sequence effects. We used two additive models and the haplotype phylogeny scanning approach of Templeton (Tree Scanning to evaluate association between individual SNPs, all SNPs at a gene, and diplotypes, with log-transformed gene expression. SNPs and diplotypes at eight candidate genes exhibited statistically significant (p cis sequence effects in our study, respectively. Conclusion Based on analysis of our results and the extant literature, one in four genes exhibits significant cis sequence effects, and for these genes, about 30% of gene expression variation is accounted for by cis sequence variation. Despite diverse experimental approaches, the presence or absence of significant cis sequence effects is largely supported by previously published studies.

  19. Partial sequence homogenization in the 5S multigene families may generate sequence chimeras and spurious results in phylogenetic reconstructions. (United States)

    Galián, José A; Rosato, Marcela; Rosselló, Josep A


    Multigene families have provided opportunities for evolutionary biologists to assess molecular evolution processes and phylogenetic reconstructions at deep and shallow systematic levels. However, the use of these markers is not free of technical and analytical challenges. Many evolutionary studies that used the nuclear 5S rDNA gene family rarely used contiguous 5S coding sequences due to the routine use of head-to-tail polymerase chain reaction primers that are anchored to the coding region. Moreover, the 5S coding sequences have been concatenated with independent, adjacent gene units in many studies, creating simulated chimeric genes as the raw data for evolutionary analysis. This practice is based on the tacitly assumed, but rarely tested, hypothesis that strict intra-locus concerted evolution processes are operating in 5S rDNA genes, without any empirical evidence as to whether it holds for the recovered data. The potential pitfalls of analysing the patterns of molecular evolution and reconstructing phylogenies based on these chimeric genes have not been assessed to date. Here, we compared the sequence integrity and phylogenetic behavior of entire versus concatenated 5S coding regions from a real data set obtained from closely related plant species (Medicago, Fabaceae). Our results suggest that within arrays sequence homogenization is partially operating in the 5S coding region, which is traditionally assumed to be highly conserved. Consequently, concatenating 5S genes increases haplotype diversity, generating novel chimeric genotypes that most likely do not exist within the genome. In addition, the patterns of gene evolution are distorted, leading to incorrect haplotype relationships in some evolutionary reconstructions.

  20. PAPR Reduction in OFDM Using Partial Transmit Sequence (PTS

    Directory of Open Access Journals (Sweden)



    Full Text Available In recent years, OFDM has gained a lot of interest in diverse-digital communication applications. This has been due to its favorable properties like high spectral efficiency and robustness to channel fading. Some of the major drawbacks in OFDM are its high peak-to-average-power ratio(PAPR of the transmitted signals and the synchronization of signals. OFDM consist of large number of independent subcarriers, as a result of which the amplitude of such a signal can have high peak values. Coding, phase rotation and clipping are among many PAPR reduction schemes that have been proposed to overcome this problem. Significant reduction in PAPR has been achieved using Partial Transmit Sequence(PTS which comes under Signal scrambling techniques.

  1. Disease gene identification strategies for exome sequencing

    NARCIS (Netherlands)

    Gilissen, C.; Hoischen, A.; Brunner, H.G.; Veltman, J.A.


    Next generation sequencing can be used to search for Mendelian disease genes in an unbiased manner by sequencing the entire protein-coding sequence, known as the exome, or even the entire human genome. Identifying the pathogenic mutation amongst thousands to millions of genomic variants is a major c

  2. Molecular identification of sibling species of Sclerodermus (Hymenoptera: Bethylidae that parasitize buprestid and cerambycid beetles by using partial sequences of mitochondrial DNA cytochrome oxidase subunit 1 and 28S ribosomal RNA gene.

    Directory of Open Access Journals (Sweden)

    Yuan Jiang

    Full Text Available The species belonging to Sclerodermus (Hymenoptera: Bethylidae are currently the most important insect natural enemies of wood borer pests, mainly buprestid and cerambycid beetles, in China. However, some sibling species of this genus are very difficult to distinguish because of their similar morphological features. To address this issue, we conducted phylogenetic and genetic analyses of cytochrome oxidase subunit I (COI and 28S RNA gene sequences from eight species of Sclerodermus reared from different wood borer pests. The eight sibling species were as follows: S. guani Xiao et Wu, S. sichuanensis Xiao, S. pupariae Yang et Yao, and Sclerodermus spp. (Nos. 1-5. A 594-bp fragment of COI and 750-bp fragment of 28S were subsequently sequenced. For COI, the G-C content was found to be low in all the species, averaging to about 30.0%. Sequence divergences (Kimura-2-parameter distances between congeneric species averaged to 4.5%, and intraspecific divergences averaged to about 0.09%. Further, the maximum sequence divergences between congeneric species and Sclerodermus sp. (No. 5 averaged to about 16.5%. All 136 samples analyzed were included in six reciprocally monophyletic clades in the COI neighbor-joining (NJ tree. The NJ tree inferred from the 28S rRNA sequence yielded almost identical results, but the samples from S. guani, S. sichuanensis, S. pupariae, and Sclerodermus spp. (Nos. 1-4 clustered together and only Sclerodermus sp. (No. 5 clustered separately. Our findings indicate that the standard barcode region of COI can be efficiently used to distinguish morphologically similar Sclerodermus species. Further, we speculate that Sclerodermus sp. (No. 5 might be a new species of Sclerodermus.

  3. Detection of Cryptosporidium species in feces or gastric contents from snakes and lizards as determined by polymerase chain reaction analysis and partial sequencing of the 18S ribosomal RNA gene. (United States)

    Richter, Barbara; Nedorost, Nora; Maderner, Anton; Weissenböck, Herbert


    Cryptosporidiosis is a well-known gastrointestinal disease of snakes and lizards. In the current study, 672 samples (feces and/or gastric contents or regurgitated food items) of various snakes and lizards were examined for the presence of cryptosporidia by polymerase chain reaction (PCR) assay targeting a part of the 18S ribosomal RNA gene. A consecutive sequencing reaction was used to identify the cryptosporidian species present in PCR-positive samples. Cryptosporidium varanii (saurophilum) was detected in 17 out of 106 (16%) samples from corn snakes (Pantherophis guttatus) and in 32 out of 462 (7%) samples from leopard geckos (Eublepharis macularius). Cryptosporidium serpentis was found in 8 out of 462 (2%) leopard gecko samples, but in no other reptile. The Cryptosporidium sp. "lizard genotype" was present in 1 leopard gecko sample, and 1 sample from a corn snake showed a single nucleotide mismatch to this genotype. Pseudoparasitic cryptosporidian species were identified in 5 out of 174 (3%) ophidian samples, but not in lizards. Other sequences did not show complete similarity to previously published Cryptosporidium sequences. The results stress the importance for diagnostic methods to be specific for Cryptosporidium species especially in snakes and show a relatively high prevalence of C. varanii in leopard geckos and corn snakes.

  4. Partial sequencing of the bottle gourd genome reveals markers useful for phylogenetic analysis and breeding

    Directory of Open Access Journals (Sweden)

    Wang Sha


    Full Text Available Abstract Background Bottle gourd [Lagenaria siceraria (Mol. Standl.] is an important cucurbit crop worldwide. Archaeological research indicates that bottle gourd was domesticated more than 10,000 years ago, making it one of the earliest plants cultivated by man. In spite of its widespread importance and long history of cultivation almost nothing has been known about the genome of this species thus far. Results We report here the partial sequencing of bottle gourd genome using the 454 GS-FLX Titanium sequencing platform. A total of 150,253 sequence reads, which were assembled into 3,994 contigs and 82,522 singletons were generated. The total length of the non-redundant singletons/assemblies is 32 Mb, theoretically covering ~ 10% of the bottle gourd genome. Functional annotation of the sequences revealed a broad range of functional types, covering all the three top-level ontologies. Comparison of the gene sequences between bottle gourd and the model cucurbit cucumber (Cucumis sativus revealed a 90% sequence similarity on average. Using the sequence information, 4395 microsatellite-containing sequences were identified and 400 SSR markers were developed, of which 94% amplified bands of anticipated sizes. Transferability of these markers to four other cucurbit species showed obvious decline with increasing phylogenetic distance. From analyzing polymorphisms of a subset of 14 SSR markers assayed on 44 representative China bottle gourd varieties/landraces, a principal coordinates (PCo analysis output and a UPGMA-based dendrogram were constructed. Bottle gourd accessions tended to group by fruit shape rather than geographic origin, although in certain subclades the lines from the same or close origin did tend to cluster. Conclusions This work provides an initial basis for genome characterization, gene isolation and comparative genomics analysis in bottle gourd. The SSR markers developed would facilitate marker assisted breeding schemes for efficient

  5. Identification of rat genes by TWINSCAN gene prediction, RT-PCR, and direct sequencing

    DEFF Research Database (Denmark)

    Wu, Jia Qian; Shteynberg, David; Arumugam, Manimozhiyan


    The publication of a draft sequence of a third mammalian genome--that of the rat--suggests a need to rethink genome annotation. New mammalian sequences will not receive the kind of labor-intensive annotation efforts that are currently being devoted to human. In this paper, we demonstrate...... an alternative approach: reverse transcription-polymerase chain reaction (RT-PCR) and direct sequencing based on dual-genome de novo predictions from TWINSCAN. We tested 444 TWINSCAN-predicted rat genes that showed significant homology to known human genes implicated in disease but that were partially...

  6. Synaptotagmin gene content of the sequenced genomes

    Directory of Open Access Journals (Sweden)

    Craxton Molly


    Full Text Available Abstract Background Synaptotagmins exist as a large gene family in mammals. There is much interest in the function of certain family members which act crucially in the regulated synaptic vesicle exocytosis required for efficient neurotransmission. Knowledge of the functions of other family members is relatively poor and the presence of Synaptotagmin genes in plants indicates a role for the family as a whole which is wider than neurotransmission. Identification of the Synaptotagmin genes within completely sequenced genomes can provide the entire Synaptotagmin gene complement of each sequenced organism. Defining the detailed structures of all the Synaptotagmin genes and their encoded products can provide a useful resource for functional studies and a deeper understanding of the evolution of the gene family. The current rapid increase in the number of sequenced genomes from different branches of the tree of life, together with the public deposition of evolutionarily diverse transcript sequences make such studies worthwhile. Results I have compiled a detailed list of the Synaptotagmin genes of Caenorhabditis, Anopheles, Drosophila, Ciona, Danio, Fugu, Mus, Homo, Arabidopsis and Oryza by examining genomic and transcript sequences from public sequence databases together with some transcript sequences obtained by cDNA library screening and RT-PCR. I have compared all of the genes and investigated the relationship between plant Synaptotagmins and their non-Synaptotagmin counterparts. Conclusions I have identified and compared 98 Synaptotagmin genes from 10 sequenced genomes. Detailed comparison of transcript sequences reveals abundant and complex variation in Synaptotagmin gene expression and indicates the presence of Synaptotagmin genes in all animals and land plants. Amino acid sequence comparisons indicate patterns of conservation and diversity in function. Phylogenetic analysis shows the origin of Synaptotagmins in multicellular eukaryotes and their

  7. Characterization and Structure Prediction of Partial Length Protein Sequences of pcoA, pcoR and chrB Genes from Heavy Metal Resistant Bacteria from the Klip River, South Africa

    Directory of Open Access Journals (Sweden)

    Patience Chihomvu


    Full Text Available The Klip River has suffered from severe anthropogenic effects from industrial activities such as mining. Long-term exposure to heavy metal pollution has led to the development of heavy metal resistant strains of Pseudomonas sp. KR23, Lysinibacillus sp. KR25, and E. coli KR29. The objectives of this study were to characterize the genetics of copper and chromate resistance of the isolates. Copper and chromate resistance determinants were cloned and sequenced. Open reading frames (ORFs related to the genes CopA and CopR were identified in E. coli KR29, PcoA in Lysinibacillus sp. KR25 and none related to chromate resistance were detected. The 3D-models predicted by I-TASSER disclose that the PcoA proteins consist of β-sheets, which form a part of the cupredoxin domain of the CopA copper resistance family of genes. The model for PcoR_29 revealed the presence of a helix turn helix; this forms part of a DNA binding protein, which is part of a heavy metal transcriptional regulator. The bacterial strains were cured using ethidium bromide. The genes encoding for heavy metal resistance and antibiotic resistance were found to be located on the chromosome for both Pseudomonas sp. (KR23 and E. coli (KR29. For Lysinibacillus (KR25 the heavy metal resistance determinants are suspected to be located on a mobile genetic element, which was not detected using gel electrophoresis.

  8. A new approach for the sequence spaces of fuzzy level sets with the partial metric

    Directory of Open Access Journals (Sweden)

    Uğur Kadak


    Full Text Available In this paper, we investigate the classical sets of sequences of fuzzy numbers by using partial metric which is based on a partial ordering. Some elementary notions and concepts for partial metric and fuzzy level sets are given. In addition, several necessary and sufficient conditions for partial completeness are established by means of fuzzy level sets. Finally, we give some illustrative examples and present some results between fuzzy and partial metric spaces.


    The abundance of expressed gene and protein sequences available in the biological information databases facilitates comparison of protein homologies. A high degree of sequence similarity typically implies homology regarding structure and function and may provide clues to antibody cross-react...

  10. Cloning and sequence analysis of US1 gene in duck enteritis virus%Cloning and sequence analysis of US1gene in duck enteritis virus

    Institute of Scientific and Technical Information of China (English)

    ZHAO Yan; WANG Jun-wei; MA Bo; ZHAO Xiao-yan


    In this paper, a 1,860 bp sequence in IRs region of duck enteritis virus(DEV)was amplified by single oligonucleotide nested PCR with a single primer designed according to partial sequence of USI and then a pair of primers designed according to the 3' UTR of US8 gene and 5'end of the new getting sequence were used to amplify a 2,426 bp sequence toward the TRs region.Sequence analysis revealed that the both sequences contained an identical 990 bp open reading frame of DEV US1 gene.The two ORFs were in opposite transcription orientation.Sequence comparison of the nucleotide sequence and the deduced amino acid sequence of US1 gene showed relatively high identity to Mardivirus.Phylogenetic tree analysis showed that the eleven herpesviruses viruses were classified into three groups, and the duck enteritis virus was most closely related to Mardivirus.

  11. Phylogeny of the genus Haemophilus as determined by comparison of partial infB sequences

    DEFF Research Database (Denmark)

    Hedegaard, J; Okkels, H; Bruun, B;


    A 453 bp fragment of infB, the gene encoding translation initiation factor 2, was sequenced and compared from 66 clinical isolates and type strains of Haemophilus species and related bacteria. Analysis of the partial infB sequences obtained suggested that the human isolates dependent on X and V...... factor, H. influenzae, H. haemolyticus, H. aegyptius and some cryptic genospecies of H. influenzae, were closely related to each other. H. parainfluenzae constituted a heterogeneous group within the boundaries of the genus, whereas H. aphrophilus/paraphrophilus and Actinobacillus actinomycetemcomitans...... were only remotely related to the type species of the genus Haemophilus H. parahaemolyticus and H. paraphrohaemolyticus took up an intermediary position and may not belong in the genus Haemophilus sensu stricto. Ambiguous results were obtained with seven isolates tentatively identified as H. segnis...

  12. Network of tRNA Gene Sequences

    Institute of Scientific and Technical Information of China (English)

    WEI Fang-ping; LI Sheng; MA Hong-ru


    A network of 3719 tRNA gene sequences was constructed using simplest alignment. Its topology, degree distribution and clustering coefficient were studied. The behaviors of the network shift from fluctuated distribution to scale-free distribution when the similarity degree of the tRNA gene sequences increases. The tRNA gene sequences with the same anticodon identity are more self-organized than those with different anticodon identities and form local clusters in the network. Some vertices of the local cluster have a high connection with other local clusters, and the probable reason was given. Moreover, a network constructed by the same number of random tRNA sequences was used to make comparisons. The relationships between the properties of the tRNA similarity network and the characters of tRNA evolutionary history were discussed.

  13. Joint Limit Distributions of Exceedances Point Processes and Partial Sums of Gaussian Vector Sequence

    Institute of Scientific and Technical Information of China (English)

    Zuo Xiang PENG; Jin Jun TONG; Zhi Chao WENG


    In this paper,we study the joint limit distributions of point processes of exceedances and partial sums of multivariate Gaussian sequences and show that the point processes and partial sums are asymptotically independent under some mild conditions.As a result,for a sequence of standardized stationary Gaussian vectors,we obtain that the point process of exceedances formed by the sequence (centered at the sample mean) converges in distribution to a Poisson process and it is asymptotically independent of the partial sums.The asymptotic joint limit distributions of order statistics and partial sums are also investigated under different conditions.

  14. Cloning and sequence analysis of the partial CO1 gene within mitochondrial DNA of Parabronema skrjabini%斯氏副柔线虫线粒体CO1基因的克隆与序列分析

    Institute of Scientific and Technical Information of China (English)

    李文生; 杨莲茹; 杨晓野; 赵治国; 赵晓慧; 李鑫; 王瑞


    为研究斯氏副柔线虫(Parabronema skrjabini)种间遗传标记特点,从内蒙古地区骆驼皱胃中采集斯氏副柔线虫成虫,提取虫体DNA,利用线虫通用引物扩增细胞色素氧化酶1(CO1)基因,将其克隆到pMD19-T载体后,用PCR技术鉴定阳性菌落并进行测序,运用DNAStar 5.0和MEGA 4.0软件将测序结果与GenBank公布的相关线虫CO1序列进行比对分析.结果显示:斯氏副柔线虫DNA扩增出的CO1基因序列片段长度为689bp.同其他相关属线虫CO1基因序列比对,斯氏副柔线虫与小口柔线虫(Habronema microstoma)、蝇柔线虫(Habronema muscae)的同源性最高,为86.7%,遗传距离为0.143;与加利西亚光丝虫(Litomosoides galizai)的同源性最低,为80.1%,遗传距离为0.229.通过两种不同方法建立的系统发育树比较,证实斯氏副柔线虫与柔线属线虫均位于同一分支,自展值都在47%以上.表明CO1基因不仅可以作为斯氏副柔线虫理想的种间遗传标记,也可为该线虫进一步的分子分类学研究提供重要基础.%To investigate the inter-specific genetic marker of Parabronema skrjabini, the genomic DNA of P. skrjabini collected from the infected camels in Inner Mongolia were extracted. The mitochondrial cytochrome oxidase subunit 1 ( CO1) gene was amplified by PCR using u-niversal primers, and then the PCR product was cloned into pMD19-T vectors. The insert was sequenced successfully and compared with other nematode sequences by DNAStar 5. 0 and MEGA 4.0. The results showed that the PCR product was 689 bp in length. Compared with other CO1 gene sequences of related nematode, the homology among P. skrjabini, Habronema microstoma, and H. muscae is 86. 7% and the nu-cleotide pairwise distance is 0. 143. Conversely, the homology between P. skrjabini and Litomosoides galizai is 80. 1% and the nucleotide pairwise distance is 0.229. It suggested that the C01 gene of P. skrjabini can be not only used as a genetic marker, but also used

  15. Isolation and molecular characterization of partial FSH and LH receptor genes in Arabian camels (Camelus dromedarius

    Directory of Open Access Journals (Sweden)

    Saber Jelokhani-Niaraki


    Full Text Available Very little is known about LHR and FSHR genes of domestic dromedary camels. The main objective of this study was to determine and analyze partial genomic regions of FSHR and LHR genes in dromedary camels for the first time. To this end, a total of 50 DNA samples belonging to dromedary camels raised in Iran were sent for sequencing (25 samples of each gene. We compared the nucleotide sequences of Camelus dromedarius with corresponding sequences of previously published FSHR and LHR genes in bactrian camels and other species. According to the data, the same nucleotide variation was identified in both regions of the two camel species. The alignment of deduced protein sequences of the two different species revealed an amino acid variation at the FSHR region. No evidence of amino acid variation was observed, however, in LHR sequences. Phylogenetic analysis indicated that both camel species had a close relationship and clustered together in a separate branch. This was further confirmed by genetic distance values illustrating significant sequence identity between Camelus dromedarius and Camelus bactrianus. Interestingly, sequence comparisons revealed heterozygote patterns in FSHR sequences isolated from dromedary camels of Iran. In comparison to other species, this camel contains three amino acid substitutions at 5, 67, and 105 positions in the FSHR coding region. These positions are found exclusively in camels and can be considered as species specific. The results of our study can be used for hormone functionality research (FSHR and LHR as well as reproduction-linked polymorphisms and breeding programs.

  16. Evidence of new species of Haematoloechus (Platyhelminthes: Digenea) using partial cox1 sequences. (United States)

    León-Règagnon, Virginia


    Haematoloechus, digeneans parasites of amphibians, is a species-rich genus with more than 50 species around the globe. Establishing an accurate taxonomy for this group has been difficult due to high intraspecific variability. Nuclear DNA sequences have given independent information about species validity and phylogeny of the group. In this paper, I test the performance of partial sequences of the mitochondrial cytochrome c oxidase subunit I (cox1) gene in the differentiation of recognized species of the genus and in the detection of potential new taxa. Samples from 13 nominal species were sequenced, plus four samples that could not be assigned to any described species based on morphology. Parsimony analysis of the amplified 360 bp fragment resulted in six most parsimonious trees showing the same grouping of samples, differing in the samples' arrangement within those groups. All 13 species were recovered on the trees, and five potential new species are shown. Additional sampling and sequencing is necessary to support this hypothesis, but with this preliminary information the search for diagnostic characters that allow the description of the new taxa is less difficult.

  17. Partial characterization of nif genes from the bacterium Azospirillum amazonense

    Directory of Open Access Journals (Sweden)

    D.P. Potrich


    Full Text Available Azospirillum amazonense revealed genomic organization patterns of the nitrogen fixation genes similar to those of the distantly related species A. brasilense. Our work suggests that A. brasilense nifHDK, nifENX, fixABC operons and nifA and glnB genes may be structurally homologous to the counterpart genes of A. amazonense. This is the first analysis revealing homology between A. brasilense nif genes and the A. amazonense genome. Sequence analysis of PCR amplification products revealed similarities between the amino acid sequences of the highly conserved nifD and glnB genes of A. amazonense and related genes of A. brasilense and other bacteria. However, the A. amazonense non-coding regions (the upstream activator sequence region and the region between the nifH and nifD genes differed from related regions of A. brasilense even in nitrogenase structural genes which are highly conserved among diazotrophic bacteria. The feasibility of the 16S ribosomal RNA gene-based PCR system for specific detection of A. amazonense was shown. Our results indicate that the PCR primers for 16S rDNA defined in this article are highly specific to A. amazonense and can distinguish this species from A. brasilense.

  18. Partial characterization of nif genes from the bacterium Azospirillum amazonense. (United States)

    Potrich, D P; Passaglia, L M; Schrank, I S


    Azospirillum amazonense revealed genomic organization patterns of the nitrogen fixation genes similar to those of the distantly related species A. brasilense. Our work suggests that A. brasilense nifHDK, nifENX, fixABC operons and nifA and glnB genes may be structurally homologous to the counterpart genes of A. amazonense. This is the first analysis revealing homology between A. brasilense nif genes and the A. amazonense genome. Sequence analysis of PCR amplification products revealed similarities between the amino acid sequences of the highly conserved nifD and glnB genes of A. amazonense and related genes of A. brasilense and other bacteria. However, the A. amazonense non-coding regions (the upstream activator sequence region and the region between the nifH and nifD genes) differed from related regions of A. brasilense even in nitrogenase structural genes which are highly conserved among diazotrophic bacteria. The feasibility of the 16S ribosomal RNA gene-based PCR system for specific detection of A. amazonense was shown. Our results indicate that the PCR primers for 16S rDNA defined in this article are highly specific to A. amazonense and can distinguish this species from A. brasilense.

  19. Isolation and identification of canine parainfluenza virus and partial gene sequence analysis of the isolate%犬副流感病毒的分离鉴定及部分基因序列分析

    Institute of Scientific and Technical Information of China (English)

    孙明; 刘巧荣; 秦亚嫚; 邓小雨; 杨欣艳; 刘伯华; 陈西钊


    目的:分离并鉴定犬副流感病毒(canine parainfluenza virus,CPIV),并对该病毒N、HN和F基因进行序列分析,研究其遗传变异情况;为CPIV诊断、治疗以及预防奠定分子生物学基础。方法用Vero细胞接种感染CPIV阳性犬肺组织,盲传3代,观察病变情况,收集72 h培养液进行PCR鉴定、血凝特性以及形态学观察。同时,用3对特异性引物,扩增N、HN和F全基因,进行序列测定和分析,并制作系统进化树。结果从犬肺中成功分离出一株犬副流感病毒,并命名为QF20100726。该病毒能凝集豚鼠、猪、鸡和人O型血,电镜观察病毒呈圆形、长丝状等形态多样、大小不等的颗粒状;序列分析表明,与人源、犬源等10株有代表性的副流感病毒基因相比,N基因核苷酸同源性为95.7%~99.8%,氨基酸同源性为97.4%~99.6%,其中有两处氨基酸发生新的变异,分别是第257位由V变成了A,第301位由A变成了T;HN基因核苷酸同源性为95.5~99.8%,氨基酸同源性为96.3~99.6%,F基因核苷酸同源性为94.7%~99.6%,氨基酸同源性为95.6%~99.3%。有两处发生特异变异,分别是56位由T变成了S,89位由T变成了M。结论成功分离犬副流感病毒QF20100726分离株,其血凝特性及超微形态特征均符合CPIV特性,N、HN和F全基因生物进化树显示,该分离株与犬副流感病毒分离株08-1990(登录号:KC237063)亲缘关系最近;N基因氨基酸在第257位(由V变成了A)和第301位(由A变成了T),F基因氨基酸在第56位(由T变成了S)和89位(由T变成了M)发生了特异性变异。这些数据将为CPIV的防控奠定基础。%Objective To isolate and identify canine parainfluenza virus ( CPIV) from the a dog suspected of CPIV infection, then to analyze the gene sequence variation of N,HN and F genes and provide a molecular biology basis for the diagnosis

  20. Partial genomic organization of ribosomal protein S7 gene from malaria vector Anopheles stephensi

    Institute of Scientific and Technical Information of China (English)



    In this study, we describe the partial genomic organization of ribosomal protein S7 gene isolated from the mosquito Anopheles stephensi. Initially a 558 bp partial cDNA sequence was amplified as precursor mRNA sequence containing 223 bp long intron. 5' and 3' end sequences were recovered using end specific rapid amplification of cDNA ends (RACE) polymerase chain reaction. The full-length cDNA sequence was 914 nucleotide long with an open reading frame capable of encoding 192 amino acid long protein with calculated molecular mass of 22174 Da and a pI point of 9.94. Protein homology search revealed >75% identity to other insect's S7 ribosomal proteins. Analysis of sequence alignment revealed several highly conserved domains, one of which is related to nuclear localization signal (NLS) region of human rpS7. Interestingly, intron nucleotide sequence comparison with A. gambiae showed a lesser degree of conservation as compared to coding and untranslated regions. Like this, early studies on the genomic organization and cDNA/Expressed sequence tag analysis (EST) could help in genome annotation ofA. stephensi, and would be likely to be sequenced in the future.

  1. Isolation of Hox cluster genes from insects reveals an accelerated sequence evolution rate.

    Directory of Open Access Journals (Sweden)

    Heike Hadrys

    Full Text Available Among gene families it is the Hox genes and among metazoan animals it is the insects (Hexapoda that have attracted particular attention for studying the evolution of development. Surprisingly though, no Hox genes have been isolated from 26 out of 35 insect orders yet, and the existing sequences derive mainly from only two orders (61% from Hymenoptera and 22% from Diptera. We have designed insect specific primers and isolated 37 new partial homeobox sequences of Hox cluster genes (lab, pb, Hox3, ftz, Antp, Scr, abd-a, Abd-B, Dfd, and Ubx from six insect orders, which are crucial to insect phylogenetics. These new gene sequences provide a first step towards comparative Hox gene studies in insects. Furthermore, comparative distance analyses of homeobox sequences reveal a correlation between gene divergence rate and species radiation success with insects showing the highest rate of homeobox sequence evolution.

  2. Phylogeny of the genus Haemophilus as determined by comparison of partial infB sequences. (United States)

    Hedegaard, J; Okkels, H; Bruun, B; Kilian, M; Mortensen, K K; Nørskov-Lauritsen, N


    A 453 bp fragment of infB, the gene encoding translation initiation factor 2, was sequenced and compared from 66 clinical isolates and type strains of Haemophilus species and related bacteria. Analysis of the partial infB sequences obtained suggested that the human isolates dependent on X and V factor, H. influenzae, H. haemolyticus, H. aegyptius and some cryptic genospecies of H. influenzae, were closely related to each other. H. parainfluenzae constituted a heterogeneous group within the boundaries of the genus, whereas H. aphrophilus/paraphrophilus and Actinobacillus actinomycetemcomitans were only remotely related to the type species of the genus Haemophilus H. parahaemolyticus and H. paraphrohaemolyticus took up an intermediary position and may not belong in the genus Haemophilus sensu stricto. Ambiguous results were obtained with seven isolates tentatively identified as H. segnis, which fell into two discrete clusters. The delineation of 'Haemophilus sensu stricto' as suggested by infB analysis supports previous results obtained by DNA hybridization, in contrast to the delineation inferred from 16S rRNA sequence comparison.

  3. Expression mediated by three partial sequences of the human tyrosine hydroxylase promoter in vivo

    Directory of Open Access Journals (Sweden)

    Anne-Sophie Rolland


    Full Text Available The use of viral vectors to transfect postmitotic neurons has provided an important research tool, and it offers promise for treatment of neurologic disease. The utility of vectors is enhanced by the use of selective promoters that permit control of the cellular site of expression. One potential clinical application is in the neurorestorative treatment of Parkinson's disease by the induction of new axon growth. However, many of the genes with an ability to restore axons have oncogenic potential. Therefore, clinical safety would be enhanced by restriction of expression to neurons affected by the disease, particularly dopamine neurons. To achieve this goal we have evaluated in vivo three partial sequences of the promoter for human tyrosine hydroxylase, the rate limiting enzyme in catecholamine synthesis. All sequences induced expression in dopamine neurons. None of them induced expression in glia or in nondopaminergic neurons in striatum or cortex. We conclude that these sequences have potential use for targeting dopamine neurons in research and clinical applications.

  4. Nemertean toxin genes revealed through transcriptome sequencing. (United States)

    Whelan, Nathan V; Kocot, Kevin M; Santos, Scott R; Halanych, Kenneth M


    Nemerteans are one of few animal groups that have evolved the ability to utilize toxins for both defense and subduing prey, but little is known about specific nemertean toxins. In particular, no study has identified specific toxin genes even though peptide toxins are known from some nemertean species. Information about toxin genes is needed to better understand evolution of toxins across animals and possibly provide novel targets for pharmaceutical and industrial applications. We sequenced and annotated transcriptomes of two free-living and one commensal nemertean and annotated an additional six publicly available nemertean transcriptomes to identify putative toxin genes. Approximately 63-74% of predicted open reading frames in each transcriptome were annotated with gene names, and all species had similar percentages of transcripts annotated with each higher-level GO term. Every nemertean analyzed possessed genes with high sequence similarities to known animal toxins including those from stonefish, cephalopods, and sea anemones. One toxin-like gene found in all nemerteans analyzed had high sequence similarity to Plancitoxin-1, a DNase II hepatotoxin that may function well at low pH, which suggests that the acidic body walls of some nemerteans could work to enhance the efficacy of protein toxins. The highest number of toxin-like genes found in any one species was seven and the lowest was three. The diversity of toxin-like nemertean genes found here is greater than previously documented, and these animals are likely an ideal system for exploring toxin evolution and industrial applications of toxins. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  5. EcoGene: a genome sequence database for Escherichia coli K-12. (United States)

    Rudd, K E


    The EcoGene database provides a set of gene and protein sequences derived from the genome sequence of Escherichia coli K-12. EcoGene is a source of re-annotated sequences for the SWISS-PROT and Colibri databases. EcoGene is used for genetic and physical map compilations in collaboration with the Coli Genetic Stock Center. The EcoGene12 release includes 4293 genes. EcoGene12 differs from the GenBank annotation of the complete genome sequence in several ways, including (i) the revision of 706 predicted or confirmed gene start sites, (ii) the correction or hypothetical reconstruction of 61 frame-shifts caused by either sequence error or mutation, (iii) the reconstruction of 14 protein sequences interrupted by the insertion of IS elements, and (iv) pre-dictions that 92 genes are partially deleted gene fragments. A literature survey identified 717 proteins whose N-terminal amino acids have been verified by sequencing. 12 446 cross-references to 6835 literature citations and s are provided. EcoGene is accessible at a new website: Users can search and retrieve individual EcoGene GenePages or they can download large datasets for incorporation into database management systems, facilitating various genome-scale computational and functional analyses.

  6. Sequencing and Gene Expression Analysis of Leishmania tropica LACK Gene.

    Directory of Open Access Journals (Sweden)

    Nour Hammoudeh


    Full Text Available Leishmania Homologue of receptors for Activated C Kinase (LACK antigen is a 36-kDa protein, which provokes a very early immune response against Leishmania infection. There are several reports on the expression of LACK through different life-cycle stages of genus Leishmania, but only a few of them have focused on L.tropica.The present study provides details of the cloning, DNA sequencing and gene expression of LACK in this parasite species. First, several local isolates of Leishmania parasites were typed in our laboratory using PCR technique to verify of Leishmania parasite species. After that, LACK gene was amplified and cloned into a vector for sequencing. Finally, the expression of this molecule in logarithmic and stationary growth phase promastigotes, as well as in amastigotes, was evaluated by Reverse Transcription-PCR (RT-PCR technique.The typing result confirmed that all our local isolates belong to L.tropica. LACK gene sequence was determined and high similarity was observed with the sequences of other Leishmania species. Furthermore, the expression of LACK gene in both promastigotes and amastigotes forms was confirmed.Overall, the data set the stage for future studies of the properties and immune role of LACK gene products.


    Institute of Scientific and Technical Information of China (English)

    张敏; 曹天文; 郭亚平; 张建珍; 马恩波; 王瑞


    The genus Mimathyma is one of the more controversial lineages of Apaturnae butterflies. We sequenced 432 bp of the mitochondrial cytochrome b gene from 19 individuals of seven species of the three genera: Mimathyma, Aptura and Athyma, and constructed the MP and the NJ trees using the Charaxes kahruba (Moore) as outgroup. The NJ and the MP phylogenetic trees show that M. chevana(Moore), M. nycteis (Ménétroès), M. schrenckii (Ménétriès) and M. ambica Kollar have closer relationship, supported the Moore's classification that these four species could be removed from the genus Apatura and included in the genus Mimathyma. M. chevana (Moore) appears to be the most distantly related to other species of the genus Mimathyma, despite of morphological similarity. In addition, our findings also indicate that the genus Mimathyma is more closely related to Apatura than Athyma.%迷蛱蝶属Mimathyma隶属于蛱蝶科Nymphalidae闪蛱蝶亚科Apaturinae,该属所包含的种类复杂,其分类学地位存在争议.本文对迷蛱蝶属、闪蛱蝶属Apatura和带蛱蝶属Athyma7个种共19个个体的线粒体DNA细胞色素b基因部分序列进行测定分析,并以花斑螯蛱蝶Charaxes kahruba (Moore)作为外群用PAUP软件构建MP和NJ分子系统树.结果显示迷蛱蝶Mimathyma chevana(Moore)、夜迷蛱蝶Mimathyma nycteis(Ménétriès)、白斑迷蛱蝶Mimathyma schrenckii (Ménétriès)和环带迷蛱蝶Mimathyma ambica Kollar形成1个聚类簇,支持Moore将这4个种由闪蛱蝶属移出并建立迷蛱蝶属的观点.同时,尽管迷蛱蝶在形态上与该属其余3种相似,但研究发现聚类簇Ⅰ中夜迷蛱蝶、白斑迷蛱蝶和环带迷蛱蝶首先相聚,然后再与迷蛱蝶聚在一起,表明迷蛱蝶与这3种亲缘关系较远.此外,本文的研究结果还显示迷蛱蝶属与闪蛱蝶属关系密切,而与带蛱蝶属的关系较远.

  8. The nucleotide sequences of two leghemoglobin genes from soybean

    DEFF Research Database (Denmark)

    Wiborg, O; Hyldig-Nielsen, J J; Jensen, E O


    We present the complete nucleotide sequences of two leghemoglobin genes isolated from soybean DNA. Both genes contain three intervening sequences in identical positions. Comparison of the coding sequences with known amino-acid sequences of soybean leghemoglobins suggest that the two genes...

  9. Generalized bounds on the partial periodic correlation of complex roots of unity sequence set

    Institute of Scientific and Technical Information of China (English)


    In this paper, the generalized bounds are derived on the partial periodic correlation of complex roots of unity sequence set with zero or low correlation zone (ZCZ/LCZ) as the important criteria of the sequence design and application. The derived bounds are with respect to family size, subsequence length, maximum partial autocorrelation sidelobe, maximum partial crosscorrelation value and the ZCZ/LCZ. The results show that the derived bounds include the previous periodic bounds, such as Sarwate bound, Welch bound, Peng-Fan bound and Paterson-Lothian bound, as special cases.

  10. Cloning and sequencing genes related to preeclampsia

    Institute of Scientific and Technical Information of China (English)

    SHI Juan-zi; LIU Yan-fang; YAO Yuan-qing; YAN Wei; ZHU Feng; ZHAO Zhong-liang


    To clone genes specifically expressed in the placenta of patients with preeclampsia, and to explain the mechanism in the etiopathology ofpreeclampsia. Methods: The placentae ofpreeclamptic and normotensive subjects with pregnancy were used as models, and the cDNA Library was constructed and 20 differentially expressed fragments were cloned after a new version of PCR-based subtractive hybridization. The false positive clones were identified by reverse dot blot analysis. With one of the obtained gene taken as the probe, the placentas of 10 normal pregnant women and 10 preeclamptic patients were studied by using dot hybridization methods. Results: Six false positive clones were identified by reverse dot blot, and the rest 14 clones were identified as preeclampsia-related genes. These clones were sequenced, and analyzed with BLAST analysis system. Eleven of 14 clones were genes already known, among which one belongs to necdin family; the rest 3 were identified as novel genes. These 3 genes were acknowledged by GenBank, with the accession numbers AF232216, AF232217, AF233648. The results of dot hybridization using necdin gene as probe were as follows: (1) There was this mRNA in the placental tissues of normal pregnancy as well as in that ofpreeclampsia.(2) The intensity of transcription of this mRNA in the placental tissues of preeclampsia increased significantly compared with that of the normal pregnancy (P<0.05). Conclusions: This study for the first time reported this group of genes, especially necdin-expressing gene, which are related to the etiopathology of preeclampsia. In addition, the overtranscription ofnecdin gene has been found in preeclampsia. It is helpful in further studies of the etiology ofpreeclampsia.

  11. Partial primary structure of human pregnancy zone protein: extensive sequence homology with human alpha 2-macroglobulin

    DEFF Research Database (Denmark)

    Sottrup-Jensen, Lars; Folkersen, J; Kristensen, Torsten


    the results of complete or partial sequence determination of a random selection of 38 tryptic peptides covering 685 residues of the subunit of PZP, that PZP and alpha 2M indeed are extensively homologous. In the stretches of PZP sequenced so far, the degree of identically placed residues in the two proteins...

  12. Full-length minor ampullate spidroin gene sequence.

    Directory of Open Access Journals (Sweden)

    Gefei Chen

    Full Text Available Spider silk includes seven protein based fibers and glue-like substances produced by glands in the spider's abdomen. Minor ampullate silk is used to make the auxiliary spiral of the orb-web and also for wrapping prey, has a high tensile strength and does not supercontract in water. So far, only partial cDNA sequences have been obtained for minor ampullate spidroins (MiSps. Here we describe the first MiSp full-length gene sequence from the spider species Araneus ventricosus, using a multidimensional PCR approach. Comparative analysis of the sequence reveals regulatory elements, as well as unique spidroin gene and protein architecture including the presence of an unusually large intron. The spliced full-length transcript of MiSp gene is 5440 bp in size and encodes 1766 amino acid residues organized into conserved nonrepetitive N- and C-terminal domains and a central predominantly repetitive region composed of four units that are iterated in a non regular manner. The repeats are more conserved within A. ventricosus MiSp than compared to repeats from homologous proteins, and are interrupted by two nonrepetitive spacer regions, which have 100% identity even at the nucleotide level.

  13. Predicting gene expression from sequence: a reexamination.

    Directory of Open Access Journals (Sweden)

    Yuan Yuan


    Full Text Available Although much of the information regarding genes' expressions is encoded in the genome, deciphering such information has been very challenging. We reexamined Beer and Tavazoie's (BT approach to predict mRNA expression patterns of 2,587 genes in Saccharomyces cerevisiae from the information in their respective promoter sequences. Instead of fitting complex Bayesian network models, we trained naïve Bayes classifiers using only the sequence-motif matching scores provided by BT. Our simple models correctly predict expression patterns for 79% of the genes, based on the same criterion and the same cross-validation (CV procedure as BT, which compares favorably to the 73% accuracy of BT. The fact that our approach did not use position and orientation information of the predicted binding sites but achieved a higher prediction accuracy, motivated us to investigate a few biological predictions made by BT. We found that some of their predictions, especially those related to motif orientations and positions, are at best circumstantial. For example, the combinatorial rules suggested by BT for the PAC and RRPE motifs are not unique to the cluster of genes from which the predictive model was inferred, and there are simpler rules that are statistically more significant than BT's ones. We also show that CV procedure used by BT to estimate their method's prediction accuracy is inappropriate and may have overestimated the prediction accuracy by about 10%.

  14. Measurement of word frequencies in genomic DNA sequences based on partial alignment and fuzzy set. (United States)

    Shida, Fumiya; Mizuta, Satoshi


    Accompanied with the rapid increase of the amount of data registered in the databases of biological sequences, the need for a fast method of sequence comparison applicable to sequences of large size is also increasing. In general, alignment is used for sequence comparison. However, the alignment may not be appropriate for comparison of sequences of large size such as whole genome sequences due to its large time complexity. In this article, we propose a semi alignment-free method of sequence comparison based on word frequency distributions, in which we partially use the alignment to measure word frequencies along with the idea of fuzzy set theory. Experiments with ten bacterial genome sequences demonstrated that the fuzzy measurements has the effect that facilitates discrimination between close relatives and distant relatives.

  15. [cDNA cloning and sequence analysis of pluripotency genes in tree shrews (Tupaia belangeri)]. (United States)

    Wang, Cai-Yun; Ma, Yun-Han; He, Da-Jian; Yang, Shi-Hua


    In this paper, partial sequences of the tree shrew (Tupaia belangeri) Klf4, Sox2, and c-Myc genes were cloned and sequenced, which were 382, 612, and 485 bp in length and encoded 127, 204, and 161 amino acids, respectively. Whereas, their cDNA sequence identities with those of human were 89%, 98%, and 89%, respectively. Their phylogenetic tree results indicated different topologies and suggested individual evolutional pathways. These results can facilitate further functional studies.

  16. Molecular phylogeny of parasitic Platyhelminthes based on sequences of partial 28S rDNA D1 and mitochondrial cytochrome c oxidase subunit I. (United States)

    Lee, Soo-Ung; Chun, Ha-Chung; Huh, Sun


    The phylogenic relationships existing among 14 parasitic Platyhelminthes in the Republic of Korea were investigated via the use of the partial 28S ribosomal DNA (rDNA) D1 region and the partial mitochondrial cytochrome c oxidase subunit 1 (mCOI) DNA sequences. The nucleotide sequences were analyzed by length, G + C %, nucleotide differences and gaps in order to determine the analyzed phylogenic relationships. The phylogenic patterns of the 28S rDNA D1 and mCOI regions were closely related within the same class and order as analyzed by the PAUP 4.0 program, with the exception of a few species. These findings indicate that the 28S rDNA gene sequence is more highly conserved than are the mCOI gene sequences. The 28S rDNA gene may prove useful in studies of the systematics and population genetic structures of parasitic Platyhelminthes.

  17. Sequencing and identification of expressed Schistosoma mansoni genes by random selection of cDNA clones from a directional library

    Directory of Open Access Journals (Sweden)

    Glória R. Franco


    Full Text Available We have initiated a gene discovery program in Schistosoma mansoni based on the technique of Expressed Sequence Tags (ESTs, i.e. partial sequences of cDNAs obtained from single passes in automatic DNA sequencers. ESTs can be used to identify genese onf the basis of their homology whith sequences from other species deposited in DNA or protein databases. Trasncripts with sequences without matches in teh databases may represent novel parasite-specific genes. This approach has shown to be very efficient and in less than two years a broad range of novel genes has already been ascertained, more than doubling the number of known S. mansoni genes.

  18. Molecular phylogeny of Cricetulus griseus based on partial sequences of mitochondrial 16S rRNA gene analysis%中国地鼠线粒体DNA 16S rRNA基因序列分析及分子系统发育研究

    Institute of Scientific and Technical Information of China (English)

    宋国华; 高继萍; 王裕; 王春芳; 刘田福


    目的 通过克隆分析中国地鼠16S基因的部分序列,对中国地鼠16S基因的结构和功能进行初步探索和揭示.方法 从GenBank中已报道的啮齿动物16S基因保守区设计一对引物,进行PCR扩增,测序.用Blastn与GenBank中七种啮齿类动物的16S基因进行序列比较,分析其碱基组成及变异情况,并用邻接法(NJ)、非加权组平均法(UPGMA)构建分子系统树,在分子水平上探讨中国地鼠和其他啮齿类动物的进化关系,对中国地鼠的种属地位进行了进一步验证.结果 获得了中国地鼠线粒体16S基因的部分序列,其碱基组成A、T、C、G分别为40.5%、24.5%、18.7%、16.3%,与其他七种啮齿类动物的碱基含量相比,各碱基含量基本相似.NJ进化树表明,中国地鼠、金黄地鼠与欧洲仓鼠先聚为一支,小鼠与大鼠先聚为一支,东方田鼠、台湾田鼠与东欧田鼠先聚为一支.结论 中国地鼠和金黄地鼠的亲缘关系最近,与传统的分类地位基本吻合.%Objective To observe the structure and function of 16S gene by cloning and analyzing the partial sequence of Cricetulus griseus 16S, and to explore its molecular phylogeny. Methods According to the conservative domain of the published sequence of 16S gene of rodent animals in GenBank, a pair of primers that could amplify Cricetulus griseus 16S gene was designed and synthesized. The sequence was compared with the published 16S genes in GenBank by Blastn. Based on the 16S rRNA sequences the molecular phylogenetic trees were constructed by neighbor-joining method, unweighted air-group method with arithmetic means, and the taxonomic status of Cricetulus griseus was estimated at molecular level. Results A part of sequences of 16S rRNA gene in Cricetulus griseus was obtained,and the A, T, C, G base average contents were 40. 5% ,24. 5% ,18. 7% and 16. 3% , respectively. The 16S base content was similar to that in other 6 rodent species. The neighbor-joining ( NJ

  19. Some new sets of sequences of fuzzy numbers with respect to the partial metric. (United States)

    Kadak, Uğur; Ozluk, Muharrem


    In this paper, we essentially deal with Köthe-Toeplitz duals of fuzzy level sets defined using a partial metric. Since the utilization of Zadeh's extension principle is quite difficult in practice, we prefer the idea of level sets in order to construct some classical notions. In this paper, we present the sets of bounded, convergent, and null series and the set of sequences of bounded variation of fuzzy level sets, based on the partial metric. We examine the relationships between these sets and their classical forms and give some properties including definitions, propositions, and various kinds of partial metric spaces of fuzzy level sets. Furthermore, we study some of their properties like completeness and duality. Finally, we obtain the Köthe-Toeplitz duals of fuzzy level sets with respect to the partial metric based on a partial ordering.

  20. Partial sequence and toxic effects of granulitoxin, a neurotoxic peptide from the sea anemone Bunodosoma granulifera

    Directory of Open Access Journals (Sweden)

    A.N.C. Santana


    Full Text Available A neurotoxic peptide, granulitoxin (GRX, was isolated from the sea anemone Bunodosoma granulifera. The N-terminal amino acid sequence of GRX is AKTGILDSDGPTVAGNSLSGT and its molecular mass is 4958 Da by electrospray mass spectrometry. This sequence presents a partial degree of homology with other toxins from sea anemones such as Bunodosoma caissarum, Anthopleura fuscoviridis and Anemonia sulcata. However, important differences were found: the first six amino acids of the sequence are different, Arg-14 was replaced by Ala and no cysteine residues were present in the partial sequence, while two cysteine residues were present in the first 21 amino acids of other toxins described above. Purified GRX injected ip (800 µg/kg into mice produced severe neurotoxic effects such as circular movements, aggressive behavior, dyspnea, tonic-clonic convulsion and death. The 2-h LD50 of GRX was 400 ± 83 µg/kg.

  1. Unresolved orthology and peculiar coding sequence properties of lamprey genes: the KCNA gene family as test case

    Directory of Open Access Journals (Sweden)

    Kuraku Shigehiro


    Full Text Available Abstract Background In understanding the evolutionary process of vertebrates, cyclostomes (hagfishes and lamprey occupy crucial positions. Resolving molecular phylogenetic relationships of cyclostome genes with gnathostomes (jawed vertebrates genes is indispensable in deciphering both the species tree and gene trees. However, molecular phylogenetic analyses, especially those including lamprey genes, have produced highly discordant results between gene families. To efficiently scrutinize this problem using partial genome assemblies of early vertebrates, we focused on the potassium voltage-gated channel, shaker-related (KCNA family, whose members are mostly single-exon. Results Seven sea lamprey KCNA genes as well as six elephant shark genes were identified, and their orthologies to bony vertebrate subgroups were assessed. In contrast to robustly supported orthology of the elephant shark genes to gnathostome subgroups, clear orthology of any sea lamprey gene could not be established. Notably, sea lamprey KCNA sequences displayed unique codon usage pattern and amino acid composition, probably associated with exceptionally high GC-content in their coding regions. This lamprey-specific property of coding sequences was also observed generally for genes outside this gene family. Conclusions Our results suggest that secondary modifications of sequence properties unique to the lamprey lineage may be one of the factors preventing robust orthology assessments of lamprey genes, which deserves further genome-wide validation. The lamprey lineage-specific alteration of protein-coding sequence properties needs to be taken into consideration in tackling the key questions about early vertebrate evolution.

  2. Partial duplication of the APBA2 gene in chromosome 15q13 corresponds to duplicon structures

    Directory of Open Access Journals (Sweden)

    Kesterson Robert A


    Full Text Available Abstract Background Chromosomal abnormalities affecting human chromosome 15q11-q13 underlie multiple genomic disorders caused by deletion, duplication and triplication of intervals in this region. These events are mediated by highly homologous segments of DNA, or duplicons, that facilitate mispairing and unequal cross-over in meiosis. The gene encoding an amyloid precursor protein-binding protein (APBA2 was previously mapped to the distal portion of the interval commonly deleted in Prader-Willi and Angelman syndromes and duplicated in cases of autism. Results We show that this gene actually maps to a more telomeric location and is partially duplicated within the broader region. Two highly homologous copies of an interval containing a large 5' exon and downstream sequence are located ~5 Mb distal to the intact locus. The duplicated copies, containing the first coding exon of APBA2, can be distinguished by single nucleotide sequence differences and are transcriptionally inactive. Adjacent to APBA2 maps a gene termed KIAA0574. The protein encoded by this gene is weakly homologous to a protein termed X123 that in turn maps adjacent to APBA1 on 9q21.12; APBA1 is highly homologous to APBA2 in the C-terminal region and is distinguished from APBA2 by the N-terminal region encoded by this duplicated exon. Conclusion The duplication of APBA2 sequences in this region adds to a complex picture of different low copy repeats present across this region and elsewhere on the chromosome.

  3. Mutants of Streptomyces roseosporus that express enhanced recombination within partially homologous genes. (United States)

    Hosted, T J; Baltz, R H


    Streptomyces roseosporus mutants that express enhanced recombination between partially homologous (homeologous) sequences were isolated by selection for recombination between the bacteriophage phi C31 derivative KC570 containing the Streptomyces coelicolor glucose kinase (glk) gene and the S. roseosporus chromosome. The frequencies of homeologous recombination in the ehr mutants were determined by measuring the chromosomal insertion frequencies of plasmids containing S. coelicolor glnA or whiG genes. S. roseosporus ehr mutants showed 10(2)- to 10(4)-fold increases in homeologous recombination relative to Ehr+ strains, but no increase in homologous recombination. Southern hybridization analysis revealed single unique sites for the insertion of each of the plasmids, and the crossovers occurred in frame and in proper translational register, yielding functional chimeric glnA and whiG genes.

  4. Synthesis of a hexasaccharide partial sequence of hyaluronan for click chemistry and more

    Directory of Open Access Journals (Sweden)

    Marina Bantzi


    Full Text Available In the present work, the synthesis of a hexasaccharide partial sequence of hyaluronan equipped with a terminal azido moiety is reported. This hexasaccharide can be used for the attachment on surfaces by means of click chemistry and after suitable deprotection for biophysical studies.

  5. Improved identification of Gordonia, Rhodococcus and Tsukamurella species by 5'-end 16S rRNA gene sequencing. (United States)

    Wang, Tao; Kong, Fanrong; Chen, Sharon; Xiao, Meng; Sorrell, Tania; Wang, Xiaoyan; Wang, Shuo; Sintchenko, Vitali


    The identification of fastidious aerobic Actinomycetes such as Gordonia, Rhodococcus, and Tsukamurella has remained a challenge leading to clinically significant misclassifications. This study is intended to examine the feasibility of partial 5'-end 16S rRNA gene sequencing for the identification of Gordonia, Rhodococcus, and Tsukamurella, and defined potential reference sequences for species from each of these genera. The 16S rRNA gene sequence based identification algorithm for species identification was used and enhanced by aligning test sequences with reference sequences from the List of Prokaryotic Names with Standing in Nomenclature. Conventional PCR based 16S rRNA gene sequencing and the alignment of the isolate 16S rRNA gene sequence with reference sequences accurately identified 100% of clinical strains of aerobic Actinomycetes. While partial 16S rRNA gene sequences of reference type strains matched with the 16S rRNA gene sequences of 19 isolates in our data set, another 13 strains demonstrated a degree of polymorphism with a 1-4 bp difference in the regions of difference. 5'-end 606 bp 16S rRNA gene sequencing, coupled with the assignment of well defined reference sequences to clinically relevant species of bacteria, can be a useful strategy for improving the identification of clinically relevant aerobic Actinomycetes.

  6. Phylogenetics of Rhinolophidea, Vespertilionidae and Emballonuridae Based on Partial Sequences of the Nuclear RAG2 Gene%基于核基因RAG2部分序列探索菊头蝠总科、蝙蝠科和鞘尾蝠科物种的系统发育关系

    Institute of Scientific and Technical Information of China (English)

    袁小爱; 田东; 谷晓明


    采用PCR技术获得了翼手目(菊头蝠总科、蝙蝠科以及鞘尾蝠科)16种19个个体的核基因重组激活基因2(Recombination activating gene,RAG2)部分序列,长度为730~760 bp.结合从Genbank中提取的翼手目3科14个体的RAG2序列,通过构建贝叶斯(Bayesian inference,BI)和最大似然树对翼手目种属间的系统进化关系进行研究.研究表明:在葡头蝠总科中,菊头蝠科和蹄蝠科是两个独立的科,且在蹄蝠科中,大蹄蝠、中蹄蝠、普氏蹄蝠3种之间的亲缘关系非常近;蝙蝠科中的南蝠属与棕蝠属是姊妹群;长翼蝠亚科应提升为长翼蝠科.此外,鞘尾蝠科与犬吻蝠科形成姊妹群关系.%The partial sequences of nuclear RAG2 gene (Recombination activating gene, RAG2 )were obtained from 19 individuals of 16 species of Chiroptera( Rhinolophidea, Vespertilionidae and Emballonuridae) from Guizhou Province. The lengths of RAG2 are 730~760 bp. The evolutionary trees was produced by Bayesian inference (BI) and Maximum likelihood (ML) methods, combined with the sequences RAG2 of 14 individuals of 3 families from Genbank, to study the relationship of species or genera of Chiroptera It suggested that: Rhinolophidae and Hipposideridae were two separate families and H.pratti, H.armiger and H. larvatus have more closely genetic relationship in Hipposideridae, la genus and Eptesicus genus was sister group in Vespertilionidae. The subfamily Miniopterinae should be elevated to family status. In addition, Emballonuridae and Molossidae form sister group.

  7. Identification of differentially expressed genes after partial rat liver ischemia/reperfusion by suppression subtractive hybridization

    Institute of Scientific and Technical Information of China (English)

    Christine Fallsehr; Christina Zapletal; Michael Kremer; Resit Demir; Magnus von Knebel Doeberitz; Ernst Klar


    AIM: To identify potential diagnostic target genes in early reperfusion periods following warm liver ischemia before irreversible liver damage occurs.METHODS: We used two strategies (SSH suppression subtractive hybridization and hybridization of cDNA arrays)to determine early changes in gene expression profiles in a rat model of partial WI/R, comparing postischemic and adjacent nonischemic liver lobes. Differential gene expression was verified (WT/R; 1 h/2 h) and analyzed in more detail after warm ischemia (1 h) in a reperfusion time kinetics (0, 1, 2 and 6 h) and compared to untreated livers by Northern blot hybridizations. Protein expression was examined on Western blots and by immunohistochemistry for four differentially expressed target genes (Hsp70,Hsp27, Gadd45a and IL-1rl).RESULTS: Thirty-two individual WI/R target genes showing altered RNA levels after confirmation by Northern blot analyzes were identified. Among them, six functionally uncharacteristic expressed sequences and 26 known genes (12 induced in postischemic liver lobes, 14 with higher transcriptional expression in adjacent nonischemic liver lobes). Functional categories of the verified marker genes indicate on the one hand cellular stress and tissue damage but otherwise activation of protective cellular reactions (AP-1 transcription factors, apoptosis related genes, heat shock genes). In order to assign the transcriptional status to the biological relevant protein level we demonstrated that Hsp70, Hsp27, Gadd45a and IL-1rI were clearly up-regulated comparing postischemic and untreated rat livers, suggesting their involvement in the WI/R context.CONCLUSION: This study unveils a WI/R response gene set that will help to explore molecular pathways involved in the tissue damage after WI/R. In addition, these genes especially Hsp70and Gadd45a might represent promising new candidates indicating WI/R liver damage.

  8. Gene and translation initiation site prediction in metagenomic sequences

    Energy Technology Data Exchange (ETDEWEB)

    Hyatt, Philip Douglas [ORNL; LoCascio, Philip F [ORNL; Hauser, Loren John [ORNL; Uberbacher, Edward C [ORNL


    Gene prediction in metagenomic sequences remains a difficult problem. Current sequencing technologies do not achieve sufficient coverage to assemble the individual genomes in a typical sample; consequently, sequencing runs produce a large number of short sequences whose exact origin is unknown. Since these sequences are usually smaller than the average length of a gene, algorithms must make predictions based on very little data. We present MetaProdigal, a metagenomic version of the gene prediction program Prodigal, that can identify genes in short, anonymous coding sequences with a high degree of accuracy. The novel value of the method consists of enhanced translation initiation site identification, ability to identify sequences that use alternate genetic codes and confidence values for each gene call. We compare the results of MetaProdigal with other methods and conclude with a discussion of future improvements.

  9. Extensive mitochondrial genome rearrangements between Cerithioidea and Hypsogastropoda (Mollusca; Caenogastropoda) as determined from the partial nucleotide sequences of the mitochondrial DNA of Cerithidea djadjariensis and Batillaria cumingi. (United States)

    Kojima, Shigeaki


    Partial nucleotide sequences ( approximately 8000 bp) of the mitochondrial DNA of two cerithioidean gastropod species-Cerithidea djadjariensis and Batillaria cumingi-were determined. The order of mitochondrial genes (eight protein genes, two ribosomal RNA genes, and nine transfer RNA genes) was identical between these two species. and remarkably different from the previously reported order in other gastropods. The results indicate that the genome structure of the common ancestor of Cerithioidea and its sister group, Hypsogastropoda, is almost identical to that of the common ancestor of Gastropoda; moreover, independent mitochondrial genome rearrangements were identified between the lineages of Cerithioidea and Hypsogastropoda. The rearrangements within Cerithioidea can be explained by the inversion of a single tRNA gene, two translocations of a single tRNA gene, and three translocations of a genome fragment containing a tRNA gene and protein-coding gene(s).

  10. Molecular cloning and sequencing of a cDNA encoding partial putative molt-inhibiting hormone from Penaeus chinensis (United States)

    Wang, Zai-Zhao; Xiang, Jian-Hai


    Total RNA was extracted from eyestalks of shrimp Penaeus chinensis. Eyestalk cDNA was obtained from total RNA by reverse transcription. Reverse transcriptase-polymerase chain reaction (RT-PCR) was initiated using eyestalk cDNA and degenerate primers designed from the amino acid sequence of molt-inhibiting hormone from shrimp Penaeus japonicus. A specific cDNA was obtained and cloned into a T vector for sequencing. The cDNA consisted of 201 base pairs and encoding for a peptide of 67 amino acid residues. The peptide of P. chinensis had the highest identity with molt-inhibiting hormones of P. japonicus. The cDNA could be a partial gene of molt-inhibiting hormones from P. chinensis. This paper reports for the first time cDNA encoding for neuropeptide of P. chinensis.


    Institute of Scientific and Technical Information of China (English)

    王在照; 相建海


    Total RNA was extracted from eyestalks of shrimp Penaeus chinensis. Eyestalk cDNA was obtained from total RNA by reverse transcription. Reverse transcriptase-polymer ase chain reaction (RT-PCR) was initiated using eyestalk cDNA and degenerate primers designed from the amino acid sequence of molt-inhibiting hormone from shrimp Penaeus japonicus. A s pecific cDNA was obtained and cloned into a T vector for sequencing. The cDNA consisted of 201 ba se pairs and encoding for a peptide of 67 amino acid residues. The peptide of P. chinensis had the highest identity with molt-inhibiting hormones of P. japonicus. The cDNA could be a partial gene of molt-inhibiting hormones from P. chinensis. This paper reports for the first time cDNA encoding for neuropeptide of P. chinensis.


    Institute of Scientific and Technical Information of China (English)

    王在照; 相建海


    Total RNA was extracted from eyestalks of shrimp Penaeue chinensis. Eyestalk cDNA was obtained from total RNA by reverse transcription. Reverse transcriptase-polymerase chain reaction (RT-PCR) was initiated using eyestalk cDNA and degenerate primers designed from the amino acid sequence of molt-inhibiting hormone from shrimp Penaeus japonicus. A specific cDNA was obtained and cloned into a T vector for sequencing. The cDNA consisted of 201 base pairs and encoding for a peptide of 67 amino acid residues. The peptide of P. chinensis had the highest identity with molt-inhibiting hormones of P. japonicus. The cDNA could be a partial gene of molt-inhibiting hormones from P. chinensis. This paper reports for the first time cDNA encoding for neuropeptide of P. chinensis.

  13. Molecular cloning and nucleotide sequence of a transforming gene detected by transfection of chicken B-cell lymphoma DNA (United States)

    Goubin, Gerard; Goldman, Debra S.; Luce, Judith; Neiman, Paul E.; Cooper, Geoffrey M.


    A transforming gene detected by transfection of chicken B-cell lymphoma DNA has been isolated by molecular cloning. It is homologous to a conserved family of sequences present in normal chicken and human DNAs but is not related to transforming genes of acutely transforming retroviruses. The nucleotide sequence of the cloned transforming gene suggests that it encodes a protein that is partially homologous to the amino terminus of transferrin and related proteins although only about one tenth the size of transferrin.

  14. Using Partial Genomic Fosmid Libraries for Sequencing CompleteOrganellar Genomes

    Energy Technology Data Exchange (ETDEWEB)

    McNeal, Joel R.; Leebens-Mack, James H.; Arumuganathan, K.; Kuehl, Jennifer V.; Boore, Jeffrey L.; dePamphilis, Claude W.


    Organellar genome sequences provide numerous phylogenetic markers and yield insight into organellar function and molecular evolution. These genomes are much smaller in size than their nuclear counterparts; thus, their complete sequencing is much less expensive than total nuclear genome sequencing, making broader phylogenetic sampling feasible. However, for some organisms it is challenging to isolate plastid DNA for sequencing using standard methods. To overcome these difficulties, we constructed partial genomic libraries from total DNA preparations of two heterotrophic and two autotrophic angiosperm species using fosmid vectors. We then used macroarray screening to isolate clones containing large fragments of plastid DNA. A minimum tiling path of clones comprising the entire genome sequence of each plastid was selected, and these clones were shotgun-sequenced and assembled into complete genomes. Although this method worked well for both heterotrophic and autotrophic plants, nuclear genome size had a dramatic effect on the proportion of screened clones containing plastid DNA and, consequently, the overall number of clones that must be screened to ensure full plastid genome coverage. This technique makes it possible to determine complete plastid genome sequences for organisms that defy other available organellar genome sequencing methods, especially those for which limited amounts of tissue are available.

  15. DNA sequence explains seemingly disordered methylation levels in partially methylated domains of Mammalian genomes.

    Directory of Open Access Journals (Sweden)

    Dimos Gaidatzis


    Full Text Available For the most part metazoan genomes are highly methylated and harbor only small regions with low or absent methylation. In contrast, partially methylated domains (PMDs, recently discovered in a variety of cell lines and tissues, do not fit this paradigm as they show partial methylation for large portions (20%-40% of the genome. While in PMDs methylation levels are reduced on average, we found that at single CpG resolution, they show extensive variability along the genome outside of CpG islands and DNase I hypersensitive sites (DHS. Methylation levels range from 0% to 100% in a roughly uniform fashion with only little similarity between neighboring CpGs. A comparison of various PMD-containing methylomes showed that these seemingly disordered states of methylation are strongly conserved across cell types for virtually every PMD. Comparative sequence analysis suggests that DNA sequence is a major determinant of these methylation states. This is further substantiated by a purely sequence based model which can predict 31% (R(2 of the variation in methylation. The model revealed CpG density as the main driving feature promoting methylation, opposite to what has been shown for CpG islands, followed by various dinucleotides immediately flanking the CpG and a minor contribution from sequence preferences reflecting nucleosome positioning. Taken together we provide a reinterpretation for the nucleotide-specific methylation levels observed in PMDs, demonstrate their conservation across tissues and suggest that they are mainly determined by specific DNA sequence features.

  16. Marker2sequence, mine your QTL regions for candidate genes

    NARCIS (Netherlands)

    Chibon, P.Y.F.R.P.; Schoof, H.; Visser, R.G.F.; Finkers, H.J.


    Marker2sequence (M2S) aims at mining quantitative trait loci (QTLs) for candidate genes. For each gene, within the QTL region, M2S uses data integration technology to integrate putative gene function with associated gene ontology terms, proteins, pathways and literature. As a typical QTL region

  17. A Probabilistic Genome-Wide Gene Reading Frame Sequence Model

    DEFF Research Database (Denmark)

    Have, Christian Theil; Mørk, Søren

    We introduce a new type of probabilistic sequence model, that model the sequential composition of reading frames of genes in a genome. Our approach extends gene finders with a model of the sequential composition of genes at the genome-level -- effectively producing a sequential genome annotation...... and are evaluated by the effect on prediction performance. Since bacterial gene finding to a large extent is a solved problem it forms an ideal proving ground for evaluating the explicit modeling of larger scale gene sequence composition of genomes. We conclude that the sequential composition of gene reading frames...... as output. The model can be used to obtain the most probable genome annotation based on a combination of i: a gene finder score of each gene candidate and ii: the sequence of the reading frames of gene candidates through a genome. The model --- as well as a higher order variant --- is developed and tested...

  18. Characterization of the Helicoverpa assulta nucleopolyhedrovirus genome and sequence analysis of the polyhedrin gene region

    Indian Academy of Sciences (India)

    Soo-Dong Woo; Jae Young Choi; Yeon Ho Je; Byung Rae Jin


    A local strain of Helicoverpa assulta nucleopolyhedrovirus (HasNPV) was isolated from infected H. assulta larvae in Korea. Restriction endonuclease fragment analysis, using 4 restriction enzymes, estimated that the total genome size of HasNPV is about 138 kb. A degenerate polymerase chain reaction (PCR) primer set for the polyhedrin gene successfully amplified the partial polyhedrin gene of HasNPV. The sequencing results showed that the about 430 bp PCR product was a fragment of the corresponding polyhedrin gene. Using HasNPV partial predicted polyhedrin to probe the Southern blots, we identified the location of the polyhedrin gene within the 6 kb EcoRI, 15 kb NcoI, 20 kb XhoI, 17 kb BglII and 3 kb ClaI fragments, respectively. The 3 kb ClaI fragment was cloned and the nucleotide sequences of the polyhedrin coding region and its flaking regions were determined. Nucleotide sequence analysis indicated the presence of an open reading frame of 735 nucleotides which could encode 245 amino acids with a predicted molecular mass of 29 kDa. The nucleotide sequences within the coding region of HasNPV polyhedrin shared 73.7% identity with the polyhedrin gene from Autographa californica NPV but were most closely related to Helicoverpa and Heliothis species NPVs with over 99% sequence identity.

  19. Sequencing genes in silico using single nucleotide polymorphisms

    Directory of Open Access Journals (Sweden)

    Zhang Xinyi


    Full Text Available Abstract Background The advent of high throughput sequencing technology has enabled the 1000 Genomes Project Pilot 3 to generate complete sequence data for more than 906 genes and 8,140 exons representing 697 subjects. The 1000 Genomes database provides a critical opportunity for further interpreting disease associations with single nucleotide polymorphisms (SNPs discovered from genetic association studies. Currently, direct sequencing of candidate genes or regions on a large number of subjects remains both cost- and time-prohibitive. Results To accelerate the translation from discovery to functional studies, we propose an in silico gene sequencing method (ISS, which predicts phased sequences of intragenic regions, using SNPs. The key underlying idea of our method is to infer diploid sequences (a pair of phased sequences/alleles at every functional locus utilizing the deep sequencing data from the 1000 Genomes Project and SNP data from the HapMap Project, and to build prediction models using flanking SNPs. Using this method, we have developed a database of prediction models for 611 known genes. Sequence prediction accuracy for these genes is 96.26% on average (ranges 79%-100%. This database of prediction models can be enhanced and scaled up to include new genes as the 1000 Genomes Project sequences additional genes on additional individuals. Applying our predictive model for the KCNJ11 gene to the Wellcome Trust Case Control Consortium (WTCCC Type 2 diabetes cohort, we demonstrate how the prediction of phased sequences inferred from GWAS SNP genotype data can be used to facilitate interpretation and identify a probable functional mechanism such as protein changes. Conclusions Prior to the general availability of routine sequencing of all subjects, the ISS method proposed here provides a time- and cost-effective approach to broadening the characterization of disease associated SNPs and regions, and facilitating the prioritization of candidate

  20. Phylogenetic analysis of canine parvovirus partial VP2 gene in India. (United States)

    Mukhopadhyay, H K; Matta, Samyukta Lakshmi; Amsaveni, S; Antony, P X; Thanislass, J; Pillai, R M


    A total of 85 samples (58.0 %) were found to be positive for Canine parvovirus (CPV) by PCR assay (Hfor/Hrev primers) out of 158 suspected faecal samples of dogs collected from various states/union territories of India. Nine CPV isolates could be obtained in A-72 cell line. The sequencing of the partial VP2 gene of CPV identified the predominant CPV strain as CPV-2a (Ser297Ala) with one CPV-2b (Ser297Ala) and another CPV-2a variant strain (Ser297Gly). Several non-synonymous and synonymous mutations were also recorded in this study. The phylogenetic tree revealed that most of the CPV sequences from Tamil Nadu (Southern India) and Maharashtra (Western India) obtained during 2011 and few sequences from Northern India obtained during 2012 were grouped together along with CPV-2a (Ser297Ala) strains from China and India and followed the same evolution; although there was definitive indication of separate lineages too by few other sequences.

  1. Collection of mitochondrial cytochrome oxidase I gene sequences from Rhipicephalus ticks from various geographic locations around the world (United States)

    Determining the origin of the cattle tick, Rhipicephalus microplus, will be helpful to the effort to find biological control agents. Molecular phylogenetics can assist in this determination. Thus, we sequenced and assembled partial gene sequences from the mitochondrial cytochrome oxidase I coding r...

  2. Molecular phylogenetic relationship among seven Nemipterus species based on partial S7 ribosomal protein gene sequences%基于S7核糖体蛋白基因部分序列的7种金线鱼属鱼类分子系统进化关系

    Institute of Scientific and Technical Information of China (English)

    梁日深; 王超; 邹青; 周爱国; 邹记兴


      测定了7种金线鱼属(Nemipterus)及2种锥齿鲷属(Pentapus)鱼类S7核糖体蛋白基因(S7基因)内含子1部分序列,以二线眶棘鲈(Scolopsis bilineatus)做为外类群初步探讨了其分子进化关系。测序所得S7部分序列为734~746 bp,序列比对后得到同源序列743 bp。其中保守位点386个,变异位点351个,简约性信息位点289个。A+T含量(54.1%)高于G+C含量(45.9%)。基于Kimura 2-Parameter模型计算出7种金线鱼的遗传距离为0.042~0.294。S7序列存在大量碱基插入与缺失,其中日本金线鱼(Nemipterus japonicus)与苏门答腊金线鱼(N. mesoprion)在第167、182、474、608、662 bp位置,金线鱼(N. virgatus)与印度洋金线鱼(N. bipunctatus)第227、332、401、604 bp的位置具有相同的碱基插入缺失特征,且具有种类特异性。最后利用最大似然法(ML)与贝叶斯分析(BI)构建分子系统进化树,7种金线鱼聚在一起,其中日本金线鱼与苏门答腊金线鱼聚为一支,金线鱼、深水金线鱼和印度洋金线鱼聚成一支。结论认为需要结合多方面的形态与分子证据,才能进一步明确金线鱼属鱼类的系统进化关系。%Nemipterus species, belonging to the family Nemipteridae, order Perciformes, are widely distributed in the tropical and subtropical waters of the Indo-Western Pacific. Morphologically, a number of closely related Ne-mipterus species share similar external coloration and patterns, resulting in difficulty identifying and classifying them based on morphological evidence alone. Despite this, priorevaluation of Nemipterushas primarily relied on morphological data, with few molecular phylogenetic data currently available for this genus. We analyzed the mo-lecular phylogenetic relationship among seven common Nemipterus species collected from the Indo-Western Pa-cific. Partial sequences of the 1st intron of the S7 ribosomal protein gene were obtained for seven Nemipterus and two Pentapus species. We used

  3. Cloning and sequence analysis of chitin synthase gene fragments of Demodex mites. (United States)

    Zhao, Ya-e; Wang, Zheng-hang; Xu, Yang; Xu, Ji-ru; Liu, Wen-yan; Wei, Meng; Wang, Chu-ying


    To our knowledge, few reports on Demodex studied at the molecular level are available at present. In this study our group, for the first time, cloned, sequenced and analyzed the chitin synthase (CHS) gene fragments of Demodex folliculorum, Demodex brevis, and Demodex canis (three isolates from each species) from Xi'an China, by designing specific primers based on the only partial sequence of the CHS gene of D. canis from Japan, retrieved from GenBank. Results show that amplification was successful only in three D. canis isolates and one D. brevis isolate out of the nine Demodex isolates. The obtained fragments were sequenced to be 339 bp for D. canis and 338 bp for D. brevis. The CHS gene sequence similarities between the three Xi'an D. canis isolates and one Japanese D. canis isolate ranged from 99.7% to 100.0%, and those between four D. canis isolates and one D. brevis isolate were 99.1%-99.4%. Phylogenetic trees based on maximum parsimony (MP) and maximum likelihood (ML) methods shared the same clusters, according with the traditional classification. Two open reading frames (ORFs) were identified in each CHS gene sequenced, and their corresponding amino acid sequences were located at the catalytic domain. The relatively conserved sequences could be deduced to be a CHS class A gene, which is associated with chitin synthesis in the integument of Demodex mites.

  4. Discrimination of germline V genes at different sequencing lengths and mutational burdens: A new tool for identifying and evaluating the reliability of V gene assignment. (United States)

    Zhang, Bochao; Meng, Wenzhao; Prak, Eline T Luning; Hershberg, Uri


    Immune repertoires are collections of lymphocytes that express diverse antigen receptor gene rearrangements consisting of Variable (V), (Diversity (D) in the case of heavy chains) and Joining (J) gene segments. Clonally related cells typically share the same germline gene segments and have highly similar junctional sequences within their third complementarity determining regions. Identifying clonal relatedness of sequences is a key step in the analysis of immune repertoires. The V gene is the most important for clone identification because it has the longest sequence and the greatest number of sequence variants. However, accurate identification of a clone's germline V gene source is challenging because there is a high degree of similarity between different germline V genes. This difficulty is compounded in antibodies, which can undergo somatic hypermutation. Furthermore, high-throughput sequencing experiments often generate partial sequences and have significant error rates. To address these issues, we describe a novel method to estimate which germline V genes (or alleles) cannot be discriminated under different conditions (read lengths, sequencing errors or somatic hypermutation frequencies). Starting with any set of germline V genes, this method measures their similarity using different sequencing lengths and calculates their likelihood of unambiguous assignment under different levels of mutation. Hence, one can identify, under different experimental and biological conditions, the germline V genes (or alleles) that cannot be uniquely identified and bundle them together into groups of specific V genes with highly similar sequences.

  5. Molecular evidence of lateral gene transfer in rpoB gene of Mycobacterium yongonense strains via multilocus sequence analysis.

    Directory of Open Access Journals (Sweden)

    Byoung-Jun Kim

    Full Text Available Recently, a novel species, Mycobacterium yongonense (DSM 45126(T, was introduced and while it is phylogenetically related to Mycobacterium intracellulare, it has a distinct RNA polymerase β-subunit gene (rpoB sequence that is identical to that of Mycobacterium parascrofulaceum, which is a distantly related scotochromogen, which suggests the acquisition of the rpoB gene via a potential lateral gene transfer (LGT event. The aims of this study are to prove the presence of the LGT event in the rpoB gene of the M. yongonense strains via multilocus sequence analysis (MLSA. In order to determine the potential of an LGT event in the rpoB gene of the M. yongonense, the MLSA based on full rpoB sequences (3447 or 3450 bp and on partial sequences of five other targets [16S rRNA (1383 or 1395 bp, hsp65 (603 bp, dnaJ (192 bp, recA (1053 bp, and sodA (501 bp] were conducted. Incongruences between the phylogenetic analysis of the full rpoB and the five other genes in a total of three M. yongonense strains [two clinical strains (MOTT-12 and MOTT-27 and one type strain (DSM 45126(T] were observed, suggesting that rpoB gene of three M. yongonense strains may have been acquired very recently via an LGT event from M. parascrofulaceum, which is a distantly related scotochromogen.

  6. [Recent progress in gene mapping through high-throughput sequencing technology and forward genetic approaches]. (United States)

    Lu, Cairui; Zou, Changsong; Song, Guoli


    Traditional gene mapping using forward genetic approaches is conducted primarily through construction of a genetic linkage map, the process of which is tedious and time-consuming, and often results in low accuracy of mapping and large mapping intervals. With the rapid development of high-throughput sequencing technology and decreasing cost of sequencing, a variety of simple and quick methods of gene mapping through sequencing have been developed, including direct sequencing of the mutant genome, sequencing of selective mutant DNA pooling, genetic map construction through sequencing of individuals in population, as well as sequencing of transcriptome and partial genome. These methods can be used to identify mutations at the nucleotide level and has been applied in complex genetic background. Recent reports have shown that sequencing mapping could be even done without the reference of genome sequence, hybridization, and genetic linkage information, which made it possible to perform forward genetic study in many non-model species. In this review, we summarized these new technologies and their application in gene mapping.

  7. Validation of two-channel sequencing-by-synthesis for noninvasive prenatal testing of fetal whole and partial chromosome aberrations

    NARCIS (Netherlands)

    Neveling, K.; Thung, G.W.D.T.; Beulen, L.; Rens-Buijsman, W. van; Gomes, I.; Heuvel, S. van den; Mieloo, H.; Derks-Prinsen, I.; Kater-Baats, E.; Faas, B.H.W.


    OBJECTIVE: To validate Illumina's two-channel NextSeq 500 sequencing system for noninvasive prenatal testing (NIPT) of fetal whole chromosome and partial aberrations. METHODS: A total of 162 plasma samples, previously sequenced for NIPT on a SOLiD 5500xl platform, were sequenced on the NextSeq 500 u

  8. A human gut microbial gene catalogue established by metagenomic sequencing

    DEFF Research Database (Denmark)

    dos Santos, Marcelo Bertalan Quintanilha; Sicheritz-Pontén, Thomas; Nielsen, Henrik Bjørn


    , from faecal samples of 124 European individuals. The gene set, ,150 times larger than the human gene complement, contains an overwhelming majority of the prevalent (more frequent) microbial genes of the cohort and probably includes a large proportion of the prevalent human intestinal microbial genes......To understand the impact of gut microbes on human health and well-being it is crucial to assess their genetic potential. Here we describe the Illumina-based metagenomic sequencing, assembly and characterization of 3.3 million non-redundant microbial genes, derived from 576.7 gigabases of sequence...

  9. Low temperature biological phosphorus removal and partial nitrification in a pilot sequencing batch reactor system. (United States)

    Yuan, Qiuyan; Oleszkiewicz, Jan A


    Partial nitrification and biological phosphorus removal appear to hold promise of a cost-effective and sustainable biological nutrient removal process. Pilot sequencing batch reactors (SBRs) were operated under anaerobic/aerobic configuration for 8 months. It was found that biological phosphorus removal can be achieved in an SBR system, along with the partial nitrification process. Sufficient volatile fatty acids supply was the key for enhanced biological phosphorus removal. This experiment demonstrated that partial nitrification can be achieved even at low temperature with high dissolved oxygen (>3 mg/L) concentration. Shorter solid retention time (SRT) for nitrite oxidizing bacteria (NOB) than for ammonia oxidizing bacteria due to the nitrite substrate limitation at the beginning of the aeration cycle was the reason that caused NOB wash-out. Controlling SRT should be the strategy for an SBR operated in cold climate to achieve partial nitrification. It was also found that the aerobic phosphorus accumulating organisms' P-uptake was more sensitive to nitrite inhibition than the process of anaerobic P-release.

  10. Bridging PCR and partially overlapping primers for novel allergen gene cloning and expression insert decoration

    Institute of Scientific and Technical Information of China (English)

    Ai-Lin Tao; Shao-Heng He


    AIM: To obtain the entire gene open reading frame (ORF)and to construct the expression vectors for recombinant allergen production.METHODS: Gene fragments corresponding to the gene specific region and the cDNA ends of pollen allergens of short ragweed (Rg, Ambrosia artemisiifolia L.) were obtained by pan-degenerate primer-based PCR and rapid amplification of the cDNA ends (RACE), and the products were mixed to serve as the bridging PCR (BPCR) template. The full-length gene was then obtained. Partially overlapping primer-based PCR (POP-PCR) method was developed to overcome the other problem, i.e., the non-specific amplification of the ORF with routine long primers for expression insert decoration.Northern blot was conducted to confirm pollen sources of the gene. The full-length coding region was evaluated for its gene function by homologue search in GenBank database and Western blotting of the recombinant protein Amb a 8 (D106) expressed in Escherichia colipET-44 system.RESULTS: The full-length cDNA sequence of Amb a 8(D106)was obtained by using the above procedure and deduced to encode a 131 amino acid polypeptide. Multiple sequence alignment exhibited the gene D106sharing a homology as high as 54-89% and 79-89% to profilin from pollen and food sources, respectively. The expression vector of the allergen gene D106was successfully constructed by employing the combined method of BPCR and POP-PCR. Recombinant allergen rAmb a 8(D106) was then successfully generated.The allergenicity was hallmarked by immunoblotting with the allergic serum samples and its RNA source was confirmed by Northern blot.CONCLUSION: The combined procedure of POP-PCR and BPCR is a powerful method for full-length allergen gene retrieval and expression insert decoration, which would be useful for recombinant allergen production and subsequent diagnosis and immunotherapy of pollen and food allergy.

  11. Yeast DNA sequences initiating gene expression in Escherichia coli. (United States)

    Lewin, Astrid; Tran, Thi Tuyen; Jacob, Daniela; Mayer, Martin; Freytag, Barbara; Appel, Bernd


    DNA transfer between pro- and eukaryotes occurs either during natural horizontal gene transfer or as a result of the employment of gene technology. We analysed the capacity of DNA sequences from a eukaryotic donor organism (Saccharomyces cerevisiae) to serve as promoter region in a prokaryotic recipient (Escherichia coli) by creating fusions between promoterless luxAB genes from Vibrio harveyi and random DNA sequences from S. cerevisiae and measuring the luminescence of transformed E. coli. Fifty-four out of 100 randomly analysed S. cerevisiae DNA sequences caused considerable gene expression in E. coli. Determination of transcription start sites within six selected yeast sequences in E. coli confirmed the existence of bacterial -10 and -35 consensus sequences at appropriate distances upstream from transcription initiation sites. Our results demonstrate that the probability of transcription of transferred eukaryotic DNA in bacteria is extremely high and does not require the insertion of the transferred DNA behind a promoter of the recipient genome.

  12. Chiropteran types I and II interferon genes inferred from genome sequencing traces by a statistical gene-family assembler

    Directory of Open Access Journals (Sweden)

    Haines Albert


    Full Text Available Abstract Background The rate of emergence of human pathogens is steadily increasing; most of these novel agents originate in wildlife. Bats, remarkably, are the natural reservoirs of many of the most pathogenic viruses in humans. There are two bat genome projects currently underway, a circumstance that promises to speed the discovery host factors important in the coevolution of bats with their viruses. These genomes, however, are not yet assembled and one of them will provide only low coverage, making the inference of most genes of immunological interest error-prone. Many more wildlife genome projects are underway and intend to provide only shallow coverage. Results We have developed a statistical method for the assembly of gene families from partial genomes. The method takes full advantage of the quality scores generated by base-calling software, incorporating them into a complete probabilistic error model, to overcome the limitation inherent in the inference of gene family members from partial sequence information. We validated the method by inferring the human IFNA genes from the genome trace archives, and used it to infer 61 type-I interferon genes, and single type-II interferon genes in the bats Pteropus vampyrus and Myotis lucifugus. We confirmed our inferences by direct cloning and sequencing of IFNA, IFNB, IFND, and IFNK in P. vampyrus, and by demonstrating transcription of some of the inferred genes by known interferon-inducing stimuli. Conclusion The statistical trace assembler described here provides a reliable method for extracting information from the many available and forthcoming partial or shallow genome sequencing projects, thereby facilitating the study of a wider variety of organisms with ecological and biomedical significance to humans than would otherwise be possible.

  13. Operational conditions for successful partial nitrification in a sequencing batch reactor (SBR) based on process kinetics. (United States)

    Liu, Xiaoguang; Kim, Mingu; Nakhla, George


    The objective of this study is to analyze the factors affecting the performance of partial nitrification in a sequencing batch reactor (SBR) using kinetic models. During the 4-month operation, dissolved oxygen (DO) and influent ammonia concentration were selected as operating variables to evaluate nitrite accumulation. Stable partial nitrification was observed with two conditions, influent ammonia concentration of 190 mg N/L and a DO of 0.6-3.0 mg/L as well as influent ammonia concentration of 100 mg N/L and a DO of 0.15-2.0 mg/L with intermittent aeration. At a DO of 0.6-3.0 mg O2/L and influent ammonia concentration of 90 mg N/L, nitrite-oxidizing bacteria growth was not suppressed. Kinetic parameters were determined or estimated with batch tests and model simulation. The kinetic model predicted the SBR performance well.

  14. DNA methylation dynamics in the rat EGF gene promoter after partial hepatectomy

    Directory of Open Access Journals (Sweden)

    Deming Li


    Full Text Available Epidermal growth factor (EGF, a multifunctional growth factor, is a regulator in a wide variety of physiological processes. EGF plays an important role in the regulation of liver regeneration. This study was aimed at investigating the methylation level of EGF gene throughout liver regeneration. DNA of liver tissue from control rats and partial hepatectomy (PH rats at 10 time points was extracted and a 354 bp fragment including 10 CpG sites from the transcription start was amplified after DNA was modified by sodium bisulfate. The result of sequencing suggested that methylation ratio of four CpG sites was found to be significantly changed when PH group was compared to control group, in particular two of them were extremely striking. mRNA expression of EGF was down-regulated in total during liver regeneration. We think that the rat EGF promoter region is regulated by variation in DNA methylation during liver regeneration.


    Directory of Open Access Journals (Sweden)

    Anthonius Y.P.B.C. Widyatmoko


    Full Text Available Sequence polymorphisms among and within four Acacia species,  A. aulacocarpa, A. auriculiformis, A. crassicarpa, and A. mangium, were investigated using four chloroplast DNA genes (atpA, petA, rbcL, and rpoA. The phylogenetic relationship among these species is discussed in light of the results of the sequence information. No intraspecific sequence variation was found in the four genes of the four species, and a conservative rate of mutation of the chloroplast DNA genes was also confirmed in the Acacia species. In the atpA and petA of the four genes, all four species possessed identical sequences, and no sequence variation was found among the four Acacia species. In the rbcL and rpoA genes, however, sequence polymorphisms were revealed among these species. Acacia aulacocarpa and A. crassicarpa shared an identical sequence, and A. auriculiformis and A. mangium also showed no sequence variation.  The fact that A. mangium and A. auriculiformis shared identical sequences as did A. aulacocarpa and A. crassicarpa indicated that the two respective species were extremely closely related. Although a putative natural hybrid of A. aulacocarpa and A. auriculiformis has been reported, our results suggested that natural hybridization should be further verified using molecular markers.

  16. Isolamento e caracterização parcial de sequências homólogas a genes ribossomais (rDNA em Blastocladiella emersonii - DOI: 10.4025/actascibiolsci.v25i2.2037 Isolation and partial characterization of homologous sequences of ribosomal genes (rDNA in Blastocladiella emersonii

    Directory of Open Access Journals (Sweden)

    Luiz Carlos Correa


    Full Text Available A definição e a caracterização de regiões de origens de replicação nos eucariotos superiores são ainda controversas. A iniciação da replicação é sítio-específica em alguns sistemas e, em outros, parece estar contida em regiões extensas. Regiões rDNA são modelos atrativos para o estudo de origens de replicação pela sua organização in tandem, reduzindo a área de estudo para o espaço restrito que codifica uma unidade de transcrição. Neste trabalho nós isolamos e caracterizamos parcialmente um clone que contém uma sequência ribossomal do fungo aquático Blastocladiella emersonii, Be97M20. Southern blots mostraram diversos sítios para enzimas de restrição Eco RI, HindIII e SalI. Northern blot de RNA total hibridado contra uma sonda feita com Be97M20 confirmou a sua homologia com o gene ribossomal 18S. A caracterização detalhada, incluindo o mapeamento de restrição completo, subclonagem, sequenciamento e análise em géis bidimensionais proverão informações adicionais importantes sobre a estrutura e dinâmica desta regiãoThe definition and the characterization of replication origins regions in higher eukaryotes are still controversial. The initiation of the replication is site-specific in some systems but seems to occur in large regions in others. Because of its in tandem organization, reducing the area to the restricted space that codifies an unit of transcription, rDNA regions are attractive models to study replication origins. In this work we isolated and started to characterize a clone that contains a ribosomal sequence from the aquatic fungus B. emersonii, Be97M20. Southern blots showed several sites for the restrition enzymes Eco RI, HindIII and SalI. A northern blot of total RNA, hybridized against a probe made from Be97M20, confirmed its homology with the ribosomal 18S gene. The detailed characterization, including complete restriction map, subcloning, sequence and analysis on bidimensional gels will

  17. Complete nucleotide sequence of primitive vertebrate immunoglobulin light chain genes. (United States)

    Shamblott, M J; Litman, G W


    Antibody to Heterodontus francisci (horned shark) immunoglobulin light chain was used to screen a spleen cDNA expression library, and recombinant clones encoding light chain genes were isolated. The complete sequences of the mature coding regions of two light chain genes in this phylogenetically distant vertebrate have been determined and are reported here. Comparisons of the sequences are consistent with the presence of mammalian-like framework and complementarity-determining regions. The predicted amino acid sequences of the genes are more related to mammalian lambda than to kappa light chains. The nucleotide sequences of the genes are most related to mammalian T-cell antigen receptor beta chain. Heterodontus light chain genes may reflect characteristics of the common ancestor of immunoglobulin and T-cell antigen receptors before its evolutionary diversification.

  18. Partial summations of stationary sequences of non-Gaussian random variables

    DEFF Research Database (Denmark)

    Mohr, Gunnar; Ditlevsen, Ove Dalager


    -Gaussian variables up to the moments of the fourth order [Winterstein, S. R. Nonlinear vibration models for extremes and fatigue. J. Engng Mech. ASCE 114 (1988) 1772-1790](1). A method to obtain the Winterstein approximation to a partial sum of a sequence of Winterstein approximations is explained and results...... of convergence of the distribution of a sum (or an integral) of mutually dependent random variables to the Gaussian distribution. The paper is closely related to the work in Ditlevsen el al. [Ditlevsen, O., Mohr, G. & Hoffmeyer, P. Integration of non-Gaussian fields. Prob. Engng Mech 11 (1996) 15-23](2)....

  19. [Sequence analysis of LEAFY homologous gene from Dendrobium moniliforme and application for identification of medicinal Dendrobium]. (United States)

    Xing, Wen-Rui; Hou, Bei-Wei; Guan, Jing-Jiao; Luo, Jing; Ding, Xiao-Yu


    The LEAFY (LFY) homologous gene of Dendrobium moniliforme (L.) Sw. was cloned by new primers which were designed based on the conservative region of known sequences of orchid LEAFY gene. Partial LFY homologous gene was cloned by common PCR, then we got the complete LFY homologous gene Den LFY by Tail-PCR. The complete sequence of DenLFY gene was 3 575 bp which contained three exons and two introns. Using BLAST method, comparison analysis among the exon of LFY homologous gene indicted that the DenLFY gene had high identity with orchids LFY homologous, including the related fragment of PhalLFY (84%) in Phalaenopsis hybrid cultivar, LFY homologous gene in Oncidium (90%) and in other orchid (over 80%). Using MP analysis, Dendrobium is found to be the sister to Oncidium and Phalaenopsis. Homologous analysis demonstrated that the C-terminal amino acids were highly conserved. When the exons and introns were separately considered, exons and the sequence of amino acid were good markers for the function research of DenLFY gene. The second intron can be used in authentication research of Dendrobium based on the length polymorphism between Dendrobium moniliforme and Dendrobium officinale.

  20. Mutations in SRY and WT1 genes required for gonadal development are not responsible for XY partial gonadal dysgenesis

    Directory of Open Access Journals (Sweden)

    E.B. Tagliarini


    Full Text Available The WT1 transcription factor regulates SRY expression during the initial steps of the sex determination process in humans, activating a gene cascade leading to testis differentiation. In addition to causing Wilms' tumor, mutations in WT1 are often responsible for urogenital defects in men, while SRY mutations are mainly related to 46,XY pure gonadal dysgenesis. In order to evaluate their role in abnormal testicular organogenesis, we screened for SRY and WT1 gene mutations in 10 children with XY partial gonadal dysgenesis, 2 of whom with a history of Wilms' tumor. The open reading frame and 360 bp of the 5' flanking sequence of the SRY gene, and the ten exons and intron boundaries of the WT1 gene were amplified by PCR of genomic DNA. Single-strand conformation polymorphism was initially used for WT1 mutation screening. Since shifts in fragment migration were only observed for intron/exon 4, the ten WT1 exons from all patients were sequenced manually. No mutations were detected in the SRY 5' untranslated region or within SRY open-reading frame sequences. WT1 sequencing revealed one missense mutation (D396N in the ninth exon of a patient who also had Wilms' tumor. In addition, two silent point mutations were found in the first exon including one described here for the first time. Some non-coding sequence variations were detected, representing one new (IVS4+85A>G and two already described (-7ATG T>G, IVS9-49 T>C single nucleotide polymorphisms. Therefore, mutations in two major genes required for gonadal development, SRY and WT1, are not responsible for XY partial gonadal dysgenesis.

  1. Identification and sequence analysis of Tapasin gene in guinea fowl

    Directory of Open Access Journals (Sweden)

    Varuna P. Panicker


    Full Text Available Aim: An attempt has been made to identify and study the nucleotide sequence variability in exon 5 - exon 6 regions of guinea fowl Tapasin gene. Materials and Methods: Blood samples were collected from randomly selected birds (12 guinea fowl birds and Tapasin gene amplified using chicken specific primers designed from GenBank submitted sequences. Polymerase chain reaction conditions were standardized so as get only single amplicons. Obtained products were then cloned and sequenced; sequences were then analyzed using suitable software. Results: Amplicon size of the Tapasin gene in guinea fowl was same as reported in chicken with areas of transitions and transversions. The sequence variations reported in these coding sequences might have influence in the protein structure, which may be correlated with the increased immune status of the bird when compared with chicken breeds. Conclusion: Since Tapasin gene is an immunologically important gene, which plays an important role in the immune status of the bird. Sequence variations in the gene can be correlated with the altered immune status of the bird.

  2. Comparison of methods for genomic localization of gene trap sequences

    Directory of Open Access Journals (Sweden)

    Ferrin Thomas E


    Full Text Available Abstract Background Gene knockouts in a model organism such as mouse provide a valuable resource for the study of basic biology and human disease. Determining which gene has been inactivated by an untargeted gene trapping event poses a challenging annotation problem because gene trap sequence tags, which represent sequence near the vector insertion site of a trapped gene, are typically short and often contain unresolved residues. To understand better the localization of these sequences on the mouse genome, we compared stand-alone versions of the alignment programs BLAT, SSAHA, and MegaBLAST. A set of 3,369 sequence tags was aligned to build 34 of the mouse genome using default parameters for each algorithm. Known genome coordinates for the cognate set of full-length genes (1,659 sequences were used to evaluate localization results. Results In general, all three programs performed well in terms of localizing sequences to a general region of the genome, with only relatively subtle errors identified for a small proportion of the sequence tags. However, large differences in performance were noted with regard to correctly identifying exon boundaries. BLAT correctly identified the vast majority of exon boundaries, while SSAHA and MegaBLAST missed the majority of exon boundaries. SSAHA consistently reported the fewest false positives and is the fastest algorithm. MegaBLAST was comparable to BLAT in speed, but was the most susceptible to localizing sequence tags incorrectly to pseudogenes. Conclusion The differences in performance for sequence tags and full-length reference sequences were surprisingly small. Characteristic variations in localization results for each program were noted that affect the localization of sequence at exon boundaries, in particular.

  3. [CHL15--a new gene controlling the replication of chromosomes in saccharomycetes yeast: cloning, physical mapping, sequencing, and sequence analysis]. (United States)

    Kuprina, N Iu; Krol', E S; Koriabin, M Iu; Shestopalov, B V; Bliskovskiĭ, V V; Bannikov, V M; Gizatullin, R Z; Kirillov, A V; Kravtsov, V Iu; Zakhar'ev, V M


    We have analyzed the CHL15 gene, earlier identified in a screen for yeast mutants with increased loss of chromosome III and artificial circular and linear chromosomes in mitosis. Mutations in the CHL15 gene lead to a 100-fold increase in the rate of chromosome III loss per cell division and a 200-fold increase in the rate of marker homozygosis on this chromosome by mitotic recombination. Analysis of segregation of artificial circular minichromosome and artificially generated nonessential marker chromosome fragment indicated that sister chromatid loss (1:0 segregation) is a main reason of chromosome destabilization in the chl15-1 mutant. A genomic clone of CHL15 was isolated and used to map its physical position on chromosome XVI. Nucleotide sequence analysis of CHL15 revealed a 2.8-kb open reading frame with a 105-kD predicted protein sequence. At the N-terminal region of the protein sequences potentially able to form DNA-binding domains defined as zinc-fingers were found. The C-terminal region of the predicted protein displayed a similarity to sequence of regulatory proteins known as the helix-loop-helix (HLH) proteins. Data on partial deletion analysis suggest that the HLH domain is essential for the function of the CHL15 gene product. Analysis of the upstream untranslated region of CHL15 revealed the presence of the hexamer element, ACGCGT (an MluI restriction site) controlling both the periodic expression and coordinate regulation of the DNA synthesis genes in budding yeast. Deletion in the RAD52 gene, the product of which is involved in double-strand break/recombination repair and replication, leads to a considerable decrease in the growth rate of the chl15 mutant. We suggest that CHL15 is a new DNA synthesis gene in the yeast Saccharomyces cerevisiae.

  4. Molecular genotyping of human Ureaplasma species based on multiple-banded antigen (MBA) gene sequences. (United States)

    Kong, F; Ma, Z; James, G; Gordon, S; Gilbert, G L


    Ureaplasma urealyticum has been divided into 14 serovars. Recently, subdivision of U. urealyticum into two species has been proposed: U. parvum (previously U. urealyticum parvo biovar), comprising four serovars (1, 3, 6, 14) and U. urealyticum (previously U. urealyticum T-960 biovar), 10 serovars (2, 4, 5, 7-13). The multiple-banded antigen (MBA) genes of these species contain both species and serovar/subtype specific sequences. Based on whole sequences of the 5'-ends of MBA genes of U. parvum serovars and partial sequences of the 5'-ends of MBA genes of U. urealyticum serovars, we previously divided each of these species into three MBA genotypes. To further elucidate the relationships between serovars, we sequenced the whole 5'-ends of MBA genes of all 10 U. urealyticum serovars and partial repetitive regions of these genes from all serovars of U. parvum and U. urealyticum. For the first time, all four serovars of U. parvum were clearly differentiated from each other. In addition, the 10 serovars of U. urealyticum were divided into five MBA genotypes, as follows: MBA genotype A comprises serovars 2, 5, 8; MBA genotype B, serovar 10 only; MBA genotype C, serovars 4, 12, 13; MBA genotype D, serovar 9 only; and MBA genotype E comprises serovars 7 and 11. There were no sequence differences between members within each MBA genotype. Further work is required to identify other genes or other regions of the MBA genes that may be used to differentiate U. urealyticum serovars within MBA genotypes A, C and E. A better understanding of the molecular basis of serotype differentiation will help to improve subtyping methods for use in studies of the pathogenesis and epidemiology of these organisms.

  5. Changes in Gene Expression of Arabidopsis Thaliana Cell Cultures Upon Exposure to Real and Simulated Partial- g Forces (United States)

    Fengler, Svenja; Spirer, Ina; Neef, Maren; Ecke, Margret; Hauslage, Jens; Hampp, Rüdiger


    Cell cultures of the plant model organism Arabidopsis thaliana were exposed to partial- g forces during parabolic flight and clinostat experiments (0.16 g, 0.38 g and 0.5 g were tested). In order to investigate gravity-dependent alterations in gene expression, samples were metabolically quenched by the fixative RNA later Ⓡ to stabilize nucleic acids and used for whole-genome microarray analysis. An attempt to identify the potential threshold acceleration for the gravity-dependent response showed that the smaller the experienced g-force, the greater was the susceptibility of the cell cultures. Compared to short-term μ g during a parabolic flight, the number of differentially expressed genes under partial- g was lower. In addition, the effect on the alteration of amounts of transcripts decreased during partial- g parabolic flight due to the sequence of the different parabolas (0.38 g, 0.16 g and μ g). A time-dependent analysis under simulated 0.5 g indicates that adaptation occurs within minutes. Differentially expressed genes (at least 2-fold up- or down-regulated in expression) under real flight conditions were to some extent identical with those affected by clinorotation. The highest number of homologuous genes was detected within seconds of exposure to 0.38 g (both flight and clinorotation). To a considerable part, these genes deal with cell wall properties. Additionally, responses specific for clinorotation were observed.

  6. Degenerative primer design and gene sequencing validation for select turkey genes. (United States)

    Hutsko, Stephanie L; Lilburn, Michael S; Wick, Macdonald


    We successfully designed and validated degenerative primers for turkey genes MUC2, RPS13, TBP and TFF2 based on chicken sequences in order to use gene transcription analysis to evaluate (quantify) the mucin transcription to probiotic supplementation in turkeys. Primers were designed for the genes MUC2, TFF2, RPS13 and TBP using a degenerative primer design method based on the available Gallus gallus sequences. All primer sets, which produced a single PCR amplicon of the expected sizes, were cloned into the TOPO(®) vector and then transformed into TOP 10(®) competent cells. Plasmid DNA isolation was performed on the TOP10(®) cell culture and sent for sequencing. Sequences were analyzed using NCBI BLAST. All genes sequenced had over 90% homology with both the chicken and predicted turkey sequences. The sequences were used to design new 100% homologous primer sets for the genes of interest. © 2016 Poultry Science Association Inc.

  7. Regulatory sequence of cupin family gene

    Energy Technology Data Exchange (ETDEWEB)

    Hood, Elizabeth; Teoh, Thomas


    This invention is in the field of plant biology and agriculture and relates to novel seed specific promoter regions. The present invention further provide methods of producing proteins and other products of interest and methods of controlling expression of nucleic acid sequences of interest using the seed specific promoter regions.

  8. Variation of partial transferrin sequences and phylogenetic relationships among hares (Lepus capensis, Lagomorpha) from Tunisia. (United States)

    Awadi, Asma; Suchentrunk, Franz; Makni, Mohamed; Ben Slimen, Hichem


    North African hares are currently included in cape hares, Lepus capensis sensu lato, a taxon that may be considered a superspecies or a complex of closely related species. The existing molecular data, however, are not unequivocal, with mtDNA control region sequences suggesting a separate species status and nuclear loci (allozymes, microsatellites) revealing conspecificity of L. capensis and L. europaeus. Here, we study sequence variation in the intron 6 (468 bp) of the transferrin nuclear gene, of 105 hares with different coat colour from different regions in Tunisia with respect to genetic diversity and differentiation, as well as their phylogenetic status. Forty-six haplotypes (alleles) were revealed and compared phylogenetically to all available TF haplotypes of various Lepus species retrieved from GenBank. Maximum Likelihood, neighbor joining and median joining network analyses concordantly grouped all currently obtained haplotypes together with haplotypes belonging to six different Chinese hare species and the African scrub hare L. saxatilis. Moreover, two Tunisian haploypes were shared with L. capensis, L timidus, L. sinensis, L. yarkandensis, and L. hainanus from China. These results indicated the evolutionary complexity of the genus Lepus with the mixing of nuclear gene haplotypes resulting from introgressive hybridization or/and shared ancestral polymorphism. We report the presence of shared ancestral polymorphism between North African and Chinese hares. This has not been detected earlier in the mtDNA sequences of the same individuals. Genetic diversity of the TF sequences from the Tunisian populations was relatively high compared to other hare populations. However, genetic differentiation and gene flow analyses (AMOVA, FST, Nm) indicated little divergence with the absence of geographically meaningful phylogroups and lack of clustering with coat colour types. These results confirm the presence of a single hare species in Tunisia, but a sound inference on

  9. BAC-based sequencing of behaviorally-relevant genes in the prairie vole.

    Directory of Open Access Journals (Sweden)

    Lisa A McGraw

    Full Text Available The prairie vole (Microtus ochrogaster is an important model organism for the study of social behavior, yet our ability to correlate genes and behavior in this species has been limited due to a lack of genetic and genomic resources. Here we report the BAC-based targeted sequencing of behaviorally-relevant genes and flanking regions in the prairie vole. A total of 6.4 Mb of non-redundant or haplotype-specific sequence assemblies were generated that span the partial or complete sequence of 21 behaviorally-relevant genes as well as an additional 55 flanking genes. Estimates of nucleotide diversity from 13 loci based on alignments of 1.7 Mb of haplotype-specific assemblies revealed an average pair-wise heterozygosity (8.4×10(-3. Comparative analyses of the prairie vole proteins encoded by the behaviorally-relevant genes identified >100 substitutions specific to the prairie vole lineage. Finally, our sequencing data indicate that a duplication of the prairie vole AVPR1A locus likely originated from a recent segmental duplication spanning a minimum of 105 kb. In summary, the results of our study provide the genomic resources necessary for the molecular and genetic characterization of a high-priority set of candidate genes for regulating social behavior in the prairie vole.

  10. GeneTack database: genes with frameshifts in prokaryotic genomes and eukaryotic mRNA sequences. (United States)

    Antonov, Ivan; Baranov, Pavel; Borodovsky, Mark


    Database annotations of prokaryotic genomes and eukaryotic mRNA sequences pay relatively low attention to frame transitions that disrupt protein-coding genes. Frame transitions (frameshifts) could be caused by sequencing errors or indel mutations inside protein-coding regions. Other observed frameshifts are related to recoding events (that evolved to control expression of some genes). Earlier, we have developed an algorithm and software program GeneTack for ab initio frameshift finding in intronless genes. Here, we describe a database (freely available at containing genes with frameshifts (fs-genes) predicted by GeneTack. The database includes 206 991 fs-genes from 1106 complete prokaryotic genomes and 45 295 frameshifts predicted in mRNA sequences from 100 eukaryotic genomes. The whole set of fs-genes was grouped into clusters based on sequence similarity between fs-proteins (conceptually translated fs-genes), conservation of the frameshift position and frameshift direction (-1, +1). The fs-genes can be retrieved by similarity search to a given query sequence via a web interface, by fs-gene cluster browsing, etc. Clusters of fs-genes are characterized with respect to their likely origin, such as pseudogenization, phase variation, etc. The largest clusters contain fs-genes with programed frameshifts (related to recoding events).

  11. Sequence signatures involved in targeting the male-specific lethal complex to X-chromosomal genes in Drosophila melanogaster

    Directory of Open Access Journals (Sweden)

    Philip Philge


    Full Text Available Abstract Background In Drosophila melanogaster, the dosage-compensation system that equalizes X-linked gene expression between males and females, thereby assuring that an appropriate balance is maintained between the expression of genes on the X chromosome(s and the autosomes, is at least partially mediated by the Male-Specific Lethal (MSL complex. This complex binds to genes with a preference for exons on the male X chromosome with a 3' bias, and it targets most expressed genes on the X chromosome. However, a number of genes are expressed but not targeted by the complex. High affinity sites seem to be responsible for initial recruitment of the complex to the X chromosome, but the targeting to and within individual genes is poorly understood. Results We have extensively examined X chromosome sequence variation within five types of gene features (promoters, 5' UTRs, coding sequences, introns, 3' UTRs and intergenic sequences, and assessed its potential involvement in dosage compensation. Presented results show that: the X chromosome has a distinct sequence composition within its gene features; some of the detected variation correlates with genes targeted by the MSL-complex; the insulator protein BEAF-32 preferentially binds upstream of MSL-bound genes; BEAF-32 and MOF co-localizes in promoters; and that bound genes have a distinct sequence composition that shows a 3' bias within coding sequence. Conclusions Although, many strongly bound genes are close to a high affinity site neither our promoter motif nor our coding sequence signatures show any correlation to HAS. Based on the results presented here, we believe that there are sequences in the promoters and coding sequences of targeted genes that have the potential to direct the secondary spreading of the MSL-complex to nearby genes.

  12. Sequence signatures involved in targeting the Male-Specific Lethal complex to X-chromosomal genes in Drosophila melanogaster. (United States)

    Philip, Philge; Pettersson, Fredrik; Stenberg, Per


    In Drosophila melanogaster, the dosage-compensation system that equalizes X-linked gene expression between males and females, thereby assuring that an appropriate balance is maintained between the expression of genes on the X chromosome(s) and the autosomes, is at least partially mediated by the Male-Specific Lethal (MSL) complex. This complex binds to genes with a preference for exons on the male X chromosome with a 3' bias, and it targets most expressed genes on the X chromosome. However, a number of genes are expressed but not targeted by the complex. High affinity sites seem to be responsible for initial recruitment of the complex to the X chromosome, but the targeting to and within individual genes is poorly understood. We have extensively examined X chromosome sequence variation within five types of gene features (promoters, 5' UTRs, coding sequences, introns, 3' UTRs) and intergenic sequences, and assessed its potential involvement in dosage compensation. Presented results show that: the X chromosome has a distinct sequence composition within its gene features; some of the detected variation correlates with genes targeted by the MSL-complex; the insulator protein BEAF-32 preferentially binds upstream of MSL-bound genes; BEAF-32 and MOF co-localizes in promoters; and that bound genes have a distinct sequence composition that shows a 3' bias within coding sequence. Although, many strongly bound genes are close to a high affinity site neither our promoter motif nor our coding sequence signatures show any correlation to HAS. Based on the results presented here, we believe that there are sequences in the promoters and coding sequences of targeted genes that have the potential to direct the secondary spreading of the MSL-complex to nearby genes.

  13. Mechanism of Gene Amplification via Yeast Autonomously Replicating Sequences

    Directory of Open Access Journals (Sweden)

    Shelly Sehgal


    Full Text Available The present investigation was aimed at understanding the molecular mechanism of gene amplification. Interplay of fragile sites in promoting gene amplification was also elucidated. The amplification promoting sequences were chosen from the Saccharomyces cerevisiae ARS, 5S rRNA regions of Plantago ovata and P. lagopus, proposed sites of replication pausing at Ste20 gene locus of S. cerevisiae, and the bend DNA sequences within fragile site FRA11A in humans. The gene amplification assays showed that plasmid bearing APS from yeast and human beings led to enhanced protein concentration as compared to the wild type. Both the in silico and in vitro analyses were pointed out at the strong bending potential of these APS. In addition, high mitotic stability and presence of TTTT repeats and SAR amongst these sequences encourage gene amplification. Phylogenetic analysis of S. cerevisiae ARS was also conducted. The combinatorial power of different aspects of APS analyzed in the present investigation was harnessed to reach a consensus about the factors which stimulate gene expression, in presence of these sequences. It was concluded that the mechanism of gene amplification was that AT rich tracts present in fragile sites of yeast serve as binding sites for MAR/SAR and DNA unwinding elements. The DNA protein interactions necessary for ORC activation are facilitated by DNA bending. These specific bindings at ORC promote repeated rounds of DNA replication leading to gene amplification.

  14. Biased distribution of DNA uptake sequences towards genome maintenance genes

    DEFF Research Database (Denmark)

    Davidsen, T.; Rodland, E.A.; Lagesen, K.


    coding regions are the DNA uptake sequences (DUS) required for natural genetic transformation. More importantly, we found a significantly higher density of DUS within genes involved in DNA repair, recombination, restriction-modification and replication than in any other annotated gene group......Repeated sequence signatures are characteristic features of all genomic DNA. We have made a rigorous search for repeat genomic sequences in the human pathogens Neisseria meningitidis, Neisseria gonorrhoeae and Haemophilus influenzae and found that by far the most frequent 9-10mers residing within...

  15. Targeted sequencing of cancer-related genes in colorectal cancer using next-generation sequencing.

    Directory of Open Access Journals (Sweden)

    Sae-Won Han

    Full Text Available Recent advance in sequencing technology has enabled comprehensive profiling of genetic alterations in cancer. We have established a targeted sequencing platform using next-generation sequencing (NGS technology for clinical use, which can provide mutation and copy number variation data. NGS was performed with paired-end library enriched with exons of 183 cancer-related genes. Normal and tumor tissue pairs of 60 colorectal adenocarcinomas were used to test feasibility. Somatic mutation and copy number alteration were analyzed. A total of 526 somatic non-synonymous sequence variations were found in 113 genes. Among these, 278 single nucleotide variations were 232 different somatic point mutations. 216 SNV were 79 known single nucleotide polymorphisms in the dbSNP. 32 indels were 28 different indel mutations. Median number of mutated gene per tumor was 4 (range 0-23. Copy number gain (>X2 fold was found in 65 genes in 40 patients, whereas copy number loss (genes in 39 patients. The most frequently altered genes (mutation and/or copy number alteration were APC in 35 patients (58%, TP53 in 34 (57%, and KRAS in 24 (40%. Altered gene list revealed ErbB signaling pathway as the most commonly involved pathway (25 patients, 42%. Targeted sequencing platform using NGS technology is feasible for clinical use and provides comprehensive genetic alteration data.

  16. Molecular phylogeny of the crab genera Helice and Helicana (Crustacea, Decapoda) based on the partial sequences of mitochondrial 16S rRNA gene%厚蟹线粒体16S rRNA基因序列分析及系统发育研究

    Institute of Scientific and Technical Information of China (English)

    徐敬明; 孙翰昌; 孙世春


    对我国沿海天津厚蟹6个群体、侧足厚蟹4个群体、伍氏仿厚蟹2个群体和日本仿厚蟹1个群体的线粒体16S rRNA基因片段进行了序列测定;结合从GenBank下载的其他厚蟹序列,分析了厚蟹分子系统发育关系.除天津厚蟹丹东群体的2个个体16S rRNA基因片段长度为526 bp外,其他天津厚蟹和侧足厚蟹均为525 bp;除日照群体和丹东群体外,其他每个群体的个体之间没有序列差异,天津厚蟹和侧足厚蟹共有5个单倍型.伍氏仿厚蟹16S rRNA基因片段长度均为525 bp,群体、个体之间均无序列差异.日本仿厚蟹16S rRNA基因片段长度均为523 bp,个体之间无序列差异.其A,T,G,C含量只有略微的差异,A+T含量(72.3%~73.7%)明显高于G+C含量.4种厚蟹所有序列比对,在526 bp序列上共有变异位点36处和4处插入/缺失,其中简约信息位点26处,转换/颠换的平均值为2.8.在5个单倍型中,侧足厚蟹泉州、宁波群体的6个个体与天津厚蟹宁波、日照群体的4个个体的序列都相同,共享1个单倍型;而且5个单倍型之间的遗传距离很小,仅为0.19%~1.15%,表明二者为同一物种的不同形态类型.采用NJ法,ML法和MP法构建的厚蟹/张口蟹复合群系统进化树的拓扑结构基本一致.结果显示,天津厚蟹和侧足厚蟹的10个群体的5个单倍型和台湾厚蟹首先聚到一起,形成侧足厚蟹复合体后,再与三齿厚蟹聚为一支;日本仿厚蟹与德氏仿厚蟹先聚在一起,然后再与伍氏仿厚蟹聚为一支;均得到99%置信度的支持,表明厚蟹属蟹类和仿厚蟹属蟹类均为单系,支持将厚蟹亚属和仿厚蟹亚属分别提升为属.分子数据及系统树的拓扑结构亦支持将假厚蟹亚属和拟厚蟹亚属分别提升为属,以及形态学上将厚蟹/张口蟹复合群分为7个属的结论.%The partial sequences of mitochondrial 16S rRNA gene of Helice tientsinensis (six populations), H. Latimera

  17. Coelacanth genome sequence reveals the evolutionary history of vertebrate genes. (United States)

    Noonan, James P; Grimwood, Jane; Danke, Joshua; Schmutz, Jeremy; Dickson, Mark; Amemiya, Chris T; Myers, Richard M


    The coelacanth is one of the nearest living relatives of tetrapods. However, a teleost species such as zebrafish or Fugu is typically used as the outgroup in current tetrapod comparative sequence analyses. Such studies are complicated by the fact that teleost genomes have undergone a whole-genome duplication event, as well as individual gene-duplication events. Here, we demonstrate the value of coelacanth genome sequence by complete sequencing and analysis of the protocadherin gene cluster of the Indonesian coelacanth, Latimeria menadoensis. We found that coelacanth has 49 protocadherin cluster genes organized in the same three ordered subclusters, alpha, beta, and gamma, as the 54 protocadherin cluster genes in human. In contrast, whole-genome and tandem duplications have generated two zebrafish protocadherin clusters comprised of at least 97 genes. Additionally, zebrafish protocadherins are far more prone to homogenizing gene conversion events than coelacanth protocadherins, suggesting that recombination- and duplication-driven plasticity may be a feature of teleost genomes. Our results indicate that coelacanth provides the ideal outgroup sequence against which tetrapod genomes can be measured. We therefore present L. menadoensis as a candidate for whole-genome sequencing.

  18. Q8IYL2 is a candidate gene for the familial epilepsy syndrome of Partial Epilepsy with Pericentral Spikes (PEPS). (United States)

    Leschziner, Guy D; Coffey, Alison J; Andrew, Toby; Gregorio, Sheila P; Dias-Neto, Emmanuel; Calafato, Maria; Bentley, David R; Kinton, Lucy; Sander, Josemir W; Johnson, Michael R


    Partial Epilepsy with Pericentral Spikes (PEPS) is a novel Mendelian idiopathic epilepsy with evidence of linkage to Chromosome 4p15. Our aim was to identify the causative mutation in this epilepsy syndrome. We re-annotated all 42 genes in the linked chromosomal region and sequenced all genes within the linked interval. All exons, intron-exon boundaries and untranslated regions were sequenced in the original pedigree, and novel changes segregating correctly were subjected to bioinformatic analysis. Quantitative polymerase chain reaction was performed to examine for potential copy number variation (CNV). 29 previously undescribed variants correctly segregating with the linked haplotype were identified. Bioinformatic analysis demonstrated that six variants were non-synonymous coding sequence polymorphisms, one of which, in Q8IYL2 (Gly400Ala), was found in neither Caucasian (n=243) and ancestry-matched Brazilian (n=180) control samples, nor subjects from the 1000 Genome Project. No gene duplications or deletions were identified in the linked region. We postulate that Q8IYL2 is a causative gene for PEPS, after exhaustive resequencing and bioinformatic analysis. The function of this gene is unknown, but it is expressed in brain tissue. Copyright © 2011 Elsevier B.V. All rights reserved.

  19. Efficient Query-by-Content Audio Retrieval by Locality Sensitive Hashing and Partial Sequence Comparison (United States)

    Yu, Yi; Joe, Kazuki; Downie, J. Stephen

    This paper investigates suitable indexing techniques to enable efficient content-based audio retrieval in large acoustic databases. To make an index-based retrieval mechanism applicable to audio content, we investigate the design of Locality Sensitive Hashing (LSH) and the partial sequence comparison. We propose a fast and efficient audio retrieval framework of query-by-content and develop an audio retrieval system. Based on this framework, four different audio retrieval schemes, LSH-Dynamic Programming (DP), LSH-Sparse DP (SDP), Exact Euclidian LSH (E2LSH)-DP, E2LSH-SDP, are introduced and evaluated in order to better understand the performance of audio retrieval algorithms. The experimental results indicate that compared with the traditional DP and the other three compititive schemes, E2LSH-SDP exhibits the best tradeoff in terms of the response time, retrieval accuracy and computation cost.

  20. Molecular characterization of 18S rDNA partial sequence in Microcosmus (Stolidobranchiata, Pyuridae

    Directory of Open Access Journals (Sweden)



    Full Text Available We present a 18S rDNA based molecular phylogeny of two species of the genus Microcosmus (M. sulcatus and M. claudicans sampled in the Mediterranean, to investigate their phylogenetic position relative to species of the order Stolidobranchiata. The analysis is based on partial sequences (739 bp of the 18S rDNA. Among the 18 variable sites found between the two species, 4 correspond to transitions (ts, 14 to transversions (tv and 4 to deletions/insertions. In the considered Stolidobranchiata, we found 4.3% overall mean number of nucleotide differences and 0.06 (S.E. ±0.01 Kimura 2-parameter distance. The mean number of nucleotide differences between Microcosmus spp. and other Stolidobranchiata species was of 6% and 0.08 (S.E. ±0.01 Kimura 2-parameter distance. A molecular phylogeny obtained by Maximum Parsimony corroborates results of the traditional taxonomy.

  1. Molecular characterization of 18S rDNA partial sequence in Microcosmus (Stolidobranchiata, Pyuridae

    Directory of Open Access Journals (Sweden)



    Full Text Available We present a 18S rDNA based molecular phylogeny of two species of the genus Microcosmus (M. sulcatus and M. claudicans sampled in the Mediterranean, to investigate their phylogenetic position relative to species of the order Stolidobranchiata. The analysis is based on partial sequences (739 bp of the 18S rDNA. Among the 18 variable sites found between the two species, 4 correspond to transitions (ts, 14 to transversions (tv and 4 to deletions/insertions. In the considered Stolidobranchiata, we found 4.3% overall mean number of nucleotide differences and 0.06 (S.E. ±0.01 Kimura 2-parameter distance. The mean number of nucleotide differences between Microcosmus spp. and other Stolidobranchiata species was of 6% and 0.08 (S.E. ±0.01 Kimura 2-parameter distance. A molecular phylogeny obtained by Maximum Parsimony corroborates results of the traditional taxonomy.

  2. High occurrence of functional new chimeric genes in survey of rice chromosome 3 short arm genome sequences. (United States)

    Zhang, Chengjun; Wang, Jun; Marowsky, Nicholas C; Long, Manyuan; Wing, Rod A; Fan, Chuanzhu


    In an effort to identify newly evolved genes in rice, we searched the genomes of Asian-cultivated rice Oryza sativa ssp. japonica and its wild progenitors, looking for lineage-specific genes. Using genome pairwise comparison of approximately 20-Mb DNA sequences from the chromosome 3 short arm (Chr3s) in six rice species, O. sativa, O. nivara, O. rufipogon, O. glaberrima, O. barthii, and O. punctata, combined with synonymous substitution rate tests and other evidence, we were able to identify potential recently duplicated genes, which evolved within the last 1 Myr. We identified 28 functional O. sativa genes, which likely originated after O. sativa diverged from O. glaberrima. These genes account for around 1% (28/3,176) of all annotated genes on O. sativa's Chr3s. Among the 28 new genes, two recently duplicated segments contained eight genes. Fourteen of the 28 new genes consist of chimeric gene structure derived from one or multiple parental genes and flanking targeting sequences. Although the majority of these 28 new genes were formed by single or segmental DNA-based gene duplication and recombination, we found two genes that were likely originated partially through exon shuffling. Sequence divergence tests between new genes and their putative progenitors indicated that new genes were most likely evolving under natural selection. We showed all 28 new genes appeared to be functional, as suggested by Ka/Ks analysis and the presence of RNA-seq, cDNA, expressed sequence tag, massively parallel signature sequencing, and/or small RNA data. The high rate of new gene origination and of chimeric gene formation in rice may demonstrate rice's broad diversification, domestication, its environmental adaptation, and the role of new genes in rice speciation.

  3. Sequence Variability in Staphylococcal Enterotoxin Genes seb, sec, and sed

    Directory of Open Access Journals (Sweden)

    Sophia Johler


    Full Text Available Ingestion of staphylococcal enterotoxins preformed by Staphylococcus aureus in food leads to staphylococcal food poisoning, the most prevalent foodborne intoxication worldwide. There are five major staphylococcal enterotoxins: SEA, SEB, SEC, SED, and SEE. While variants of these toxins have been described and were linked to specific hosts or levels or enterotoxin production, data on sequence variation is still limited. In this study, we aim to extend the knowledge on promoter and gene variants of the major enterotoxins SEB, SEC, and SED. To this end, we determined seb, sec, and sed promoter and gene sequences of a well-characterized set of enterotoxigenic Staphylococcus aureus strains originating from foodborne outbreaks, human infections, human nasal colonization, rabbits, and cattle. New nucleotide sequence variants were detected for all three enterotoxins and a novel amino acid sequence variant of SED was detected in a strain associated with human nasal colonization. While the seb promoter and gene sequences exhibited a high degree of variability, the sec and sed promoter and gene were more conserved. Interestingly, a truncated variant of sed was detected in all tested sed harboring rabbit strains. The generated data represents a further step towards improved understanding of strain-specific differences in enterotoxin expression and host-specific variation in enterotoxin sequences.

  4. Molecular characterisation of lineage IV peste des petits ruminants virus using multi gene sequence data. (United States)

    Kumar, K Senthil; Babu, Aravindh; Sundarapandian, G; Roy, Parimal; Thangavelu, A; Kumar, K Siva; Arumugam, R; Chandran, N D J; Muniraju, Murali; Mahapatra, Mana; Banyard, Ashley C; Manohar, B Murali; Parida, Satya


    Peste des petits ruminants is responsible for an economically important plague of small ruminants that is endemic across much of the developing world. Here we describe the detection and characterisation of a PPR virus from a recent outbreak in Tamil Nadu, India. We demonstrate the isolation of PPR virus from rectal swab and highlight the potential spread of disease to in-contact animals through faecal materials and use of faecal material as non-invasive method of sampling for susceptible wild ruminants. Finally we have performed a comprehensive 'multi-gene' assessment of lineage IV isolates of PPRV utilising sequence data from our study and publically available partial N, partial F and partial H gene data. We describe the effects of grouping PPRV isolates utilising different gene loci and conclude that the variable part of N gene at C terminus gives the best phylogenetic assessment of PPRV isolates with isolates generally clustering according to geographical isolation. This assessment highlights the importance of careful gene targeting with RT-PCR to enable thorough phylogenetic analysis. Copyright © 2014 The Authors. Published by Elsevier B.V. All rights reserved.

  5. Cloning and sequence analysis of chitin synthase gene fragments of Demodex mites

    Institute of Scientific and Technical Information of China (English)

    Ya-e ZHAO; Zheng-hang WANG; Yang XU; Ji-ru XU; Wen-yan LIU; Meng WEI; Chu-ying WANG


    To our knowledge,few reports on Demodex studied at the molecular level are available at present.In this study our group,for the first time,cloned,sequenced and analyzed the chitin synthase (CHS) gene fragments of Demodex folliculorum,Demodex brevis,and Demodex canis (three isolates from each species) from Xi'an China,by designing specific primers based on the only partial sequence of the CHS gene of D.canis from Japan,retrieved from GenBank.Results show that amplification was successful only in three D.canis isolates and one D.brevis isolate out of the nine Demodex isolates.The obtained fragments were sequenced to be 339 bp for D.canis and 338 bp for D.brevis.The CHS gene sequence similarities between the three Xi'an D.canis isolates and one Japanese D.canis isolate ranged from 99.7% to 100.0%,and those between four D.canis isolates and one D.brevis isolate were 99.1%-99.4%.Phylogenetic trees based on maximum parsimony (MP) and maximum likelihood (ML) methods shared the same clusters,according with the traditional classification.Two open reading frames (ORFs) were identified in each CHS gene sequenced,and their corresponding amino acid sequences were located at the catalytic domain.The relatively conserved sequences could be deduced to be a CHS class A gene,which is associated with chitin synthesis in the integument of Demodex mites.

  6. Combinatorial pooling enables selective sequencing of the barley gene space.

    Directory of Open Access Journals (Sweden)

    Stefano Lonardi


    Full Text Available For the vast majority of species - including many economically or ecologically important organisms, progress in biological research is hampered due to the lack of a reference genome sequence. Despite recent advances in sequencing technologies, several factors still limit the availability of such a critical resource. At the same time, many research groups and international consortia have already produced BAC libraries and physical maps and now are in a position to proceed with the development of whole-genome sequences organized around a physical map anchored to a genetic map. We propose a BAC-by-BAC sequencing protocol that combines combinatorial pooling design and second-generation sequencing technology to efficiently approach denovo selective genome sequencing. We show that combinatorial pooling is a cost-effective and practical alternative to exhaustive DNA barcoding when preparing sequencing libraries for hundreds or thousands of DNA samples, such as in this case gene-bearing minimum-tiling-path BAC clones. The novelty of the protocol hinges on the computational ability to efficiently compare hundred millions of short reads and assign them to the correct BAC clones (deconvolution so that the assembly can be carried out clone-by-clone. Experimental results on simulated data for the rice genome show that the deconvolution is very accurate, and the resulting BAC assemblies have high quality. Results on real data for a gene-rich subset of the barley genome confirm that the deconvolution is accurate and the BAC assemblies have good quality. While our method cannot provide the level of completeness that one would achieve with a comprehensive whole-genome sequencing project, we show that it is quite successful in reconstructing the gene sequences within BACs. In the case of plants such as barley, this level of sequence knowledge is sufficient to support critical end-point objectives such as map-based cloning and marker-assisted breeding.

  7. Partial Sequence Analysis of the Genome of Human Herpesvirus 7 YY5 Isolated from Saliva Samples

    Institute of Scientific and Technical Information of China (English)


    Objective To isolate and identify Nanjing local strains of Human Herpesvirus 7 (HH-V-7), and to analyze their partia l genome characteristic. Methods The saliva specimens were collected from 2 healthy adults and 5 children with kidney disease. After treatment with antibiotics and filtering. they were inoculated on to the phytohemagglutin stimulated umbilical cord blood mononuclear cells ( CBMCs). When the infected cells presented the typical ballooning and polykaryotic cytopathic effects (CPE), we identified them by transvnission electron microscopy and polymerase chain reaction.PCR product was also sequenced. Results Four strains were isolated from the seven saliva specimens. The 186-base-pair fragment of the isolated strain YY5 PCR products was sequenced, which encoded part of the HHV-7 U10 gene. The DNA sequence revealed an identity of 57. 5% and 36.0%, respectively with HHV-6 and human cytomegalovirus ( HCMV). At the amino acid level, the similarity of 51.6% was found between HHV-7 and HHV-6, and that of 25.8% between HHV-7and HCMV. Conclusion The isolated viruses were HHV-7, and 186 bp fragments revealed an identity with HHV-7 RK and Jl of 100%.

  8. Proteolipid protein 1 gene sequencing of hereditary spastic paraplegia

    Institute of Scientific and Technical Information of China (English)

    Yu Gao; Lumei Chi; Yinshi Jin; Guangxian Nan


    PCR amplification and sequencing of whole blood DNA from an individual with hereditary spastic paraplegia, as well as family members, revealed a fragment of proteolipid protein 1 (PLP1) gene exon 1, which excluded the possibility of isomer 1 expression for this family. The fragment sequence of exon 3 and exon 5 was consistent with the proteolipid protein 1 sequence at NCBI. In the proband samples, a PLP1 point mutation in exon 4 was detected at the basic group of position 844, T→C, phenylalanine→leucine. In proband samples from a male cousin, the basic group at position 844 was C, but gene sequencing signals revealed mixed signals of T and C, indicating possible mutation at this locus. Results demonstrated that changes in PLP1 exon 4 amino acids were associated with onset of hereditary spastic paraplegia.

  9. Speeding disease gene discovery by sequence based candidate prioritization

    Directory of Open Access Journals (Sweden)

    Porteous David J


    Full Text Available Abstract Background Regions of interest identified through genetic linkage studies regularly exceed 30 centimorgans in size and can contain hundreds of genes. Traditionally this number is reduced by matching functional annotation to knowledge of the disease or phenotype in question. However, here we show that disease genes share patterns of sequence-based features that can provide a good basis for automatic prioritization of candidates by machine learning. Results We examined a variety of sequence-based features and found that for many of them there are significant differences between the sets of genes known to be involved in human hereditary disease and those not known to be involved in disease. We have created an automatic classifier called PROSPECTR based on those features using the alternating decision tree algorithm which ranks genes in the order of likelihood of involvement in disease. On average, PROSPECTR enriches lists for disease genes two-fold 77% of the time, five-fold 37% of the time and twenty-fold 11% of the time. Conclusion PROSPECTR is a simple and effective way to identify genes involved in Mendelian and oligogenic disorders. It performs markedly better than the single existing sequence-based classifier on novel data. PROSPECTR could save investigators looking at large regions of interest time and effort by prioritizing positional candidate genes for mutation detection and case-control association studies.

  10. A human gut microbial gene catalogue established by metagenomic sequencing

    DEFF Research Database (Denmark)

    dos Santos, Marcelo Bertalan Quintanilha; Sicheritz-Pontén, Thomas; Nielsen, Henrik Bjørn;


    To understand the impact of gut microbes on human health and well-being it is crucial to assess their genetic potential. Here we describe the Illumina-based metagenomic sequencing, assembly and characterization of 3.3 million non-redundant microbial genes, derived from 576.7 gigabases of sequence...... gut metagenome and the minimal gut bacterial genome in terms of functions present in all individuals and most bacteria, respectively....

  11. Cloning and sequencing of the trpE gene from Arthrobacter globiformis ATCC 8010 and several related subsurface Arthrobacter isolates

    Energy Technology Data Exchange (ETDEWEB)

    Chernova, T.; Viswanathan, V.K.; Austria, N.; Nichols, B.P.


    Tryptophan dependent mutants of Arthrobacter globiformis ATCC 8010 were isolated and trp genes were cloned by complementation and marker rescue of the auxotrophic strains. Rescue studies and preliminary sequence analysis reveal that at least the genes trpE, trpC, and trpB are clustered together in this organism. In addition, sequence analysis of the entire trpE gene, which encodes component I of anthranilate synthase, is described. Segments of the trpE gene from 17 subsurface isolates of Arthrobacter sp. were amplified by PCR and sequenced. The partial trpE sequences from the various strains were aligned and subjected to phylogenetic analysis. The data suggest that in addition to single base changes, recombination and genetic exchange play a major role in the evolution of the Arthrobacter genome.

  12. Automated cleaning and pre-processing of immunoglobulin gene sequences from high-throughput sequencing

    Directory of Open Access Journals (Sweden)

    Miri eMichaeli


    Full Text Available High throughput sequencing (HTS yields tens of thousands to millions of sequences that require a large amount of pre-processing work to clean various artifacts. Such cleaning cannot be performed manually. Existing programs are not suitable for immunoglobulin (Ig genes, which are variable and often highly mutated. This paper describes Ig-HTS-Cleaner (Ig High Throughput Sequencing Cleaner, a program containing a simple cleaning procedure that successfully deals with pre-processing of Ig sequences derived from HTS, and Ig-Indel-Identifier (Ig Insertion – Deletion Identifier, a program for identifying legitimate and artifact insertions and/or deletions (indels. Our programs were designed for analyzing Ig gene sequences obtained by 454 sequencing, but they are applicable to all types of sequences and sequencing platforms. Ig-HTS-Cleaner and Ig-Indel-Identifier have been implemented in Java and saved as executable JAR files, supported on Linux and MS Windows. No special requirements are needed in order to run the programs, except for correctly constructing the input files as explained in the text. The programs' performance has been tested and validated on real and simulated data sets.

  13. Cloning and sequence analysis of 5' fragment of Hoxa-11 gene in Latimeria chalumnae. (United States)

    Xue, L Y; Qian, K X


    Hoxa-11 gene is essential for the development of fish fins and tetrapod limbs. Based on the published nucleotide sequences of human and mouse Hoxa-11 genes, two degenerate primers were designed. Latimeria Hoxa-11 gene fragment was amplified by PCR, cloned and sequenced. The acquired Hox gene fragment, which encodes 204 amino acids, is comprised of 2,065 bp, including most exon 1, intron and partial exon 2. The homology of latimeria Hoxa-11 protein is 66.0% to human, 67.6% to mouse, 74.4% to chick, 72.8% to frog, and 59.7% to zebrafish, respectively. The exon 2 region including the homeobox and the splice site are highly conserved. However, the exon 1 region has increased in size by 16% from latimeria to human. Sequence analysis further revealed that exon 1 of latimeria Hoxa-11 could be divided into four regions: two highly conserved regions, a moderately conserved region, and a variable region adjacent to the intron. The size variation is primarily caused by the accumulation of alanine repeats and of flanking segments rich in glycine and serine in the variable region. It implies that the variable region might be related to acquisition of new functions in the fin-limb transition and vertebrate evolution. Besides the homeobox, two highly conserved regions in exon 1 and two phylogenetic footprints in the intro were found. The strong sequence conservation suggests an important functional role of these regions.

  14. Dinoflagellate 17S rRNA sequence inferred from the gene sequence: Evolutionary implications (United States)

    Herzog, Michel; Maroteaux, Luc


    We present the complete sequence of the nuclear-encoded small-ribosomal-subunit RNA inferred from the cloned gene sequence of the dinoflagellate Prorocentrum micans. The dinoflagellate 17S rRNA sequence of 1798 nucleotides is contained in a family of 200 tandemly repeated genes per haploid genome. A tentative model of the secondary structure of P. micans 17S rRNA is presented. This sequence is compared with the small-ribosomal-subunit rRNA of Xenopus laevis (Animalia), Saccharomyces cerevisiae (Fungi), Zea mays (Planta), Dictyostelium discoideum (Protoctista), and Halobacterium volcanii (Monera). Although the secondary structure of the dinoflagellate 17S rRNA presents most of the eukaryotic characteristics, it contains sufficient archaeobacterial-like structural features to reinforce the view that dinoflagellates branch off very early from the eukaryotic lineage. PMID:16578795

  15. Sequence Analysis of the ank Gene of Granulocytic Ehrlichiae



    The ank gene of the agent of human granulocytic ehrlichiosis (HGE) codes for a protein with a predicted molecular size of 131.2 kDa that is recognized by serum from both dogs and humans infected with granulocytic ehrlichiae. As part of an effort to assess the phylogenetic relatedness of granulocytic ehrlichiae from different geographic regions and in different host species, the ank gene was PCR amplified and sequenced from a variety of sources. These included 10 blood specimens from patients ...

  16. Phylogeny of the cuttlefishes (Mollusca:Cephalopoda) based on mitochondrial COI and 16S rRNA gene sequence data

    Institute of Scientific and Technical Information of China (English)

    LIN Xiangzhi; ZHENG Xiaodong; XIAO Shu; WANG Rucai


    To clarify cuttlefish phylogeny, mitochondrial cytochrome c oxidase subunit I (COI) gene and partial 16S rRNA gene are sequenced for 13 cephalopod species. Phylogenetic trees are constructed, with the neighbor-joining method.Coleoids are divided into two main lineages, Decabrachia and Octobrachia. The monophyly of the order Sepioidea,which includes the families Sepiidae, Sepiolidae and Idiosepiidae, is not supported. From the two families of Sepioidea examined, the Sepiolidae are polyphyletic and are excluded from the order. On the basis of 16S rRNA and amino acid of COI gene sequences data, the two genera (Sepiella and Sepia) from the Sepiidae can be distinguished, but do not have a visible boundary using COI gene sequences. The reason is explained. This suggests that the 16S rDNA of cephalopods is a precious tool to analyze taxonomic relationships at the genus level, and COI gene is fitter at a higher taxonomic level (i.e., family).

  17. Sequence and gene expression evolution of paralogous genes in willows. (United States)

    Harikrishnan, Srilakshmy L; Pucholt, Pascal; Berlin, Sofia


    Whole genome duplications (WGD) have had strong impacts on species diversification by triggering evolutionary novelties, however, relatively little is known about the balance between gene loss and forces involved in the retention of duplicated genes originating from a WGD. We analyzed putative Salicoid duplicates in willows, originating from the Salicoid WGD, which took place more than 45 Mya. Contigs were constructed by de novo assembly of RNA-seq data derived from leaves and roots from two genotypes. Among the 48,508 contigs, 3,778 pairs were, based on fourfold synonymous third-codon transversion rates and syntenic positions, predicted to be Salicoid duplicates. Both copies were in most cases expressed in both tissues and 74% were significantly differentially expressed. Mean Ka/Ks was 0.23, suggesting that the Salicoid duplicates are evolving by purifying selection. Gene Ontology enrichment analyses showed that functions related to DNA- and nucleic acid binding were over-represented among the non-differentially expressed Salicoid duplicates, while functions related to biosynthesis and metabolism were over-represented among the differentially expressed Salicoid duplicates. We propose that the differentially expressed Salicoid duplicates are regulatory neo- and/or subfunctionalized, while the non-differentially expressed are dose sensitive, hence, functionally conserved. Multiple evolutionary processes, thus drive the retention of Salicoid duplicates in willows.

  18. Phylogenetic Analysis of Cynoglossidae in the Yangtze Estuary Based on Partial Sequence of Mitochondrial COⅠ

    Institute of Scientific and Technical Information of China (English)

    Song Chao; Yu Yanan; Zhang Tao; Yang Gang; Zhang Longzhen


    To determine the role of mitochondrial COⅠgene in classification and identification of species,a total of 39 single individuals from nine species pertaining to two genera of Cynoglossidae in the Yangtze Estuary were barcoded by COⅠ,sequenced and compared with that of other Cynoglossidae species recorded in the Gen Bank. Total genomic DNA was extracted from each scale sample using the classic phenol / chloroform extraction method. Six hundred and fifty base pairs( bp)COⅠfragments were amplified using the primers ’i. e. ’ F1: 5’- TCA ACC AAC CAC AAA GAC ATT GGC AC- 3’,R1: 5’- TAG ACT TCT GGG TGG CCA AAG AAT CA- 3’. Every PCR amplification was performed in a total volume of 50 μL of PCR mixture. PCR products were purified and then sequenced in both forward and reverse directions using an ABI PRISMTM 3730 XL Automated Sequencer. DNA sequences were aligned with clustal W using default parameters. Base composition,variable and parsimony informative sites were determined using MEGA 5. 0. Neighbor- joining( NJ) and Maximum parsimony( MP) phylogenetic trees were constructed for COⅠhaplotypes( Kimura 2 Parameter substitution model,K2P; 1 000 bootstraps pseudoreplications) using MEGA 5. 0. Using the MEGA5. 0 software for statistical analysis,the averaged AT content was greater than the GC content( Tab. 2). The GC content of codon position 1 averaged 53. 8%( 51. 8%-57. 3%),which of position 2 for 42. 0%,and that of position 3 ranged from 28. 1% to 37. 8% in average of 32. 4%( Tab. 4). The transitional pairs( si) was slightly more than the transversional pairs( sv),and the ratio( R = si/sv) was 1. 45( Tab. 3). Analysis of the frequency of amino acids in COⅠgene encoding protein showed that the highest frequency of amino acid was leucine,and the lowest frequency of amino acid was tryptophan( Tab. 5). The average K2 P distances pairwise-species and within-species were 0. 191 and 0. 003,respectively( Tab. 6). The K2 P distance pairwise-species was 63. 7


    Directory of Open Access Journals (Sweden)



    Full Text Available The objective of this research was to identify diversity of exon 5 UTMP gene fragment in Bali cattle using direct sequencing. The total 60 blood samples of Bali Cattle derived from BPTU Bali in Bali siland (20 heads, BPTU Serading in Sumbawa island (20 heads and Village Breeding Center in Barru District South Sulawesi (20 heads were used to evaluate their genetic diversity at exon 5 UTMP gene. The forward and reverse data sequences were analyzed using Bioedit program and alignment analysis was carried out using MEGA5 program. Meanwhile haplotype analysis was performed by DnaSPv5 program. The result showed that partial sequences in exon 5 UTMP gene had 16 haplotypes with the highest number of haplotypes ware found in VBC Barru district South Sulawesi (8 haplotypes. Moreover, the highest average of haplotype (h and nucleotide (p diversity were found in VBC Barru district South Sulawesi were 0.7949 and 0.0016, respectively. In addition, minisatellite insersion was found in exon 5 UTMP gene fragment on Bali cattle which are consist of 5'-CCA GTC ATG AAG AAG GCA GAG GTC GTC GTG CCG GCG AAA-3'. According to our results, haplotype and minisatellite variation in exon 5 UTMP gene fragment can be used as a candidate genetic marker specific for reproductive trait in the Bali cattle and for its strategy breeding program in the future.

  20. Phylogenetic position of the family Orientocreadiidae within the superfamily Plagiorchioidea (Trematoda) based on partial 28S rDNA sequence. (United States)

    Sokolov, S G; Shchenkov, S V


    Trematodes of the family Orientocreadiidae are mostly parasites of freshwater fishes. Here, the phylogenetic position of this family is inferred based on the partial 28S rDNA sequence from a representative of the genus Orientocreadium s. str.-О. pseudobagri Yamaguti, 1934. Sequences were analysed by maximum likelihood and Bayesian inference algorithms. Both approaches placed the Orientocreadiidae within a clade corresponding to the superfamily Plagiorchioidea and supported the family Leptophallidae as a sister taxon.

  1. Diagnosis of partial body radiation exposure in mice using peripheral blood gene expression profiles.

    Directory of Open Access Journals (Sweden)

    Sarah K Meadows

    Full Text Available In the event of a terrorist-mediated attack in the United States using radiological or improvised nuclear weapons, it is expected that hundreds of thousands of people could be exposed to life-threatening levels of ionizing radiation. We have recently shown that genome-wide expression analysis of the peripheral blood (PB can generate gene expression profiles that can predict radiation exposure and distinguish the dose level of exposure following total body irradiation (TBI. However, in the event a radiation-mass casualty scenario, many victims will have heterogeneous exposure due to partial shielding and it is unknown whether PB gene expression profiles would be useful in predicting the status of partially irradiated individuals. Here, we identified gene expression profiles in the PB that were characteristic of anterior hemibody-, posterior hemibody- and single limb-irradiation at 0.5 Gy, 2 Gy and 10 Gy in C57Bl6 mice. These PB signatures predicted the radiation status of partially irradiated mice with a high level of accuracy (range 79-100% compared to non-irradiated mice. Interestingly, PB signatures of partial body irradiation were poorly predictive of radiation status by site of injury (range 16-43%, suggesting that the PB molecular response to partial body irradiation was anatomic site specific. Importantly, PB gene signatures generated from TBI-treated mice failed completely to predict the radiation status of partially irradiated animals or non-irradiated controls. These data demonstrate that partial body irradiation, even to a single limb, generates a characteristic PB signature of radiation injury and thus may necessitate the use of multiple signatures, both partial body and total body, to accurately assess the status of an individual exposed to radiation.

  2. Nucleotide Sequence of the Protective Antigen Gene of Bacillus Anthracis (United States)


    Montie, S. Kadis, and S. I. Ajl (ed.), Microbial toxins, vol. 3. Academic Press, Inc., New York. 23. Little, S. F., and G. B. Knudaon. 1986...Takkinen, and L. Kaariainen. 1981. Nucleotide sequence of the promoter and NHa-terminal signal peptide region of the a- amylase gene from Bacillus

  3. Influence of temperature on the partial nitritation of reject water in a granular sequencing batch reactor. (United States)

    López-Palau, Sílvia; Sancho, Irene; Pinto, Antonio; Dosta, Joan; Mata-Alvarez, Joan


    Two Granular Sequencing Batch Reactors were operated to perform partial nitrification of sludge reject water at different temperatures, from 25-41 degrees C. Every temperature was fixed for about a month in order to evaluate the nitritation rate, morphological features of aggregates and bacterial populations. The optimum temperature was found between 33 and 37 degrees C in terms of nitritation rate. Morphological features of granules did not show significant changes with temperature in the range between 28 and 37 degrees C; Feret diameter remained at 5.8 +/- 0.7mm and roundness was 0.76 +/- 0.02. Lower temperatures promoted the appearance of filamentous bacteria, leading to an increase of the sludge volume index (SVI) and a consequent reduction of biomass concentration. When the temperature was increased to 39 degrees C, more than the 80% of aggregates showed a diameter higher than 6mm but density decreased from 28 to 19 g VSS L(-1), resulting in an increase of the SVI from 33 to 80 mL g(-1). The establishment of 41 degrees C caused a rapid destabilization of the system and nitritation activity disappeared. Bacterial populations did not experience significant changes during the experimental period and Nitrosomonas was the dominant species at all the temperatures assayed.

  4. An Enhanced Partial Transmit Sequence Segmentation Schemes to Reduce the PAPR in OFDM Systems

    Directory of Open Access Journals (Sweden)

    Yasir Amer Al-Jawhar


    Full Text Available Although the orthogonal frequency division multiplexing system (OFDM is widely used in high-speed data rate wire and wireless environment, the peak-to- average-power-ratio (PAPR is one of its major obstacles for the real applications. The high PAPR value leads some devices of the OFDM system such as power amplifiers and analog to digital converters to work out of band of these devices. Thus the system efficiency is degraded. Many techniques have been proposed to overcome the high PAPR in OFDM systems such as partial transmit sequences (PTS, selected mapping and interleaving technique. PTS is considered as one of the effective PAPR reduction methods; this scheme depends on segmentation of the input data into several subblocks and then combined again. The three well-known segmentation schemes are pseudo-random, adjacent and interleaving; each scheme has PAPR reduction performance, and computational complexity differs from one to another. In this paper, five types of segmentation schemes are proposed to improve the PAPR reduction execution including sine and cosine shape as well as hybrid interleaving and adjacent schemes in new approaches. From the simulation results, the proposed methods can achieve PAPR reduction performance greater than that of the adjacent and interleaving partition schemes, without increasing the computational complexity of the system. Moreover, the enhanced schemes can realize better PAPR performance with any number of subcarriers.

  5. Thermodynamics-based models of transcriptional regulation with gene sequence. (United States)

    Wang, Shuqiang; Shen, Yanyan; Hu, Jinxing


    Quantitative models of gene regulatory activity have the potential to improve our mechanistic understanding of transcriptional regulation. However, the few models available today have been based on simplistic assumptions about the sequences being modeled or heuristic approximations of the underlying regulatory mechanisms. In this work, we have developed a thermodynamics-based model to predict gene expression driven by any DNA sequence. The proposed model relies on a continuous time, differential equation description of transcriptional dynamics. The sequence features of the promoter are exploited to derive the binding affinity which is derived based on statistical molecular thermodynamics. Experimental results show that the proposed model can effectively identify the activity levels of transcription factors and the regulatory parameters. Comparing with the previous models, the proposed model can reveal more biological sense.

  6. Quantitative modeling of a gene's expression from its intergenic sequence.

    Directory of Open Access Journals (Sweden)

    Md Abul Hassan Samee


    Full Text Available Modeling a gene's expression from its intergenic locus and trans-regulatory context is a fundamental goal in computational biology. Owing to the distributed nature of cis-regulatory information and the poorly understood mechanisms that integrate such information, gene locus modeling is a more challenging task than modeling individual enhancers. Here we report the first quantitative model of a gene's expression pattern as a function of its locus. We model the expression readout of a locus in two tiers: 1 combinatorial regulation by transcription factors bound to each enhancer is predicted by a thermodynamics-based model and 2 independent contributions from multiple enhancers are linearly combined to fit the gene expression pattern. The model does not require any prior knowledge about enhancers contributing toward a gene's expression. We demonstrate that the model captures the complex multi-domain expression patterns of anterior-posterior patterning genes in the early Drosophila embryo. Altogether, we model the expression patterns of 27 genes; these include several gap genes, pair-rule genes, and anterior, posterior, trunk, and terminal genes. We find that the model-selected enhancers for each gene overlap strongly with its experimentally characterized enhancers. Our findings also suggest the presence of sequence-segments in the locus that would contribute ectopic expression patterns and hence were "shut down" by the model. We applied our model to identify the transcription factors responsible for forming the stripe boundaries of the studied genes. The resulting network of regulatory interactions exhibits a high level of agreement with known regulatory influences on the target genes. Finally, we analyzed whether and why our assumption of enhancer independence was necessary for the genes we studied. We found a deterioration of expression when binding sites in one enhancer were allowed to influence the readout of another enhancer. Thus, interference

  7. Sequence validation of candidates for selectively important genes in sunflower.

    Directory of Open Access Journals (Sweden)

    Mark A Chapman

    Full Text Available Analyses aimed at identifying genes that have been targeted by past selection provide a powerful means for investigating the molecular basis of adaptive differentiation. In the case of crop plants, such studies have the potential to not only shed light on important evolutionary processes, but also to identify genes of agronomic interest. In this study, we test for evidence of positive selection at the DNA sequence level in a set of candidate genes previously identified in a genome-wide scan for genotypic evidence of selection during the evolution of cultivated sunflower. In the majority of cases, we were able to confirm the effects of selection in shaping diversity at these loci. Notably, the genes that were found to be under selection via our sequence-based analyses were devoid of variation in the cultivated sunflower gene pool. This result confirms a possible strategy for streamlining the search for adaptively-important loci process by pre-screening the derived population to identify the strongest candidates before sequencing them in the ancestral population.

  8. Cloning and sequencing of a Moraxella bovis pilin gene. (United States)

    Marrs, C F; Schoolnik, G; Koomey, J M; Hardy, J; Rothbard, J; Falkow, S


    Moraxella bovis pili have been shown to play a major role in both infectivity and protective immunity of bovine infectious keratoconjunctivitis. Sonicated M. bovis DNA from the piliated strain EPP63 was inserted into the vector lambda gt11 with EcoRI linkers. Recombinant phage were screened with an oligonucleotide probe based on the amino-terminal portion of the DNA sequence of a Neisseria gonorrhoeae pilin gene. Two candidate phages produced a protein that comigrated with EPP63 beta pilin in sodium dodecyl sulfate-polyacrylamide gels and bound anti-pilus antisera. The 1.9-kilobase insert from one of these, lambda gt11M182, was subcloned in both orientations into pBR322, forming the plasmids pMxB7 and pMxB9, both of which produced beta pilin, as did pMxB12, a HindIII deletion derivative of pMxB7. In HB101(pMxB12), the M. bovis pilin protein was shown to be primarily localized in the inner membrane. The entire 939-base-pair insert of pMxB12 was sequenced, revealing a ribosome binding site just upstream of the coding region and an AT-rich region further upstream containing some potential RNA polymerase recognition sites. The translation of the sequence predicts a six-amino-acid leader sequence preceding the phenylalanine that begins the mature protein. Codon usage analysis of the M. bovis beta pilin gene revealed greater use of the CUA codon for leucine than usual for a well-expressed Escherichia coli gene. Comparisons of the M. bovis EPP63 beta pilin protein sequence with other pilin gene sequences are presented.

  9. Comparative genomic analysis of the arthropod muscle myosin heavy chain genes allows ancestral gene reconstruction and reveals a new type of 'partially' processed pseudogene

    Directory of Open Access Journals (Sweden)

    Kollmar Martin


    Full Text Available Abstract Background Alternative splicing of mutually exclusive exons is an important mechanism for increasing protein diversity in eukaryotes. The insect Mhc (myosin heavy chain gene produces all different muscle myosins as a result of alternative splicing in contrast to most other organisms of the Metazoa lineage, that have a family of muscle genes with each gene coding for a protein specialized for a functional niche. Results The muscle myosin heavy chain genes of 22 species of the Arthropoda ranging from the waterflea to wasp and Drosophila have been annotated. The analysis of the gene structures allowed the reconstruction of an ancient muscle myosin heavy chain gene and showed that during evolution of the arthropods introns have mainly been lost in these genes although intron gain might have happened in a few cases. Surprisingly, the genome of Aedes aegypti contains another and that of Culex pipiens quinquefasciatus two further muscle myosin heavy chain genes, called Mhc3 and Mhc4, that contain only one variant of the corresponding alternative exons of the Mhc1 gene. Mhc3 transcription in Aedes aegypti is documented by EST data. Mhc3 and Mhc4 inserted in the Aedes and Culex genomes either by gene duplication followed by the loss of all but one variant of the alternative exons, or by incorporation of a transcript of which all other variants have been spliced out retaining the exon-intron structure. The second and more likely possibility represents a new type of a 'partially' processed pseudogene. Conclusion Based on the comparative genomic analysis of the alternatively spliced arthropod muscle myosin heavy chain genes we propose that the splicing process operates sequentially on the transcript. The process consists of the splicing of the mutually exclusive exons until one exon out of the cluster remains while retaining surrounding intronic sequence. In a second step splicing of introns takes place. A related mechanism could be responsible for

  10. Clonning and sequence analysis of the 26 kDa glutathiones-transferase gene of Schistosoma mekongi. (United States)

    Vichasri-Grams, S; Grams, R; Korge, G; Viyanant, V; Upatham, E S


    The number of genomic DNA or cDNA sequences of Schistosoma mekongi accessible in Genbank or EMBL is very limited up to now. Recently, two reports have appeared on the molecular phylogeny of Schistosoma species inferred from partial sequence data of rRNA genes; no further sequence data of S. mekongi is available yet. Knowledge of the molecular structure of protein coding genes of S. mekongi will provide a better understanding of gene function in the genus Schistosoma. A cDNA library of S. mekongi adult male was constructed and a cDNA encoding the 26 kDa glutathione S-transferase protein of this species was cloned. Sequence analysis of this cDNA confirmed the close phylogenetic relationship of S. mekongi to S. japonicum.

  11. Rapid Amplification of Flanking Sequences of a Known DNA Region by Partial Restriction Digestion and Hot Start PCR

    Institute of Scientific and Technical Information of China (English)

    LOU Qun-feng; LIU Qiang; YANG Yin-gui; CHEN Jin-feng


    A simple and efficient method for cloning the flanking genomic sequences of a known DNA region is reported in this study. This method combined partial restriction endonuclease digestion, adaptor ligation, and a single round polymerase chain reaction. Total genomic DNA was partially digested with the frequent-cutting restriction enzyme Mse Ⅰ. The partially digested products were ligated to an unphosphorylated adaptor. A hot start PCR amplification with Taq polymerase and dNTP was performed with a DNA-specific primer and the adaptor primer complementary to the adaptor and the Mse Ⅰ recognition site. The amplified products were fractionated, cloned and sequenced. By this method, we cloned the downstream region of a gynoecious marker TG/CAC234 from cucumber (Cucumis sativus L.).

  12. Imprinted genes show unique patterns of sequence conservation

    Directory of Open Access Journals (Sweden)

    Helms Volkhard


    Full Text Available Abstract Background Genomic imprinting is an evolutionary conserved mechanism of epigenetic gene regulation in placental mammals that results in silencing of one of the parental alleles. In order to decipher interactions between allele-specific DNA methylation of imprinted genes and evolutionary conservation, we performed a genome-wide comparative investigation of genomic sequences and highly conserved elements of imprinted genes in human and mouse. Results Evolutionarily conserved elements in imprinted regions differ from those associated with autosomal genes in various ways. Whereas for maternally expressed genes strong divergence of protein-encoding sequences is most prominent, paternally expressed genes exhibit substantial conservation of coding and noncoding sequences. Conserved elements in imprinted regions are marked by enrichment of CpG dinucleotides and low (TpG+CpA/(2·CpG ratios indicate reduced CpG deamination. Interestingly, paternally and maternally expressed genes can be distinguished by differences in G+C and CpG contents that might be associated with unusual epigenetic features. Especially noncoding conserved elements of paternally expressed genes are exceptionally G+C and CpG rich. In addition, we confirmed a frequent occurrence of intronic CpG islands and observed a decelerated degeneration of ancient LINE-1 repeats. We also found a moderate enrichment of YY1 and CTCF binding sites in imprinted regions and identified several short sequence motifs in highly conserved elements that might act as additional regulatory elements. Conclusions We discovered several novel conserved DNA features that might be related to allele-specific DNA methylation. Our results hint at reduced CpG deamination rates in imprinted regions, which affects mostly noncoding conserved elements of paternally expressed genes. Pronounced differences between maternally and paternally expressed genes imply specific modes of evolution as a result of differences in

  13. Identification of Expressed Resistance Gene Analogs from Peanut (Arachis hypogaea L.) Expressed Sequence Tags

    Institute of Scientific and Technical Information of China (English)

    Zhanji Liu; Suping Feng; Manish K.Pandey; Xiaoping Chen; Albert K.Culbreath; Rajeev K.Varshney; Baozhu Guo


    Low genetic diversity makes peanut (Arachis hypogaea L.) very vulnerable to plant pathogens,causing severe yield loss and reduced seed quality.Several hundred partial genomic DNA sequences as nucleotide-binding-site leucine-rich repeat (NBS-LRR) resistance genes (R) have been identified,but a small portion with expressed transcripts has been found.We aimed to identify resistance gene analogs (RGAs) from peanut expressed sequence tags (ESTs) and to develop polymorphic markers.The protein sequences of 54 known R genes were used to identify homologs from peanut ESTs from public databases.A total of 1,053 ESTs corresponding to six different classes of known R genes were recovered,and assembled 156 contigs and 229 singletons as peanut-expressed RGAs.There were 69 that encoded for NBS-LRR proteins,191 that encoded for protein kinases,82 that encoded for LRR-PK/transmembrane proteins,28 that encoded for Toxin reductases,11 that encoded for LRR-domain containing proteins and four that encoded for TM-domain containing proteins.Twenty-eight simple sequence repeats (SSRs)were identified from 25 peanut expressed RGAs.One SSR polymorphic marker (RGA121) was identified.Two polymerase chain reaction-based markers (Ahsw-1 and Ahsw-2) developed from RGA013 were homologous to the Tomato Spotted Wilt Virus (TSWV) resistance gene.All three markers were mapped on the same linkage group AhlV.These expressed RGAs are the source for RGA-tagged marker development and identification of peanut resistance genes.

  14. Phylogenetic relationships in Demodex mites (Acari: Demodicidae) based on mitochondrial 16S rDNA partial sequences. (United States)

    Zhao, Ya-E; Wu, Li-Ping


    To confirm phylogenetic relationships in Demodex mites based on mitochondrial 16S rDNA partial sequences, mtDNA 16S partial sequences of ten isolates of three Demodex species from China were amplified, recombined, and sequenced and then analyzed with two Demodex folliculorum isolates from Spain. Lastly, genetic distance was computed, and phylogenetic tree was reconstructed. MEGA 4.0 analysis showed high sequence identity among 16S rDNA partial sequences of three Demodex species, which were 95.85 % in D. folliculorum, 98.53 % in Demodex canis, and 99.71 % in Demodex brevis. The divergence, genetic distance, and transition/transversions of the three Demodex species reached interspecies level, whereas there was no significant difference of the divergence (1.1 %), genetic distance (0.011), and transition/transversions (3/1) of the two geographic D. folliculorum isolates (Spain and China). Phylogenetic trees reveal that the three Demodex species formed three separate branches of one clade, where D. folliculorum and D. canis gathered first, and then gathered with D. brevis. The two Spain and five China D. folliculorum isolates did not form sister clades. In conclusion, 16S mtDNA are suitable for phylogenetic relationship analysis in low taxa (genus or species), but not for intraspecies determination of Demodex. The differentiation among the three Demodex species has reached interspecies level.

  15. Sequence variations in the FAD2 gene in seeded pumpkins. (United States)

    Ge, Y; Chang, Y; Xu, W L; Cui, C S; Qu, S P


    Seeded pumpkins are important economic crops; the seeds contain various unsaturated fatty acids, such as oleic acid and linoleic acid, which are crucial for human and animal nutrition. The fatty acid desaturase-2 (FAD2) gene encodes delta-12 desaturase, which converts oleic acid to linoleic acid. However, little is known about sequence variations in FAD2 in seeded pumpkins. Twenty-seven FAD2 clones from 27 accessions of Cucurbita moschata, Cucurbita maxima, Cucurbita pepo, and Cucurbita ficifolia were obtained (totally 1152 bp; a single gene without introns). More than 90% nucleotide identities were detected among the 27 FAD2 clones. Nucleotide substitution, rather than nucleotide insertion and deletion, led to sequence polymorphism in the 27 FAD2 clones. Furthermore, the 27 FAD2 selected clones all encoded the FAD2 enzyme (delta-12 desaturase) with amino acid sequence identities from 91.7 to 100% for 384 amino acids. The same main-function domain between 47 and 329 amino acids was identified. The four species clustered separately based on differences in the sequences that were identified using the unweighted pair group method with arithmetic mean. Geographic origin and species were found to be closely related to sequence variation in FAD2.

  16. Sequence length polymorphisms within primate amelogenin and amelogenin-like genes: usefulness in sex determination. (United States)

    Morrill, Benson H; Rickords, Lee F; Schafstall, Heather J


    Sequence length polymorphisms between the amelogenin (AMELX) and the amelogenin-like (AMELY) genes both within and between several mammalian species have been identified and utilized for sex determination, species identification, and to elucidate evolutionary relationships. Sex determination via polymerase chain reaction (PCR) assays of the AMELX and AMELY genes has been successful in greater apes, prosimians, and two species of old world monkeys. To date, no sex determination PCR assay using AMELX and AMELY has been developed for new world monkeys. In this study, we present partial AMELX and AMELY sequences for five old world monkey species (Mandrillus sphinx, Macaca nemestrina, Macaca fuscata, Macaca mulatta, and Macaca fascicularis) along with primer sets that can be used for sex determination of these five species. In addition, we compare the sequences we generated with other primate AMELX and AMELY sequences available on GenBank and discuss sequence length polymorphisms and their usefulness in sex determination within primates. The mandrill and four species of macaque all share two similar deletion regions with each other, the human, and the chimpanzee in the region sequenced. These two deletion regions are 176-181 and 8 nucleotides in length. In analyzing existing primate sequences on GenBank, we also discovered that a separate six-nucleotide polymorphism located approximately 300 nucleotides upstream of the 177 nucleotide polymorphism in sequences of humans and chimps was also present in two species of new world monkeys (Saimiri boliviensis and Saimiri sciureus). We designed primers that incorporate this polymorphism, creating the first AMELX and AMELY PCR primer set that has been used successfully to generate two bands in a new world monkey species.

  17. Informational structure of genetic sequences and nature of gene splicing (United States)

    Trifonov, E. N.


    Only about 1/20 of DNA of higher organisms codes for proteins, by means of classical triplet code. The rest of DNA sequences is largely silent, with unclear functions, if any. The triplet code is not the only code (message) carried by the sequences. There are three levels of molecular communication, where the same sequence ``talks'' to various bimolecules, while having, respectively, three different appearances: DNA, RNA and protein. Since the molecular structures and, hence, sequence specific preferences of these are substantially different, the original DNA sequence has to carry simultaneously three types of sequence patterns (codes, messages), thus, being a composite structure in which one had the same letter (nucleotide) is frequently involved in several overlapping codes of different nature. This multiplicity and overlapping of the codes is a unique feature of the Gnomic, language of genetic sequences. The coexisting codes have to be degenerate in various degrees to allow an optimal and concerted performance of all the encoded functions. There is an obvious conflict between the best possible performance of a given function and necessity to compromise the quality of a given sequence pattern in favor of other patterns. It appears that the major role of various changes in the sequences on their ``ontogenetic'' way from DNA to RNA to protein, like RNA editing and splicing, or protein post-translational modifications is to resolve such conflicts. New data are presented strongly indicating that the gene splicing is such a device to resolve the conflict between the code of DNA folding in chromatin and the triplet code for protein synthesis.

  18. Variations in CCL3L gene cluster sequence and non-specific gene copy numbers

    Directory of Open Access Journals (Sweden)

    Edberg Jeffrey C


    Full Text Available Abstract Background Copy number variations (CNVs of the gene CC chemokine ligand 3-like1 (CCL3L1 have been implicated in HIV-1 susceptibility, but the association has been inconsistent. CCL3L1 shares homology with a cluster of genes localized to chromosome 17q12, namely CCL3, CCL3L2, and, CCL3L3. These genes are involved in host defense and inflammatory processes. Several CNV assays have been developed for the CCL3L1 gene. Findings Through pairwise and multiple alignments of these genes, we have shown that the homology between these genes ranges from 50% to 99% in complete gene sequences and from 70-100% in the exonic regions, with CCL3L1 and CCL3L3 being identical. By use of MEGA 4 and BioEdit, we aligned sense primers, anti-sense primers, and probes used in several previously described assays against pre-multiple alignments of all four chemokine genes. Each set of probes and primers aligned and matched with overlapping sequences in at least two of the four genes, indicating that previously utilized RT-PCR based CNV assays are not specific for only CCL3L1. The four available assays measured median copies of 2 and 3-4 in European and African American, respectively. The concordance between the assays ranged from 0.44-0.83 suggesting individual discordant calls and inconsistencies with the assays from the expected gene coverage from the known sequence. Conclusions This indicates that some of the inconsistencies in the association studies could be due to assays that provide heterogenous results. Sequence information to determine CNV of the three genes separately would allow to test whether their association with the pathogenesis of a human disease or phenotype is affected by an individual gene or by a combination of these genes.

  19. Gene expression differences of regenerating rat liver in a short interval successive partial hepatectomy

    Institute of Scientific and Technical Information of China (English)

    Cun-Shuan Xu; Hui-Yong Zhang; Salman Rahman; Jing-Bo Zhang; An-Shi Zhang; Hong-Peng Han; Jin-Yun Yuan; Cui-Fang Chang; Wen-Qiang Li; Ke-Jin Yang; Li-Feng Zhao; Yu-Chang Li


    AIM: To identify the genes expressed differentially in the regenerating rat liver in a short interval successive partial hepatectomy (SISPH), and to analyze their expression profiles. METHODS: Five hundred and fifty-one elements selected from subtractive cDNA libraries were conformed to a cDNAmicroarray (cDNA chip). An extensive gene expressionanalysis following 0-36-72-96-144 h SISPH was performed by microarray.RESETS: Two hundred and sixteen elements were identified either up- or down-regulated more than 2-fold at one or more time points of SISPH. By cluster analysis and generalization analysis, 8 kinds of ramose gene expression clusters were generated in the SISPH. Of the 216 elements, 111 were up-regulated and 105 down-regulated. Except 99 unreported genes, 117 reported genes were categorized into 22 groups based on their biological functions. Comparison of the gene expression in SISPH with that after partial hepatectomy (PH) disclosed that 56 genes were specially altered in SISPH, and 160 genes were simultaneously up regulated or down-regulated in SISPH and after PH, but in various amount and at different time points.CONCLUSION: Genes expressed consistently are far less than that intermittently; the genes strikingly increased are much less than that increased only 2-5 fold; the expression trends of most genes in SISPH and in PH are similar, but the expression of 56 genes is specifically altered in SISPH.Microarray combined with suppressive subtractive hybridization can in a large scale effectively identify the genes related to liver regeneration.

  20. The Complete Sequence of Mitochondrial COⅡ Gene of Fenneropenaeus chinensis and Its Applicability as a Marker for Phylogenetic Analysis

    Institute of Scientific and Technical Information of China (English)

    YU Shanshan; KONG Xiaoyu; LI Yulong; XU Hui


    The complete mitochondrial cytochrome oxidase subunit Ⅱ (COⅡ) gene of Penaeinae shrimp Fenneropenaeus chinensis was cloned and sequenced. The gene is 688 bp in length and codes for 229 amino acids. It shows 83.2%, 87.0% and 83.8% sequence similarity to Marsupenaeus Japonicus, Penaeus monodon and Farfantepenaeus notialis, respectively. The A+T content of the whole gene and that at the third position of codons are 64.7% and 78.2%, respectively. The phylogenetic relationship between F. chinensis and three other species representing genera Farfanatepenaeus, Marsupenaeus and Penaeus was analyzed. Results showed that the genetic distances among the four taxa ranged from 0.144 0 to 0.200 5, exceeding those estimated with COⅠ and partial 16S rRNA gene sequences among Marsupenaeus, Litopenaeus and Melicertus, and being therefore larger than the value among subgenera. It has been suggested that the COⅡ gene has a faster evolutionary rate than that of the COⅠ gene and partial 16S rRNA gene and could be used for phylogenetic analysis at genus or species level. The results of the present study indicated that Farfantepenaeus, Fenneropenaeus, Marsupenaeus and Penaeus are at a higher phylogenetic level than subgenus, which supports the opinion of the elevation of phylogenetic status of the four subgenera to genus level.

  1. Human case of bacteremia caused by Streptococcus canis sequence type 9 harboring the scm gene. (United States)

    Taniyama, Daisuke; Abe, Yoshihiko; Sakai, Tetsuya; Kikuchi, Takahide; Takahashi, Takashi


    Streptococcus canis (Sc) is a zoonotic pathogen that is transferred mainly from companion animals to humans. One of the major virulence factors in Sc is the M-like protein encoded by the scm gene, which is involved in anti-phagocytic activities, as well as the recruitment of plasminogen to the bacterial surface in cooperation with enolase, and the consequent enhancement of bacterial transmigration and survival. This is the first reported human case of uncomplicated bacteremia following a dog bite, caused by Streptococcus canis harboring the scm gene. The similarity of the 16S rRNA from the infecting species to that of the Sc type strain, as well as the amplification of the species-specific cfg gene, encoding a co-hemolysin, was used to confirm the species identity. Furthermore, the isolate was confirmed as sequence type 9. The partial scm gene sequence harbored by the isolate was closely related to those of other two Sc strains. While this isolate did not possess the erm(A), erm(B), or mef(A), macrolide/lincosamide resistance genes, it was not susceptible to azithromycin: its susceptibility was intermediate. Even though human Sc bacteremia is rare, clinicians should be aware of this microorganism, as well as Pasteurella sp., Prevotella sp., and Capnocytophaga sp., when examining and treating patients with fever who maintain close contact with companion animals.

  2. Cloning,sequencing and phylogenic analysis of duck prion gene

    Institute of Scientific and Technical Information of China (English)

    WANG Qigui; ZHANG Lei; HU Xiaoxiang; FAN Baoliang; LI Ning; LI Hui; WU Changxin


    Duck prion gene was cloned and sequenced. Similar to mammalian prion protein (PrP), duck prion is encoded by a single exon of a single copy in genome, which was confirmed by Southern blot analysis. All of the structural features of mammalian PrP were also identified in the duck PrP. Compared with mammalian PrP, it exhibited a 30 % of general similarity. When compared with chicken PrP, it showed a higher homology of 97%. A phylogenetic tree was constructed to trace evolution of prion gene in animals.

  3. Sequence Analysis of the Capsid Gene during a Genotype II.4 Dominated Norovirus Season in One University Hospital

    DEFF Research Database (Denmark)

    Holzknecht, Barbara Juliane; Franck, Kristina Træholt; Nielsen, Rikke Thoft


    transmission routes. All NoV positive samples submitted from one university hospital during the 2007/8 season were selected. Genotyping of selected samples by partial polymerase gene sequencing had shown that the majority belonged to the GII.4 variant Den Haag 2006b and had identical polymerase sequences...... by the same GII.4 variant were analyzed. Forty-seven of the inpatients (85%) were infected with the GII.4 variant Den Haag 2006b. Phylogenetic analysis of the Den Haag 2006b sequences identified four distinct outbreaks in different departments and a fifth outbreak with possible inter-department spread...

  4. Sphaeridiotrema globulus and Sphaeridiotrema pseudoglobulus (Digenea): Species Differentiation Based on mtDNA (Barcode) and Partial LSUrDNA Sequences (United States)

    Bergmame, L.; Huffman, J.; Cole, R.; Dayanandan, S.; Tkach, V.; McLaughlin, J.D.


    Flukes belonging to Sphaeridiotrema are important parasites of waterfowl, and 2 morphologically similar species Sphaeridiotrema globulus and Sphaeridiotrema pseudoglobulus, have been implicated in waterfowl mortality in North America. Cytochrome oxidase I (barcode region) and partial LSU-rDNA sequences from specimens of S. globulus and S. pseudoglobulus, obtained from naturally and experimentally infected hosts from New Jersey and Quebec, respectively, confirmed that these species were distinct. Barcode sequences of the 2 species differed at 92 of 590 nucleotide positions (15.6%) and the translated sequences differed by 13 amino acid residues. Partial LSU-rDNA sequences differed at 29 of 1,208 nucleotide positions (2.4%). Additional barcode sequences from specimens collected from waterfowl in Wisconsin and Minnesota and morphometric data obtained from specimens acquired along the north shore of Lake Superior revealed the presence of S. pseudoglobulus in these areas. Although morphometric data suggested the presence of S. globulus in the Lake Superior sample, it was not found among the specimens sequenced from Wisconsin or Minnesota. ?? 2011 American Society of Parasitologists.

  5. Cloning and sequencing of a Moraxella bovis pilin gene.



    Moraxella bovis pili have been shown to play a major role in both infectivity and protective immunity of bovine infectious keratoconjunctivitis. Sonicated M. bovis DNA from the piliated strain EPP63 was inserted into the vector lambda gt11 with EcoRI linkers. Recombinant phage were screened with an oligonucleotide probe based on the amino-terminal portion of the DNA sequence of a Neisseria gonorrhoeae pilin gene. Two candidate phages produced a protein that comigrated with EPP63 beta pilin in...

  6. Isolation and characterization of Phi class glutathione transferase partial gene from Iranian barley

    Directory of Open Access Journals (Sweden)

    Hassan Mohabatkar


    Full Text Available Glutathione transferases are multifunctional proteins involved in several diverse intracellular events such as primary and secondary metabolisms, signaling and stress metabolism. These enzymes have been subdivided into eight classes in plants. The Phi class, being plant specific, is the most represented. In the present study, based on the sequences available at GenBank, different primers were designed for amplifying the Phi class of glutathione transferase gene in the genome and transcriptome of Iranian barley, Karoun cultivar. After extraction of DNA and total RNA, Phi class was amplified and sequenced. Bioinformatics analysis predicted that the deduced protein sequence has two ß-sheets, eight α-helices and some intermediate loops in its secondary structure. Consequently, the sequences were submitted to NCBI GenBank with GS262333 and GW342614 accession numbers. Phylogenic relationships of the sequences were compared with existing sequences in GenBank.

  7. 16S partial gene mitochondrial DNA and internal transcribed spacers ribosomal DNA as differential markers of Trichuris discolor populations. (United States)

    Callejón, R; Halajian, A; de Rojas, M; Marrugal, A; Guevara, D; Cutillas, C


    Comparative morphological, biometrical and molecular studies of Trichuris discolor isolated from Bos taurus from Spain and Iran was carried out. Furthermore, Trichuris ovis isolated from B. taurus and Capra hircus from Spain has been, molecularly, analyzed. Morphological studies revealed clear differences between T. ovis and T. discolor isolated from B. taurus but differences were not observed between populations of T. discolor isolated from different geographical regions. Nevertheless, the molecular studies based on the amplification and sequencing of the internal transcribed spacers 1 and 2 ribosomal DNA and 16S partial gene mitochondrial DNA showed clear differences between both populations of T. discolor from Spain and Iran suggesting two cryptic species. Phylogenetic studies corroborated these data. Thus, phylogenetic trees based on ITS1, ITS2 and 16S partial gene sequences showed that individuals of T. discolor from B. taurus from Iran clustered together and separated, with high bootstrap values, of T. discolor isolated from B. taurus from Spain, while populations of T. ovis from B. taurus and C. hircus from Spain clustered together but separated with high bootstrap values of both populations of T. discolor. Furthermore, a comparative phylogenetic study has been carried out with the ITS1and ITS2 sequences of Trichuris species from different hosts. Three clades were observed: the first clustered all the species of Trichuris parasitizing herbivores (T. discolor, T. ovis, Trichuris leporis and Trichuris skrjabini), the second clustered all the species of Trichuris parasitizing omnivores (Trichuris trichiura and Trichuris suis) and finally, the third clustered species of Trichuris parasitizing carnivores (Trichuris muris, Trichuris arvicolae and Trichuris vulpis).

  8. Sequence Analysis of Bitter Taste Receptor Gene Repertoires in Different Ruminant Species.

    Directory of Open Access Journals (Sweden)

    Ana Monteiro Ferreira

    Full Text Available Bitter taste has been extensively studied in mammalian species and is associated with sensitivity to toxins and with food choices that avoid dangerous substances in the diet. At the molecular level, bitter compounds are sensed by bitter taste receptor proteins (T2R present at the surface of taste receptor cells in the gustatory papillae. Our work aims at exploring the phylogenetic relationships of T2R gene sequences within different ruminant species. To accomplish this goal, we gathered a collection of ruminant species with different feeding behaviors and for which no genome data is available: American bison, chamois, elk, European bison, fallow deer, goat, moose, mouflon, muskox, red deer, reindeer and white tailed deer. The herbivores chosen for this study belong to different taxonomic families and habitats, and hence, exhibit distinct foraging behaviors and diet preferences. We describe the first partial repertoires of T2R gene sequences for these species obtained by direct sequencing. We then consider the homology and evolutionary history of these receptors within this ruminant group, and whether it relates to feeding type classification, using MEGA software. Our results suggest that phylogenetic proximity of T2R genes corresponds more to the traditional taxonomic groups of the species rather than reflecting a categorization by feeding strategy.

  9. A Cluster of Vitellogenin Genes in the Mediterranean Fruit Fly Ceratitis Capitata: Sequence and Structural Conservation in Dipteran Yolk Proteins and Their Genes (United States)

    Rina, M.; Savakis, C.


    Four genes encoding the major egg yolk polypeptides of the Mediterranean fruit fly Ceratitis capitata, vitellogenins 1 and 2 (VG1 and VG2), were cloned, characterized and partially sequenced. The genes are located on the same region of chromosome 5 and are organized in pairs, each encoding the two polypeptides on opposite DNA strands. Restriction and nucleotide sequence analysis indicate that the gene pairs have arisen from an ancestral pair by a relatively recent duplication event. The transcribed part is very similar to that of the Drosophila melanogaster yolk protein genes Yp1, Yp2 and Yp3. The Vg1 genes have two introns at the same positions as those in D. melanogaster Yp3; the Vg2 genes have only one of the introns, as do D. melanogaster Yp1 and Yp2. Comparison of the five polypeptide sequences shows extensive homology, with 27% of the residues being invariable. The sequence similarity of the processed proteins extends in two regions separated by a nonconserved region of varying size. Secondary structure predictions suggest a highly conserved secondary structure pattern in the two regions, which probably correspond to structural and functional domains. The carboxy-end domain of the C. capitata proteins shows the same sequence similarities with triacylglycerol lipases that have been reported previously for the D. melanogaster yolk proteins. Analysis of codon usage shows significant differences between D. melanogaster and C. capitata vitellogenins with the latter exhibiting a less biased representation of synonymous codons. PMID:1903120

  10. Partial nephrogenic diabetes insipidus caused by a novel mutation in the AVPR2 gene

    DEFF Research Database (Denmark)

    Færch, Mia; Christensen, Jane H; Corydon, Thomas J


    and polyuria since infancy. Initial clinical testing confirmed a diagnosis of diabetes insipidus (DI). Urine osmolarity rose during fluid deprivation and after 20 microg of intranasal desmopressin [1-deamino-8-d-arginine-vasopressin (dDAVP)]. A similar DI phenotype was found in his brother. Methods The coding...... output. Discussion The affected members of this Belgian kindred have CNDI with partial resistance to AVP caused by a mutation in the AVPR2 gene that differs from any of the six mutations reported previously to produce this phenotype. Because the resistance to AVP is partial, this form of CNDI can...

  11. A Label Correcting Algorithm for Partial Disassembly Sequences in the Production Planning for End-of-Life Products

    Directory of Open Access Journals (Sweden)

    Pei-Fang (Jennifer Tsai


    Full Text Available Remanufacturing of used products has become a strategic issue for cost-sensitive businesses. Due to the nature of uncertain supply of end-of-life (EoL products, the reverse logistic can only be sustainable with a dynamic production planning for disassembly process. This research investigates the sequencing of disassembly operations as a single-period partial disassembly optimization (SPPDO problem to minimize total disassembly cost. AND/OR graph representation is used to include all disassembly sequences of a returned product. A label correcting algorithm is proposed to find an optimal partial disassembly plan if a specific reusable subpart is retrieved from the original return. Then, a heuristic procedure that utilizes this polynomial-time algorithm is presented to solve the SPPDO problem. Numerical examples are used to demonstrate the effectiveness of this solution procedure.

  12. Technology development for gene discovery and full-length sequencing

    Energy Technology Data Exchange (ETDEWEB)

    Marcelo Bento Soares


    In previous years, with support from the U.S. Department of Energy, we developed methods for construction of normalized and subtracted cDNA libraries, and constructed hundreds of high-quality libraries for production of Expressed Sequence Tags (ESTs). Our clones were made widely available to the scientific community through the IMAGE Consortium, and millions of ESTs were produced from our libraries either by collaborators or by our own sequencing laboratory at the University of Iowa. During this grant period, we focused on (1) the development of a method for preferential cloning of tissue-specific and/or rare transcripts, (2) its utilization to expedite EST-based gene discovery for the NIH Mouse Brain Molecular Anatomy Project, (3) further development and optimization of a method for construction of full-length-enriched cDNA libraries, and (4) modification of a plasmid vector to maximize efficiency of full-length cDNA sequencing by the transposon-mediated approach. It is noteworthy that the technology developed for preferential cloning of rare mRNAs enabled identification of over 2,000 mouse transcripts differentially expressed in the hippocampus. In addition, the method that we optimized for construction of full-length-enriched cDNA libraries was successfully utilized for the production of approximately fifty libraries from the developing mouse nervous system, from which over 2,500 full-ORF-containing cDNAs have been identified and accurately sequenced in their entirety either by our group or by the NIH-Mammalian Gene Collection Program Sequencing Team.

  13. Angiosperm phylogeny inferred from sequences of four mitochondrial genes

    Institute of Scientific and Technical Information of China (English)

    Yin-Long QIU; Zhi-Duan CHEN; Libo LI; Bin WANG; Jia-Yu XUE; Tory A. HENDRY; Rui-Qi LI; Joseph W. BROWN; Yang LIU; Geordan T. HUDSON


    An angiosperm phylogeny was reconstructed in a maximum likelihood analysis of sequences of four mitochondrial genes, atpl, matR, had5, and rps3, from 380 species that represent 376 genera and 296 families of seed plants. It is largely congruent with the phylogeny of angiosperms reconstructed from chloroplast genes atpB, matK, and rbcL, and nuclear 18S rDNA. The basalmost lineage consists of Amborella and Nymphaeales (including Hydatellaceae). Austrobaileyales follow this clade and are sister to the mesangiosperms, which include Chloranthaceae, Ceratophyllum, magnoliids, monocots, and eudicots. With the exception of Chloranthaceae being sister to Ceratophyllum, relationships among these five lineages are not well supported. In eudicots, Ranunculales, Sabiales, Proteales, Trochodendrales, Buxales, Gunnerales, Saxifragales, Vitales, Berberidopsidales, and Dilleniales form a basal grade of lines that diverged before the diversification of rosids and asterids. Within rosids, the COM (Celastrales-Oxalidales-Malpighiales) clade is sister to malvids (or rosid Ⅱ), instead of to the nitrogen-fixing clade as found in all previous large-scale molecular analyses of angiosperms. Santalales and Caryophyllales are members of an expanded asterid clade. This study shows that the mitochondrial genes are informative markers for resolving relationships among genera, families, or higher rank taxa across angiosperms. The low substitution rates and low homoplasy levels of the mitochondrial genes relative to the chloroplast genes, as found in this study, make them particularly useful for reconstructing ancient phylogenetic relationships. A mitochondrial gene-based angiosperm phylogeny provides an independent and essential reference for comparison with hypotheses of angiosperm phylogeny based on chloroplast genes, nuclear genes, and non-molecular data to reconstruct the underlying organismal phylogeny.

  14. Partial correction of a severe molecular defect in hemophilia A, because of errors during expression of the factor VIII gene

    Energy Technology Data Exchange (ETDEWEB)

    Young, M.; Antonarakis, S.E. [Univ. of Geneva (Switzerland); Inaba, Hiroshi [Tokyo Medical College (Japan)] [and others


    Although the molecular defect in patients in a Japanese family with mild to moderately severe hemophilia A was a deletion of a single nucleotide T within an A{sub 8}TA{sub 2} sequence of exon 14 of the factor VIII gene, the severity of the clinical phenotype did not correspond to that expected of a frameshift mutation. A small amount of functional factor VIII protein was detected in the patient`s plasma. Analysis of DNA and RNA molecules from normal and affected individuals and in vitro transcription/translation suggested a partial correction of the molecular defect, because of the following: (i) DNA replication/RNA transcription errors resulting in restoration of the reading frame and/or (ii) {open_quotes}ribosomal frameshifting{close_quotes} resulting in the production of normal factor VIII polypeptide and, thus, in a milder than expected hemophilia A. All of these mechanisms probably were promoted by the longer run of adenines, A{sub 10} instead of A{sub 8}TA{sub 2}, after the delT. Errors in the complex steps of gene expression therefore may partially correct a severe frameshift defect and ameliorate an expected severe phenotype. 36 refs., 6 figs.

  15. Detection of a functional insertion sequence responsible for deletion of the thermostable direct hemolysin gene (tdh) in Vibrio parahaemolyticus. (United States)

    Kamruzzaman, Muhammad; Bhoopong, Phuangthip; Vuddhakul, Varaporn; Nishibuchi, Mitsuaki


    The thermostable direct hemolysin coded by the tdh gene is a marker of virulent strains of Vibrio parahaemolyticus. The tdh genes are flanked by insertion sequences collectively named as ISVs or their remnants; but the ISVs so far examined have accumulated mutations in the transposase genes and underwent structural arrangements and their transposition activity could not be expected; the tdh gene was thus considered to have been acquired by V. parahaemolyticus through horizontal transfer in the past during evolution. We recently isolated from the same patient tdh+ strains and a tdh(-) strain (PCR examination) that were otherwise indistinguishable. The purpose of this study was to examine the hypothesis that the tdh(-) strain was derived from the tdh+ strain by a deletion of the tdh gene mediated by a functional ISV. Southern blot hybridization showed tdh+ sequences in the tdh(-) strain (PSU-1466). Nucleotide sequence analysis of the tdh and its flanking sequences revealed the tdh gene was split into two parts and they were located 3182-bp apart in PSU-1466. The two tdh sequences were flanked by one of the ISVs, named as ISVpa3, in PSU-1466. This genetic structure could be explained by an ISVpa3-mediated partial tdh deletion from a tdh+ strain followed by transposition of the duplicated ISVpa3 and the deleted tdh sequence into a neighboring location. The ISVpa3 of PSU-1466 coded for a full-length transposase and a DDE motif. We were able to demonstrate transposition activity of the ISVpa3 cloned from PSU-1466 using the replicon fusion assay with the conjugal transfer of a cointegrate from Escherichia coli to V. parahaemolyticus. Our data support ISVpa3-mediated partial tdh deletion resulted in the emergence of the tdh(-) strain.

  16. Detection and sequence analysis of accessory gene regulator genes of Staphylococcus pseudintermedius isolates

    Directory of Open Access Journals (Sweden)

    M. Ananda Chitra


    Full Text Available Background: Staphylococcus pseudintermedius (SP is the major pathogenic species of dogs involved in a wide variety of skin and soft tissue infections. The accessory gene regulator (agr locus of Staphylococcus aureus has been extensively studied, and it influences the expression of many virulence genes. It encodes a two-component signal transduction system that leads to down-regulation of surface proteins and up-regulation of secreted proteins during in vitro growth of S. aureus. The objective of this study was to detect and sequence analyzing the AgrA, B, and D of SP isolated from canine skin infections. Materials and Methods: In this study, we have isolated and identified SP from canine pyoderma and otitis cases by polymerase chain reaction (PCR and confirmed by PCR-restriction fragment length polymorphism. Primers for SP agrA and agrBD genes were designed using online primer designing software and BLAST searched for its specificity. Amplification of the agr genes was carried out for 53 isolates of SP by PCR and sequencing of agrA, B, and D were carried out for five isolates and analyzed using DNAstar and Mega5.2 software. Results: A total of 53 (59% SP isolates were obtained from 90 samples. 15 isolates (28% were confirmed to be methicillinresistant SP (MRSP with the detection of the mecA gene. Accessory gene regulator A, B, and D genes were detected in all the SP isolates. Complete nucleotide sequences of the above three genes for five isolates were submitted to GenBank, and their accession numbers are from KJ133557 to KJ133571. AgrA amino acid sequence analysis showed that it is mainly made of alpha-helices and is hydrophilic in nature. AgrB is a transmembrane protein, and AgrD encodes the precursor of the autoinducing peptide (AIP. Sequencing of the agrD gene revealed that the 5 canine SP strains tested could be divided into three Agr specificity groups (RIPTSTGFF, KIPTSTGFF, and RIPISTGFF based on the putative AIP produced by each strain

  17. Nuclear gene sequences from a late pleistocene sloth coprolite. (United States)

    Poinar, Hendrik; Kuch, Melanie; McDonald, Gregory; Martin, Paul; Pääbo, Svante


    The determination of nuclear DNA sequences from ancient remains would open many novel opportunities such as the resolution of phylogenies, the sexing of hominid and animal remains, and the characterization of genes involved in phenotypic traits. However, to date, single-copy nuclear DNA sequences from fossils have been determined only from bones and teeth of woolly mammoths preserved in the permafrost. Since the best preserved ancient nucleic acids tend to stem from cold environments, this has led to the assumption that nuclear DNA would be retrievable only from frozen remains. We have previously shown that Pleistocene coprolites stemming from the extinct Shasta sloth (Nothrotheriops shastensis, Megatheriidae) contain mitochondrial (mt) DNA from the animal that produced them as well as chloroplast (cp) DNA from the ingested plants. Recent attempts to resolve the phylogeny of two families of extinct sloths by using strictly mitochondrial DNA has been inconclusive. We have prepared DNA extracts from a ground sloth coprolite from Gypsum Cave, Nevada, and quantitated the number of mtDNA copies for three different fragment lengths by using real-time PCR. We amplified one multicopy and three single-copy nuclear gene fragments and used the concatenated sequence to resolve the phylogeny. These results show that ancient single-copy nuclear DNA can be recovered from warm, arid climates. Thus, nuclear DNA preservation is not restricted to cold climates.

  18. Expression detection and par-tial cloning of porcine leptin receptor (OBR) gene

    Institute of Scientific and Technical Information of China (English)


    The product of the obesity gene, called leptin, is an important regulator of adiposity, which mainly functions via its regulation of feed intake and energy metabolism. Both obesity gene (ob gene) and leptin receptor gene (OBR gene) are thought to play a major role in controlling of body fat mass as well as body weight. The result of RT-PCR shows that levels of pig OBR mRNA are higher in hypothalamus, lung and liver, and lower expression can be detected in other tissues. Total RNA purified from 11 different organs and tissues have been hybridized with pig OBR cDNA probes. The hybridization signals are shown in 7 organs and tissues. 4.1 and 3.8 kb bands were observed from hypothalamus, whereas 3.8 and 3.5 kb bands were observed in other tissues instead. The nearly complete sequence of the extracellular domain of pig OBR gene was obtained. The homology of sequence is 89.2% between pig and human, 80.3% between pig and mouse. Alignment of the predicted amino acid se-quence of OBR in pig, human and mouse shows that the overall identity is 86.5% between pig and human, 76.6% between pig and mouse. Two WSXWS motifs were found at aa313 and aa616.

  19. Sequence polymorphism and evolution of three cetacean MHC genes. (United States)

    Xu, Shi Xia; Ren, Wen Hua; Li, Shu Zhen; Wei, Fu Wen; Zhou, Kai Ya; Yang, Guang


    Sequence variability at three major histocompatibility complex (MHC) genes (DQB, DRA, and MHC-I) of cetaceans was investigated in order to get an overall understanding of cetacean MHC evolution. Little sequence variation was detected at the DRA locus, while extensive and considerable variability were found at the MHC-I and DQB loci. Phylogenetic reconstruction and sequence comparison revealed extensive sharing of identical MHC alleles among different species at the three MHC loci examined. Comparisons of phylogenetic trees for these MHC loci with the trees reconstructed only based on non-PBR sites revealed that allelic similarity/identity possibly reflected common ancestry and were not due to adaptive convergence. At the same time, trans-species evolution was also evidenced that the allelic diversity of the three MHC loci clearly pre-dated species divergence events according to the relaxed molecular clock. It may be the forces of balancing selection acting to maintain the high sequence variability and identical alleles in trans-specific manner at the MHC-I and DQB loci.

  20. Mining Association Rules in Dengue Gene Sequence with Latent Periodicity

    Directory of Open Access Journals (Sweden)

    Marimuthu Thangam


    Full Text Available The mining of periodic patterns in dengue database is an interesting research problem that can be used for predicting the future evolution of dengue viruses. In this paper, we propose an algorithm called Recurrence Finder (RECFIN that uses the suffix tree for detecting the periodic patterns of dengue gene sequence. Also, the RECFIN finds the presence of palindrome which indicates the possibilities of formation of proteins. Further, this paper computes the periodicity of nucleic acid and amino acid sequences of any length. The periodicity based association rules are used to diagnose the type of dengue. The time complexity of the proposed algorithm is O(n2. We demonstrate the effectiveness of the proposed approach by comparing the experimental results performed on dengue virus serotypes dataset with NCBI-BLAST algorithm.

  1. Methylation-sensitive linking libraries enhance gene-enriched sequencing of complex genomes and map DNA methylation domains

    Directory of Open Access Journals (Sweden)

    Bharti Arvind K


    Full Text Available Abstract Background Many plant genomes are resistant to whole-genome assembly due to an abundance of repetitive sequence, leading to the development of gene-rich sequencing techniques. Two such techniques are hypomethylated partial restriction (HMPR and methylation spanning linker libraries (MSLL. These libraries differ from other gene-rich datasets in having larger insert sizes, and the MSLL clones are designed to provide reads localized to "epigenetic boundaries" where methylation begins or ends. Results A large-scale study in maize generated 40,299 HMPR sequences and 80,723 MSLL sequences, including MSLL clones exceeding 100 kb. The paired end reads of MSLL and HMPR clones were shown to be effective in linking existing gene-rich sequences into scaffolds. In addition, it was shown that the MSLL clones can be used for anchoring these scaffolds to a BAC-based physical map. The MSLL end reads effectively identified epigenetic boundaries, as indicated by their preferential alignment to regions upstream and downstream from annotated genes. The ability to precisely map long stretches of fully methylated DNA sequence is a unique outcome of MSLL analysis, and was also shown to provide evidence for errors in gene identification. MSLL clones were observed to be significantly more repeat-rich in their interiors than in their end reads, confirming the correlation between methylation and retroelement content. Both MSLL and HMPR reads were found to be substantially gene-enriched, with the SalI MSLL libraries being the most highly enriched (31% align to an EST contig, while the HMPR clones exhibited exceptional depletion of repetitive DNA (to ~11%. These two techniques were compared with other gene-enrichment methods, and shown to be complementary. Conclusion MSLL technology provides an unparalleled approach for mapping the epigenetic status of repetitive blocks and for identifying sequences mis-identified as genes. Although the types and natures of

  2. A complete alpha1,3-galactosyltransferase gene is present in the human genome and partially transcribed. (United States)

    Lantéri, Marion; Giordanengo, Valérie; Vidal, Frédérique; Gaudray, Patrick; Lefebvre, Jean-Claude


    The synthesis of Galalpha1-3Gal-terminated oligosaccharides (alpha-Gal) epitopes has been interrupted during the course of evolution, starting with Old World primates. Partial sequences similar to the alpha1,3-galactosyltransferase (alpha1,3GalT) gene, which governs the synthesis of alpha-Gal epitopes, have been detected in the human genome and were found to correspond to pseudogenes. We completed the sequence of the human alpha1,3GalT pseudogene present on chromosome 9 and found it to be organized like the murine alpha1,3GalT gene. In human cell lines and several normal and tumor tissues we detected truncated transcripts corresponding to this pseudogene. Considering these mRNAs, translation of an open reading frame containing the first four translated exons but missing the two catalytic exons could predict a truncated alpha1,3GalT polypeptide that should be enzymatically inactive. We show that transcription of human alpha1,3GalT is prematurely terminated at the level of a strong transcriptional stop signal in the middle of intron VII. We were able to reproduce this effect in vitro by subcloning the implicated DNA region upstream from a reporter cDNA. The premature transcriptional arrest of human alpha1,3-GalT gene leads to an ectopic splicing event and to the connection of a short intronic sequence downstream from translated exons. Finally, we show that these truncated transcripts are overexpressed in cell lines with modifications of O-glycans.

  3. Evidence of geographical structuring in the Malaysian Snakehead, Channa striata based on partial segment of the CO1 gene. (United States)

    Siti-Balkhis, Abu Bakar; Jamsari, Amirul Firdaus Jamaluddin; Hwai, Tan Shau; Yasin, Zulfigar; Siti-Azizah, Mohd Nor


    Channa striata, locally known as "haruan", is economically important in fisheries and aquaculture industries in several Asian countries. DNA sequencing, based on a partial segment of the Cytochrome oxidase c subunit 1 (CO1) gene, was used to determine genetic variation in C. striata samples from four different populations on the west coast of Peninsular Malaysia. The highest nucleotide and haplotype diversities were observed in the Linggi population (π = 0.0067, h = 0.835), and the lowest in the Timah Tasoh population (π = 0.0008, h = 0.286). Apart from Kajang-Linggi, which was insignificant, F(ST) values were significant (p < 0.05) in all pairwise-population comparisons. Consequently, it is inferred that genetic structuring C. striata populations in this region was largely shaped by a common origin, with secondary influences from geographical factors and isolation.

  4. Evidence of geographical structuring in the Malaysian Snakehead, Channa striata based on partial segment of the CO1 gene

    Directory of Open Access Journals (Sweden)

    Abu Bakar Siti-Balkhis


    Full Text Available Channa striata, locally known as "haruan", is economically important in fisheries and aquaculture industries in several Asian countries. DNA sequencing, based on a partial segment of the Cytochrome oxidase c subunit 1 (CO1 gene, was used to determine genetic variation in C. striata samples from four different populations on the west coast of Peninsular Malaysia. The highest nucleotide and haplotype diversities were observed in the Linggi population (π = 0.0067, h = 0.835, and the lowest in the Timah Tasoh population (π = 0.0008, h = 0.286. Apart from Kajang-Linggi, which was insignificant, F ST values were significant (p < 0.05 in all pairwise-population comparisons. Consequently, it is inferred that genetic structuring C. striata populations in this region was largely shaped by a common origin, with secondary influences from geographical factors and isolation.

  5. In Vivo Gene Therapy of Hemophilia B: Sustained Partial Correction in Factor IX-Deficient Dogs (United States)

    Kay, Mark A.; Rothenberg, Steven; Landen, Charles N.; Bellinger, Dwight A.; Leland, Frances; Toman, Carol; Finegold, Milton; Thompson, Arthur R.; Read, M. S.; Brinkhous, Kenneth M.; Woo, Savio L. C.


    The liver represents a model organ for gene therapy. A method has been developed for hepatic gene transfer in vivo by the direct infusion of recombinant retroviral vectors into the portal vasculature, which results in the persistent expression of exogenous genes. To determine if these technologies are applicable for the treatment of hemophilia B patients, preclinical efficacy studies were done in a hemophilia B dog model. When the canine factor IX complementary DNA was transduced directly into the hepatocytes of affected dogs in vivo, the animals constitutively expressed low levels of canine factor IX for more than 5 months. Persistent expression of the clotting. factor resulted in reductions of whole blood clotting and partial thromboplastin times of the treated animals. Thus, long-term treatment of hemophilia B patients may be feasible by direct hepatic gene therapy in vivo.

  6. International interlaboratory study comparing single organism 16S rRNA gene sequencing data: Beyond consensus sequence comparisons. (United States)

    Olson, Nathan D; Lund, Steven P; Zook, Justin M; Rojas-Cornejo, Fabiola; Beck, Brian; Foy, Carole; Huggett, Jim; Whale, Alexandra S; Sui, Zhiwei; Baoutina, Anna; Dobeson, Michael; Partis, Lina; Morrow, Jayne B


    This study presents the results from an interlaboratory sequencing study for which we developed a novel high-resolution method for comparing data from different sequencing platforms for a multi-copy, paralogous gene. The combination of PCR amplification and 16S ribosomal RNA gene (16S rRNA) sequencing has revolutionized bacteriology by enabling rapid identification, frequently without the need for culture. To assess variability between laboratories in sequencing 16S rRNA, six laboratories sequenced the gene encoding the 16S rRNA from Escherichia coli O157:H7 strain EDL933 and Listeria monocytogenes serovar 4b strain NCTC11994. Participants performed sequencing methods and protocols available in their laboratories: Sanger sequencing, Roche 454 pyrosequencing(®), or Ion Torrent PGM(®). The sequencing data were evaluated on three levels: (1) identity of biologically conserved position, (2) ratio of 16S rRNA gene copies featuring identified variants, and (3) the collection of variant combinations in a set of 16S rRNA gene copies. The same set of biologically conserved positions was identified for each sequencing method. Analytical methods using Bayesian and maximum likelihood statistics were developed to estimate variant copy ratios, which describe the ratio of nucleotides at each identified biologically variable position, as well as the likely set of variant combinations present in 16S rRNA gene copies. Our results indicate that estimated variant copy ratios at biologically variable positions were only reproducible for high throughput sequencing methods. Furthermore, the likely variant combination set was only reproducible with increased sequencing depth and longer read lengths. We also demonstrate novel methods for evaluating variable positions when comparing multi-copy gene sequence data from multiple laboratories generated using multiple sequencing technologies.

  7. Detecting gene mutations in Japanese Alzheimer's patients by semiconductor sequencing. (United States)

    Yagi, Ryoichi; Miyamoto, Ryosuke; Morino, Hiroyuki; Izumi, Yuishin; Kuramochi, Masahito; Kurashige, Takashi; Maruyama, Hirofumi; Mizuno, Noriyoshi; Kurihara, Hidemi; Kawakami, Hideshi


    Alzheimer's disease (AD) is the most common form of dementia. To date, several genes have been identified as the cause of AD, including PSEN1, PSEN2, and APP. The association between APOE and late-onset AD has also been reported. We here used a bench top next-generation sequencer, which uses an integrated semiconductor device, detects hydrogen ions, and operates at a high-speed using nonoptical technology. We examined 45 Japanese AD patients with positive family histories, and 29 sporadic patients with early onset (useful for detecting genetic variations in familial AD.

  8. High throughput 16S rRNA gene amplicon sequencing

    DEFF Research Database (Denmark)

    Nierychlo, Marta; Larsen, Poul; Jørgensen, Mads Koustrup

    S rRNA gene amplicon sequencing has been developed over the past few years and is now ready to use for more comprehensive studies related to plant operation and optimization thanks to short analysis time, low cost, high throughput, and high taxonomic resolution. In this study we show how 16S r...... to the presence of filamentous microorganisms was monitored weekly over 4 months. Microthrix was identified as a causative filament and suitable control measures were introduced. The level of Microthrix was reduced after 1-2 months but a number of other filamentous species were still present, with most of them...

  9. Diversity of the luciferin binding protein gene in bioluminescent dinoflagellates--insights from a new gene in Noctiluca scintillans and sequences from gonyaulacoid genera. (United States)

    Valiadi, Martha; Iglesias-Rodriguez, Maria Debora


    Dinoflagellate bioluminescence systems operate with or without a luciferin binding protein, representing two distinct modes of light production. However, the distribution, diversity, and evolution of the luciferin binding protein gene within bioluminescent dinoflagellates are not well known. We used PCR to detect and partially sequence this gene from the heterotrophic dinoflagellate Noctiluca scintillans and a group of ecologically important gonyaulacoid species. We report an additional luciferin binding protein gene in N. scintillans which is not attached to luciferase, further to its typical combined bioluminescence gene. This supports the hypothesis that a profound re-organization of the bioluminescence system has taken place in this organism. We also show that the luciferin binding protein gene is present in the genera Ceratocorys, Gonyaulax, and Protoceratium, and is prevalent in bioluminescent species of Alexandrium. Therefore, this gene is an integral component of the standard molecular bioluminescence machinery in dinoflagellates. Nucleotide sequences showed high within-strain variation among gene copies, revealing a highly diverse gene family comprising multiple gene types in some organisms. Phylogenetic analyses showed that, in some species, the evolution of the luciferin binding protein gene was different from the organism's general phylogenies, highlighting the complex evolutionary history of dinoflagellate bioluminescence systems.

  10. A sequence-based approach to identify reference genes for gene expression analysis

    Directory of Open Access Journals (Sweden)

    Chari Raj


    Full Text Available Abstract Background An important consideration when analyzing both microarray and quantitative PCR expression data is the selection of appropriate genes as endogenous controls or reference genes. This step is especially critical when identifying genes differentially expressed between datasets. Moreover, reference genes suitable in one context (e.g. lung cancer may not be suitable in another (e.g. breast cancer. Currently, the main approach to identify reference genes involves the mining of expression microarray data for highly expressed and relatively constant transcripts across a sample set. A caveat here is the requirement for transcript normalization prior to analysis, and measurements obtained are relative, not absolute. Alternatively, as sequencing-based technologies provide digital quantitative output, absolute quantification ensues, and reference gene identification becomes more accurate. Methods Serial analysis of gene expression (SAGE profiles of non-malignant and malignant lung samples were compared using a permutation test to identify the most stably expressed genes across all samples. Subsequently, the specificity of the reference genes was evaluated across multiple tissue types, their constancy of expression was assessed using quantitative RT-PCR (qPCR, and their impact on differential expression analysis of microarray data was evaluated. Results We show that (i conventional references genes such as ACTB and GAPDH are highly variable between cancerous and non-cancerous samples, (ii reference genes identified for lung cancer do not perform well for other cancer types (breast and brain, (iii reference genes identified through SAGE show low variability using qPCR in a different cohort of samples, and (iv normalization of a lung cancer gene expression microarray dataset with or without our reference genes, yields different results for differential gene expression and subsequent analyses. Specifically, key established pathways in lung

  11. Phylogenetic relationships among Linguatula serrata isolates from Iran based on 18S rRNA and mitochondrial cox1 gene sequences. (United States)

    Ghorashi, Seyed Ali; Tavassoli, Mousa; Peters, Andrew; Shamsi, Shokoofeh; Hajipour, Naser


    The phylogenetic relationships among seven Linguatula serrata (L. serrata) isolates collected from cattle, goats, sheep, dogs and camels in different geographical locations of Iran were investigated using partial 18S ribosomal RNA (rRNA) and partial mitochondrial cytochrome c oxidase subunit 1 (cox1) gene sequences. The nucleotide sequences were analysed in order to determine the phylogenetic relationships between the isolates. Higher sequence diversity and intraspecies variation was observed in the cox1 gene compared to 18S rRNA sequences. Phylogenetic analysis of the cox1 gene placed all L. serrata isolates in a sister clade to L. arctica. The Mantel regression analysis revealed no association between genetic variations and host species or geographical location, perhaps due to the small sample size. However, genetic variations between L. serrata isolates in Iran and those isolated in other parts of the world may exist and could reveal possible evolutionary relationships.

  12. 基于线粒体 COI 基因部分序列的长江口虾虎鱼科鱼类系统分类%System classification for Gobiidae derived from the Yangtze River based on mitochondrial COI gene partial sequence

    Institute of Scientific and Technical Information of China (English)

    于亚男; 宋超; 侯俊利; 王妤; 庄平


    对采自长江口的7种虾虎鱼(Gobiidae)19个个体的线粒体COI基因进行测序,其他6种虾虎鱼的相应序列从Genbank中获得。序列分析结果显示7种虾虎鱼的COI基因部分序列密码子的第一位碱基的GC碱基含量最恒定,第三位碱基的GC含量最高,平均A+T的含量要高于G+C的含量,与多种鱼类的COI基因的碱基特点相似。运用MEGA5.0计算13种虾虎鱼的种间遗传距离,表明拉氏狼牙虾虎鱼(Oxyurichthy tentacularis)与孔虾虎鱼( Chaeturichthys stigmatias)归属于近盲虾虎鱼亚科,根据种间遗传距离与系统发育树的结构特征推测,触角沟虾虎鱼和矛尾虾虎鱼可能具有共同的起源。%A total of nineteen specimens of seven species of Gobiidae derived from the Yangtze River were couected to se -quence mitochondrial COI genes.Sequences from homologous regions of six other species of this family were gathered from Genbank.The results showed that the most constant GC content was in the first base of codon and the highest GC content existed in the third base of codon .The average content of A +T was higher than the average content of G +C.The feature was similar to most fishes .Comparing the COI gene of thirteen species of Gobiidae with MEGA 5.0 , the taxonomic status of Odontamblyopus lacepedii and Trypauchen vagina were proved legitimately .The genetic distance between species and the structure feature of Phylogeny tree indicated the same origin of Oxyurichthy tentacularis and Chaeturichthys stigmatias.

  13. Isolation of Canine parvovirus with a view to identify the prevalent serotype on the basis of partial sequence analysis

    Directory of Open Access Journals (Sweden)

    Gurpreet Kaur


    Full Text Available Aim: The aim of this study was to isolate Canine parvovirus (CPV from suspected dogs on madin darby canine kidney (MDCK cell line and its confirmation by polymerase chain reaction (PCR and nested PCR (NPCR. Further, VP2 gene of the CPV isolates was amplified and sequenced to determine prevailing antigenic type. Materials and Methods: A total of 60 rectal swabs were collected from dogs showing signs of gastroenteritis, processed and subjected to isolation in MDCK cell line. The samples showing cytopathic effects (CPE were confirmed by PCR and NPCR. These samples were subjected to PCR for amplification of VP2 gene of CPV, sequenced and analyzed to study the prevailing antigenic types of CPV. Results: Out of the 60 samples subjected to isolation in MDCK cell line five samples showed CPE in the form of rounding of cells, clumping of cells and finally detachment of the cells. When these samples and the two commercially available vaccines were subjected to PCR for amplification of VP2 gene, a 1710 bp product was amplified. The sequence analysis revealed that the vaccines belonged to the CPV-2 type and the samples were of CPV-2b type. Conclusion: It can be concluded from the present study that out of a total of 60 samples 5 samples exhibited CPE as observed in MDCK cell line. Sequence analysis of the VP2 gene among the samples and vaccine strains revealed that samples belonged to CPV-2b type and vaccines belonging to CPV-2.

  14. PCR Cloning of Partial "nbs" Sequences from Grape ("Vitis aestivalis" Michx) (United States)

    Chang, Ming-Mei; DiGennaro, Peter; Macula, Anthony


    Plants defend themselves against pathogens via the expressions of disease resistance (R) genes. Many plant R gene products contain the characteristic nucleotide-binding site (NBS) and leucine-rich repeat (LRR) domains. There are highly conserved motifs within the NBS domain which could be targeted for polymerase chain reaction (PCR) cloning of R…

  15. PCR Cloning of Partial "nbs" Sequences from Grape ("Vitis aestivalis" Michx) (United States)

    Chang, Ming-Mei; DiGennaro, Peter; Macula, Anthony


    Plants defend themselves against pathogens via the expressions of disease resistance (R) genes. Many plant R gene products contain the characteristic nucleotide-binding site (NBS) and leucine-rich repeat (LRR) domains. There are highly conserved motifs within the NBS domain which could be targeted for polymerase chain reaction (PCR) cloning of R…

  16. Molecular chaperone genes in the sugarcane expressed sequence database (SUCEST

    Directory of Open Access Journals (Sweden)

    Júlio C. Borges


    Full Text Available Some newly synthesized proteins require the assistance of molecular chaperones for their correct folding. Chaperones are also involved in the dissolution of protein aggregates making their study significant for both biotechnology and medicine and the identification of chaperones and stress-related protein sequences in different organisms is an important task. We used bioinformatic tools to investigate the information generated by the Sugarcane Expressed Sequence Tag (SUCEST genome project in order to identify and annotate molecular chaperones. We considered that the SUCEST sequences belonged to this category of proteins when their E-values were lower than 1.0e-05. Our annotation shows that 4,164 of the 5’ expressed sequence tag (EST sequences were homologous to molecular chaperones, nearly 1.8% of all the 5’ ESTs sequenced during the SUCEST project. About 43% of the chaperones which we found were Hsp70 chaperones and its co-chaperones, 10% were Hsp90 chaperones and 13% were peptidyl-prolyl cis, trans isomerase. Based on the annotation results we predicted 156 different chaperone gene subclasses in the sugarcane genome. Taken together, our results indicate that genes which encode chaperones were diverse and abundantly expressed in sugarcane cells, which emphasizes their biological importance.Algumas proteínas ao serem sintetizadas necessitam do auxílio de chaperones moleculares para seu correto enovelamento. Chaperones também estão envolvidas na dissolução de agregados protéicos, fazendo com que seu estudo seja de relevância biotecnológica e médica. Portanto, a identificação de seqüências de chaperones moleculares é uma tarefa importante. Nós usamos ferramentas de bioinformática para procurar informações geradas pelo sugarcane EST Genome Project (SUCEST a fim de identificar e anotar chaperones e proteínas relacionas ao estresse. As seqüências do SUCEST eram anotadas como pertencentes a uma categoria de proteínas se o E

  17. An automated annotation tool for genomic DNA sequences using GeneScan and BLAST

    Indian Academy of Sciences (India)

    Andrew M. Lynn; Chakresh Kumar Jain; K. Kosalai; Pranjan Barman; Nupur Thakur; Harish Batra; Alok Bhattacharya


    Genomic sequence data are often available well before the annotated sequence is published. We present a method for analysis of genomic DNA to identify coding sequences using the GeneScan algorithm and characterize these resultant sequences by BLAST. The routines are used to develop a system for automated annotation of genome DNA sequences.

  18. Solving the molecular diagnostic testing conundrum for Mendelian disorders in the era of next-generation sequencing: single-gene, gene panel, or exome/genome sequencing. (United States)

    Xue, Yuan; Ankala, Arunkanth; Wilcox, William R; Hegde, Madhuri R


    Next-generation sequencing is changing the paradigm of clinical genetic testing. Today there are numerous molecular tests available, including single-gene tests, gene panels, and exome sequencing or genome sequencing. As a result, ordering physicians face the conundrum of selecting the best diagnostic tool for their patients with genetic conditions. Single-gene testing is often most appropriate for conditions with distinctive clinical features and minimal locus heterogeneity. Next-generation sequencing-based gene panel testing, which can be complemented with array comparative genomic hybridization and other ancillary methods, provides a comprehensive and feasible approach for heterogeneous disorders. Exome sequencing and genome sequencing have the advantage of being unbiased regarding what set of genes is analyzed, enabling parallel interrogation of most of the genes in the human genome. However, current limitations of next-generation sequencing technology and our variant interpretation capabilities caution us against offering exome sequencing or genome sequencing as either stand-alone or first-choice diagnostic approaches. A growing interest in personalized medicine calls for the application of genome sequencing in clinical diagnostics, but major challenges must be addressed before its full potential can be realized. Here, we propose a testing algorithm to help clinicians opt for the most appropriate molecular diagnostic tool for each scenario.

  19. High-throughput sequencing of peptoids and peptide-peptoid hybrids by partial edman degradation and mass spectrometry. (United States)

    Thakkar, Amit; Cohen, Allison S; Connolly, Michael D; Zuckermann, Ronald N; Pei, Dehua


    A method for the rapid sequence determination of peptoids [oligo(N-substituted glycines)] and peptide-peptoid hybrids selected from one-bead-one-compound combinatorial libraries has been developed. In this method, beads carrying unique peptoid (or peptide-peptoid) sequences were subjected to multiple cycles of partial Edman degradation (PED) by treatment with a 1:3 (mol/mol) mixture of phenyl isothiocyanate (PITC) and 9-fluorenylmethyl chloroformate (Fmoc-Cl) to generate a series of N-terminal truncation products for each resin-bound peptoid. After PED, the Fmoc group was removed from the N-terminus and any reacted side chains via piperidine treatment. The resulting mixture of the full-length peptoid and its truncation products was analyzed by matrix-assisted laser desorption ionization (MALDI) mass spectrometry, to reveal the sequence of the full-length peptoid. With a slight modification, the method was also effective in the sequence determination of peptide-peptoid hybrids. This rapid, high-throughput, sensitive, and inexpensive sequencing method should greatly expand the utility of combinatorial peptoid libraries in biomedical and materials research.

  20. Membrane gene ontology bias in sequencing and microarray obtained by housekeeping-gene analysis. (United States)

    Zhang, Yijuan; Akintola, Oluwafemi S; Liu, Ken J A; Sun, Bingyun


    Microarray (MA) and high-throughput sequencing are two commonly used detection systems for global gene expression profiling. Although these two systems are frequently used in parallel, the differences in their final results have not been examined thoroughly. Transcriptomic analysis of housekeeping (HK) genes provides a unique opportunity to reliably examine the technical difference between these two systems. We investigated here the structure, genome location, expression quantity, microarray probe coverage, as well as biological functions of differentially identified human HK genes by 9 MA and 6 sequencing studies. These in-depth analyses allowed us to discover, for the first time, a subset of transcripts encoding membrane, cell surface and nuclear proteins that were prone to differential identification by the two platforms. We hope that the discovery can aid the future development of these technologies for comprehensive transcriptomic studies. Copyright © 2015 Elsevier B.V. All rights reserved.

  1. Estimating the extent of horizontal gene transfer in metagenomic sequences

    Directory of Open Access Journals (Sweden)

    Moya Andrés


    Full Text Available Abstract Background Although the extent of horizontal gene transfer (HGT in complete genomes has been widely studied, its influence in the evolution of natural communities of prokaryotes remains unknown. The availability of metagenomic sequences allows us to address the study of global patterns of prokaryotic evolution in samples from natural communities. However, the methods that have been commonly used for the study of HGT are not suitable for metagenomic samples. Therefore it is important to develop new methods or to adapt existing ones to be used with metagenomic sequences. Results We have created two different methods that are suitable for the study of HGT in metagenomic samples. The methods are based on phylogenetic and DNA compositional approaches, and have allowed us to assess the extent of possible HGT events in metagenomes for the first time. The methods are shown to be compatible and quite precise, although they probably underestimate the number of possible events. Our results show that the phylogenetic method detects HGT in between 0.8% and 1.5% of the sequences, while DNA compositional methods identify putative HGT in between 2% and 8% of the sequences. These ranges are very similar to these found in complete genomes by related approaches. Both methods act with a different sensitivity since they probably target HGT events of different ages: the compositional method mostly identifies recent transfers, while the phylogenetic is more suitable for the detections of older events. Nevertheless, the study of the number of HGT events in metagenomic sequences from different communities shows a consistent trend for both methods: the lower amount is found for the sequences of the Sargasso Sea metagenome, while the higher quantity is found in the whale fall metagenome from the bottom of the ocean. The significance of these observations is discussed. Conclusion The computational approaches that are used to find possible HGT events in complete

  2. Cloning and characterization of the major histone H2A genes completes the cloning and sequencing of known histone genes of Tetrahymena thermophila. (United States)

    Liu, X; Gorovsky, M A


    A truncated cDNA clone encoding Tetrahymena thermophila histone H2A2 was isolated using synthetic degenerate oligonucleotide probes derived from H2A protein sequences of Tetrahymena pyriformis. The cDNA clone was used as a homologous probe to isolate a truncated genomic clone encoding H2A1. The remaining regions of the genes for H2A1 (HTA1) and H2A2 (HTA2) were then isolated using inverse PCR on circularized genomic DNA fragments. These partial clones were assembled into intact HTA1 and HTA2 clones. Nucleotide sequences of the two genes were highly homologous within the coding region but not in the noncoding regions. Comparison of the deduced amino acid sequences with protein sequences of T. pyriformis H2As showed only two and three differences respectively, in a total of 137 amino acids for H2A1, and 132 amino acids for H2A2, indicating the two genes arose before the divergence of these two species. The HTA2 gene contains a TAA triplet within the coding region, encoding a glutamine residue. In contrast with the T. thermophila HHO and HTA3 genes, no introns were identified within the two genes. The 5'- and 3'-ends of the histone H2A mRNAs; were determined by RNase protection and by PCR mapping using RACE and RLM-RACE methods. Both genes encode polyadenylated mRNAs and are highly expressed in vegetatively growing cells but only weakly expressed in starved cultures. With the inclusion of these two genes, T. thermophila is the first organism whose entire complement of known core and linker histones, including replication-dependent and basal variants, has been cloned and sequenced. PMID:8760889

  3. Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications

    DEFF Research Database (Denmark)

    Yilmaz, Pelin; Kottmann, Renzo; Field, Dawn


    Here we present a standard developed by the Genomic Standards Consortium (GSC) for reporting marker gene sequences--the minimum information about a marker gene sequence (MIMARKS). We also introduce a system for describing the environment from which a biological sample originates. The 'environmental...... present the minimum information about any (x) sequence (MIxS). Adoption of MIxS will enhance our ability to analyze natural genetic diversity documented by massive DNA sequencing efforts from myriad ecosystems in our ever-changing biosphere....

  4. Substitution and Haplotype Diversity Analysis on The Partial Sequence of The Mitochondrial DNA Cyt b of Indonesian Swamp Buffalo (Bubalus bubalis

    Directory of Open Access Journals (Sweden)

    Akhmad Sukri


    Full Text Available This research aims to investigate the substitution pattern of nucleotide base and haplotype diversity of Indonesian swamp buffalo (Bubalus bubalis based on the mitochondrial DNA cyt b partial gene sequence. 17 samples were chosen from 7 different regions with each uniquely represents Indonesian biogeography which comprise Aceh, Riau, Madiun, Blitar, Lombok, South Borneo and Tana Toraja. The result of cyt b gene sequence alignment showed the presence of transition and transversion substitutions and the absence of insertion and deletion. The amount of transitions was found to be higher than that of transversions and the amount of substitution in pyrimidine was also higher than that in purine. The highest amount of transitions happened in base TàC which is a silent substitution. The result of median joining network analysis showed that Indonesian haplotype Bubalus bubalis could be classified into 16 haplotypes which form different haplogroups unique to their geographical region. The result of median joining network analysis also indicated that the genetic relationships of swamp buffaloes (Bubalus bubalis in Indonesia are highly influenced by their geographical locations.

  5. Partial mitochondrial gene arrangements support a close relationship between Tardigrada and Arthropoda. (United States)

    Ryu, Shi Hyun; Lee, Ji Min; Jang, Kuem-Hee; Choi, Eun Hwa; Park, Shin Ju; Chang, Cheon Young; Kim, Won; Hwang, Ui Wook


    Regions (about 3.7-3.8 kb) of the mitochondrial genomes (rrnL-cox1) of two tardigrades, a heterotardigrade, Batillipes pennaki, and a eutardigrade, Pseudobiotus spinifer, were sequenced and characterized. The gene order in Batillipes was rrnL-V-rrnS-Q-I-M-nad2-W-C-Y-cox1, and in Pseudobiotus it was rrnL-V-rrnS-Q-M-nad2-W-C-Y-cox1. With the exception of the trnI gene, the two tardigrade regions have the same gene content and order. Their gene orders are strikingly similar to that of the chelicerate Limulus polyphemus (rrnL-V-rrnS-CR-I-Q-M-nad2-W-C-Y-cox1), which is considered to be ancestral for arthropods. Although the tardigrades do not have a distinct control region (CR) within this segment, the trnI gene in Pseudobiotus is located between rrnL-trnL1 and trnL2-nad1, and the trnI gene in Batillipes is located between trnQ and trnM. In addition, the 106-bp region between trnQ and trnM in Batillipes not only contains two plausible trnI genes with opposite orientations, but also exhibits some CR-like characteristics. The mitochondrial gene arrangements of 183 other protostomes were compared. 60 (52.2%) of the 115 arthropods examined have the M-nad2-W-C-Y-cox1 arrangement, and 88 (76.5%) the M-nad2-W arrangement, as found in the tardigrades. In contrast, no such arrangement was seen in the 70 non-arthropod protostomes studied. These are the first non-sequence molecular data that support the close relationship of tardigrades and arthropods.

  6. Identification and partial sequencing of a crocodile poxvirus associated with deeply penetrating skin lesions in farmed Nile crocodiles, Crocodylus niloticus. (United States)

    Huchzermeyer, F W; Wallace, D B; Putterill, J F; Gerdes, G H


    When large numbers of crocodile skins were downgraded because of the presence of small pin prick-like holes, collapsed epidermal cysts were found deep in the dermis of juvenile crocodiles while forming cysts were observed in hatchlings. Histopathology of these forming cysts showed the presence of intracytoplasmic inclusions in proliferating and ballooning epidermal cells. Pox virions were seen in electron microscope preparations made from the scabs of such early lesions. The partial sequencing of virus material from scrapings of these lesions and comparison of it with the published sequence of crocodile poxvirus showed the virus associated with the deep lesions to be closely related, but different. To differentiate between the two forms of crocodile pox infection it is suggested that the previously known form should be called "classical crocodile pox" and the newly discovered form "atypical crocodile pox". The application of strict hygiene measures brought about a decline in the percentage of downgraded skins.

  7. Identification and partial sequencing of a crocodile poxvirus associated with deeply penetrating skin lesions in farmed Nile crocodiles, Crocodylus niloticus

    Directory of Open Access Journals (Sweden)

    F.W. Huchzermeyer


    Full Text Available When large numbers of crocodile skins were downgraded because of the presence of small pin pricklike holes, collapsed epidermal cysts were found deep in the dermis of juvenile crocodiles while forming cysts were observed in hatchlings. Histopathology of these forming cysts showed the presence of intracytoplasmic inclusions in proliferating and ballooning epidermal cells. Pox virions were seen in electron microscope preparations made from the scabs of such early lesions. The partial sequencing of virus material from scrapings of these lesions and comparison of it with the published sequence of crocodile poxvirus showed the virus associated with the deep lesions to be closely related, but different. To differentiate between the two forms of crocodile pox infection it is suggested that the previously known form should be called ''classical crocodile pox'' and the newly discovered form ''atypical crocodile pox''. The application of strict hygiene measures brought about a decline in the percentage of downgraded skins.

  8. Sequencing, characterization, and gene expression analysis of the histidine decarboxylase gene cluster of Morganella morganii. (United States)

    Ferrario, Chiara; Borgo, Francesca; de Las Rivas, Blanca; Muñoz, Rosario; Ricci, Giovanni; Fortina, Maria Grazia


    The histidine decarboxylase gene cluster of Morganella morganii DSM30146(T) was sequenced, and four open reading frames, named hdcT1, hdc, hdcT2, and hisRS were identified. Two putative histidine/histamine antiporters (hdcT1 and hdcT2) were located upstream and downstream the hdc gene, codifying a pyridoxal-P dependent histidine decarboxylase, and followed by hisRS gene encoding a histidyl-tRNA synthetase. This organization was comparable with the gene cluster of other known Gram negative bacteria, particularly with that of Klebsiella oxytoca. Recombinant Escherichia coli strains harboring plasmids carrying the M. morganii hdc gene were shown to overproduce histidine decarboxylase, after IPTG induction at 37 °C for 4 h. Quantitative RT-PCR experiments revealed the hdc and hisRS genes were highly induced under acidic and histidine-rich conditions. This work represents the first description and identification of the hdc-related genes in M. morganii. Results support the hypothesis that the histidine decarboxylation reaction in this prolific histamine producing species may play a role in acid survival. The knowledge of the role and the regulation of genes involved in histidine decarboxylation should improve the design of rational strategies to avoid toxic histamine production in foods.

  9. In vivo gene transfer strategies to achieve partial correction of von Willebrand disease. (United States)

    Wang, Lan; Rosenberg, Jonathan B; De, Bishnu P; Ferris, Barbara; Wang, Rui; Rivella, Stefano; Kaminsky, Stephen M; Crystal, Ronald G


    von Willebrand disease (VWD), the most common hereditary coagulation disorder, results from mutations in the 52-exon gene for von Willebrand factor (VWF), which encodes an 8.4-kB cDNA. Studies with VWF cDNA plasmids have demonstrated that in vivo gene transfer to the liver will correct the coagulation dysfunction in VWF(-/-) mice, but the correction is transient. To develop gene therapy for VWF that would mediate long-term expression of the VWF cDNA in liver, we first evaluated segmental pre-mRNA trans-splicing (SPTS) with two adeno-associated virus (AAV) serotype 8 vectors, each delivering one-half of the VWF cDNA. However, although the two vectors functioned well to generate VWF multimers after infection of cells in vitro, the efficiency of SPTS was insufficient to correct the VWF(-/-) mouse in vivo. As an alternative, we assessed the ability of a lentiviral vector to transfer the intact murine VWF cDNA in vivo directly to the neonatal liver of VWF(-/-) mice, using generation of VWF multimers, bleeding time, and bleeding volume as efficacy parameters. The VWF lentivirus generated VWF multimers and partially or completely corrected the coagulation defect on a persistent basis in 33% of the treated VWF-deficient mice. On the basis of the concept that partial persistent correction with gene transfer could be beneficial in VWD patients, these observations suggest that lentiviral delivery of VWF cDNA should be explored as a candidate for gene therapy in patients with a severe form of VWD.

  10. Variation in the nucleotide sequence of a prolamin gene family in wild rice. (United States)

    Barbier, P; Ishihama, A


    Variation in the DNA sequence of the 10 kDa prolamin gene family within the wild rice species Oryza rufipogon was probed using the direct sequencing of PCR-amplified genes. A comparison of the nucleotide and deduced amino-acid sequences of eight Asian strains of O. rufipogon and one strain of the related African species O. longistaminata is presented.

  11. Molecular Cloning and Sequencing of Hemoglobin-Beta Gene of Channel Catfish, Ictalurus Punctatus Rafinesque (United States)

    : Hemoglobin-y gene of channel catfish , lctalurus punctatus, was cloned and sequenced . Total RNA from head kidneys was isolated, reverse transcribed and amplified . The sequence of the channel catfish hemoglobin-y gene consists of 600 nucleotides . Analysis of the nucleotide sequence reveals one o...

  12. Variation and Characterization Analysis of Partial Fragment of Fibroin Gene From Silkworm, Antheraea pernyi

    Institute of Scientific and Technical Information of China (English)

    Li Wenli(李文利); Jin Liji; An Lijia


    A 1.4Kb DNA fragment containing 3' flanking sequence of fibroin gene of silkworm, Antheraea pernyi, was obtained from the silk gland's mRNA of 5th larva. Analysis of this sequence with another A.pernyi fibroin protein (accession No. D83241) revealed that it consists of a completely open reading frame (ORF), which includes 14 polyalanine-containing units (motifs) and 100bp 3'-UTR. The sequence of the predicted amino acid reveals the highest level of overall identity (90%) with D83241. It was found that it loses a repeat region at the upstream of TAA codon and some mutations. A putative polyadenylation signal AATAAA tail was found in position 1300, which follows the termination codon.

  13. Partial sequencing and phylogenetic analysis of Soybean mosaic virus isolated in Ukraine

    Directory of Open Access Journals (Sweden)

    Polischuk V. P.


    Full Text Available The aim of the present study is to compare the biological and molecular properties of Ukrainian soybean mosaic virus (SMV isolates with those of known strains or isolates from other countries, and to trace their possible origin. The methods of mechanical inoculation, reverse transcription polymerase chain reaction, DNA sequencing and phylogenetic analysis have been used. Results. Five SMV isolates have been collected and biologically purified from breeding plots in Vinnitsa region of Ukraine. It has been found that all these isolates show the same reaction patterns when infecting 11 differential soybean cultivars. Phylogenetic analysis of sequences of the coat protein coding region and P1 coding region revealed strong genetic relationships between representative Ukrainian (UA1Gr and SMV-VA2 isolates which together were sorted in one clade with G2 strain. The investigation of sequence identity showed that different genomic regions of SMV were under different evolutionary constraints. Conclusions. SMV, isolated in Ukraine for the first time, belongs to the G2 strain group that is widespread in North America. The SMV isolates obtained in this work may be employed in the Ukrainian national breeding programs to create soybean with durable virus resistance.

  14. Mutational analysis of DBD*--a unique antileukemic gene sequence. (United States)

    Ji, Yan-shan; Johnson, Betty H; Webb, M Scott; Thompson, E Brad


    DBD* is a novel gene encoding an 89 amino acid peptide that is constitutively lethal to leukemic cells. DBD* was derived from the DNA binding domain of the human glucocorticoid receptor by a frameshift that replaces the final 21 C-terminal amino acids of the domain. Previous studies suggested that DBD* no longer acted as the natural DNA binding domain. To confirm and extend these results, we mutated DBD* in 29 single amino acid positions, critical for the function in the native domain or of possible functional significance in the novel 21 amino acid C-terminal sequence. Steroid-resistant leukemic ICR-27-4 cells were transiently transfected by electroporation with each of the 29 mutants. Cell kill was evaluated by trypan blue dye exclusion, a WST-1 tetrazolium-based assay for cell respiration, propidium iodide exclusion, and Hoechst 33258 staining of chromatin. Eleven of the 29 point mutants increased, whereas four decreased antileukemic activity. The remainder had no effect on activity. The nonconcordances between these effects and native DNA binding domain function strongly suggest that the lethality of DBD* is distinct from that of the glucocorticoid receptor. Transfections of fragments of DBD* showed that optimal activity localized to the sequence for its C-terminal 32 amino acids.

  15. Sequencing 16S rRNA gene fragments using the PacBio SMRT DNA sequencing system. (United States)

    Schloss, Patrick D; Jenior, Matthew L; Koumpouras, Charles C; Westcott, Sarah L; Highlander, Sarah K


    Over the past 10 years, microbial ecologists have largely abandoned sequencing 16S rRNA genes by the Sanger sequencing method and have instead adopted highly parallelized sequencing platforms. These new platforms, such as 454 and Illumina's MiSeq, have allowed researchers to obtain millions of high quality but short sequences. The result of the added sequencing depth has been significant improvements in experimental design. The tradeoff has been the decline in the number of full-length reference sequences that are deposited into databases. To overcome this problem, we tested the ability of the PacBio Single Molecule, Real-Time (SMRT) DNA sequencing platform to generate sequence reads from the 16S rRNA gene. We generated sequencing data from the V4, V3-V5, V1-V3, V1-V5, V1-V6, and V1-V9 variable regions from within the 16S rRNA gene using DNA from a synthetic mock community and natural samples collected from human feces, mouse feces, and soil. The mock community allowed us to assess the actual sequencing error rate and how that error rate changed when different curation methods were applied. We developed a simple method based on sequence characteristics and quality scores to reduce the observed error rate for the V1-V9 region from 0.69 to 0.027%. This error rate is comparable to what has been observed for the shorter reads generated by 454 and Illumina's MiSeq sequencing platforms. Although the per base sequencing cost is still significantly more than that of MiSeq, the prospect of supplementing reference databases with full-length sequences from organisms below the limit of detection from the Sanger approach is exciting.

  16. Sequencing 16S rRNA gene fragments using the PacBio SMRT DNA sequencing system

    Directory of Open Access Journals (Sweden)

    Patrick D. Schloss


    Full Text Available Over the past 10 years, microbial ecologists have largely abandoned sequencing 16S rRNA genes by the Sanger sequencing method and have instead adopted highly parallelized sequencing platforms. These new platforms, such as 454 and Illumina’s MiSeq, have allowed researchers to obtain millions of high quality but short sequences. The result of the added sequencing depth has been significant improvements in experimental design. The tradeoff has been the decline in the number of full-length reference sequences that are deposited into databases. To overcome this problem, we tested the ability of the PacBio Single Molecule, Real-Time (SMRT DNA sequencing platform to generate sequence reads from the 16S rRNA gene. We generated sequencing data from the V4, V3–V5, V1–V3, V1–V5, V1–V6, and V1–V9 variable regions from within the 16S rRNA gene using DNA from a synthetic mock community and natural samples collected from human feces, mouse feces, and soil. The mock community allowed us to assess the actual sequencing error rate and how that error rate changed when different curation methods were applied. We developed a simple method based on sequence characteristics and quality scores to reduce the observed error rate for the V1–V9 region from 0.69 to 0.027%. This error rate is comparable to what has been observed for the shorter reads generated by 454 and Illumina’s MiSeq sequencing platforms. Although the per base sequencing cost is still significantly more than that of MiSeq, the prospect of supplementing reference databases with full-length sequences from organisms below the limit of detection from the Sanger approach is exciting.

  17. The Clinical Significance of Unknown Sequence Variants in BRCA Genes

    Energy Technology Data Exchange (ETDEWEB)

    Calò, Valentina; Bruno, Loredana; Paglia, Laura La; Perez, Marco; Margarese, Naomi [Department of Surgery and Oncology, Regional Reference Center for the Biomolecular Characterization and Genetic Screening of Hereditary Tumors, University of Palermo, Via del Vespro 127, 90127 Palermo (Italy); Gaudio, Francesca Di [Department of Medical Biotechnologies and Legal Medicine, University of Palermo, Palermo (Italy); Russo, Antonio, E-mail: [Department of Surgery and Oncology, Regional Reference Center for the Biomolecular Characterization and Genetic Screening of Hereditary Tumors, University of Palermo, Via del Vespro 127, 90127 Palermo (Italy)


    Germline mutations in BRCA1/2 genes are responsible for a large proportion of hereditary breast and/or ovarian cancers. Many highly penetrant predisposition alleles have been identified and include frameshift or nonsense mutations that lead to the translation of a truncated protein. Other alleles contain missense mutations, which result in amino acid substitution and intronic variants with splicing effect. The discovery of variants of uncertain/unclassified significance (VUS) is a result that can complicate rather than improve the risk assessment process. VUSs are mainly missense mutations, but also include a number of intronic variants and in-frame deletions and insertions. Over 2,000 unique BRCA1 and BRCA2 missense variants have been identified, located throughout the whole gene (Breast Cancer Information Core Database (BIC database)). Up to 10–20% of the BRCA tests report the identification of a variant of uncertain significance. There are many methods to discriminate deleterious/high-risk from neutral/low-risk unclassified variants (i.e., analysis of the cosegregation in families of the VUS, measure of the influence of the VUSs on the wild-type protein activity, comparison of sequence conservation across multiple species), but only an integrated analysis of these methods can contribute to a real interpretation of the functional and clinical role of the discussed variants. The aim of our manuscript is to review the studies on BRCA VUS in order to clarify their clinical relevance.

  18. Phylogeographic analysis of African swine fever virus based on the p72 gene sequence. (United States)

    Muangkram, Y; Sukmak, M; Wajjwalku, W


    African swine fever virus (ASFV) outbreak has been considered as an emerging and re-emerging disease for almost a century. Diagnostically, simple polymerase chain reaction and sequencing-based molecular detection could be employed for both viral identification and genotyping. This study established a novel phylogenetic analysis and epidemiology comparison based on 205 bp of p72 gene sequences. Based on this partial p72 fragment, an updated list of 44 different genotypes from a total of 516 ASFV sequences compiled from GenBank was generated. Nucleotide diversity was 0.04325 ± 0.00231. The analysis of spatial genetic variation divided the ASFV populations of the African continent into four clades (clade A: central and upper eastern Africa; clade B: eastern Africa; clade C: eastern and southern Africa; and clade D: southern Africa). These results and the developed protocol could serve as useful molecular tools for ASFV diagnosis from degraded DNA or putrefied samples, and also provide the phylogeographic perspective to identify the origin of viral outbreaks, facilitating the decision planning to limit their spread.

  19. Cloning, sequencing and expression of a xylanase gene from the maize pathogen Helminthosporium turcicum

    DEFF Research Database (Denmark)

    Degefu, Y.; Paulin, L.; Lübeck, Peter Stephensen


    A gene encoding an endoxylanase from the phytopathogenic fungus Helminthosporium turcicum Pass. was cloned and sequenced. The entire nucleotide sequence of a 1991 bp genomic fragment containing an endoxylanase gene was determined. The xylanase gene of 795 bp, interrupted by two introns of 52 and ...

  20. Poly purine.pyrimidine sequences upstream of the beta-galactosidase gene affect gene expression in Saccharomyces cerevisiae

    Directory of Open Access Journals (Sweden)

    Brahmachari Samir K


    Full Text Available Abstract Background Poly purine.pyrimidine sequences have the potential to adopt intramolecular triplex structures and are overrepresented upstream of genes in eukaryotes. These sequences may regulate gene expression by modulating the interaction of transcription factors with DNA sequences upstream of genes. Results A poly purine.pyrimidine sequence with the potential to adopt an intramolecular triplex DNA structure was designed. The sequence was inserted within a nucleosome positioned upstream of the β-galactosidase gene in yeast, Saccharomyces cerevisiae, between the cycl promoter and gal 10Upstream Activating Sequences (UASg. Upon derepression with galactose, β-galactosidase gene expression is reduced 12-fold in cells carrying single copy poly purine.pyrimidine sequences. This reduction in expression is correlated with reduced transcription. Furthermore, we show that plasmids carrying a poly purine.pyrimidine sequence are not specifically lost from yeast cells. Conclusion We propose that a poly purine.pyrimidine sequence upstream of a gene affects transcription. Plasmids carrying this sequence are not specifically lost from cells and thus no additional effort is needed for the replication of these sequences in eukaryotic cells.

  1. Complete mitochondrial genome sequence and identification of a candidate gene responsible for cytoplasmic male sterility in radish (Raphanus sativus L.) containing DCGMS cytoplasm. (United States)

    Park, Jee Young; Lee, Young-Pyo; Lee, Jonghoon; Choi, Beom-Soon; Kim, Sunggil; Yang, Tae-Jin


    A novel cytoplasmic male sterility (CMS) conferred by Dongbu cytoplasmic and genic male-sterility (DCGMS) cytoplasm and its restorer-of-fertility gene (Rfd1) was previously reported in radish (Raphanus sativus L.). Its inheritance of fertility restoration and profiles of mitochondrial DNA (mtDNA)-based molecular markers were reported to be different from those of Ogura CMS, the first reported CMS in radish. The complete mitochondrial genome sequence (239,186 bp; GenBank accession No. KC193578) of DCGMS mitotype is reported in this study. Thirty-four protein-coding genes and three ribosomal RNA genes were identified. Comparative analysis of a mitochondrial genome sequence of DCGMS and previously reported complete sequences of normal and Ogura CMS mitotypes revealed various recombined structures of seventeen syntenic sequence blocks. Short-repeat sequences were identified in almost all junctions between syntenic sequence blocks. Phylogenetic analysis of three radish mitotypes showed that DCGMS was more closely related to the normal mitotype than to the Ogura mitotype. A single 1,551-bp unique region was identified in DCGMS mtDNA sequences and a novel chimeric gene, designated orf463, consisting of 128-bp partial sequences of cox1 gene and 1,261-bp unidentified sequences were found in the unique region. No other genes with a chimeric structure, a major feature of most characterized CMS-associated genes in other plant species, were found in rearranged junctions of syntenic sequence blocks. Like other known CMS-associated mitochondrial genes, the predicted gene product of orf463 contained 12 transmembrane domains. Thus, this gene product might be integrated into the mitochondrial membrane. In total, the results indicate that orf463 is likely to be a casual factor for CMS induction in radish containing the DCGMS cytoplasm.

  2. Targeted Sequencing Reveals Large-Scale Sequence Polymorphism in Maize Candidate Genes for Biomass Production and Composition.

    Directory of Open Access Journals (Sweden)

    Moses M Muraya

    Full Text Available A major goal of maize genomic research is to identify sequence polymorphisms responsible for phenotypic variation in traits of economic importance. Large-scale detection of sequence variation is critical for linking genes, or genomic regions, to phenotypes. However, due to its size and complexity, it remains expensive to generate whole genome sequences of sufficient coverage for divergent maize lines, even with access to next generation sequencing (NGS technology. Because methods involving reduction of genome complexity, such as genotyping-by-sequencing (GBS, assess only a limited fraction of sequence variation, targeted sequencing of selected genomic loci offers an attractive alternative. We therefore designed a sequence capture assay to target 29 Mb genomic regions and surveyed a total of 4,648 genes possibly affecting biomass production in 21 diverse inbred maize lines (7 flints, 14 dents. Captured and enriched genomic DNA was sequenced using the 454 NGS platform to 19.6-fold average depth coverage, and a broad evaluation of read alignment and variant calling methods was performed to select optimal procedures for variant discovery. Sequence alignment with the B73 reference and de novo assembly identified 383,145 putative single nucleotide polymorphisms (SNPs, of which 42,685 were non-synonymous alterations and 7,139 caused frameshifts. Presence/absence variation (PAV of genes was also detected. We found that substantial sequence variation exists among genomic regions targeted in this study, which was particularly evident within coding regions. This diversification has the potential to broaden functional diversity and generate phenotypic variation that may lead to new adaptations and the modification of important agronomic traits. Further, annotated SNPs identified here will serve as useful genetic tools and as candidates in searches for phenotype-altering DNA variation. In summary, we demonstrated that sequencing of captured DNA is a powerful

  3. Phylogeny of canine coronavirus (CCoV) from Brazilian dogs based on membrane protein partial sequences


    M.P. Guirao; Souza, S. P.; JEREZ, J.A.; Richtzenhain, L.J.; Brandão, P. E.


    Este artigo descreve a anteriormente desconhecida diversidade molecular de amostras brasileiras de Coronavírus canino (CCoV). Vinte e duas amostras foram submetidas à análise da sequência parcial do gene codificador da proteína de membrana, sendo 12 classificadas como CCoV Tipo II e 10 como CCoV Tipo I e uma possível sublinhagem tipicamente brasileira foi encontrada para o CCoV Tipo II.

  4. The equine herpesvirus 1 gene 63 RING finger protein partially complements Vmw110, its herpes simplex virus type 1 counterpart. (United States)

    Everett, R; Orr, A; Elliott, M


    All alpha herpesviruses of known DNA sequence have been found to encode a protein with similarities to immediate early protein Vmw110 (ICP0) of herpes simplex virus type 1 (HSV-1). The conserved portion of this family of proteins is a characteristic zinc binding module, known as a RING finger or C3HC4 domain. Examples of RING finger domains occur in many other proteins of diverse evolutionary origin and function. Recently, the solution structure of the equine herpesvirus 1 (EHV-1) RING finger protein, encoded by gene 63, has been solved. To investigate whether this structure could be considered to be a paradigm of herpesvirus RING domains, we have constructed a recombinant HSV-1 which expresses the EHV-1 gene 63 protein (EHVg63) in place of Vmw110. Comparison of the growth properties of the recombinant with those of wild-type and Vmw110-defective viruses indicates that EHVg63 is able to fulfil partially, but not completely, the roles of Vmw110 during virus growth in tissue culture.

  5. PPARγ partial agonist GQ-16 strongly represses a subset of genes in 3T3-L1 adipocytes

    Energy Technology Data Exchange (ETDEWEB)

    Milton, Flora Aparecida [Faculdade de Ciências da Saúde, Laboratório de Farmacologia Molecular, Universidade de Brasília (Brazil); Genomic Medicine, Houston Methodist Research Institute, Houston, TX (United States); Cvoro, Aleksandra [Genomic Medicine, Houston Methodist Research Institute, Houston, TX (United States); Amato, Angelica A. [Faculdade de Ciências da Saúde, Laboratório de Farmacologia Molecular, Universidade de Brasília (Brazil); Sieglaff, Douglas H.; Filgueira, Carly S.; Arumanayagam, Anithachristy Sigamani [Genomic Medicine, Houston Methodist Research Institute, Houston, TX (United States); Caro Alves de Lima, Maria do; Rocha Pitta, Ivan [Laboratório de Planejamento e Síntese de Fármacos – LPSF, Universidade Federal de Pernambuco (Brazil); Assis Rocha Neves, Francisco de [Faculdade de Ciências da Saúde, Laboratório de Farmacologia Molecular, Universidade de Brasília (Brazil); Webb, Paul, E-mail: [Genomic Medicine, Houston Methodist Research Institute, Houston, TX (United States)


    Thiazolidinediones (TZDs) are peroxisome proliferator-activated receptor gamma (PPARγ) agonists that improve insulin resistance but trigger side effects such as weight gain, edema, congestive heart failure and bone loss. GQ-16 is a PPARγ partial agonist that improves glucose tolerance and insulin sensitivity in mouse models of obesity and diabetes without inducing weight gain or edema. It is not clear whether GQ-16 acts as a partial agonist at all PPARγ target genes, or whether it displays gene-selective actions. To determine how GQ-16 influences PPARγ activity on a gene by gene basis, we compared effects of rosiglitazone (Rosi) and GQ-16 in mature 3T3-L1 adipocytes using microarray and qRT-PCR. Rosi changed expression of 1156 genes in 3T3-L1, but GQ-16 only changed 89 genes. GQ-16 generally showed weak effects upon Rosi induced genes, consistent with partial agonist actions, but a subset of modestly Rosi induced and strongly repressed genes displayed disproportionately strong GQ-16 responses. PPARγ partial agonists MLR24 and SR1664 also exhibit disproportionately strong effects on transcriptional repression. We conclude that GQ-16 displays a continuum of weak partial agonist effects but efficiently represses some negatively regulated PPARγ responsive genes. Strong repressive effects could contribute to physiologic actions of GQ-16. - Highlights: • GQ-16 is an insulin sensitizing PPARγ ligand with reduced harmful side effects. • GQ-16 displays a continuum of weak partial agonist activities at PPARγ-induced genes. • GQ-16 exerts strong repressive effects at a subset of genes. • These inhibitor actions should be evaluated in models of adipose tissue inflammation.

  6. Candidate gene analysis and exome sequencing confirm LBX1 as a susceptibility gene for idiopathic scoliosis. (United States)

    Grauers, Anna; Wang, Jingwen; Einarsdottir, Elisabet; Simony, Ane; Danielsson, Aina; Åkesson, Kristina; Ohlin, Acke; Halldin, Klas; Grabowski, Pawel; Tenne, Max; Laivuori, Hannele; Dahlman, Ingrid; Andersen, Mikkel; Christensen, Steen Bach; Karlsson, Magnus K; Jiao, Hong; Kere, Juha; Gerdhem, Paul


    Idiopathic scoliosis is a spinal deformity affecting approximately 3% of otherwise healthy children or adolescents. The etiology is still largely unknown but has an important genetic component. Genome-wide association studies have identified a number of common genetic variants that are significantly associated with idiopathic scoliosis in Asian and Caucasian populations, rs11190870 close to the LBX1 gene being the most replicated finding. The aim of the present study was to investigate the genetics of idiopathic scoliosis in a Scandinavian cohort by performing a candidate gene study of four variants previously shown to be associated with idiopathic scoliosis and exome sequencing of idiopathic scoliosis patients with a severe phenotype to identify possible novel scoliosis risk variants. This was a case control study. A total of 1,739 patients with idiopathic scoliosis and 1,812 controls were included. The outcome measure was idiopathic scoliosis. The variants rs10510181, rs11190870, rs12946942, and rs6570507 were genotyped in 1,739 patients with idiopathic scoliosis and 1,812 controls. Exome sequencing was performed on pooled samples from 100 surgically treated idiopathic scoliosis patients. Novel or rare missense, nonsense, or splice site variants were selected for individual genotyping in the 1,739 cases and 1,812 controls. In addition, the 5'UTR, noncoding exon and promoter regions of LBX1, not covered by exome sequencing, were Sanger sequenced in the 100 pooled samples. Of the four candidate genes, an intergenic variant, rs11190870, downstream of the LBX1 gene, showed a highly significant association to idiopathic scoliosis in 1,739 cases and 1,812 controls (p=7.0×10(-18)). We identified 20 novel variants by exome sequencing after filtration and an initial genotyping validation. However, we could not verify any association to idiopathic scoliosis in the large cohort of 1,739 cases and 1,812 controls. We did not find any variants in the 5'UTR, noncoding exon and

  7. A new clade, based on partial LSU rDNA sequences, of unarmoured dinoflagellates. (United States)

    Reñé, Albert; de Salas, Miguel; Camp, Jordi; Balagué, Vanessa; Garcés, Esther


    The order Gymnodiniales comprises unarmoured dinoflagellates. However, the lack of sequences hindered determining the phylogenetic positions and systematic relationships of several gymnodinioid taxa. In this study, a monophyletic clade was defined for the species Ceratoperidinium margalefii Loeblich III, Gyrodinium falcatum Kofoid & Swezy, three Cochlodinium species, and two Gymnodinium-like dinoflagellates. Despite their substantial morphotypic differentiation, Cochlodinium cf. helix, G. falcatum and 'Gymnodinium' sp. 1 share a common shape of the acrobase. The phylogenetic data led to the following conclusions: (1) C. margalefii is closely related to several unarmoured dinoflagellates. Its sulcus shape has been observed for the first time. (2) G. falcatum was erroneously assigned to the genus Gyrodinium and is transferred to Ceratoperidinium (C. falcatum (Kofoid & Swezy) Reñé & de Salas comb. nov.). (3) The genus Cochlodinium is polyphyletic and thus artificial; our data support its separation into three different genera. (4) The two Gymnodinium-like species could not be morphologically or phylogenetically related to any other gymnodinioid species sequenced to date. While not all studied species have been definitively transferred to the correct genus, our study is a step forward in the classification of inconspicuous unarmoured dinoflagellates. The family Ceratoperidiniaeceae and the genus Ceratoperidinium are emended.

  8. Plasmodium falciparum antigenic variation. Mapping mosaic var gene sequences onto a network of shared, highly polymorphic sequence blocks. (United States)

    Bull, Peter C; Buckee, Caroline O; Kyes, Sue; Kortok, Moses M; Thathy, Vandana; Guyah, Bernard; Stoute, José A; Newbold, Chris I; Marsh, Kevin


    Plasmodium falciparum erythrocyte membrane protein 1 (PfEMP1) is a potentially important family of immune targets, encoded by an extremely diverse gene family called var. Understanding of the genetic organization of var genes is hampered by sequence mosaicism that results from a long history of non-homologous recombination. Here we have used software designed to analyse social networks to visualize the relationships between large collections of short var sequences tags sampled from clinical parasite isolates. In this approach, two sequences are connected if they share one or more highly polymorphic sequence blocks. The results show that the majority of analysed sequences including several var-like sequences from the chimpanzee parasite Plasmodium reichenowi can be either directly or indirectly linked together in a single unbroken network. However, the network is highly structured and contains putative subgroups of recombining sequences. The major subgroup contains the previously described group A var genes, previously proposed to be genetically distinct. Another subgroup contains sequences found to be associated with rosetting, a parasite virulence phenotype. The mosaic structure of the sequences and their division into subgroups may reflect the conflicting problems of maximizing antigenic diversity and minimizing epitope sharing between variants while maintaining their host cell binding functions.

  9. Facilitating genome navigation : survey sequencing and dense radiation-hybrid gene mapping

    NARCIS (Netherlands)

    Hitte, C; Madeoy, J; Kirkness, EF; Priat, C; Lorentzen, TD; Senger, F; Thomas, D; Derrien, T; Ramirez, C; Scott, C; Evanno, G; Pullar, B; Cadieu, E; Oza, [No Value; Lourgant, K; Jaffe, DB; Tacher, S; Dreano, S; Berkova, N; Andre, C; Deloukas, P; Fraser, C; Lindblad-Toh, K; Ostrander, EA; Galibert, F


    Accurate and comprehensive sequence coverage for large genomes has been restricted to only a few species of specific interest. Lower sequence coverage (survey sequencing) of related species can yield a wealth of information about gene content and putative regulatory elements. But survey sequences la

  10. Genome-wide gene-gene interaction analysis for next-generation sequencing. (United States)

    Zhao, Jinying; Zhu, Yun; Xiong, Momiao


    The critical barrier in interaction analysis for next-generation sequencing (NGS) data is that the traditional pairwise interaction analysis that is suitable for common variants is difficult to apply to rare variants because of their prohibitive computational time, large number of tests and low power. The great challenges for successful detection of interactions with NGS data are (1) the demands in the paradigm of changes in interaction analysis; (2) severe multiple testing; and (3) heavy computations. To meet these challenges, we shift the paradigm of interaction analysis between two SNPs to interaction analysis between two genomic regions. In other words, we take a gene as a unit of analysis and use functional data analysis techniques as dimensional reduction tools to develop a novel statistic to collectively test interaction between all possible pairs of SNPs within two genome regions. By intensive simulations, we demonstrate that the functional logistic regression for interaction analysis has the correct type 1 error rates and higher power to detect interaction than the currently used methods. The proposed method was applied to a coronary artery disease dataset from the Wellcome Trust Case Control Consortium (WTCCC) study and the Framingham Heart Study (FHS) dataset, and the early-onset myocardial infarction (EOMI) exome sequence datasets with European origin from the NHLBI's Exome Sequencing Project. We discovered that 6 of 27 pairs of significantly interacted genes in the FHS were replicated in the independent WTCCC study and 24 pairs of significantly interacted genes after applying Bonferroni correction in the EOMI study.

  11. Phylogeny of canine coronavirus (CCoV from Brazilian dogs based on membrane protein partial sequences

    Directory of Open Access Journals (Sweden)

    M.P. Guirao


    Full Text Available Este artigo descreve a anteriormente desconhecida diversidade molecular de amostras brasileiras de Coronavírus canino (CCoV. Vinte e duas amostras foram submetidas à análise da sequência parcial do gene codificador da proteína de membrana, sendo 12 classificadas como CCoV Tipo II e 10 como CCoV Tipo I e uma possível sublinhagem tipicamente brasileira foi encontrada para o CCoV Tipo II.

  12. Nucleotide sequence of the structural gene for tryptophanase of Escherichia coli K-12.


    Deeley, M C; Yanofsky, C


    The tryptophanase structural gene, tnaA, of Escherichia coli K-12 was cloned and sequenced. The size, amino acid composition, and sequence of the protein predicted from the nucleotide sequence agree with protein structure data previously acquired by others for the tryptophanase of E. coli B. Physiological data indicated that the region controlling expression of tnaA was present in the cloned segment. Sequence data suggested that a second structural gene of unknown function was located distal ...

  13. Tandem gene arrays in Trypanosoma brucei: Comparative phylogenomic analysis of duplicate sequence variation

    Directory of Open Access Journals (Sweden)

    Jackson Andrew P


    Full Text Available Abstract Background The genome sequence of the protistan parasite Trypanosoma brucei contains many tandem gene arrays. Gene duplicates are created through tandem duplication and are expressed through polycistronic transcription, suggesting that the primary purpose of long, tandem arrays is to increase gene dosage in an environment where individual gene promoters are absent. This report presents the first account of the tandem gene arrays in the T. brucei genome, employing several related genome sequences to establish how variation is created and removed. Results A systematic survey of tandem gene arrays showed that substantial sequence variation existed across the genome; variation from different regions of an array often produced inconsistent phylogenetic affinities. Phylogenetic relationships of gene duplicates were consistent with concerted evolution being a widespread homogenising force. However, tandem duplicates were not usually identical; therefore, any homogenising effect was coincident with divergence among duplicates. Allelic gene conversion was detected using various criteria and was apparently able to both remove and introduce sequence variation. Tandem arrays containing structural heterogeneity demonstrated how sequence homogenisation and differentiation can occur within a single locus. Conclusion The use of multiple genome sequences in a comparative analysis of tandem gene arrays identified substantial sequence variation among gene duplicates. The distribution of sequence variation is determined by a dynamic balance of conservative and innovative evolutionary forces. Gene trees from various species showed that intraspecific duplicates evolve in concert, perhaps through frequent gene conversion, although this does not prevent sequence divergence, especially where structural heterogeneity physically separates a duplicate from its neighbours. In describing dynamics of sequence variation that have consequences beyond gene dosage, this

  14. Phylogenetic analysis of vibrios and related species by means of atpA gene sequences. (United States)

    Thompson, Cristiane C; Thompson, Fabiano L; Vicente, Ana Carolina P; Swings, Jean


    We investigated the use of atpA gene sequences as alternative phylogenetic and identification markers for vibrios. A fragment of 1322 bp (corresponding to approximately 88% of the coding region) was analysed in 151 strains of vibrios. The relationships observed were in agreement with the phylogeny inferred from 16S rRNA gene sequence analysis. For instance, the Vibrio cholerae, Vibrio halioticoli, Vibrio harveyi and Vibrio splendidus species groups appeared in the atpA gene phylogenetic analyses, suggesting that these groups may be considered as separate genera within the current Vibrio genus. Overall, atpA gene sequences appeared to be more discriminatory for species differentiation than 16S rRNA gene sequences. 16S rRNA gene sequence similarities above 97% corresponded to atpA gene sequences similarities above 80%. The intraspecies variation in the atpA gene sequence was about 99% sequence similarity. The results showed clearly that atpA gene sequences are a suitable alternative for the identification and phylogenetic study of vibrios.

  15. Characterization of sulphonamide-resistant Escherichia coli using comparison of sul2 gene sequences and multilocus sequence typing

    DEFF Research Database (Denmark)

    Trobos, Margarita; Christensen, Henrik; Sunde, Marianne


    The sul2 gene encodes sulphonamide resistance (Sul(R)) and is commonly found in Escherichia coli from different hosts. We typed E coli isolates by multilocus sequence typing (MLST) and compared the results to sequence variation of sul2, in order to investigate the relation to host origin of patho......The sul2 gene encodes sulphonamide resistance (Sul(R)) and is commonly found in Escherichia coli from different hosts. We typed E coli isolates by multilocus sequence typing (MLST) and compared the results to sequence variation of sul2, in order to investigate the relation to host origin...... of pathogenic and commensal E coli strains and to investigate whether transfer of sul2 into different genomic lineages has happened multiple times. Sixty-eight E coli isolated in Denmark and Norway from different hosts and years were MLST typed and sul2 PCR products were sequenced and compared. PFGE...

  16. Defining reference sequences for Nocardia species by similarity and clustering analyses of 16S rRNA gene sequence data.

    Directory of Open Access Journals (Sweden)

    Manal Helal

    Full Text Available BACKGROUND: The intra- and inter-species genetic diversity of bacteria and the absence of 'reference', or the most representative, sequences of individual species present a significant challenge for sequence-based identification. The aims of this study were to determine the utility, and compare the performance of several clustering and classification algorithms to identify the species of 364 sequences of 16S rRNA gene with a defined species in GenBank, and 110 sequences of 16S rRNA gene with no defined species, all within the genus Nocardia. METHODS: A total of 364 16S rRNA gene sequences of Nocardia species were studied. In addition, 110 16S rRNA gene sequences assigned only to the Nocardia genus level at the time of submission to GenBank were used for machine learning classification experiments. Different clustering algorithms were compared with a novel algorithm or the linear mapping (LM of the distance matrix. Principal Components Analysis was used for the dimensionality reduction and visualization. RESULTS: The LM algorithm achieved the highest performance and classified the set of 364 16S rRNA sequences into 80 clusters, the majority of which (83.52% corresponded with the original species. The most representative 16S rRNA sequences for individual Nocardia species have been identified as 'centroids' in respective clusters from which the distances to all other sequences were minimized; 110 16S rRNA gene sequences with identifications recorded only at the genus level were classified using machine learning methods. Simple kNN machine learning demonstrated the highest performance and classified Nocardia species sequences with an accuracy of 92.7% and a mean frequency of 0.578. CONCLUSION: The identification of centroids of 16S rRNA gene sequence clusters using novel distance matrix clustering enables the identification of the most representative sequences for each individual species of Nocardia and allows the quantitation of inter- and intra

  17. Pathogenesis-related proteins in Brazilian wheat genotypes: protein induction and partial gene sequencing Proteínas relacionadas à patogênese em genótipos brasileiros de trigo: indução e seqüenciamento parcial

    Directory of Open Access Journals (Sweden)

    Loreta Brandão de Freitas


    Full Text Available Leaves from 14 Brazilian genotypes of Triticum aestivum L. were treated with salicylic acid to induce pathogenesis-related (PR proteins. Inter and intracellular extracts were then obtained and investigated through polyacrilamide gel electrophoresis. Seven bands were observed. Material related to two of them (of 40 and 24 kDa occurred in intracellular spaces only. DNA from these same genotypes was then amplified through PCR using primers developed from three sequences encoding PR proteins, and compared with previously described sequences. The fragments presented homologies to PR groups 1, 3 (chitinases, and 5 (thaumatin-like. The PR3-like sequence also showed a site characteristic of PRs induced by ethylene and a portion without homology with previous sequences. No variation among genotypes were observed, either for protein extracts or DNA sequences.Folhas de 14 genótipos brasileiros de Triticum aestivum L. foram tratadas com ácido salicílico para a indução de proteínas relacionadas à patogênese (PR. Extratos inter e intracelulares foram assim obtidos e estudados através de eletroforese em gel de poliacrilamida. Sete bandas foram observadas, sendo que o material referente a duas delas (de 40 e 24 kDa foi detectado somente nos espaços intracelulares. O DNA desses mesmos genótipos foi então amplificado através de PCR, usando iniciadores desenvolvidos a partir de três seqüências que codificam proteínas PR, e comparados com seqüências previamente descritas. Eles apresentaram homologia com os grupos PR 1, PR 3 (quitinases e PR 5 (semelhante à taumatina, sendo que a seqüência do grupo PR 3 apresentou também um sítio característico de PRs induzidas pelo etileno e uma porção sem homologia com seqüências prévias. Não foi observada qualquer variação entre genótipos, seja nos extratos protéicos ou nas seqüências de DNA.

  18. Gene Expression Profiles in Living Donors Immediately After Partial Hepatectomy—The Initial Response of Liver Regeneration

    Directory of Open Access Journals (Sweden)

    Cheng-Maw Ho


    Conclusion: Gene expression profiles immediately after partial hepatectomy were reported first in humans with the techniques of oligo DNA microarray, which were compatible with the initial gene expression patterns of liver regeneration in rats. [J Formos Med Assoc 2007;106(4:288-294

  19. Targeted RNA Sequencing Assay to Characterize Gene Expression and Genomic Alterations (United States)

    Martin, Dorrelyn P.; Miya, Jharna; Reeser, Julie W.; Roychowdhury, Sameek


    RNA sequencing (RNAseq) is a versatile method that can be utilized to detect and characterize gene expression, mutations, gene fusions, and noncoding RNAs. Standard RNAseq requires 30 – 100 million sequencing reads and can include multiple RNA products such as mRNA and noncoding RNAs. We demonstrate how targeted RNAseq (capture) permits a focused study on selected RNA products using a desktop sequencer. RNAseq capture can characterize unannotated, low, or transiently expressed transcripts that may otherwise be missed using traditional RNAseq methods. Here we describe the extraction of RNA from cell lines, ribosomal RNA depletion, cDNA synthesis, preparation of barcoded libraries, hybridization and capture of targeted transcripts and multiplex sequencing on a desktop sequencer. We also outline the computational analysis pipeline, which includes quality control assessment, alignment, fusion detection, gene expression quantification and identification of single nucleotide variants. This assay allows for targeted transcript sequencing to characterize gene expression, gene fusions, and mutations. PMID:27585245

  20. Partial protoporphyrinogen oxidase (PPOX gene deletions, due to different Alu-mediated mechanisms, identified by MLPA analysis in patients with variegate porphyria

    Directory of Open Access Journals (Sweden)

    Barbaro Michela


    Full Text Available Abstract Variegate porphyria (VP is an autosomal dominantly inherited hepatic porphyria. The genetic defect in the PPOX gene leads to a partial defect of protoporphyrinogen oxidase, the penultimate enzyme of heme biosynthesis. Affected individuals can develop cutaneous symptoms in sun-exposed areas of the skin and/or neuropsychiatric acute attacks. The identification of the genetic defect in VP families is of crucial importance to detect the carrier status which allows counseling to prevent potentially life threatening neurovisceral attacks, usually triggered by factors such as certain drugs, alcohol or fasting. In a total of 31 Swedish VP families sequence analysis had identified a genetic defect in 26. In the remaining five families an extended genetic investigation was necessary. After the development of a synthetic probe set, MLPA analysis to screen for single exon deletions/duplications was performed. We describe here, for the first time, two partial deletions within the PPOX gene detected by MLPA analysis. One deletion affects exon 5 and 6 (c.339-197_616+320del1099 and has been identified in four families, most probably after a founder effect. The other extends from exon 5 to exon 9 (c.339-350_987+229del2609 and was found in one family. We show that both deletions are mediated by Alu repeats. Our findings emphasize the usefulness of MLPA analysis as a complement to PPOX gene sequencing analysis for comprehensive genetic diagnostics in patients with VP.

  1. Characterizations of Chinese isolates of Coxiella burnetii in the com1 gene sequence

    Institute of Scientific and Technical Information of China (English)

    YU Quan; ZHANG Guo-quan; FUKUSHI Hideto; YAMAGUCHI Tsuyoshi; HIRAI Katsuya


    Objective: To know some genetical characterizations of Coxiella burnetii Chinese isolates by comparing the com1 gene sequence. Methods: com1 gene sequences of Chinese isolates were amplified, sequenced, and analyzed by comparing our result and the previous published data. Results: Three different com1 sequences were identified in 7 Chinese isolates. Sequence comparison indicated that the isolates harboring the QpRS plasmid could be defined as a new group and, in addition, the isolates carrying the same plasmid type showed similar com1 gene sequence. Conclusion: Study suggests that the classification of the group based on the com1 gene sequence is highly associated with the plasmid type of the isolates and, however, little related to disease forms and geographical origins of the isolates.

  2. Colorimetric biosensing of targeted gene sequence using dual nanoparticle platforms

    Directory of Open Access Journals (Sweden)

    Thavanathan J


    Full Text Available Jeevan Thavanathan,1 Nay Ming Huang,1 Kwai Lin Thong2 1Low Dimension Material Research Center, Department of Physics, 2Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia Abstract: We have developed a colorimetric biosensor using a dual platform of gold nanoparticles and graphene oxide sheets for the detection of Salmonella enterica. The presence of the invA gene in S. enterica causes a change in color of the biosensor from its original pinkish-red to a light purplish solution. This occurs through the aggregation of the primary gold nanoparticles–conjugated DNA probe onto the surface of the secondary graphene oxide–conjugated DNA probe through DNA hybridization with the targeted DNA sequence. Spectrophotometry analysis showed a shift in wavelength from 525 nm to 600 nm with 1 µM of DNA target. Specificity testing revealed that the biosensor was able to detect various serovars of the S. enterica while no color change was observed with the other bacterial species. Sensitivity testing revealed the limit of detection was at 1 nM of DNA target. This proves the effectiveness of the biosensor in the detection of S. enterica through DNA hybridization. Keywords: biosensor, DNA hybridization, DNA probe, gold nanoparticles, graphene oxide, Salmonella enterica

  3. Isolation and characterization of gene sequences expressed in cotton fiber

    Directory of Open Access Journals (Sweden)

    Taciana de Carvalho Coutinho


    Full Text Available ABSTRACT Cotton fiber are tubular cells which develop from the differentiation of ovule epidermis. In addition to being one of the most important natural fiber of the textile group, cotton fiber afford an excellent experimental system for studying the cell wall. The aim of this work was to isolate and characterise the genes expressed in cotton fiber (Gossypium hirsutum L. to be used in future work in cotton breeding. Fiber of the cotton cultivar CNPA ITA 90 II were used to extract RNA for the subsequent generation of a cDNA library. Seventeen sequences were obtained, of which 14 were already described in the NCBI database (National Centre for Biotechnology Information, such as those encoding the lipid transfer proteins (LTPs and arabinogalactans (AGP. However, other cDNAs such as the B05 clone, which displays homology with the glycosyltransferases, have still not been described for this crop. Nevertheless, results showed that several clones obtained in this study are associated with cell wall proteins, wall-modifying enzymes and lipid transfer proteins directly involved in fiber development.

  4. High frequency of additional gene mutations in acute myeloid leukemia with MLL partial tandem duplication: DNMT3A mutation is associated with poor prognosis. (United States)

    Kao, Hsiao-Wen; Liang, D Cherng; Kuo, Ming-Chung; Wu, Jin-Hou; Dunn, Po; Wang, Po-Nan; Lin, Tung-Liang; Shih, Yu-Shu; Liang, Sung-Tzu; Lin, Tung-Huei; Lai, Chen-Yu; Lin, Chun-Hui; Shih, Lee-Yung


    The mutational profiles of acute myeloid leukemia (AML) with partial tandem duplication of mixed-lineage leukemia gene (MLL-PTD) have not been comprehensively studied. We studied 19 gene mutations for 98 patients with MLL-PTD AML to determine the mutation frequency and clinical correlations. MLL-PTD was screened by reverse-transcriptase PCR and confirmed by real-time quantitative PCR. The mutational analyses were performed with PCR-based assays followed by direct sequencing. Gene mutations of signaling pathways occurred in 63.3% of patients, with FLT3-ITD (44.9%) and FLT3-TKD (13.3%) being the most frequent. 66% of patients had gene mutations involving epigenetic regulation, and DNMT3A (32.7%), IDH2 (18.4%), TET2 (18.4%), and IDH1 (10.2%) mutations were most common. Genes of transcription pathways and tumor suppressors accounted for 23.5% and 10.2% of patients. RUNX1 mutation occurred in 23.5% of patients, while none had NPM1 or double CEBPA mutation. 90.8% of MLL-PTD AML patients had at least one additional gene mutation. Of 55 MLL-PTD AML patients who received standard chemotherapy, age older than 50 years and DNMT3A mutation were associated with inferior outcome. In conclusion, gene mutations involving DNA methylation and activated signaling pathway were common co-existed gene mutations. DNMT3A mutation was a poor prognostic factor in MLL-PTD AML.

  5. Isolation and partial sequence of a Kunitz-type elastase specific inhibitor from marama bean (Tylosema esculentum). (United States)

    Nadaraja, Deepa; Weintraub, Susan T; Hakala, Kevin W; Sherman, Nicholas E; Starcher, Barry


    An isolation procedure utilizing ammonium sulfate fractionation and affinity chromatography was used to purify an elastase inhibitor present in large amounts in marama beans (Tylosema esculentum). The protein appeared to be heterogeneous due to carbohydrate differences, demonstrating two bands on SDS gels with molecular weights of 17.8 kDa and 20 kDa. Partial sequence, derived from mass spectrometry, indicated that the protein is a Kunitz-type inhibitor distinct from other known plant serine protease inhibitors. The marama bean inhibitor is specific for elastase, with very low K(i) for both pancreatic and neutrophil elastase. The quantity of elastase inhibitor present in marama beans is many times greater than in soybean or any other bean or nut source reported to date. This raises the question of why a bean found in an arid corner of the Kalahari Desert would be so rich in a very potent elastase inhibitor.

  6. Effects of phenol on physicochemical properties and treatment performances of partial nitrifying granules in sequencing batch reactors

    Directory of Open Access Journals (Sweden)

    Mingming Gao


    Full Text Available This study attempts to investigate the effect of phenol on physicochemical properties and treatment performances of partial nitrifying granules (PNGs. Two sequencing batch reactors (SBRs fed with synthetic ammonium wastewaters were operated in absence (R1 or presence (R2 of phenol. The PNGs in R1 maintained excellent partial nitrification performance and relatively stable physicochemical properties, and exhibited compact and regular shaped structure with a cocci-dominant surface. However, as phenol concentration was stepwise increased from 0 to 300 mg/L in R2, filamentous bacteria appeared and gradually dominated within granules, which in turn resulted in settleability deterioration. Most notably, granules in R2 got easier to agglomerate in the reactor walls and then been washed out with effluent, leading to significant biomass loss, frequent outflow pipe blockage, and eventual system failure. The extracellular polymeric substances (EPS contents including proteins and polysaccharides in R2 reached 1.8 and 1.7 times of that in R1, respectively, indicating that the presence of phenol played an important role on EPS production. Removal efficiency of ammonium and phenol remained high, but dropped sharply when phenol concentration reached 300 mg/L. Moreover, the failed maintenance of partial nitrification was observed due to the revival of nitrite oxidizing bacteria (NOB within granules after phenol exposure, which was confirmed by quantitative fluorescence in situ hybridization (FISH analysis. Overall this study demonstrates that phenol had negative effects on PNGs, and pretreatment to eliminate phenolic substances is recommended when using PNGs for wastewater treatment.

  7. Bacteriological analysis of saliva from partially or fully engorged female adult Rhipicephalus microplus by next-generation sequencing. (United States)

    Xiang, Liangliang; Poźniak, Błażej; Cheng, Tian-Yin


    Tick-borne diseases are a major epidemiological problem worldwide. The aim of this study was to investigate the bacterial composition of saliva obtained from engorged adult Rhipicephalus microplus females. Saliva samples collected from partially or fully engorged adult female ticks were analysed using an ultra-high-throughput Illumina HiSeq 2500 sequencing system. To elucidate the possible routes of bacterial transmission, the bacterial flora from whole ticks were also investigated. Proteobacteria, Firmicutes, and Actinobacteria were the predominant phyla in all samples, and Acinetobacter, Rickettsia, Escherichia and Coxiella were the major genera. Microbial diversity in saliva samples from partially engorged ticks was more complex than that of samples from fully engorged individuals. The comparison of saliva and whole-tick samples suggests that bacteria in saliva also colonize the tick's body. We believe that some bacterial genera, such as Dermacoccus, Achromia, SMB53, Sutterella, Providencia, Mycoplana, Oscillospira, and Agrobacterium, were found and reported in ticks for the first time. The Coxiella and Rickettsia detected in this study might be tick-borne pathogens, suggesting health risks associated with exposure to R. microplus in humans and animals. These findings may serve as the basis for developing strategies to control ticks and tick-borne diseases.

  8. An internal part of the chloroplast atpA gene sequence is present in the mitochondrial genome of Triticum aestivum: molecular organisation and evolutionary aspects. (United States)

    Jubier, M F; Lucas, H; Delcher, E; Hartmann, C; Quétier, F; Lejeune, B


    An internal part of the chloroplast atpA gene has been identified in the mitochondrial DNA of Triticum aestivum. It is located near the 18S-5S ribosomal genes and partially contained within a repeated sequence. Comparison of the transferred sequence with the original ct sequence reveals several nucleotide changes and shows that neither 5' nor 3' ends are present in the mt genome. No transcript of this region could be detected by Northern analysis. This sequence is present in mitochondrial genomes of other tetraploid and diploid species of Triticum, also in the vicinity of the 18S-5S ribosomal genes, suggesting a unique transfer event. The date of this event is discussed.


    Apichat, Vitta; Narongrit, Srisongcram; Jittranuch, Thiproaj; Anucha, Wongma; Wilaiwan, Polsut; Chamaiporn, Fukruksa; Thatcha, Yimthin; Bandid, Mangkit; Aunchalee, Thanwisai; Paron, Dekumyoy


    Angiostrongylus cantonensis is an emerging infectious agent causing eosinophilic meningitis or meningoencephalitis in humans with clinical manifestation of severe headache. Molecular genetic studies on classification and phylogeny of A. cantonensis in Thailand are limited. This study surveyed A. cantonensis larvae prevalence in natural intermediate hosts across Thailand and analyzed their phylogenetic relationships. A total of 14,032 freshwater and land snails were collected from 19 provinces of Thailand. None of Filopaludina sp, Pomacea sp, and Cyclophorus sp were infected with Angiostrongylus larvae, whereas Achatina fulica, Cryptozona siamensis, and Megaustenia siamensis collected from Kalasin, Kamphaeng Phet, Phetchabun, Phitsanulok, and Tak Provinces were infected, with C. siamensis being the common intermediate host. Based on morphology, larvae isolated from 11 samples of these naturally infected snails preliminarily were identified as A. cantonensis. Comparison of partial nucleotide sequences of cytochrome c oxidase subunit I gene revealed that four sequences are identical to A. cantonensis haplotype ac4 from Bangkok and the other seven to that of A. cantonensis isolate AC Thai, indicating two independent lineages of A. cantonensis in Thailand.

  10. Nucleotide sequences related to the transforming gene of avian sarcoma virus are present in DNA of uninfected vertebrates. (United States)

    Spector, D H; Varmus, H E; Bishop, J M


    We have detected nucleotide sequences related to the transforming gene of avian sarcoma vius (ASV) in the DNA of uninfected vertebrates. Purified radioactive DNA (cDNAsarc) complementary to most of all of the gene (src) required for transformation of fibroblasts by ASV was annealed with DNA from a variety of normal species. Under conditions that facilitate pairing of partially matched nucleotide sequences (1.5 M NaCl, 59 degrees), cDNAsarc formed duplexes with chicken, human, calf, mouse, and salmon DNA but not with DNA from sea urchin, Drosophila, or Escherichia coli. The kinetics of duplex formation indicated that cDNAsarc was reacting with nucleotide sequences present in a single copy or at most a few copies per cell. In contrast to the preceding findings, nucleotide sequences complementary to the remainder of the ASV genome were observed only in chicken DNA. Thermal denaturation studies of the duplexes formed with cDNAsarc indicated a high degree of conservation of the nucleotide sequences related to src in vertebrate DNAs; the reductions in melting temperature suggested about 3--4% mismatching of cDNAsarc with chicken DNA and 8--10% mismatching of cDNAsarc with the other vertebrate DNAs.

  11. Gene discovery in the hamster: a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs

    Directory of Open Access Journals (Sweden)

    Khan Shafiq A


    Full Text Available Abstract Background Complete genome annotation will likely be achieved through a combination of computer-based analysis of available genome sequences combined with direct experimental characterization of expressed regions of individual genomes. We have utilized a comparative genomics approach involving the sequencing of randomly selected hamster testis cDNAs to begin to identify genes not previously annotated on the human, mouse, rat and Fugu (pufferfish genomes. Results 735 distinct sequences were analyzed for their relatedness to known sequences in public databases. Eight of these sequences were derived from previously unidentified genes and expression of these genes in testis was confirmed by Northern blotting. The genomic locations of each sequence were mapped in human, mouse, rat and pufferfish, where applicable, and the structure of their cognate genes was derived using computer-based predictions, genomic comparisons and analysis of uncharacterized cDNA sequences from human and macaque. Conclusion The use of a comparative genomics approach resulted in the identification of eight cDNAs that correspond to previously uncharacterized genes in the human genome. The proteins encoded by these genes included a new member of the kinesin superfamily, a SET/MYND-domain protein, and six proteins for which no specific function could be predicted. Each gene was expressed primarily in testis, suggesting that they may play roles in the development and/or function of testicular cells.

  12. A Tudor protein with multiple SNc domains from pea seedlings: cellular localization, partial characterization, sequence analysis, and phylogenetic relationships. (United States)

    Abe, Shunnosuke; Sakai, Masako; Yagi, Kosaku; Hagino, Takehiko; Ochi, Katsumasa; Shibata, Koichi; Davies, Eric


    A major high molecular weight protein (HMP) in the cytoskeletal fraction from pea has been purified. A combination of chromatographic techniques and protease fragment analysis also facilitated the isolation of the encoding cDNA, disclosing the sequence of the complete open reading frame. The protein possesses four complete N-terminal Staphylococcal nuclease (SNc) domains, a central Tudor domain and a partial SNc domain at the C-terminus, which may act as a coiled-coil cytoskeleton interaction motif. Cell fractionation studies showed that the protein was abundant in the cytoskeleton fraction in dark-grown pea seedlings, but essentially was absent from the nucleus. Gel filtration column chromatography indicated that the native protein exists as a dimer, while isoelectric focusing suggested that there were at least four HMP isotypes. The protein co-eluted with ribosomes from a heparin affinity column in vitro, consistent with ribosome/polysome interactions in vivo. Significantly, sequence analysis of the C-terminal SNc motif may accurately predict nuclear versus cytoplasmic localization resulting in potentially very different functional roles for this protein family in different organisms. An antibody to HMP from peas was also raised and an HMP with a similar molecular mass was detected in the cytoskeleton fractions and to a lesser extent in the nuclear fraction (250 g pellet) from rice and wheat seedlings.

  13. Ancient DNA analyses of museum specimens from selected Presbytis (primate: Colobinae) based on partial Cyt b sequences (United States)

    Aifat, N. R.; Yaakop, S.; Md-Zain, B. M.


    The IUCN Red List of Threatened Species has categorized Malaysian primates from being data deficient to critically endanger. Thus, ancient DNA analyses hold great potential to understand phylogeny, phylogeography and population history of extinct and extant species. Museum samples are one of the alternatives to provide important sources of biological materials for a large proportion of ancient DNA studies. In this study, a total of six museum skin samples from species Presbytis hosei (4 samples) and Presbytis frontata (2 samples), aged between 43 and 124 years old were extracted to obtain the DNA. Extraction was done by using QIAGEN QIAamp DNA Investigator Kit and the ability of this kit to extract museum skin samples was tested by amplification of partial Cyt b sequence using species-specific designed primer. Two primer pairs were designed specifically for P. hosei and P. frontata, respectively. These primer pairs proved to be efficient in amplifying 200bp of the targeted species in the optimized PCR conditions. The performance of the sequences were tested to determine genetic distance of genus Presbytis in Malaysia. From the analyses, P. hosei is closely related to P. chrysomelas and P. frontata with the value of 0.095 and 0.106, respectively. Cyt b gave a clear data in determining relationships among Bornean species. Thus, with the optimized condition, museum specimens can be used for molecular systematic studies of the Malaysian primates.

  14. Partial Rescue of pos5 Mutants by YEF1 and UTR1 Genes in Saccharomyces cerevisiae

    Institute of Scientific and Technical Information of China (English)

    Yong-Fu LI; Feng SHI


    Three NAD kinase homologs, encoded by UTR1, POS5 and YEF1 genes, are found in the yeast Saccharomyces cerevisiae and proven to be important sources of NADPH for the cell. Pos5p, existing in the mitochondrial matrix, is critical for higher temperature endurance and mitochondrial functions, such as glycerol usability and arginine biosynthesis. Through constructing the high-copy expression plasmids of YEF1 and UTR1, which contained the green fluorescent protein reporter tag at their 3' terminus, and introducing them into POS5 gene deletion mutants (i.e. pos5, utr1pos5, yef1pos5 and utr1yef1pos5), the high-copy YEF1 and UTR1 plasmids carrying transformants for pos5 mutants were obtained. Their temperature sensitivity and growth phenotype on media with glycerol as the sole carbon source, or on media without arginine, were checked. Results showed the partial rescue of mitochondrial dysfunctions and temperature sensitivity of pos5 mutants by the high-copy YEF1 gene, and of glycerol growth defect and temperature sensitivity by the high-copy UTR1 gene, which confirmed the potential supplying ability of Yef1p and Utr1p for mitochondrial NADP(H) and implied the weak transport of NADP from cytosol to mitochondria. However, even through the green fluorescent protein reporter label, the subcellular localization of Yef1p and Utr1p in yeast cells could not be observed, which indicated the low expression level of these two NAD kinase homologs.

  15. Function of Partially Duplicated Human α7 Nicotinic Receptor Subunit CHRFAM7A Gene (United States)

    de Lucas-Cerrillo, Ana M.; Maldifassi, M. Constanza; Arnalich, Francisco; Renart, Jaime; Atienza, Gema; Serantes, Rocío; Cruces, Jesús; Sánchez-Pacheco, Aurora; Andrés-Mateos, Eva; Montiel, Carmen


    The neuronal α7 nicotinic receptor subunit gene (CHRNA7) is partially duplicated in the human genome forming a hybrid gene (CHRFAM7A) with the novel FAM7A gene. The hybrid gene transcript, dupα7, has been identified in brain, immune cells, and the HL-60 cell line, although its translation and function are still unknown. In this study, dupα7 cDNA has been cloned and expressed in GH4C1 cells and Xenopus oocytes to study the pattern and functional role of the expressed protein. Our results reveal that dupα7 transcript was natively translated in HL-60 cells and heterologously expressed in GH4C1 cells and oocytes. Injection of dupα7 mRNA into oocytes failed to generate functional receptors, but when co-injected with α7 mRNA at α7/dupα7 ratios of 5:1, 2:1, 1:1, 1:5, and 1:10, it reduced the nicotine-elicited α7 current generated in control oocytes (α7 alone) by 26, 53, 75, 93, and 94%, respectively. This effect is mainly due to a reduction in the number of functional α7 receptors reaching the oocyte membrane, as deduced from α-bungarotoxin binding and fluorescent confocal assays. Two additional findings open the possibility that the dominant negative effect of dupα7 on α7 receptor activity observed in vitro could be extrapolated to in vivo situations. (i) Compared with α7 mRNA, basal dupα7 mRNA levels are substantial in human cerebral cortex and higher in macrophages. (ii) dupα7 mRNA levels in macrophages are down-regulated by IL-1β, LPS, and nicotine. Thus, dupα7 could modulate α7 receptor-mediated synaptic transmission and cholinergic anti-inflammatory response. PMID:21047781

  16. GBF-dependent family genes morphologically suppress the partially active Dictyostelium STATa strain. (United States)

    Shimada, Nao; Kanno-Tanabe, Naoko; Minemura, Kakeru; Kawata, Takefumi


    Transcription factor Dd-STATa, a functional Dictyostelium homologue of metazoan signal transducers and activators of transcription proteins, is necessary for culmination during development. We have isolated more than 18 putative multicopy suppressors of Dd-STATa using genetic screening. One was hssA gene, whose expression is known to be G-box-binding-factor-dependent and which was specific to prestalk A (pstA) cells, where Dd-STATa is activated. Also, hssA mRNA was expressed in pstA cells in the Dd-STATa-null mutant. At least 40 hssA-related genes are present in the genome and constitute a multigene family. The tagged HssA protein was translated; hssA encodes an unusually high-glycine-serine-rich small protein (8.37 kDa), which has strong homology to previously reported cyclic-adenosine-monophosphate-inducible 2C and 7E proteins. Overexpression of hssA mRNA as well as frame-shifted versions of hssA RNA suppressed the phenotype of the partially active Dd-STATa strain, suggesting that translation is not necessary for suppression. Although overexpression of prespore-specific genes among the family did not suppress the parental phenotype, prestalk-specific family members did. Although overexpression of the hssA did not revert the expression of Dd-STATa target genes, and although its suppression mechanism remains unknown, morphological reversion implies functional relationships between Dd-STATa and hssA.

  17. Candidate gene analysis and exome sequencing confirm LBX1 as a susceptibility gene for idiopathic scoliosis

    DEFF Research Database (Denmark)

    Grauers, Anna; Wang, Jingwen; Einarsdottir, Elisabet


    BACKGROUND CONTEXT: Idiopathic scoliosis is a spinal deformity affecting approximately 3% of otherwise healthy children or adolescents. The etiology is still largely unknown but has an important genetic component. Genome-wide association studies have identified a number of common genetic variants...... that are significantly associated with idiopathic scoliosis in Asian and Caucasian populations, rs11190870 close to the LBX1 gene being the most replicated finding. PURPOSE: The aim of the present study was to investigate the genetics of idiopathic scoliosis in a Scandinavian cohort by performing a candidate gene study...... of four variants previously shown to be associated with idiopathic scoliosis and exome sequencing of idiopathic scoliosis patients with a severe phenotype to identify possible novel scoliosis risk variants. STUDY DESIGN: This was a case control study. PATIENT SAMPLE: A total of 1,739 patients...


    Institute of Scientific and Technical Information of China (English)

    马巍; 吴玲; 王德利; 刘淼; 任惠民; 杨广笑; 王全颖


    Objective Molecular cloning and sequencing of the human matured fragment of human nerve growth factor(NGF) gene. Methods Extracting the human genomic DNA from the white blood cells as templates, the gene of NGF was cloned by using PCR and T-vector cloning method. Screening the positive clones and identified by the restriction enzymes, and then the cloned amplified fragment was sequenced and analyzed. Results DNA sequence comparison the cloned gene of NGF with the GenBank (V01511) sequence demonstrated that both of sequences were identical, 354bp length. Conclusion Cloning the NGF gene from the human genomic DNA has paved the way for further study on gene therapy of nerve system injury.

  19. Sequence characterization, polymorphism and chromosomal localizations of the porcine PSME1 and PSME2 genes

    NARCIS (Netherlands)

    Wang, Y.F.; Yu, M.; Pas, te M.F.W.; Yerle, M.; Liu, B.; Fan, B.; Xiong, T.; Li, K.


    The full-length cDNA of porcine genes (PSME1 and PSME2) encoding proteasome activators PA28¿- and ß-subunits were obtained by the rapid amplification of cDNA ends (RACE). The nucleotide sequences and the predicted protein sequences share high sequence identity with their mammalian counterparts. The

  20. Neural network predicts sequence of TP53 gene based on DNA chip

    DEFF Research Database (Denmark)

    Spicker, J.S.; Wikman, F.; Lu, M.L.;


    We have trained an artificial neural network to predict the sequence of the human TP53 tumor suppressor gene based on a p53 GeneChip. The trained neural network uses as input the fluorescence intensities of DNA hybridized to oligonucleotides on the surface of the chip and makes between zero...... and four errors in the predicted 1300 bp sequence when tested on wild-type TP53 sequence....

  1. Sequencing and bacterial expression of a novel murine alpha interferon gene

    Institute of Scientific and Technical Information of China (English)

    王焱; 王征宇; 周鸣南; 蔡菊娥; 孙兰英; 刘新垣; B.L.Daugherty; S.Pestka


    A murine new alpha interferon gene (mIFN-αB) was found by primer-based sequencing method in a murine genomic DNA library. The gene was cloned and its sequence was determined. It was expressed in Escherichia coli under the control of the PL promoter which resulted in antiviral activity on mouse L-cells. The sequence of mlFN-αB has been accepted by GENEBANK.

  2. Isolation of endophytic bacteria from arboreal species of the Amazon and identification by sequencing of the 16S rRNA encoding gene

    Directory of Open Access Journals (Sweden)

    Mariza M. Coêlho


    Full Text Available Endophytic bacteria from three arboreal species native to the Amazon (Carapa guianenses, Ceiba pentandra, and Swietenia macrophylla, were isolated and identified, through partial sequencing of the 16S rRNA encoding gene. From these, 16 isolates were obtained, although, when compared to sequences deposited in GenBank, only seven had produced identifiable fragments. Bacillus, Pantoea and two non-culturable samples were identified. Results obtained through sequence analysis revealed low genetic diversity across the isolates, even when analyzing different species and plant structures. This is the first report concerning the isolation and identification of endophytic bacteria in these plant species.

  3. Tawny owl (Strix aluco) and Hume's Tawny owl (Strix butleri) are distinct species: evidence from nucleotide sequences of the cytochrome b gene. (United States)

    Heidrich, P; Wink, M


    The cytochrome b gene of the Tawny Owl (Strix aluco), Hume's Tawny Owl (Strix butleri) and the African wood owl (Strix woodfordii) was amplified by polymerase chain reaction (PCR) and partially sequenced (300 base pairs). Sequences differ substantially (9 to 12% nucleotide substitutions) between these taxa indicating that they represent distinct species, which is also implicated from morphological and biogeographic differences. Using cytochrome b sequences of S. aluco, S. butleri, S. woodfordii, Athene noctua and Tyto alba phylogenetic relationship were reconstructed using the "maximum parsimony" principal (PAUP 3.1.1) and the neighbour-joining method (MEGA).

  4. Comparison of restriction enzyme pattern analysis and full gene sequencing of 16S rRNA gene for Nocardia species identification, the first report of Nocardia transvalensis isolated of sputum from Iran, and review of the literature. (United States)

    Fatahi-Bafghi, Mehdi; Heidarieh, Parvin; Rasouli-Nasab, Masoumeh; Habibnia, Shadi; Hashemi-Shahraki, Abdorazagh; Eshraghi, Seyyed Saeed


    Nocardial infections occur in different organs of the body and are common in immune disorder diseases of individuals. The aim of this study was to assess Nocardia species identification by phenotypic tests and molecular techniques applied to nocardiosis in Iranian patients. In the current study, various clinical samples were collected and cultured on conventional media and using the paraffin baiting method. Various phenotypic tests were performed. For accurate identification at the species level, restriction fragment length polymorphisms (RFLP) in the hsp65 and partial 16S rRNA genes and full gene sequencing of the 16S rRNA gene were used. Twenty-seven Nocardia spp. were isolated and analysis of phenotypic tests results showed Nocardia asteroides complex, Nocardia otitidiscaviarum, Nocardia nova, and Nocardia spp. New RFLP patterns of Nocardia strains with hsp65 and partial 16S rRNA genes were obtained. Full gene sequencing of the 16S rRNA gene identified Nocardia cyriacigeorgica, N. otitidiscaviarum, Nocardia farcinica, Nocardia transvalensis, and N. nova. Nocardia infections are rarely reported and this genus is the cause of various illnesses. Accurate identification of Nocardia spp. is important for epidemiology studies and treatment. It should also be noted that some species may have similar RFLP patterns; therefore, full gene sequencing of the 16S rRNA gene is necessary for confirmation.

  5. Annotation of a hybrid partial genome of the coffee rust (Hemileia vastatrix) contributes to the gene repertoire catalog of the Pucciniales. (United States)

    Cristancho, Marco A; Botero-Rozo, David Octavio; Giraldo, William; Tabima, Javier; Riaño-Pachón, Diego Mauricio; Escobar, Carolina; Rozo, Yomara; Rivera, Luis F; Durán, Andrés; Restrepo, Silvia; Eilam, Tamar; Anikster, Yehoshua; Gaitán, Alvaro L


    Coffee leaf rust caused by the fungus Hemileia vastatrix is the most damaging disease to coffee worldwide. The pathogen has recently appeared in multiple outbreaks in coffee producing countries resulting in significant yield losses and increases in costs related to its control. New races/isolates are constantly emerging as evidenced by the presence of the fungus in plants that were previously resistant. Genomic studies are opening new avenues for the study of the evolution of pathogens, the detailed description of plant-pathogen interactions and the development of molecular techniques for the identification of individual isolates. For this purpose we sequenced 8 different H. vastatrix isolates using NGS technologies and gathered partial genome assemblies due to the large repetitive content in the coffee rust hybrid genome; 74.4% of the assembled contigs harbor repetitive sequences. A hybrid assembly of 333 Mb was built based on the 8 isolates; this assembly was used for subsequent analyses. Analysis of the conserved gene space showed that the hybrid H. vastatrix genome, though highly fragmented, had a satisfactory level of completion with 91.94% of core protein-coding orthologous genes present. RNA-Seq from urediniospores was used to guide the de novo annotation of the H. vastatrix gene complement. In total, 14,445 genes organized in 3921 families were uncovered; a considerable proportion of the predicted proteins (73.8%) were homologous to other Pucciniales species genomes. Several gene families related to the fungal lifestyle were identified, particularly 483 predicted secreted proteins that represent candidate effector genes and will provide interesting hints to decipher virulence in the coffee rust fungus. The genome sequence of Hva will serve as a template to understand the molecular mechanisms used by this fungus to attack the coffee plant, to study the diversity of this species and for the development of molecular markers to distinguish races/isolates.

  6. Annotation of a hybrid partial genome of the Coffee Rust (Hemileia vastatrix contributes to the gene repertoire catalogue of the Pucciniales

    Directory of Open Access Journals (Sweden)

    Marco Aurelio Cristancho


    Full Text Available Coffee leaf rust caused by the fungus Hemileia vastatrix is the most damaging disease to coffee worldwide. The pathogen has recently appeared in multiple outbreaks in coffee producing countries resulting in significant yield losses and increases in costs related to its control. New races/isolates are constantly emerging as evidenced by the presence of the fungus in plants that were previously resistant. Genomic studies are opening new avenues for the study of the evolution of pathogens, the detailed description of plant-pathogen interactions and the development of molecular techniques for the identification of individual isolates. For this purpose we sequenced 8 different H. vastatrix isolates using NGS technologies and gathered partial genome assemblies due to the large repetitive content in the coffee rust hybrid genome; 74.4% of the assembled contigs harbor repetitive sequences. A hybrid assembly of 333Mb was built based on the 8 isolates; this assembly was used for subsequent analyses.Analysis of the conserved gene space showed that the hybrid H. vastatrix genome, though highly fragmented, had a satisfactory level of completion with 91.94% of core protein-coding orthologous genes present. RNA-Seq from urediniospores was used to guide the de novo annotation of the H. vastatrix gene complement. In total, 14,445 genes organized in 3,921 families were uncovered; a considerable proportion of the predicted proteins (73.8% were homologous to other Pucciniales species genomes. Several gene families related to the fungal lifestyle were identified, particularly 483 predicted secreted proteins that represent candidate effector genes and will provide interesting hints to decipher virulence in the coffee rust fungus. The genome sequence of Hva will serve as a template to understand the molecular mechanisms used by this fungus to attack the coffee plant, to study the diversity of this species and for the development of molecular markers to distinguish

  7. Partial antiviral activities detection of chicken Mx jointing with neuraminidase gene (NA against Newcastle disease virus.

    Directory of Open Access Journals (Sweden)

    Yani Zhang

    Full Text Available As an attempt to increase the resistance to Newcastle Disease Virus (NDV and so further reduction of its risk on the poultry industry. This work aimed to build the eukaryotic gene co-expression plasmid of neuraminidase (NA gene and myxo-virus resistance (Mx and detect the gene expression in transfected mouse fibroblasts (NIH-3T3 cells, it is most important to investigate the influence of the recombinant plasmid on the chicken embryonic fibroblasts (CEF cells. cDNA fragment of NA and mutant Mx gene were derived from pcDNA3.0-NA and pcDNA3.0-Mx plasmid via PCR, respectively, then NA and Mx cDNA fragment were inserted into the multiple cloning sites of pVITRO2 to generate the eukaryotic co-expression plasmid pVITRO2-Mx-NA. The recombinant plasmid was confirmed by restriction endonuclease treatment and sequencing, and it was transfected into the mouse fibroblasts (NIH-3T3 cells. The expression of genes in pVITRO2-Mx-NA were measured by RT-PCR and indirect immunofluorescence assay (IFA. The recombinant plasmid was transfected into CEF cells then RT-PCR and the micro-cell inhibition tests were used to test the antiviral activity for NDV. Our results showed that co-expression vector pVITRO2-Mx-NA was constructed successfully; the expression of Mx and NA could be detected in both NIH-3T3 and CEF cells. The recombinant proteins of Mx and NA protect CEF cells from NDV infection until after 72 h of incubation but the individually mutagenic Mx protein or NA protein protects CEF cells from NDV infection till 48 h post-infection, and co-transfection group decreased significantly NDV infection compared with single-gene transfection group (P<0. 05, indicating that Mx-NA jointing contributed to delaying the infection of NDV in single-cell level and the co-transfection of the jointed genes was more powerful than single one due to their synergistic effects.

  8. Cloning and sequencing of the bovine gastrin gene

    DEFF Research Database (Denmark)

    Lund, T; Rehfeld, J F; Olsen, Jørgen


    In order to deduce the primary structure of bovine preprogastrin we therefore sequenced a gastrin DNA clone isolated from a bovine liver cosmid library. Bovine preprogastrin comprises 104 amino acids and consists of a signal peptide, a 37 amino acid spacer-sequence, the gastrin-34 sequence followed...... by an amidation-site (Gly-Arg-Arg), and a C-terminal nonapeptide. Comparison with human, porcine, and rat cDNA sequences revealed extensive homology in the coding region as well as in short noncoding structures....

  9. Isolation and sequence analysis of Clpg1, a gene coding for an endopolygalacturonase of the phytopathogenic fungus Colletotrichum lindemuthianum. (United States)

    Centis, S; Dumas, B; Fournier, J; Marolda, M; Esquerré-Tugayé, M T


    Oligodeoxyribonucleotide primers designed from the N-terminal amino acid (aa) sequence of the endopolygalacturonase (EndoPG) of Colletotrichum lindemuthianum (Cl) race beta and from an internal sequence conserved among different fungal EndoPG were used in a polymerase chain reaction (PCR) to amplify genomic related sequences of the fungus. A 542-bp fragment, designated pgA, was obtained and used as a probe to screen a partial genomic library of Cl. Among the positive clones, one was further analyzed. Nucleotide sequencing of this clone revealed on ORF encoding a 363-amino-acid (aa) polypeptide beginning with a signal peptide of 26 aa interrupted by an intron of 70 bp, and showing a high degree of homology to ten fungal EndoPG sequences. Consensus sequences were identified in the 5' non-coding region. This genomic clone was thereafter designated Clpg1. Southern analysis, performed with a Clpg1-specific probe, showed that this gene is present as a single copy in the Cl genome.

  10. Molecular characterization, sequence analysis and tissue expression of a porcine gene – MOSPD2

    Directory of Open Access Journals (Sweden)

    Yang Jie


    Full Text Available The full-length cDNA sequence of a porcine gene, MOSPD2, was amplified using the rapid amplification of cDNA ends method based on a pig expressed sequence tag sequence which was highly homologous to the coding sequence of the human MOSPD2 gene. Sequence prediction analysis revealed that the open reading frame of this gene encodes a protein of 491 amino acids that has high homology with the motile sperm domain-containing protein 2 (MOSPD2 of five species: horse (89%, human (90%, chimpanzee (89%, rhesus monkey (89% and mouse (85%; thus, it could be defined as a porcine MOSPD2 gene. This novel porcine gene was assigned GeneID: 100153601. This gene is structured in 15 exons and 14 introns as revealed by computer-assisted analysis. The phylogenetic analysis revealed that the porcine MOSPD2 gene has a closer genetic relationship with the MOSPD2 gene of horse. Tissue expression analysis indicated that the porcine MOSPD2 gene is generally and differentially expressed in the spleen, muscle, skin, kidney, lung, liver, fat and heart. Our experiment is the first to establish the primary foundation for further research on the porcine MOSPD2 gene.

  11. A Probabilistic Genome-Wide Gene Reading Frame Sequence Model

    DEFF Research Database (Denmark)

    Have, Christian Theil; Mørk, Søren

    using the probabilistic logic programming language and machine learning system PRISM - a fast and efficient model prototyping environment, using bacterial gene finding performance as a benchmark of signal strength. The model is used to prune a set of gene predictions from an underlying gene finder...

  12. Characterization and phylogenetic analysis of -gliadin gene sequences reveals significant genomic divergence in Triticeae species

    Indian Academy of Sciences (India)

    Guang-Rong Li; Tao Lang; En-Nian Yang; Cheng Liu; Zu-Jun Yang


    Although the unique properties of wheat -gliadin gene family are well characterized, little is known about the evolution and genomic divergence of -gliadin gene family within the Triticeae. We isolated a total of 203 -gliadin gene sequences from 11 representative diploid and polyploid Triticeae species, and found 108 sequences putatively functional. Our results indicate that -gliadin genes may have possibly originated from wild Secale species, where the sequences contain the shortest repetitive domains and display minimum variation. A miniature inverted-repeat transposable element insertion is reported for the first time in -gliadin gene sequence of Thinopyrum intermedium in this study, indicating that the transposable element might have contributed to the diversification of -gliadin genes family among Triticeae genomes. The phylogenetic analyses revealed that the -gliadin gene sequences of Dasypyrum, Australopyrum, Lophopyrum, Eremopyrum and Pseudoroengeria species have amplified several times. A search for four typical toxic epitopes for celiac disease within the Triticeae -gliadin gene sequences showed that the -gliadins of wild Secale, Australopyrum and Agropyron genomes lack all four epitopes, while other Triticeae species have accumulated these epitopes, suggesting that the evolution of these toxic epitopes sequences occurred during the course of speciation, domestication or polyploidization of Triticeae.

  13. Partial nitritation of stored source-separated urine by granular activated sludge in a sequencing batch reactor. (United States)

    Chen, Liping; Yang, Xiaoxiao; Tian, Xiujun; Yao, Song; Li, Jiuyi; Wang, Aimin; Yao, Qian; Peng, Dangcong


    The combination of partial nitritation (PN) and anaerobic ammonium oxidation (anammox) has been proposed as an ideal process for nitrogen removal from source-separated urine, while the high organic matters in urine cause instability of single-stage PN-anammox process. This study aims to remove the organic matters and partially nitrify the nitrogen in urine, producing an ammonium/nitrite solution suitable for anammox. The organic matters in stored urine were used as the electron donors to achieve 40% total nitrogen removal in nitritation-denitrification process in a sequencing batch reactor (SBR). Granular aggregates were observed and high mixed liquor suspended solids (9.5 g/L) were maintained in the SBR. Around 70-75% ammonium was oxidized to nitrite under the volumetric loading rates of 3.23 kg chemical oxygen demand (COD)/(m(3) d) and 1.86 kg N/(m(3) d), respectively. The SBR produced an ammonium/nitrite solution free of biodegradable organic matters, with a NO2(-)-N:NH4(+)-N of 1.24 ± 0.13. Fluorescence in situ hybridization images showed that Nitrosomonas-like ammonium-oxidizing bacteria, accounting for 7.2% of total bacteria, located in the outer layer (25 μm), while heterotrophs distributed homogeneously throughout the granular aggregates. High concentrations of free ammonia and nitrous acids in the reactor severely inhibited the growth of nitrite-oxidizing bacteria, resulting in their absence in the granular sludge. The microbial diversity analysis indicated Proteobacteria was the predominant phylum, in which Pseudomonas was the most abundant genus.

  14. Molecular analysis of 16S rRNA genes identifies potentially periodontal pathogenic bacteria and archaea in the plaque of partially erupted third molars. (United States)

    Mansfield, J M; Campbell, J H; Bhandari, A R; Jesionowski, A M; Vickerman, M M


    Small subunit rRNA sequencing and phylogenetic analysis were used to identify cultivable and uncultivable microorganisms present in the dental plaque of symptomatic and asymptomatic partially erupted third molars to determine the prevalence of putative periodontal pathogens in pericoronal sites. Template DNA prepared from subgingival plaque collected from partially erupted symptomatic and asymptomatic mandibular third molars and healthy incisors was used in polymerase chain reaction with broad-range oligonucleotide primers to amplify 16S rRNA bacterial and archaeal genes. Amplicons were cloned, sequenced, and compared with known nucleotide sequences in online databases to identify the microorganisms present. Two thousand three hundred two clones from the plaque of 12 patients carried bacterial sequences from 63 genera belonging to 11 phyla, including members of the uncultivable TM7, SR1, and Chloroflexi, and difficult-to-cultivate Synergistetes and Spirochaetes. Dialister invisus, Filifactor alocis, Fusobacterium nucleatum, Porphyromonas endodontalis, Prevotella denticola, Tannerella forsythia, and Treponema denticola, which have been associated with periodontal disease, were found in significantly greater abundance in pericoronal compared with incisor sites. Dialister invisus and F nucleatum were found in greater abundance in sites exhibiting clinical symptoms. The archaeal species, Methanobrevibacter oralis, which has been associated with severe periodontitis, was found in 3 symptomatic patients. These findings have provided new insights into the complex microbiota of pericoronitis. Several bacterial and archaeal species implicated in periodontal disease were recovered in greater incidence and abundance from the plaque of partially erupted third molars compared with incisors, supporting the hypothesis that the pericoronal region may provide a favored niche for periodontal pathogens in otherwise healthy mouths. Copyright © 2012 American Association of Oral and

  15. Sequence Length Polymorphisms within Cervid AmelogeninX and AmelogeninY Genes: Use in Sex Determination

    Directory of Open Access Journals (Sweden)

    Benson H. Morrill


    Full Text Available Sequence length polymorphisms between the amelogeninX and amelogeninY genes have been utilized for the design of rapid PCR sex determination assays for various mammalian species, including four cervid species (Capreolus capreolus, Cervus elaphus, Cervus nippon, and Dama dama. Several of these assays have been shown to be able to determine sex from non-invasively collected samples such as hair roots, feces, or skeletal remains. Such assays can be helpful for both zoo and wildlife biologists for sex determination, and could possibly be further developed to aid in species identification and SNP analyses if sequence information is also available. In order to further study the amelogeninX and amelogeninY gene polymorphisms that exist within cervids, partial sequences from these genes were generated and analyzed in multiple amelogeninX and amelogeninY samples of mule deer (Odocoileus hemionus and Rocky Mountain elk (Cervus elaphus nelsoni; and single amelogeninX and amelogeninY samples of sika deer (Cervus nippon, fallow deer (Dama dama and axis deer (Axis axis. Within these five species, three sequence length polymorphism regions (two or more nucleotides inserted or deleted between samples were found. The lengths of these polymorphisms were 45, 9, and 9 nucleotides. In addition, a SNP was found in the amelogeninY sequences of the mule deer samples that were analyzed. Lastly, the primers used to amplify the regions of the amelogeninX and amelogeninY sequences were able to determine sex in all five species that were studied.

  16. Genomic organization and sequences of immunoglobulin light chain genes in a primitive vertebrate suggest coevolution of immunoglobulin gene organization. (United States)

    Shamblott, M J; Litman, G W


    The genomic organization and sequence of immunoglobulin light chain genes in Heterodontus francisci (horned shark), a phylogenetically primitive vertebrate, have been characterized. Light chain variable (VL) and joining (JI) segments are separated by 380 nucleotides and together with the single constant region exon (CI), occupy less than 2.7 kb, the closest linkage described thus far for a rearranging gene system. The VL segment is flanked by a characteristic recombination signal sequence possessing a 12 nucleotide spacer; the recombination signal sequence flanking the JL segment is 23 nucleotides. The VL genes, unlike heavy chain genes, possess a typical upstream regulatory octamer as well as conserved enhancer core sequences in the intervening sequence separating JL and CL. Restriction mapping and genomic Southern blotting are consistent with the presence of multiple light chain gene clusters. There appear to be considerably fewer light than heavy chain genes. Heavy and light chain clusters show no evidence of genomic linkage using field inversion gel electrophoresis. The findings of major differences in the organization and functional rearrangement properties of immunoglobulin genes in species representing different levels of vertebrate evolution, but consistent similarity in the organization of heavy and light chain genes within a species, suggests that these systems may be coevolving. Images PMID:2511000

  17. Correlating Traits of Gene Retention, Sequence Divergence, Duplicability and Essentiality in Vertebrates, Arthropods, and Fungi (United States)

    Waterhouse, Robert M.; Zdobnov, Evgeny M.; Kriventseva, Evgenia V.


    Delineating ancestral gene relations among a large set of sequenced eukaryotic genomes allowed us to rigorously examine links between evolutionary and functional traits. We classified 86% of over 1.36 million protein-coding genes from 40 vertebrates, 23 arthropods, and 32 fungi into orthologous groups and linked over 90% of them to Gene Ontology or InterPro annotations. Quantifying properties of ortholog phyletic retention, copy-number variation, and sequence conservation, we examined correlations with gene essentiality and functional traits. More than half of vertebrate, arthropod, and fungal orthologs are universally present across each lineage. These universal orthologs are preferentially distributed in groups with almost all single-copy or all multicopy genes, and sequence evolution of the predominantly single-copy orthologous groups is markedly more constrained. Essential genes from representative model organisms, Mus musculus, Drosophila melanogaster, and Saccharomyces cerevisiae, are significantly enriched in universal orthologs within each lineage, and essential-gene-containing groups consistently exhibit greater sequence conservation than those without. This study of eukaryotic gene repertoire evolution identifies shared fundamental principles and highlights lineage-specific features, it also confirms that essential genes are highly retained and conclusively supports the “knockout-rate prediction” of stronger constraints on essential gene sequence evolution. However, the distinction between sequence conservation of single- versus multicopy orthologs is quantitatively more prominent than between orthologous groups with and without essential genes. The previously underappreciated difference in the tolerance of gene duplications and contrasting evolutionary modes of “single-copy control” versus “multicopy license” may reflect a major evolutionary mechanism that allows extended exploration of gene sequence space. PMID:21148284

  18. Thermodynamic properties of peptide solutions 19. Partial molar isothermal compressions at T = 298.15 K of some peptides of sequence gly-X-gly in aqueous solution

    Energy Technology Data Exchange (ETDEWEB)

    Hedwig, Gavin R., E-mail: [Institute of Fundamental Sciences - Chemistry, Massey University, Private Bag 11 222, Palmerston North (New Zealand)


    The partial molar isobaric expansions at infinite dilution and T = 298.15 K, E{sub 2}{sup 0}left braceE{sub 2}{sup 0}=(partial derivV{sub 2}{sup 0}/partial derivT){sub p}right brace were determined recently for some tripeptides of sequence glycyl-X-glycine, where X is one of the amino acids alanine, valine, leucine, isoleucine, phenylalanine, methionine, proline, serine, threonine, asparagine, glutamine, histidine, and tyrosine. These quantities, along with partial molar heat capacity data available from the literature, were used to convert the partial molar isentropic compressions at infinite dilution determined previously for these tripeptides into the partial molar isothermal compressions at infinite dilution and T = 298.15 K, K{sub T,2}{sup 0}left braceK{sub T,2}{sup 0}=-(partial derivV{sub 2}{sup 0}/partial derivp){sub T}right brace. These K{sub T,2}{sup 0} results, along with that for triglycine, were used to estimate the contributions of the amino acid side-chains to the partial molar isothermal compressions of peptides. The side-chain contributions have been rationalized in terms of the likely peptide-water interactions.

  19. Cloning and sequencing of the gene encoding thermophilic beta-amylase of Clostridium thermosulfurogenes.


    Kitamoto, N; Yamagata, H; Kato, T.; Tsukagoshi, N; Udaka, S


    A gene coding for thermophilic beta-amylase of Clostridium thermosulfurogenes was cloned into Bacillus subtilis, and its nucleotide sequence was determined. The nucleotide sequence suggested that the thermophilic beta-amylase is translated from monocistronic mRNA as a secretory precursor with a signal peptide of 32 amino acid residues. The deduced amino acid sequence of the mature beta-amylase contained 519 residues with a molecular weight of 57,167. The amino acid sequence of the C. thermosu...

  20. Sequencing of 15 622 gene-bearing BACs clarifies the gene-dense regions of the barley genome. (United States)

    Muñoz-Amatriaín, María; Lonardi, Stefano; Luo, MingCheng; Madishetty, Kavitha; Svensson, Jan T; Moscou, Matthew J; Wanamaker, Steve; Jiang, Tao; Kleinhofs, Andris; Muehlbauer, Gary J; Wise, Roger P; Stein, Nils; Ma, Yaqin; Rodriguez, Edmundo; Kudrna, Dave; Bhat, Prasanna R; Chao, Shiaoman; Condamine, Pascal; Heinen, Shane; Resnik, Josh; Wing, Rod; Witt, Heather N; Alpert, Matthew; Beccuti, Marco; Bozdag, Serdar; Cordero, Francesca; Mirebrahim, Hamid; Ounit, Rachid; Wu, Yonghui; You, Frank; Zheng, Jie; Simková, Hana; Dolezel, Jaroslav; Grimwood, Jane; Schmutz, Jeremy; Duma, Denisa; Altschmied, Lothar; Blake, Tom; Bregitzer, Phil; Cooper, Laurel; Dilbirligi, Muharrem; Falk, Anders; Feiz, Leila; Graner, Andreas; Gustafson, Perry; Hayes, Patrick M; Lemaux, Peggy; Mammadov, Jafar; Close, Timothy J


    Barley (Hordeum vulgare L.) possesses a large and highly repetitive genome of 5.1 Gb that has hindered the development of a complete sequence. In 2012, the International Barley Sequencing Consortium released a resource integrating whole-genome shotgun sequences with a physical and genetic framework. However, because only 6278 bacterial artificial chromosome (BACs) in the physical map were sequenced, fine structure was limited. To gain access to the gene-containing portion of the barley genome at high resolution, we identified and sequenced 15 622 BACs representing the minimal tiling path of 72 052 physical-mapped gene-bearing BACs. This generated ~1.7 Gb of genomic sequence containing an estimated 2/3 of all Morex barley genes. Exploration of these sequenced BACs revealed that although distal ends of chromosomes contain most of the gene-enriched BACs and are characterized by high recombination rates, there are also gene-dense regions with suppressed recombination. We made use of published map-anchored sequence data from Aegilops tauschii to develop a synteny viewer between barley and the ancestor of the wheat D-genome. Except for some notable inversions, there is a high level of collinearity between the two species. The software HarvEST:Barley provides facile access to BAC sequences and their annotations, along with the barley-Ae. tauschii synteny viewer. These BAC sequences constitute a resource to improve the efficiency of marker development, map-based cloning, and comparative genomics in barley and related crops. Additional knowledge about regions of the barley genome that are gene-dense but low recombination is particularly relevant.

  1. Estimating variation within the genes and inferring the phylogeny of 186 sequenced diverse Escherichia coli genomes

    DEFF Research Database (Denmark)

    Kaas, Rolf Sommer; Rundsten, Carsten Friis; Ussery, David


    more biologically relevant, especially considering that many of these genome sequences are draft quality. The E. coli pan-genome for this set of isolates contains 16,373 gene clusters. A core-gene tree, based on alignment and a pan-genome tree based on gene presence/absence, maps the relatedness...

  2. Sequencing analysis reveals a unique gene organization in the gyrB region of Mycoplasma hominis

    DEFF Research Database (Denmark)

    Ladefoged, Søren; Christiansen, Gunna


    of which showed similarity to that which encodes the LicA protein of Haemophilus influenzae. The organization of the genes in the region showed no resemblance to that in the corresponding regions of other bacteria sequenced so far. The gyrA gene was mapped 35 kb downstream from the gyrB gene....

  3. Cloning, sequence analysis, and characterization of the genes involved in isoprimeverose metabolism in Lactobacillus pentosus

    NARCIS (Netherlands)

    Chaillou, S.; Lokman, B.C.; Leer, R.J.; Posthuma, C.; Postma, P.W.; Pouwels, P.H.


    Two genes, xylP and xylQ, from the xylose regulon of Lactobacillus pentosus were cloned and sequenced. Together with the repressor gene of the regulon, xylR, the xylPQ genes form an operon which is inducible by xylose and which is transcribed from a promoter located 145 bp upstream of xylP. A putati

  4. Update of the Gene Discovery Program in Schistosoma mansoni with the Expressed Sequence Tag Approach

    Directory of Open Access Journals (Sweden)

    Élida ML Rabelo


    Full Text Available Continuing the Schistosoma mansoni Genome Project 363 new templates were sequenced generating 205 more ESTs corresponding to 91 genes. Seventy four of these genes (81% had not previously been described in S. mansoni. Among the newly discovered genes there are several of significant biological interest such as synaptophysin, NIFs-like and rho-GDP dissociation inhibitor

  5. Sequence Analysis of Toxin Gene-Bearing Corynebacterium diphtheriae Strains, Australia. (United States)

    Doyle, Christine J; Mazins, Adam; Graham, Rikki M A; Fang, Ning-Xia; Smith, Helen V; Jennison, Amy V


    By conducting a molecular characterization of Corynebacterium diphtheriae strains in Australia, we identified novel sequences, nonfunctional toxin genes, and 5 recent cases of toxigenic cutaneous diphtheria. These findings highlight the importance of extrapharyngeal infections for toxin gene-bearing (functional or not) and non-toxin gene-bearing C. diphtheriae strains. Continued surveillance is recommended.

  6. Characterization of promoter sequence of toll-like receptor genes in Vechur cattle

    Directory of Open Access Journals (Sweden)

    R. Lakshmi


    Full Text Available Aim: To analyze the promoter sequence of toll-like receptor (TLR genes in Vechur cattle, an indigenous breed of Kerala with the sequence of Bos taurus and access the differences that could be attributed to innate immune responses against bovine mastitis. Materials and Methods: Blood samples were collected from Jugular vein of Vechur cattle, maintained at Vechur cattle conservation center of Kerala Veterinary and Animal Sciences University, using an acid-citrate-dextrose anticoagulant. The genomic DNA was extracted, and polymerase chain reaction was carried out to amplify the promoter region of TLRs. The amplified product of TLR2, 4, and 9 promoter regions was sequenced by Sanger enzymatic DNA sequencing technique. Results: The sequence of promoter region of TLR2 of Vechur cattle with the B. taurus sequence present in GenBank showed 98% similarity and revealed variants for four sequence motifs. The sequence of the promoter region of TLR4 of Vechur cattle revealed 99% similarity with that of B. taurus sequence but not reveals significant variant in motifregions. However, two heterozygous loci were observed from the chromatogram. Promoter sequence of TLR9 gene also showed 99% similarity to B. taurus sequence and revealed variants for four sequence motifs. Conclusion: The results of this study indicate that significant variation in the promoter of TLR2 and 9 genes in Vechur cattle breed and may potentially link the influence the innate immunity response against mastitis diseases.

  7. Population genetic structure of the ark shell Scapharca broughtonii Schrenck from Korea, China, and Russia based on COI gene sequences. (United States)

    Cho, Eun-Seob; Jung, Choon-Goon; Sohn, Sang-Gyu; Kim, Chul-Won; Han, Seock-Jung


    Haplotype distribution, gene flow, and population genetic structure of the ark shell (Scapharca broughtonii) were studied using a partial sequence of a mitochondrial COI gene. The sequence analysis of 100 specimens obtained from a total of seven localities-five in Korea, one in China, and one in Russia- revealed 29 haplotypes, ranging in sequence divergence from 0.1% to 2.1%. Among these, the most frequent haplotype, SB16, was extensively distributed over study areas, especially in all Korean localities. This extensive distribution consequently resulted in the near absence of statistically significant genetic distance. Also, a high rate of gene flow was characteristic among localities in Korea. A test of genetic population structure showed that the ark shell in Korea formed a large genetic group. Moreover, an AMOVA test to determine the allocation of the genetic variance showed that most of the variance was distributed between localities, instead of within localities. However, a significant population differentiation was found between geographic populations [i.e., Jinhae (locality 6) in Korea and Sangdong (locality 5) in China and Vladivostok (locality 7) in Russia] based on geographic distance and population structure. These distinct groups may be associated with geographic characteristics and barriers. The results suggest that most of the ark shell populations in Korea caused considerable distribution to form a genetically homogeneous and intermixing structure, whereas some of the Korean and Chinese and Russian populations had a significantly different genetic structure.

  8. Identification of new genes in Sinorhizobium meliloti using the Genome Sequencer FLX system

    Directory of Open Access Journals (Sweden)

    Jensen Roderick V


    Full Text Available Abstract Background Sinorhizobium meliloti is an agriculturally important model symbiont. There is an ongoing need to update and improve its genome annotation. In this study, we used a high-throughput pyrosequencing approach to sequence the transcriptome of S. meliloti, and search for new bacterial genes missed in the previous genome annotation. This is the first report of sequencing a bacterial transcriptome using the pyrosequencing technology. Results Our pilot sequencing run generated 19,005 reads with an average length of 136 nucleotides per read. From these data, we identified 20 new genes. These new gene transcripts were confirmed by RT-PCR and their possible functions were analyzed. Conclusion Our results indicate that high-throughput sequence analysis of bacterial transcriptomes is feasible and next-generation sequencing technologies will greatly facilitate the discovery of new genes and improve genome annotation.

  9. Mouse mammary tumor virus-like gene sequences are present in lung patient specimens

    Directory of Open Access Journals (Sweden)

    Rodríguez-Padilla Cristina


    Full Text Available Abstract Background Previous studies have reported on the presence of Murine Mammary Tumor Virus (MMTV-like gene sequences in human cancer tissue specimens. Here, we search for MMTV-like gene sequences in lung diseases including carcinomas specimens from a Mexican population. This study was based on our previous study reporting that the INER51 lung cancer cell line, from a pleural effusion of a Mexican patient, contains MMTV-like env gene sequences. Results The MMTV-like env gene sequences have been detected in three out of 18 specimens studied, by PCR using a specific set of MMTV-like primers. The three identified MMTV-like gene sequences, which were assigned as INER6, HZ101, and HZ14, were 99%, 98%, and 97% homologous, respectively, as compared to GenBank sequence accession number AY161347. The INER6 and HZ-101 samples were isolated from lung cancer specimens, and the HZ-14 was isolated from an acute inflammatory lung infiltrate sample. Two of the env sequences exhibited disruption of the reading frame due to mutations. Conclusion In summary, we identified the presence of MMTV-like gene sequences in 2 out of 11 (18% of the lung carcinomas and 1 out of 7 (14% of acute inflamatory lung infiltrate specimens studied of a Mexican Population.

  10. Cloning and sequence analysis of hyaluronoglucosaminidase (nagH gene of Clostridium chauvoei

    Directory of Open Access Journals (Sweden)

    Saroj K. Dangi


    Full Text Available Aim: Blackleg disease is caused by Clostridium chauvoei in ruminants. Although virulence factors such as C. chauvoei toxin A, sialidase, and flagellin are well characterized, hyaluronidases of C. chauvoei are not characterized. The present study was aimed at cloning and sequence analysis of hyaluronoglucosaminidase (nagH gene of C. chauvoei. Materials and Methods: C. chauvoei strain ATCC 10092 was grown in ATCC 2107 media and confirmed by polymerase chain reaction (PCR using the primers specific for 16-23S rDNA spacer region. nagH gene of C. chauvoei was amplified and cloned into pRham-SUMO vector and transformed into Escherichia cloni 10G cells. The construct was then transformed into E. cloni cells. Colony PCR was carried out to screen the colonies followed by sequencing of nagH gene in the construct. Results: PCR amplification yielded nagH gene of 1143 bp product, which was cloned in prokaryotic expression system. Colony PCR, as well as sequencing of nagH gene, confirmed the presence of insert. Sequence was then subjected to BLAST analysis of NCBI, which confirmed that the sequence was indeed of nagH gene of C. chauvoei. Phylogenetic analysis of the sequence showed that it is closely related to Clostridium perfringens and Clostridium paraputrificum. Conclusion: The gene for virulence factor nagH was cloned into a prokaryotic expression vector and confirmed by sequencing.

  11. Identification of expressed genes in regenerating rat liver in 0-4-8-12 h short interval successive partial hepatectomy

    Institute of Scientific and Technical Information of China (English)

    Cun-Shuan Xu; Salman Rahman; Jing-Bo Zhang; Jin-Yun Yuan; Wen-Qiang Li; Hong-Peng Han; Ke-Jin Yang; Cui-Fang Chang; Li-Feng Zhao; Yu-Chang Li; Hui-Yong Zhang


    AIM: To identify the genes differentially expressed in the regenerating rat liver of 0-4-8-12 h short interval successive partial hepatectomy (SISPH) and to analyze their expression profiles.METHODS: Five hundred and fifty-one elements screened from subtractive cDNA libraries were made into a cDNA microarray (cDNA chip). Extensive gene expression analysis following 0-4-8-12 h SISPH was conducted by microarray.RESULTS: One hundred and eighty-three elements were selected, which were either up- or down-regulated more than 2-fold at one or more time points after SISPH. Cluster analysis and generalization analysis showed that there were five distinct temporal patterns of gene expression.Eighty-six genes were unreported, associated with liver regeneration (LR).CONCLUSION: Microarray analysis shows that the down regulated genes are much more than the up-regulated ones in SISPH; the numbers of genes expressed consistently are fewer than that expressed immediately; the genes expressed in high abundance are much fewer than that increased 2-5-fold. The comparison of SISPH with partial hepatectomy (PH) shows that the expression trends of most genes in SISPH and in PH are similar,but the expression of 43 genes is specifically altered in SISPH.

  12. Flow Cytometry-assisted Cloning of Specific Sequence Motifs fromComplex 16S ribosomal RNA Gene Libraries.

    Energy Technology Data Exchange (ETDEWEB)

    Nielsen, J.L.; Schramm, A.; Bernhard, A.E.; van den Engh, G.J.; Stahl, D.A.


    A flow cytometry method was developed for rapid screeningand recovery of cloned DNA containing common sequence motifs. Thisapproach, termed fluorescence-activated cell sorting-assisted cloning,was used to recover sequences affiliated with a unique lineage within theBacteroidetes not abundant in a clone library of environmental 16S rRNAgenes. Retrieval and sequence analysis of phylogenetically informativegenes has become a standard cultivation-independent technique toinvestigate microbial diversity in nature (7, 18). Genes encoding the 16SrRNA, because of the relative ease of their selective amplification, havebeen most frequently employed for general diversity surveys (16).Environmental studies have also focused on specific subpopulationsaffiliated with a phylogenetic group or identified by genes encodingspecific metabolic functions (e.g., ammonia oxidation, sulfaterespiration, and nitrate reduction) (8,15,20). However, specificpopulations may be of low abundance (1,23), or the genes encodingspecific metabolic functions may be insufficiently conserved to providepriming sites for general PCR amplification. Three general approacheshave been used to obtain 16S rRNA sequence information from low-abundancepopulations: screening hundreds to thousands of clones in a general 16SrRNA gene library (21), flow cytometric sorting of a subpopulation ofenvironmentally derived cells labeled by fluorescent in situhybridization (FISH) (27), or selective PCR amplification using primersspecific for the subpopulation (2,23). While the first approach is simplytime-consuming and tedious, the second has been restricted to fairlylarge and strongly fluorescent cells from aquatic samples (5, 27). Thethird approach often generates fragments of only a few hundred bases dueto the limited number of specific priming sites. Partial sequenceinformation often degrades analysis, obscuring or distorting thephylogenetic placement of the new sequences (11, 20). A more robustcharacterization of environ

  13. Identification of true EST alignments and exon regions of gene sequences

    Institute of Scientific and Technical Information of China (English)

    ZHOU Yanhong; JING Hui; LI Yanen; LIU Huailan


    Expressed sequence tags (ESTs), which have piled up considerably so far, provide a valuable resource for finding new genes, disease-relevant genes, and for recognizing alternative splicing variants, SNP sites, etc. The prerequisite for carrying out these researches is to correctly ascertain the gene-sequence-related ESTs. Based on analysis of the alignment results between some known gene sequences and ESTs in public database, several measures including Identity Check, Gap Check, Inclusion Check and Length Check have been introduced to judge whether an EST alignment is related to a gene sequence or not. A computational program EDSAc1.0 has been developed to identify true EST alignments and exon regions of query gene sequences. When tested with human gene sequences in the standard dataset HMR195 and evaluated with the standard measures of gene prediction performance, EDSAc1.0 can identify protein- coding regions with specificity of 0.997 and sensitivity of 0.88 at the nucleotide level, which outperform that of the counterpart TAP. A web server of EDSAc1.0 is available at

  14. Strong association between pseudogenization mechanisms and gene sequence length

    Directory of Open Access Journals (Sweden)

    Harrison Paul M


    Full Text Available Abstract Pseudogenes arise from the decay of gene copies following either RNA-mediated duplication (processed pseudogenes or DNA-mediated duplication (nonprocessed pseudogenes. Here, we show that long protein-coding genes tend to produce more nonprocessed pseudogenes than short genes, whereas the opposite is true for processed pseudogenes. Protein-coding genes longer than 3000 bp are 6 times more likely to produce nonprocessed pseudogenes than processed ones. Reviewers This article was reviewed by Dr. Dan Graur and Dr. Craig Nelson (nominated by Dr. J Peter Gogarten.

  15. Phylogenetic placement of the spider genus Nephila (Araneae: Araneoidea) inferred from rRNA and MaSp1 gene sequences. (United States)

    Pan, Hong-Chun; Zhou, Kai-Ya; Song, Da-Xiang; Qiu, Yang


    The family status of the genus Nephila, which belongs to Tetragnathidae currently but Araneidae formerly, was reexamined based on molecular phylogenetic analyses. In the present study, 12S and 18S rRNA gene fragments of eight species of spiders were amplified and sequenced. In addition, 3'-end partial cDNA of major ampullate spidroin-1 (MaSp1) gene of Argiope amoena was cloned and sequenced, and the 3'-end non-repetitive region's cDNA sequence of MaSp1 gene and the predicted amino acid sequence of C-terminal non-repetitive region of MaSp1 were aligned with some previously known sequences. The resulting phylogeny showed that Araneidae and Tetragnathidae are not a sister group in the superfamily Araneoidea, and the genus Nephila is closer to the genera of the family Araneidae rather than to those of Tetragnathidae. We suggest that the genus Nephila should be transferred back to Araneidae. Or the subfamily Nephilinae might be elevated to family level after it was redefined and redelimited. Furthermore, the study showed that 3'-end non-repetitive region's cDNA sequence of MaSp1 gene and C-terminal non-repetitive region's amino acid sequence of MaSp1 are useful molecular markers for phylogenetic analysis of spiders.

  16. Cloning and Sequence Analysis of Y-box Binding Protein Gene in Min Pig

    Institute of Scientific and Technical Information of China (English)

    Zhang Dong-jie; Liu Di; Wang Liang; He Xin-miao; Wang Wen-tao


    In order to study the gene sequence of Min pig Y-box binding protein (YB-1) gene, the complete coding sequence of Min pig YB-1 gene was cloned by RT-PCR, the sequence features were analyzed by some software and online website. The results showed that the complete CDS of Min pig Y-box was found to be 975 bp long, encoding 324 amino acids. It contained a conserved cold shock domain and several phosphorylation sites, but had no transmembrane domains, and was consistent with a protein found in the cytoplasm. Min pig YB-1 nucleotides shared high similarity (61.37%-97.66%) with other mammals.

  17. Cloning and sequencing of human lambda immunoglobulin genes by the polymerase chain reaction. (United States)

    Songsivilai, S; Bye, J M; Marks, J D; Hughes-Jones, N C


    Universal oligonucleotide primers, designed for amplifying and sequencing genes encoding the rearranged human lambda immunoglobulin variable region, were validated by amplification of the lambda light chain genes from four human heterohybridoma cell lines and in the generation of a cDNA library of human V lambda sequences from Epstein-Barr virus-transformed human peripheral blood lymphocytes. This technique allows rapid cloning and sequencing of human immunoglobulin genes, and has potential applications in the rescue of unstable human antibody-producing cell lines and in the production of human monoclonal antibodies.

  18. The wheat durable, multipathogen resistance gene Lr34 confers partial blast resistance in rice. (United States)

    Krattinger, Simon G; Sucher, Justine; Selter, Liselotte L; Chauhan, Harsh; Zhou, Bo; Tang, Mingzhi; Upadhyaya, Narayana M; Mieulet, Delphine; Guiderdoni, Emmanuel; Weidenbach, Denise; Schaffrath, Ulrich; Lagudah, Evans S; Keller, Beat


    The wheat gene Lr34 confers durable and partial field resistance against the obligate biotrophic, pathogenic rust fungi and powdery mildew in adult wheat plants. The resistant Lr34 allele evolved after wheat domestication through two gain-of-function mutations in an ATP-binding cassette transporter gene. An Lr34-like fungal disease resistance with a similar broad-spectrum specificity and durability has not been described in other cereals. Here, we transformed the resistant Lr34 allele into the japonica rice cultivar Nipponbare. Transgenic rice plants expressing Lr34 showed increased resistance against multiple isolates of the hemibiotrophic pathogen Magnaporthe oryzae, the causal agent of rice blast disease. Host cell invasion during the biotrophic growth phase of rice blast was delayed in Lr34-expressing rice plants, resulting in smaller necrotic lesions on leaves. Lines with Lr34 also developed a typical, senescence-based leaf tip necrosis (LTN) phenotype. Development of LTN during early seedling growth had a negative impact on formation of axillary shoots and spikelets in some transgenic lines. One transgenic line developed LTN only at adult plant stage which was correlated with lower Lr34 expression levels at seedling stage. This line showed normal tiller formation and more importantly, disease resistance in this particular line was not compromised. Interestingly, Lr34 in rice is effective against a hemibiotrophic pathogen with a lifestyle and infection strategy that is different from obligate biotrophic rusts and mildew fungi. Lr34 might therefore be used as a source in rice breeding to improve broad-spectrum disease resistance against the most devastating fungal disease of rice.

  19. A partial gene deletion of SLC45A2 causes oculocutaneous albinism in Doberman pinscher dogs.

    Directory of Open Access Journals (Sweden)

    Paige A Winkler

    Full Text Available The first white Doberman pinscher (WDP dog was registered by the American Kennel Club in 1976. The novelty of the white coat color resulted in extensive line breeding of this dog and her offspring. The WDP phenotype closely resembles human oculocutaneous albinism (OCA and clinicians noticed a seemingly high prevalence of pigmented masses on these dogs. This study had three specific aims: (1 produce a detailed description of the ocular phenotype of WDPs, (2 objectively determine if an increased prevalence of ocular and cutaneous melanocytic tumors was present in WDPs, and (3 determine if a genetic mutation in any of the genes known to cause human OCA is causal for the WDP phenotype. WDPs have a consistent ocular phenotype of photophobia, hypopigmented adnexal structures, blue irides with a tan periphery and hypopigmented retinal pigment epithelium and choroid. WDPs have a higher prevalence of cutaneous melanocytic neoplasms compared with control standard color Doberman pinschers (SDPs; cutaneous tumors were noted in 12/20 WDP (5 years of age: 8/8 and 1/20 SDPs (p<0.00001. Using exclusion analysis, four OCA causative genes were investigated for their association with WDP phenotype; TYR, OCA2, TYRP1 and SLC45A2. SLC45A2 was found to be linked to the phenotype and gene sequencing revealed a 4,081 base pair deletion resulting in loss of the terminus of exon seven of SLC45A2 (chr4∶77,062,968-77,067,051. This mutation is highly likely to be the cause of the WDP phenotype and is supported by a lack of detectable SLC45A2 transcript levels by reverse transcriptase PCR. The WDP provides a valuable model for studying OCA4 visual disturbances and melanocytic neoplasms in a large animal model.

  20. Cloning and Sequence Analysis of Glycoprotein D Gene of Bovine Herpesvirus-1 Strain Luojing

    Institute of Scientific and Technical Information of China (English)

    LI Ji-chang; TONG Guang-zhi; QIU Hua-ji; ZHOU Yan-jun; XUE Qiang


    By means of PCR,the gene encoding gD of bovine herpesvirus-1 (BHV-1) strain Luojing was amplified,cloned and sequenced.The nucleotide sequence of this gD gene was 1 251 bp,encoding 417 amino acids.Comparied with the published P8-2 strain,the homology of the necleotide sequence is 99.92%,and that of the deduced amino acid sequence is 100%.The results indicated that gD of BHV-1 was highly conservative.

  1. Sequencing of neuroblastoma identifies chromothripsis and defects in neuritogenesis genes

    NARCIS (Netherlands)

    J. Molenaar (Jan); J. Koster (Jan); D. Zwijnenburg (Danny); P. van Sluis (Peter); L.J. Valentijn (Linda); I. van der Ploeg (Ida); M. Hamdi (Mohamed); J. van Nes (Johan); B.A. Westerman (Bart); J. van Arkel (Jennemiek); M.E. Ebus; F. Haneveld (Franciska); A. Lakeman (Arjan); L. Schild (Linda); P. Molenaar (Piet); P. Stroeken (Peter); M.M. van Noesel (Max); I. Øra (Ingrid); J.P. di Santo (James); H.N. Caron (Huib); E.M. Westerhout (Ellen); R. Versteeg (Rogier)


    textabstractNeuroblastoma is a childhood tumour of the peripheral sympathetic nervous system. The pathogenesis has for a long time been quite enigmatic, as only very few gene defects were identified in this often lethal tumour. Frequently detected gene alterations are limited to MYCN amplification (

  2. Exome sequencing and arrayCGH detection of gene sequence and copy number variation between ILS and ISS mouse strains. (United States)

    Dumas, Laura; Dickens, C Michael; Anderson, Nathan; Davis, Jonathan; Bennett, Beth; Radcliffe, Richard A; Sikela, James M


    It has been well documented that genetic factors can influence predisposition to develop alcoholism. While the underlying genomic changes may be of several types, two of the most common and disease associated are copy number variations (CNVs) and sequence alterations of protein coding regions. The goal of this study was to identify CNVs and single-nucleotide polymorphisms that occur in gene coding regions that may play a role in influencing the risk of an individual developing alcoholism. Toward this end, two mouse strains were used that have been selectively bred based on their differential sensitivity to alcohol: the Inbred long sleep (ILS) and Inbred short sleep (ISS) mouse strains. Differences in initial response to alcohol have been linked to risk for alcoholism, and the ILS/ISS strains are used to investigate the genetics of initial sensitivity to alcohol. Array comparative genomic hybridization (arrayCGH) and exome sequencing were conducted to identify CNVs and gene coding sequence differences, respectively, between ILS and ISS mice. Mouse arrayCGH was performed using catalog Agilent 1 × 244 k mouse arrays. Subsequently, exome sequencing was carried out using an Illumina HiSeq 2000 instrument. ArrayCGH detected 74 CNVs that were strain-specific (38 ILS/36 ISS), including several ISS-specific deletions that contained genes implicated in brain function and neurotransmitter release. Among several interesting coding variations detected by exome sequencing was the gain of a premature stop codon in the alpha-amylase 2B (AMY2B) gene specifically in the ILS strain. In total, exome sequencing detected 2,597 and 1,768 strain-specific exonic gene variants in the ILS and ISS mice, respectively. This study represents the most comprehensive and detailed genomic comparison of ILS and ISS mouse strains to date. The two complementary genome-wide approaches identified strain-specific CNVs and gene coding sequence variations that should provide strong candidates to

  3. Description and interpretation of various SNPs identified by BRCA2 gene sequencing

    Directory of Open Access Journals (Sweden)

    Anca Negura


    Full Text Available Molecular diagnosis for hereditary breast and ovarian cancer (HBOC involves systematic DNA sequencing of predisposition genes like BRCA1 or BRCA2. Deleterious mutations within such genes are responsible for developing the disease, but other sequence variants can also be identified. Common Single Nucleotide Polymorphisms (SNPs are usually present in human genome, defining alleles whose frequencies widely vary in different populations. Either intragenic or intronic, silent or generating aminoacid substitutions, SNPs cannot be afforded themselves a predisposition status. However, prevalent SNPs can be used to define gene haplotypes, with also various frequencies. Since some mutation can easily be assigned to haplotypes (such is the case for BRCA1 gene, SNPs can therefore provide usual information in interpreting gene mutations effects on hereditary predisposition to cancer. Here we describe 10 BRCA2 SNPs identified by complete gene sequencing

  4. Genotyping of Chlamydophila psittaci using a new DNA microarray assay based on sequence analysis of ompA genes

    Directory of Open Access Journals (Sweden)

    Schubert Evelyn


    Full Text Available Abstract Background The currently used genotyping system for the avian zoonotic pathogen Chlamydophila (C. psittaci has evolved from serology and is based on ompA sequence variations. It includes seven avian and two non-avian genotypes. Restriction enzyme cleavage of the amplified ompA gene and, less frequently, ompA sequencing are being used for examination, but, beside methodological limitations, an increasing number of recently tested strains could not be assigned to any established genotype. Results Comprehensive analysis of all available ompA gene sequences has revealed a remarkable genetic diversity within the species C. psittaci, which is only partially covered by the present genotyping scheme. We suggest adjustments and extensions to the present scheme, which include the introduction of subgroups to the more heterogeneous genotypes A, E/B and D, as well as six provisional genotypes representing so far untypable strains. The findings of sequence analysis have been incorporated in the design of a new DNA microarray. The ArrayTube™ microarray-based ompA genotyping assay has been shown to discriminate among established genotypes and identify so far untyped strains. Its high specificity, which allows detection of single-nucleotide polymorphisms, is due to the parallel approach consisting in the use of 35 hybridization probes derived from variable domains 2 and 4 of the ompA gene. Conclusion The traditional genotyping system does not adequately reflect the extent of intra-species heterogeneity in ompA sequences of C. psittaci. The newly developed DNA microarray-based assay represents a promising diagnostic tool for tracing epidemiological chains, exploring the dissemination of genotypes and identifying non-typical representatives of C. psittaci.

  5. Genome-wide analysis reveals diverged patterns of codon bias, gene expression, and rates of sequence evolution in picea gene families. (United States)

    De La Torre, Amanda R; Lin, Yao-Cheng; Van de Peer, Yves; Ingvarsson, Pär K


    The recent sequencing of several gymnosperm genomes has greatly facilitated studying the evolution of their genes and gene families. In this study, we examine the evidence for expression-mediated selection in the first two fully sequenced representatives of the gymnosperm plant clade (Picea abies and Picea glauca). We use genome-wide estimates of gene expression (>50,000 expressed genes) to study the relationship between gene expression, codon bias, rates of sequence divergence, protein length, and gene duplication. We found that gene expression is correlated with rates of sequence divergence and codon bias, suggesting that natural selection is acting on Picea protein-coding genes for translational efficiency. Gene expression, rates of sequence divergence, and codon bias are correlated with the size of gene families, with large multicopy gene families having, on average, a lower expression level and breadth, lower codon bias, and higher rates of sequence divergence than single-copy gene families. Tissue-specific patterns of gene expression were more common in large gene families with large gene expression divergence than in single-copy families. Recent family expansions combined with large gene expression variation in paralogs and increased rates of sequence evolution suggest that some Picea gene families are rapidly evolving to cope with biotic and abiotic stress. Our study highlights the importance of gene expression and natural selection in shaping the evolution of protein-coding genes in Picea species, and sets the ground for further studies investigating the evolution of individual gene families in gymnosperms.

  6. Whole Blood Transcriptome Sequencing Reveals Gene Expression Differences between Dapulian and Landrace Piglets


    Hu, Jiaqing; Yang, Dandan; Chen, Wei; Li, Chuanhao; Wang, Yandong; Zeng, Yongqing; Wang, Hui


    There is little genomic information regarding gene expression differences at the whole blood transcriptome level of different pig breeds at the neonatal stage. To solve this, we characterized differentially expressed genes (DEGs) in the whole blood of Dapulian (DPL) and Landrace piglets using RNA-seq (RNA-sequencing) technology. In this study, 83 DEGs were identified between the two breeds. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analyses identified immun...

  7. Comparative genome sequencing of drosophila pseudoobscura: Chromosomal, gene and cis-element evolution

    Energy Technology Data Exchange (ETDEWEB)

    Richards, Stephen; Liu, Yue; Bettencourt, Brian R.; Hradecky, Pavel; Letovsky, Stan; Nielsen, Rasmus; Thornton, Kevin; Todd, Melissa J.; Chen, Rui; Meisel, Richard P.; Couronne, Olivier; Hua, Sujun; Smith, Mark A.; Bussemaker, Harmen J.; van Batenburg, Marinus F.; Howells, Sally L.; Scherer, Steven E.; Sodergren, Erica; Matthews, Beverly B.; Crosby, Madeline A.; Schroeder, Andrew J.; Ortiz-Barrientos, Daniel; Rives, Catherine M.; Metzker, Michael L.; Muzny, Donna M.; Scott, Graham; Steffen, David; Wheeler, David A.; Worley, Kim C.; Havlak, Paul; Durbin, K. James; Egan, Amy; Gill, Rachel; Hume, Jennifer; Morgan, Margaret B.; Miner, George; Hamilton, Cerissa; Huang, Yanmei; Waldron, Lenee; Verduzco, Daniel; Blankenburg, Kerstin P.; Dubchak, Inna; Noor, Mohamed A.F.; Anderson, Wyatt; White, Kevin P.; Clark, Andrew G.; Schaeffer, Stephen W.; Gelbart, William; Weinstock, George M.; Gibbs, Richard A.


    The genome sequence of a second fruit fly, D. pseudoobscura, presents an opportunity for comparative analysis of a primary model organism D. melanogaster. The vast majority of Drosophila genes have remained on the same arm, but within each arm gene order has been extensively reshuffled leading to the identification of approximately 1300 syntenic blocks. A repetitive sequence is found in the D. pseudoobscura genome at many junctions between adjacent syntenic blocks. Analysis of this novel repetitive element family suggests that recombination between offset elements may have given rise to many paracentric inversions, thereby contributing to the shuffling of gene order in the D. pseudoobscura lineage. Based on sequence similarity and synteny, 10,516 putative orthologs have been identified as a core gene set conserved over 35 My since divergence. Genes expressed in the testes had higher amino acid sequence divergence than the genome wide average consistent with the rapid evolution of sex-specific proteins. Cis-regulatory sequences are more conserved than control sequences between the species but the difference is slight, suggesting that the evolution of cis-regulatory elements is flexible. Overall, a picture of repeat mediated chromosomal rearrangement, and high co-adaptation of both male genes and cis-regulatory sequences emerges as important themes of genome divergence between these species of Drosophila.

  8. Phylogenetic relationships among the family Ommastrephidae (Mollusca: Cephalopoda) inferred from two mitochondrial DNA gene sequences. (United States)

    Wakabayashi, T; Suzuki, N; Sakai, M; Ichii, T; Chow, S


    Squids of the family Ommastrephidae are distributed worldwide, and the family includes many species of commercial importance. To investigate phylogenetic relationships among squid species of the family Ommastrephidae, partial nucleotide sequences of two mitochondrial gene loci (cytochrome c oxidase subunit I [1277bp] and 16S rRNA [443bp]) of 15 ommastrephid species and two outgroup species from the families Loliginidae and Enoploteuthidae were determined and used to construct parsimony and distance based phylogenetic trees. The molecular data provided several new phylogenetic inferences. The monophyletic status of three subfamilies (Illicinae, Todarodinae and Ommastrephinae) was well supported, although phylogenetic relationships between the subfamilies were not resolved. Inclusion of a problematic species, Ornithoteuthis volatilis, to Todarodinae was indicated. Within Todarodinae, the Japanese common squid Todarodes pacificus was observed to have much closer relationship to the species of the genus Nototodarus than to its congener (Todarodes filippovae). These results indicate that re-evaluation of several morphological key characters for ommastrephid taxonomy may be necessary.

  9. Evolution of the RH gene family in vertebrates revealed by brown hagfish (Eptatretus atami) genome sequences. (United States)

    Suzuki, Akinori; Komata, Hidero; Iwashita, Shogo; Seto, Shotaro; Ikeya, Hironobu; Tabata, Mitsutoshi; Kitano, Takashi


    In vertebrates, there are four major genes in the RH (Rhesus) gene family, RH, RHAG, RHBG, and RHCG. These genes are thought to have been formed by the two rounds of whole-genome duplication (2R-WGD) in the common ancestor of all vertebrates. In our previous work, where we analyzed details of the gene duplications process of this gene family, three nucleotide sequences belonging to this family were identified in Far Eastern brook lamprey (Lethenteron reissneri), and the phylogenetic positions of the genes were determined. Lampreys, along with hagfishes, are cyclostomata (jawless fishes), which is a sister group of gnathostomata (jawed vertebrates). Although those results suggested that one gene was orthologous to the gnathostome RHCG genes, we did not identify clear orthologues for other genes. In this study, therefore, we identified three novel cDNA sequences that belong to the RH gene family using de novo transcriptome analysis of another cyclostome: the brown hagfish (Eptatretus atami). We also determined the nucleotide sequences for the RHBG and RHCG genes in a red stingray (Dasyatis akajei), which belongs to the cartilaginous fishes. The phylogenetic tree showed that two brown hagfish genes, which were probably duplicated in the cyclostome lineage, formed a cluster with the gnathostome RHAG genes, whereas another brown hagfish gene formed a cluster with the gnathostome RHCG genes. We estimated that the RH genes had a higher evolutionary rate than the RHAG, RHBG, and RHCG genes. Interestingly, in the RHBG genes, only the bird lineage showed a higher rate of nonsynonymous substitutions. It is likely that this higher rate was caused by a state of relaxed functional constraints rather than positive selection nor by pseudogenization.

  10. Application of next generation sequencing to human gene fusion detection: computational tools, features and perspectives. (United States)

    Wang, Qingguo; Xia, Junfeng; Jia, Peilin; Pao, William; Zhao, Zhongming


    Gene fusions are important genomic events in human cancer because their fusion gene products can drive the development of cancer and thus are potential prognostic tools or therapeutic targets in anti-cancer treatment. Major advancements have been made in computational approaches for fusion gene discovery over the past 3 years due to improvements and widespread applications of high-throughput next generation sequencing (NGS) technologies. To identify fusions from NGS data, existing methods typically leverage the strengths of both sequencing technologies and computational strategies. In this article, we review the NGS and computational features of existing methods for fusion gene detection and suggest directions for future development.

  11. Cloning and nucleotide sequence of the Enterobacter aerogenes signal peptidase II (lsp) gene.


    Isaki, L; Kawakami, M; Beers, R; Hom, R; Wu, H.C.


    In Escherichia coli, prolipoprotein signal peptidase is encoded by the lsp gene, which is organized into an operon consisting of ileS, lsp, and three open reading frames, designated genes x, orf-149, and orf-316. The Enterobacter aerogenes lsp gene was cloned and expressed in E. coli. The nucleotide sequence of the Enterobacter aerogenes lsp gene and a part of its flanking sequences were determined. A high degree of homology was found between the E. coli ileS-lsp operon and the corresponding ...

  12. Gene conversion-like events in the diversification of human rearranged IGHV3-23*01 gene sequences

    Directory of Open Access Journals (Sweden)

    Bhargavi eDuvvuri


    Full Text Available Gene conversion (GCV as a mechanism of immunoglobulin diversification is well established in a few species. However, definitive evidence of GCV-like events in human immunoglobulin genes is scarce. GCV is mediated by activation-induced cytidine deaminase (AID. The lack of evidence of GCV in human rearranged immunoglobulin gene sequences is puzzling given the presence of highly similar germline donors and all the enzymatic machinery required for GCV. In this study, we undertook a computational analysis of rearranged IGHV3-23*01 gene sequences from common variable immunodeficiency (CVID patients and healthy individuals to survey ‘GCV-like’ activities. Our search identified strong evidence of GCV-like patterns. Germline VH sequences were identified as potential donors for clustered mutations in rearranged IGHV3-23*01 gene sequences. We identified minimum and maximum sequence identities between donor and recipient sequences that can serve as targets for GCV and our findings are consistent with those reported in literature. We observed that GCV-like tracts are flanked by activation-induced cytidine deaminase (AID hotspot motifs. Structural modeling of IGHV3-23*01 gene sequence revealed that hypermutable bases flanking GCV-like tracts, are in the single stranded DNA (ssDNA of stable stem-loop structures (SLSs. SsDNA is inherently fragile and also an optimal target for AID. We speculate that GCV could have been initiated by the targeting of hypermutable bases in ssDNA state in stable SLSs, plausibly by AID. We have observed that the frequency of GCV-like events is significantly higher in rearranged IGHV323-*01 sequences from healthy individuals compared to that of CVID patients. GCV, unlike SHM, can result in multiple base substitutions that can alter many amino acids. The extensive changes in antibody affinity by GCV-like events, as identified in this study would be instrumental in protecting humans against pathogens that diversify their genome by

  13. Outbreak tracking of Aleutian mink disease virus (AMDV) using partial NS1 gene sequencing

    DEFF Research Database (Denmark)

    Ryt-Hansen, Pia; Hjulsager, Charlotte Kristiane; Hagberg, E. E.


    described with varying virulence and the severity of infection also depends on the host's genotype and immune status. Clinical signs include respiratory distress in kits and unthriftiness and low quality of the pelts. The infection can also be subclinical. Systematic control of AMDV in Danish mink farms...

  14. Targeting of AID-mediated sequence diversification to immunoglobulin genes. (United States)

    Kothapalli, Naga Rama; Fugmann, Sebastian D


    Activation-induced cytidine deaminase (AID) is a key enzyme for antibody-mediated immune responses. Antibodies are encoded by the immunoglobulin genes and AID acts as a transcription-dependent DNA mutator on these genes to improve antibody affinity and effector functions. An emerging theme in field is that many transcribed genes are potential targets of AID, presenting an obvious danger to genomic integrity. Thus there are mechanisms in place to ensure that mutagenic outcomes of AID activity are specifically restricted to the immunoglobulin loci. Cis-regulatory targeting elements mediate this effect and their mode of action is probably a combination of immunoglobulin gene specific activation of AID and a perversion of faithful DNA repair towards error-prone outcomes.

  15. Sequence analysis of cereal sucrose synthase genes and isolation ...

    African Journals Online (AJOL)



    Oct 18, 2007 ... 1Department of Environmental Biotechnology, Bharathidasan University, ... script and UA cloning vector (QIAGEN PCR Cloning Kit) was used to clone ..... Expression of a Arabidopsis sucrose synthase gene indicates a role.

  16. Isolation and sequencing of the cryIC-like delta endotoxin gene from ...

    African Journals Online (AJOL)



    Feb 16, 2012 ... insecticidal toxin coding gene was cloned from an Egyptian isolate of B. thuringiensis. Sequence analyses of the cryIC ... insecticidal proteins into crop plants to protect them from ..... eggplant, cotton, cucumber and tomato, etc.

  17. Tsukamurella pulmonis Bloodstream Infection Identified by secA1 Gene Sequencing


    Pérez del Molino Bernal, IC; Cano García, María Eliecer; García de la Fuente, C; Martínez Martínez, Luis; López, Monica; Fernández Mazarrasa, Carlos; Agüero Balbín, Jesús


    Recurrent bloodstream infections caused by a Gram-positive bacterium affected an immunocompromised child. Tsukamurella pulmonis was the microorganism identified by secA1 gene sequencing. Antibiotic treatment in combination with removal of the subcutaneous port healed the patient.

  18. De novo transcriptome sequencing of axolotl blastema for identification of differentially expressed genes during limb regeneration (United States)


    Background Salamanders are unique among vertebrates in their ability to completely regenerate amputated limbs through the mediation of blastema cells located at the stump ends. This regeneration is nerve-dependent because blastema formation and regeneration does not occur after limb denervation. To obtain the genomic information of blastema tissues, de novo transcriptomes from both blastema tissues and denervated stump ends of Ambystoma mexicanum (axolotls) 14 days post-amputation were sequenced and compared using Solexa DNA sequencing. Results The sequencing done for this study produced 40,688,892 reads that were assembled into 307,345 transcribed sequences. The N50 of transcribed sequence length was 562 bases. A similarity search with known proteins identified 39,200 different genes to be expressed during limb regeneration with a cut-off E-value exceeding 10-5. We annotated assembled sequences by using gene descriptions, gene ontology, and clusters of orthologous group terms. Targeted searches using these annotations showed that the majority of the genes were in the categories of essential metabolic pathways, transcription factors and conserved signaling pathways, and novel candidate genes for regenerative processes. We discovered and confirmed numerous sequences of the candidate genes by using quantitative polymerase chain reaction and in situ hybridization. Conclusion The results of this study demonstrate that de novo transcriptome sequencing allows gene expression analysis in a species lacking genome information and provides the most comprehensive mRNA sequence resources for axolotls. The characterization of the axolotl transcriptome can help elucidate the molecular mechanisms underlying blastema formation during limb regeneration. PMID:23815514

  19. Successful Recovery of Nuclear Protein-Coding Genes from Small Insects in Museums Using Illumina Sequencing. (United States)

    Kanda, Kojun; Pflug, James M; Sproul, John S; Dasenko, Mark A; Maddison, David R


    In this paper we explore high-throughput Illumina sequencing of nuclear protein-coding, ribosomal, and mitochondrial genes in small, dried insects stored in natural history collections. We sequenced one tenebrionid beetle and 12 carabid beetles ranging in size from 3.7 to 9.7 mm in length that have been stored in various museums for 4 to 84 years. Although we chose a number of old, small specimens for which we expected low sequence recovery, we successfully recovered at least some low-copy nuclear protein-coding genes from all specimens. For example, in one 56-year-old beetle, 4.4 mm in length, our de novo assembly recovered about 63% of approximately 41,900 nucleotides in a target suite of 67 nuclear protein-coding gene fragments, and 70% using a reference-based assembly. Even in the least successfully sequenced carabid specimen, reference-based assembly yielded fragments that were at least 50% of the target length for 34 of 67 nuclear protein-coding gene fragments. Exploration of alternative references for reference-based assembly revealed few signs of bias created by the reference. For all specimens we recovered almost complete copies of ribosomal and mitochondrial genes. We verified the general accuracy of the sequences through comparisons with sequences obtained from PCR and Sanger sequencing, including of conspecific, fresh specimens, and through phylogenetic analysis that tested the placement of sequences in predicted regions. A few possible inaccuracies in the sequences were detected, but these rarely affected the phylogenetic placement of the samples. Although our sample sizes are low, an exploratory regression study suggests that the dominant factor in predicting success at recovering nuclear protein-coding genes is a high number of Illumina reads, with success at PCR of COI and killing by immersion in ethanol being secondary factors; in analyses of only high-read samples, the primary significant explanatory variable was body length, with small beetles

  20. Cloning and analysizing the up-regulated expression of transthyretin-related gene(LR1) in rat liver regeneration following short interval successive partial hepatectomy

    Institute of Scientific and Technical Information of China (English)

    Cun-Shuan Xu; Yu-Chang Li; Jun-Tang Lin; Hui-Yong Zhang; Yun-Han Zhang


    AIM: Cloning and analysizing the up-regulated expressionof transthyretin-related gene following short intervalsuccessive partial hepatectomy (SISPH) to elucidate themechanism of differentiation, division, dedifferentiation andredifferentiation in rat liver regeneration (LR).METHODS: Lobus external sinister and lobus centralissinister, lobus centralis, lobus dexter, lobus candatus wereremoved one by one from rat liver at four different time points4, 36, 36 and 36 hr (total time: 4 hr, 40 hr, 76 hr, 112 hr)respectively. Suppression subtractive hybridization (SSH) wascarried out by using normal rat liver tissue as driver and thetissue following short interval successive partial hepatectomy(SISPH) as tester to construct a highly efficient forward-subtractive cDNA library. After screening, an interested ESTfragment was selected by SSH and primers were designedaccording to the sequence of the EST to clone the full-lengthcDNA fragment using RACE (rapid amplification of cDNAend). Homologous detection was performed between thefull-lenth cDNA and Genbank.RESULTS: Forward suppression subtractive hybridization(FSSH) library between 0 h and 112 h following SISPH wasconstructed and an up-regulated full-length cDNA (namedLR1), which was related with the transthyretin gene, wascloned by rapid amplification of cDNA end. It was suggestedthat the gene is involved in the cellular dedifferentiation inLR following SISPH.CONCLUSION: Some genes were up-regulated in 112 hfollowing SISPH in rat. LR1 is one of these up-regulatedexpression genes which may play an important role in rat LR.

  1. How the sequence of a gene can tune its translation

    DEFF Research Database (Denmark)

    Fredrick, Kurt; Ibba, Michael


    Sixty-one codons specify 20 amino acids, offering cells many options for encoding a polypeptide sequence. Two new studies (Cannarrozzi et al., 2010; Tuller et al., 2010) now foster the idea that patterns of codon usage can control ribosome speed, fine-tuning translation to increase the efficiency...

  2. Catostylus tagi: partial rDNA sequencing and characterisation of nematocyte structures using two improvements in jellyfish sample preparation. (United States)

    Parracho, Tiago; Morais, Zilda


    More than 200 Scyphozoa species have been described, but few have been properly studied regarding their chemical and genetic characteristics. Catostylus tagi, an edible Scyphozoa and the sole European Catostylidae, occurs in summer at Tagus and Sado estuaries. Neither a systematic comparison between the two Catostylus communities nor a chemical approach on their nematocytes had been carried out yet. In order to achieve these purposes, optimisation of DNA extraction and of histochemical staining procedures were developed. Catostylus specimens from Tagus and Sado estuaries were compared by ribosomal 18S, 28S, and ITS1 partial sequencing. The morphochemistry of nematocytes was studied by optical and electronic microscopy. Macroscopic and molecular results indicated that both communities belong to the same species, C. tagi. The hematoxylin and eosin staining allowed the visualisation of nematocyst genesis and indicated a basic character for the macromolecules on the shaft of euryteles and on the tubule of isorhizae and birhopaloids. By Masson's trichrome procedure, the basic properties of the tubules were confirmed and a collagenous profile for the toxins was suggested. Results of the alcian blue staining showed that the outer membrane of nematocyte may consist of macromolecules with acidic polysaccharides, consistent with NOWA and nematogalectin glycoproteins detected in Hydra, but also with poly-gamma-glutamate complex, chitin-like polysaccharides and hyaluronic acids. Through the von Kossa assays, calcium was detected; its position suggested interactions with polysaccharides of the membrane, with proteins of the contractile system or with both. The optimisation of sample preparation for DNA extraction may facilitate further studies on little known jellyfish species. The improvement of the smear procedure simplified the use of stained reactions in zooplankton. Moreover, it was shown that good slide images might be acquired manually. The development of specific

  3. The mitochondrial genome of the stingless bee Melipona bicolor (Hymenoptera, Apidae, Meliponini: sequence, gene organization and a unique tRNA translocation event conserved across the tribe Meliponini

    Directory of Open Access Journals (Sweden)

    Daniela Silvestre


    Full Text Available At present a complete mtDNA sequence has been reported for only two hymenopterans, the Old World honey bee, Apis mellifera and the sawfly Perga condei. Among the bee group, the tribe Meliponini (stingless bees has some distinction due to its Pantropical distribution, great number of species and large importance as main pollinators in several ecosystems, including the Brazilian rain forest. However few molecular studies have been conducted on this group of bees and few sequence data from mitochondrial genomes have been described. In this project, we PCR amplified and sequenced 78% of the mitochondrial genome of the stingless bee Melipona bicolor (Apidae, Meliponini. The sequenced region contains all of the 13 mitochondrial protein-coding genes, 18 of 22 tRNA genes, and both rRNA genes (one of them was partially sequenced. We also report the genome organization (gene content and order, gene translation, genetic code, and other molecular features, such as base frequencies, codon usage, gene initiation and termination. We compare these characteristics of M. bicolor to those of the mitochondrial genome of A. mellifera and other insects. A highly biased A+T content is a typical characteristic of the A. mellifera mitochondrial genome and it was even more extreme in that of M. bicolor. Length and compositional differences between M. bicolor and A. mellifera genes were detected and the gene order was compared. Eleven tRNA gene translocations were observed between these two species. This latter finding was surprising, considering the taxonomic proximity of these two bee tribes. The tRNA Lys gene translocation was investigated within Meliponini and showed high conservation across the Pantropical range of the tribe.

  4. Trichinella pseudospiralis vs. T. spiralis thymidylate synthase gene structure and T. pseudospiralis thymidylate synthase retrogene sequence. (United States)

    Jagielska, Elżbieta; Płucienniczak, Andrzej; Dąbrowska, Magdalena; Dowierciał, Anna; Rode, Wojciech


    Thymidylate synthase is a housekeeping gene, designated ancient due to its role in DNA synthesis and ubiquitous phyletic distribution. The genomic sequences were characterized coding for thymidylate synthase in two species of the genus Trichinella, an encapsulating T. spiralis and a non-encapsulating T. pseudospiralis. Based on the sequence of parasitic nematode Trichinella spiralis thymidylate synthase cDNA, PCR techniques were employed. Each of the respective gene structures encompassed 6 exons and 5 introns located in conserved sites. Comparison with the corresponding gene structures of other eukaryotic species revealed lack of common introns that would be shared among selected fungi, nematodes, mammals and plants. The two deduced amino acid sequences were 96% identical. In addition to the thymidylate synthase gene, the intron-less retrocopy, i.e. a processed pseudogene, with sequence identical to the T. spiralis gene coding region, was found to be present within the T. pseudospiralis genome. This pseudogene, instead of the gene, was confirmed by RT-PCR to be expressed in the parasite muscle larvae. Intron load, as well as distribution of exon and intron phases in thymidylate synthase genes from various sources, point against the theory of gene assembly by the primordial exon shuffling and support the theory of evolutionary late intron insertion into spliceosomal genes. Thymidylate synthase pseudogene expressed in T. pseudospiralis muscle larvae is designated a retrogene.

  5. Phylogeny and identification of Enterococci by atpA gene sequence analysis. (United States)

    Naser, S; Thompson, F L; Hoste, B; Gevers, D; Vandemeulebroecke, K; Cleenwerck, I; Thompson, C C; Vancanneyt, M; Swings, J


    The relatedness among 91 Enterococcus strains representing all validly described species was investigated by comparing a 1,102-bp fragment of atpA, the gene encoding the alpha subunit of ATP synthase. The relationships observed were in agreement with the phylogeny inferred from 16S rRNA gene sequence analysis. However, atpA gene sequences were much more discriminatory than 16S rRNA for species differentiation. All species were differentiated on the basis of atpA sequences with, at a maximum, 92% similarity. Six members of the Enterococcus faecium species group (E. faecium, E. hirae, E. durans, E. villorum, E. mundtii, and E. ratti) showed > 99% 16S rRNA gene sequence similarity, but the highest value of atpA gene sequence similarity was only 89.9%. The intraspecies atpA sequence similarities for all species except E. faecium strains varied from 98.6 to 100%; the E. faecium strains had a lower atpA sequence similarity of 96.3%. Our data clearly show that atpA provides an alternative tool for the phylogenetic study and identification of enterococci.


    Directory of Open Access Journals (Sweden)

    Sanjoy Datta


    Full Text Available Leptin is a 16 kD protein, synthesized by adipose tissue and is involved in regulation of feed intake, energy balance, fertility and immune functions. Present study was undertaken with the objectives of sequence characterization and studying the nucleotide variation in leptin gene in Murrah buffalo. The leptin gene consists of three exons and two introns which spans about 18.9kb, of which the first exon is not transcribed into protein. In buffaloes, the leptin gene is located on chromosome eight and maps to BBU 8q32. The leptin gene was amplified by PCR using oligonucleotide primers to obtain 289 bp fragment comprising of exon 2 and 405 bp fragment containing exon 3 of leptin gene. The amplicons were sequenced to identify variation at nucleotide level. Sequence comparison of buffalo with cattle reveals variation at five nucleotide sequences at positions 983, 1083, 1147, 1152, 1221 and all the SNPs are synonymous resulting no in change in amino acids. Three of these eight nucleotide variations have been reported for the first time in buffalo. The results indicate conservation of DNA sequence between cattle and buffalo. Nucleotide sequence variations observed at leptin gene between Bubalus bubalis and Bos taurus species revealed 97% nucleotide identity.

  7. Sequence and structure of the mouse gene coding for the largest neurofilament subunit.

    NARCIS (Netherlands)

    J-P. Julien (Jean-Pierre); F. Cote; L. Beaudet (Lucille); M. Sidky (Malak); D. Flavell (David); F.G. Grosveld (Frank); W. Mushynski (Walter)


    textabstractWe have determined the complete nucleotide sequence of the mouse gene encoding the neurofilament NF-H protein. The C-terminal domain of NF-H is very rich in charged amino acids (aa) and contains a 3-aa sequence, Lys-Ser-Pro, that is repeated 51 times within a stretch of 368 aa. The

  8. Therapeutic modulation of endogenous gene function by agents with designed DNA-sequence specificities

    NARCIS (Netherlands)

    Uil, T.G.; Haisma, H.J.; Rots, Marianne


    Designer molecules that can specifically target pre-determined DNA sequences provide a means to modulate endogenous gene function. Different classes of sequence-specific DNA-binding agents have been developed, including triplex-forming molecules, synthetic polyamides and designer zinc finger protein

  9. Gene tree discordance of wild and cultivated Asian rice deciphered by genome-wide sequence comparison. (United States)

    Yang, Ching-chia; Sakai, Hiroaki; Numa, Hisataka; Itoh, Takeshi


    Although a large number of genes are expected to correctly solve a phylogenetic relationship, inconsistent gene tree topologies have been observed. This conflicting evidence in gene tree topologies, known as gene tree discordance, becomes increasingly important as advanced sequencing technologies produce an enormous amount of sequence information for phylogenomic studies among closely related species. Here, we aim to characterize the gene tree discordance of the Asian cultivated rice Oryza sativa and its progenitor, O. rufipogon, which will be an ideal case study of gene tree discordance. Using genome and cDNA sequences of O. sativa and O. rufipogon, we have conducted the first in-depth analyses of gene tree discordance in Asian rice. Our comparison of full-length cDNA sequences of O. rufipogon with the genome sequences of the japonica and indica cultivars of O. sativa revealed that 60% of the gene trees showed a topology consistent with the expected one, whereas the remaining genes supported significantly different topologies. Moreover, the proportions of the topologies deviated significantly from expectation, suggesting at least one hybridization event between the two subgroups of O. sativa, japonica and indica. In fact, a genome-wide alignment between japonica and indica indicated that significant portions of the indica genome are derived from japonica. In addition, literature concerning the pedigree of the indica cultivar strongly supported the hybridization hypothesis. Our molecular evolutionary analyses deciphered complicated evolutionary processes in closely related species. They also demonstrated the importance of gene tree discordance in the era of high-speed DNA sequencing.

  10. Cloning and Sequence Analysis of Light Variable Region Gene of Anti-human Retinoblastoma Monoclonal Antibody

    Institute of Scientific and Technical Information of China (English)

    Xiufeng Zhong; Yongping Li; Shuqi Huang; Bo Ning; Chunyan Zhang; Jianliang Zheng; Guanguang Feng


    Purpose: To clone the variable region gene of light chain of monoclonal antibody against human retinoblastoma and to analyze the characterization of its nucleotide sequence as well as amino acid sequence.Methods: Total RNA was extracted from 3C6 hybridoma cells secreting specific monoclonal antibody(McAb)against human retinoblastoma(RB), then transcripted reversely into cDNA with olig-dT primers.The variable region of the light chain (VL) gene fragments was amplified using polymeerase chain reaction(PCR) and further cloned into pGEM(R) -T Easy vector. Then, 3C6 VL cDNA was sequenced by Sanger's method.Homologous analysis was done by NCBI BLAST.Results: The complete nucleotide sequence of 3C6 VL cDNA consisted of 321 bp encoding 107 amino acid residues, containing four workframe regions(FRs)and three complementarity-determining regions (CDRs) as well as the typical structure of two cys residues. The sequence is most homological to a member of the Vk9 gene family, and its chain utilizes the Jkl gene segment.Conclusion: The light chain variable region gene of the McAb against human RB was amplified successfully , which belongs to the Vk9 gene family and utilizes Vk-Jk1 gene rearrangement. This study lays a good basis for constructing a recombinant antibody and for making a new targeted therapeutic agents against retinoblastoma.

  11. Identification of genes in anonymous DNA sequences. Annual performance report, February 1, 1991--January 31, 1992

    Energy Technology Data Exchange (ETDEWEB)

    Fields, C.A.


    The objective of this project is the development of practical software to automate the identification of genes in anonymous DNA sequences from the human, and other higher eukaryotic genomes. A software system for automated sequence analysis, gm (gene modeler) has been designed, implemented, tested, and distributed to several dozen laboratories worldwide. A significantly faster, more robust, and more flexible version of this software, gm 2.0 has now been completed, and is being tested by operational use to analyze human cosmid sequence data. A range of efforts to further understand the features of eukaryoyic gene sequences are also underway. This progress report also contains papers coming out of the project including the following: gm: a Tool for Exploratory Analysis of DNA Sequence Data; The Human THE-LTR(O) and MstII Interspersed Repeats are subfamilies of a single widely distruted highly variable repeat family; Information contents and dinucleotide compostions of plant intron sequences vary with evolutionary origin; Splicing signals in Drosophila: intron size, information content, and consensus sequences; Integration of automated sequence analysis into mapping and sequencing projects; Software for the C. elegans genome project.

  12. The Intolerance of Regulatory Sequence to Genetic Variation Predicts Gene Dosage Sensitivity.

    Directory of Open Access Journals (Sweden)

    Slavé Petrovski


    Full Text Available Noncoding sequence contains pathogenic mutations. Yet, compared with mutations in protein-coding sequence, pathogenic regulatory mutations are notoriously difficult to recognize. Most fundamentally, we are not yet adept at recognizing the sequence stretches in the human genome that are most important in regulating the expression of genes. For this reason, it is difficult to apply to the regulatory regions the same kinds of analytical paradigms that are being successfully applied to identify mutations among protein-coding regions that influence risk. To determine whether dosage sensitive genes have distinct patterns among their noncoding sequence, we present two primary approaches that focus solely on a gene's proximal noncoding regulatory sequence. The first approach is a regulatory sequence analogue of the recently introduced residual variation intolerance score (RVIS, termed noncoding RVIS, or ncRVIS. The ncRVIS compares observed and predicted levels of standing variation in the regulatory sequence of human genes. The second approach, termed ncGERP, reflects the phylogenetic conservation of a gene's regulatory sequence using GERP++. We assess how well these two approaches correlate with four gene lists that use different ways to identify genes known or likely to cause disease through changes in expression: 1 genes that are known to cause disease through haploinsufficiency, 2 genes curated as dosage sensitive in ClinGen's Genome Dosage Map, 3 genes judged likely to be under purifying selection for mutations that change expression levels because they are statistically depleted of loss-of-function variants in the general population, and 4 genes judged unlikely to cause disease based on the presence of copy number variants in the general population. We find that both noncoding scores are highly predictive of dosage sensitivity using any of these criteria. In a similar way to ncGERP, we assess two ensemble-based predictors of regional noncoding

  13. The Intolerance of Regulatory Sequence to Genetic Variation Predicts Gene Dosage Sensitivity. (United States)

    Petrovski, Slavé; Gussow, Ayal B; Wang, Quanli; Halvorsen, Matt; Han, Yujun; Weir, William H; Allen, Andrew S; Goldstein, David B


    Noncoding sequence contains pathogenic mutations. Yet, compared with mutations in protein-coding sequence, pathogenic regulatory mutations are notoriously difficult to recognize. Most fundamentally, we are not yet adept at recognizing the sequence stretches in the human genome that are most important in regulating the expression of genes. For this reason, it is difficult to apply to the regulatory regions the same kinds of analytical paradigms that are being successfully applied to identify mutations among protein-coding regions that influence risk. To determine whether dosage sensitive genes have distinct patterns among their noncoding sequence, we present two primary approaches that focus solely on a gene's proximal noncoding regulatory sequence. The first approach is a regulatory sequence analogue of the recently introduced residual variation intolerance score (RVIS), termed noncoding RVIS, or ncRVIS. The ncRVIS compares observed and predicted levels of standing variation in the regulatory sequence of human genes. The second approach, termed ncGERP, reflects the phylogenetic conservation of a gene's regulatory sequence using GERP++. We assess how well these two approaches correlate with four gene lists that use different ways to identify genes known or likely to cause disease through changes in expression: 1) genes that are known to cause disease through haploinsufficiency, 2) genes curated as dosage sensitive in ClinGen's Genome Dosage Map, 3) genes judged likely to be under purifying selection for mutations that change expression levels because they are statistically depleted of loss-of-function variants in the general population, and 4) genes judged unlikely to cause disease based on the presence of copy number variants in the general population. We find that both noncoding scores are highly predictive of dosage sensitivity using any of these criteria. In a similar way to ncGERP, we assess two ensemble-based predictors of regional noncoding importance, nc

  14. DG-CST (Disease Gene Conserved Sequence Tags), a database of human–mouse conserved elements associated to disease genes (United States)

    Boccia, Angelo; Petrillo, Mauro; di Bernardo, Diego; Guffanti, Alessandro; Mignone, Flavio; Confalonieri, Stefano; Luzi, Lucilla; Pesole, Graziano; Paolella, Giovanni; Ballabio, Andrea; Banfi, Sandro


    The identification and study of evolutionarily conserved genomic sequences that surround disease-related genes is a valuable tool to gain insight into the functional role of these genes and to better elucidate the pathogenetic mechanisms of disease. We created the DG-CST (Disease Gene Conserved Sequence Tags) database for the identification and detailed annotation of human–mouse conserved genomic sequences that are localized within or in the vicinity of human disease-related genes. CSTs are defined as sequences that show at least 70% identity between human and mouse over a length of at least 100 bp. The database contains CST data relative to over 1088 genes responsible for monogenetic human genetic diseases or involved in the susceptibility to multifactorial/polygenic diseases. DG-CST is accessible via the internet at and may be searched using both simple and complex queries. A graphic browser allows direct visualization of the CSTs and related annotations within the context of the relative gene and its transcripts. PMID:15608249

  15. PCR-Internal Transcribed Spacer (ITS) genes sequencing and ...

    African Journals Online (AJOL)

    2. Department of Pure and Applied Zoology, Federal University of Agriculture, Abeokuta, Nigeria. 3. ... Keywords: Internal transcribed spacer genes, phylogenetic, genetic ... ization of fungi by polymerase chain reaction (PCR) am- .... Basic Local Alignment search Tool (BLAST) to establish ..... Population structure and.

  16. Biased distribution of DNA uptake sequences towards genome maintenance genes

    DEFF Research Database (Denmark)

    Davidsen, T.; Rodland, E.A.; Lagesen, K.


    in these organisms. Pasteurella multocida also displayed high frequencies of a putative DUS identical to that previously identified in H. influenzae and with a skewed distribution towards genome maintenance genes, indicating that this bacterium might be transformation competent under certain conditions....

  17. Sequencing and mapping hemoglobin gene clusters in the australian model dasyurid marsupial sminthopsis macroura

    Energy Technology Data Exchange (ETDEWEB)

    De Leo, A.A.; Wheeler, D.; Lefevre, C.; Cheng, Jan-Fang; Hope, R.; Kuliwaba, J.; Nicholas, K.R.; Westermanc, M.; Graves, J.A.M.


    Comparing globin genes and their flanking sequences across many species has allowed globin gene evolution to be reconstructed in great detail. Marsupial globin sequences have proved to be of exceptional significance. A previous finding of a beta-like omega gene in the alpha cluster in the tammar wallaby suggested that the alpha and beta cluster evolved via genome duplication and loss rather than tandem duplication. To confirm and extend this important finding we isolated and sequenced BACs containing the alpha and beta loci from the distantly related Australian marsupial Sminthopsis macroura. We report that the alpha gene lies in the same BAC as the beta-like omega gene, implying that the alpha-omega juxtaposition is likely to be conserved in all marsupials. The LUC7L gene was found 3' of the S. macroura alpha locus, a gene order shared with humans but not mouse, chicken or fugu. Sequencing a BAC contig that contained the S. macroura beta globin and epsilon globin loci showed that the globin cluster is flanked by olfactory genes, demonstrating a gene arrangement conserved for over 180 MY. Analysis of the region 5' to the S. macroura epsilon globin gene revealed a region similar to the eutherian LCR, containing sequences and potential transcription factor binding sites with homology to eutherian hypersensitive sites 1 to 5. FISH mapping of BACs containing S. macroura alpha and beta globin genes located the beta globin cluster on chromosome 3q and the alpha locus close to the centromere on 1q, resolving contradictory map locations obtained by previous radioactive in situ hybridization.

  18. Complete mitochondrial DNA sequence of the endangered frog Odorrana ishikawae (family Ranidae) and unexpected diversity of mt gene arrangements in ranids. (United States)

    Kurabayashi, Atsushi; Yoshikawa, Natsuhiko; Sato, Naoki; Hayashi, Yoko; Oumi, Shohei; Fujii, Tamotsu; Sumida, Masayuki


    We determined the complete nucleotide sequence of the mitochondrial (mt) genome of an endangered Japanese frog, Odorrana ishikawae (family Ranidae). We also sequenced partial mt genomes of three other Odorrana and six ranid species to survey the diversity of genomic organizations and elucidate the phylogenetic problems remaining in this frog family. The O. ishikawae mt genome contained the 37 mt genes and single control region (CR) typically found in vertebrate mtDNAs, but the region of Light-strand replication origin (OL) was triplicated in this species. Four protein-encoding genes (atp6, nd2, nd3, and nd5) were found to have high sequence divergence and to be usable for population genetics studies on this endangered species. Among the surveyed ranids, only two species (Rana and Lithobates) manifested the typical neobatrachian-type mt gene arrangement. In contrast, relatively large gene rearrangements were found in Amolops, Babina, and Staurois species; and translocations of single tRNA genes (trns) were observed in Glandirana and Odorrana species. Though the inter-generic and interspecific relationships of ranid taxa remain to be elucidated based on 12S and 16S rrn sequence data, some of the derived mt gene orders were found to have synapomorphic features useful for solving problematic ranid phylogenies. The tandem duplication and random loss (TDRL) model, the traditional model for mt gene rearrangement, failed to easily explain several of the mt gene rearrangements observed here. Indeed, the recent recombination-based gene rearrangement models seemed to be more suitable for this purpose. The high frequency of gene translocations involving a specific trn block (trnH-trnS1) and several single tRNA genes suggest that there may be a retrotranslocation in ranid mt genomes.

  19. Sphaeridiotrema globulus and Sphaeridiotrema pseudoglobulus (Digenea): Species differentiation based on mtDNA (Barcode) and partial LSU-rDNA sequences (United States)

    Bergmame, Laura; Huffman, Jane; Cole, Rebecca; Dayanandan, Selvadurai; Tkach, Vasyl; McLaughlin, J. Daniel


    Flukes belonging to Sphaeridiotrema are important parasites of waterfowl, and 2 morphologically similar species Sphaeridiotrema globulus and Sphaeridiotrema pseudoglobulus, have been implicated in waterfowl mortality in North America. Cytochrome oxidase I (barcode region) and partial LSU-rDNA sequences from specimens of S. globulus and S. pseudoglobulus, obtained from naturally and experimentally infected hosts from New Jersey and Quebec, respectively, confirmed that these species were distinct. Barcode sequences of the 2 species differed at 92 of 590 nucleotide positions (15.6%) and the translated sequences differed by 13 amino acid residues. Partial LSU-rDNA sequences differed at 29 of 1,208 nucleotide positions (2.4%). Additional barcode sequences from specimens collected from waterfowl in Wisconsin and Minnesota and morphometric data obtained from specimens acquired along the north shore of Lake Superior revealed the presence of S. pseudoglobulus in these areas. Although morphometric data suggested the presence of S. globulus in the Lake Superior sample, it was not found among the specimens sequenced from Wisconsin or Minnesota.

  20. Growth Hormone Gene Polymorphism and Its Association with Partial Cumulative Milk Yields of Holstein Friesian Dairy Cattle

    Directory of Open Access Journals (Sweden)

    R. Misrianti


    Full Text Available Growth hormone gene (GH gene plays an important role in regulating body growth and in developing mammary gland, similar with its interaction to specific receptors. The GH gene has been considered as one of candidate gene associated with selection on lactation trait and milk production. This study was aimed to determine genetic polymorphism of the GH-AluI gene and to associate its genotype variants on various 15-d partial cumulative milk yields in Holstein Friesian (HF dairy cows. A number of 370 blood samples were collected from six HF populations, respectively from small dairy farmer under the supervision of the North Bandung Milk Cooperation (NBMC in Cilumber (98 and Pasir Kemis village (96, Dairy Cattle Breeding and Improvement Station (Cikole DCBIS Cikole (88, Lembang Artificial Insemination Center (Lembang AIC (17, Singosari Artificial Insemination Center (Singosari AIC (32, and Cipelang Livestock Embryo Center (Cipelang LEC (40. A polymerase chain reaction - restriction fragment length polymorphism (PCR-RFLP method was used to identify variant genotypes of the GH gene using AluI restriction enzyme. Genotyping results produced only two genotypes, i.e. LL and LV genotypes, without VV genotype. Frequency of the former was dominant, whilst that was low for the latter (89% vs. 11%; leading to the frequency of L allele was very high (94% compared to that of V allele (6%. No significant association between variant genotypes (LL and LV and various 15-d partial cumulative milk yields.

  1. IS21-558 insertion sequences are involved in the mobility of the multiresistance gene cfr

    DEFF Research Database (Denmark)

    Kehrenberg, Corinna; Aarestrup, Frank Møller; Schwarz, Stefan


    exporter gene lsa(B) and the gene cfr for combined resistance to phenicols, lincosamides, oxazolidinones, pleuromutilins, and streptogramin A antibiotics bracketed by IS21-558 insertion sequences orientated in the same direction. A 6-bp target site duplication was detected at the integration site within...

  2. Molecular cloning of the interleukin-1 gene cluster: Construction of an integrated YAC/PAC contig and a partial transcriptional map in the region of chromosome 2q13

    Energy Technology Data Exchange (ETDEWEB)

    Nothwang, H.G.; Strahm, B.; Denich, D. [Freiburg Univ. (Germany)] [and others


    Genes of the interleukin-1 (IL-1) gene cluster localized on chromosome 2q13 are implicated in many physiological and pathophysiological processes. We present here a high-resolution physical map of this region between markers D2S2008 and D2S43/PAX8. An integrated YAC/PAC contig and a partial transcriptional map were constructed by STS-content mapping using the CEPH YAC library and three PAC libraries. A total of 3 YACs, 34 PACs, and 56 STSs were integrated: 33 newly generated probes to PAC end sequences, 9 polymorphic and 4 nonpolymorphic markers, 5 known genes, 4 expressed sequence tags, and 1 pseudogene. Within the map, a complete PAC contig of > 1 Mb encompasses the IL-1 gene cluster and PAX8, a paired-box-containing gene. This allowed us to define the transcriptional orientation of GLVR1, IL1B, and IL1RN and to show that PAX8 is localized outside the IL-1 gene cluster. FISH analysis localized PAC clones containing the IL-1 gene cluster to 2q12-q13. The data provide the basis for further characterization of the IL-1 gene cluster and for the construction of a sequence-ready PAC contig of this region. 45 refs., 2 figs., 2 tabs.

  3. Codon usage in mammalian genes is biased by sequence slippage mechanisms. (United States)

    Bains, W


    The codons for some conserved amino acids are found to be the same between homologous genes from different species when the statistics of codon usage would suggest that they should be different. I examine whether this 'coincidence' of codon usage could be due to genetic mechanisms homogenising the DNA around specific sites. This paper describes the further analysis of the coincident codons in 19 genes (a total of 96 homologues) for slippage. Coincident codons arise in contexts of increased sequence simplicity, and have a high chance of occurring within sequences similar to the recombination-prone minisatellite 'core' sequence. This suggests a role of genetic homogenisation in their generation.

  4. Detecting Sequence Homology at the Gene Cluster Level with MultiGeneBlast

    NARCIS (Netherlands)

    Medema, Marnix H.; Takano, Eriko; Breitling, Rainer; Nowick, Katja


    The genes encoding many biomolecular systems and pathways are genomically organized in operons or gene clusters. With MultiGeneBlast, we provide a user-friendly and effective tool to perform homology searches with operons or gene clusters as basic units, instead of single genes. The contextualizatio

  5. Detecting Sequence Homology at the Gene Cluster Level with MultiGeneBlast

    NARCIS (Netherlands)

    Medema, Marnix H.; Takano, Eriko; Breitling, Rainer; Nowick, Katja

    The genes encoding many biomolecular systems and pathways are genomically organized in operons or gene clusters. With MultiGeneBlast, we provide a user-friendly and effective tool to perform homology searches with operons or gene clusters as basic units, instead of single genes. The

  6. [Characterization of Black and Dichothrix Cyanobacteria Based on the 16S Ribosomal RNA Gene Sequence (United States)

    Ortega, Maya


    My project focuses on characterizing different cyanobacteria in thrombolitic mats found on the island of Highborn Cay, Bahamas. Thrombolites are interesting ecosystems because of the ability of bacteria in these mats to remove carbon dioxide from the atmosphere and mineralize it as calcium carbonate. In the future they may be used as models to develop carbon sequestration technologies, which could be used as part of regenerative life systems in space. These thrombolitic communities are also significant because of their similarities to early communities of life on Earth. I targeted two cyanobacteria in my research, Dichothrix spp. and whatever black is, since they are believed to be important to carbon sequestration in these thrombolitic mats. The goal of my summer research project was to molecularly identify these two cyanobacteria. DNA was isolated from each organism through mat dissections and DNA extractions. I ran Polymerase Chain Reactions (PCR) to amplify the 16S ribosomal RNA (rRNA) gene in each cyanobacteria. This specific gene is found in almost all bacteria and is highly conserved, meaning any changes in the sequence are most likely due to evolution. As a result, the 16S rRNA gene can be used for bacterial identification of different species based on the sequence of their 16S rRNA gene. Since the exact sequence of the Dichothrix gene was unknown, I designed different primers that flanked the gene based on the known sequences from other taxonomically similar cyanobacteria. Once the 16S rRNA gene was amplified, I cloned the gene into specialized Escherichia coli cells and sent the gene products for sequencing. Once the sequence is obtained, it will be added to a genetic database for future reference to and classification of other Dichothrix sp.

  7. Disparate sequence characteristics of the Erysiphe graminis f.sp. hordei glyceraldehyde-3-phosphate dehydrogenase gene

    DEFF Research Database (Denmark)

    Christiansen, S.K.; Justesen, A.F.; Giese, H.


    to be similar for all four genes. The results of the codon-usage analysis suggest that Egh is more flexible than other fungi in the choice of nucleotides at the wobble position. Codon-usage preferences in Egh and barley genes indicate a level of difference which may be exploited to discriminate between fungal...... and plant genes in sequence mixtures. The Egh gpd promoter appears to be superior to that of the Egh beta-tubulin gene (tub2) for driving the E. coli beta-glucuronidase (GUS) gene in transformation experiments....

  8. Chitin synthase 1 (Chs1) gene sequences of Microsporum equinum and Trichophyton equinum. (United States)

    Kano, R; Aihara, S; Nakamura, Y; Watanabe, S; Hasegawa, A


    Chitin synthase 1 (Chs1) genes from Microsporum equinum and Trichophyton equinum were compared with those of the other dermatophytes. The Chs1 nucleotide sequences of these dermatophytes from horses showed more than 80% similarity to those of Arthroderma benhamiae, A. fulvum, A. grubyi, A. gypseum, A. incruvatum, A. otae, A. simii, A. vanbreuseghemii, Epidermophyton floccosum, T. mentagrophytes var. interdigitale (T. interdigitale), T. rubrum and T. violaceum. Especially high degree of nucleotide sequence similarity of more than 99% was noted between the Chs1 gene fragments of M. equinum and A. otae, and those of T. equinum, T. interdigitale and A. vanbreuseghemii, respectively. The phylogenetic analysis of their sequences revealed that M. equinum was genetically very close to A. otae and T. equinum to A. vanbreuseghemii. A molecular analysis of Chs1 genes will provide useful information for the genetic relatedness of M. equinum and T. equinum and confirm the value of DNA sequencing in identification of these two dermatophytes.

  9. A Robust and Versatile Method of Combinatorial Chemical Synthesis of Gene Libraries via Hierarchical Assembly of Partially Randomized Modules. (United States)

    Popova, Blagovesta; Schubert, Steffen; Bulla, Ingo; Buchwald, Daniela; Kramer, Wilfried


    A major challenge in gene library generation is to guarantee a large functional size and diversity that significantly increases the chances of selecting different functional protein variants. The use of trinucleotides mixtures for controlled randomization results in superior library diversity and offers the ability to specify the type and distribution of the amino acids at each position. Here we describe the generation of a high diversity gene library using tHisF of the hyperthermophile Thermotoga maritima as a scaffold. Combining various rational criteria with contingency, we targeted 26 selected codons of the thisF gene sequence for randomization at a controlled level. We have developed a novel method of creating full-length gene libraries by combinatorial assembly of smaller sub-libraries. Full-length libraries of high diversity can easily be assembled on demand from smaller and much less diverse sub-libraries, which circumvent the notoriously troublesome long-term archivation and repeated proliferation of high diversity ensembles of phages or plasmids. We developed a generally applicable software tool for sequence analysis of mutated gene sequences that provides efficient assistance for analysis of library diversity. Finally, practical utility of the library was demonstrated in principle by assessment of the conformational stability of library members and isolating protein variants with HisF activity from it. Our approach integrates a number of features of nucleic acids synthetic chemistry, biochemistry and molecular genetics to a coherent, flexible and robust method of combinatorial gene synthesis.

  10. A Robust and Versatile Method of Combinatorial Chemical Synthesis of Gene Libraries via Hierarchical Assembly of Partially Randomized Modules.

    Directory of Open Access Journals (Sweden)

    Blagovesta Popova

    Full Text Available A major challenge in gene library generation is to guarantee a large functional size and diversity that significantly increases the chances of selecting different functional protein variants. The use of trinucleotides mixtures for controlled randomization results in superior library diversity and offers the ability to specify the type and distribution of the amino acids at each position. Here we describe the generation of a high diversity gene library using tHisF of the hyperthermophile Thermotoga maritima as a scaffold. Combining various rational criteria with contingency, we targeted 26 selected codons of the thisF gene sequence for randomization at a controlled level. We have developed a novel method of creating full-length gene libraries by combinatorial assembly of smaller sub-libraries. Full-length libraries of high diversity can easily be assembled on demand from smaller and much less diverse sub-libraries, which circumvent the notoriously troublesome long-term archivation and repeated proliferation of high diversity ensembles of phages or plasmids. We developed a generally applicable software tool for sequence analysis of mutated gene sequences that provides efficient assistance for analysis of library diversity. Finally, practical utility of the library was demonstrated in principle by assessment of the conformational stability of library members and isolating protein variants with HisF activity from it. Our approach integrates a number of features of nucleic acids synthetic chemistry, biochemistry and molecular genetics to a coherent, flexible and robust method of combinatorial gene synthesis.

  11. Gene discovery and transcript analyses in the corn smut pathogen Ustilago maydis: expressed sequence tag and genome sequence comparison

    Directory of Open Access Journals (Sweden)

    Saville Barry J


    Full Text Available Abstract Background Ustilago maydis is the basidiomycete fungus responsible for common smut of corn and is a model organism for the study of fungal phytopathogenesis. To aid in the annotation of the genome sequence of this organism, several expressed sequence tag (EST libraries were generated from a variety of U. maydis cell types. In addition to utility in the context of gene identification and structure annotation, the ESTs were analyzed to identify differentially abundant transcripts and to detect evidence of alternative splicing and anti-sense transcription. Results Four cDNA libraries were constructed using RNA isolated from U. maydis diploid teliospores (U. maydis strains 518 × 521 and haploid cells of strain 521 grown under nutrient rich, carbon starved, and nitrogen starved conditions. Using the genome sequence as a scaffold, the 15,901 ESTs were assembled into 6,101 contiguous expressed sequences (contigs; among these, 5,482 corresponded to predicted genes in the MUMDB (MIPS Ustilago maydis database, while 619 aligned to regions of the genome not yet designated as genes in MUMDB. A comparison of EST abundance identified numerous genes that may be regulated in a cell type or starvation-specific manner. The transcriptional response to nitrogen starvation was assessed using RT-qPCR. The results of this suggest that there may be cross-talk between the nitrogen and carbon signalling pathways in U. maydis. Bioinformatic analysis identified numerous examples of alternative splicing and anti-sense transcription. While intron retention was the predominant form of alternative splicing in U. maydis, other varieties were also evident (e.g. exon skipping. Selected instances of both alternative splicing and anti-sense transcription were independently confirmed using RT-PCR. Conclusion Through this work: 1 substantial sequence information has been provided for U. maydis genome annotation; 2 new genes were identified through the discovery of 619

  12. Influences on gene expression in vivo by a Shine-Dalgarno sequence

    DEFF Research Database (Denmark)

    Jin, Haining; Zhao, Qing; Gonzalez de Valdivia, Ernesto I;


    The Shine-Dalgarno (SD+: 5'-AAGGAGG-3') sequence anchors the mRNA by base pairing to the 16S rRNA in the small ribosomal subunit during translation initiation. We have here compared how an SD+ sequence influences gene expression, if located upstream or downstream of an initiation codon....... The positive effect of an upstream SD+ is confirmed. A downstream SD+ gives decreased gene expression. This effect is also valid for appropriately modified natural Escherichia coli genes. If an SD+ is placed between two potential initiation codons, initiation takes place predominantly at the second start site...

  13. Analysis of RAPD and mitochondrial 16S rRNA gene sequences from Trichiurus lepturus and Eupleurogrammus muticus in the Yellow Sea

    Institute of Scientific and Technical Information of China (English)

    MENG Zining; ZHUANG Zhimeng; JIN Xianshi; TANG Qisheng; SU Yongquan


    Random amplified polymorphic DNA (RAPD) technique is applied to 12 individuals from each species of the hairtail fishes Trichiurus lepturus and Eupleurogrammus muticus in the Yellow Sea. The percentage of polymorphic sites, degree of genetic polymorphism and genetic distance are compared and the phylogenetic tree is constructed by Neighbor-joining method. The partial mitochondrial 16S rRNA gene is amplified by polymerase chain reaction (PCR) and the PCR products are directly sequenced after being purified. These sequences, together with the homologous sequences of another Trichiuridae species Lepidopus caudatus obtained from GenBank, are used to analyze nucleotide difference and to construct a UPGMA phylogenetic tree by means of biological informatics. Analysis shows: (1) the RAPD technique is a highly sensitive method for investigating genetic diversity in T. lepturus, and E. muticus. T. lepturus exhibits a lower polymorphism and genetic diversity than E. muticus; (2) according to the analysis of the partial mitochondrial 16S rRNA gene sequences, a very low intraspecific variation and considerably high divergence among species were found, which reveals a dual nature of conservatism and variability in mitochondrial 16S rRNA gene; (3) five primers generate the species-specific RAPD sites and these sites can be served as the molecular markers for species identification and (4) it can be proved at DNA variation level that T. lepturus and E. muticus are of two species respectively pertaining to different genera, which supports the Nelson taxonomic conclusion.

  14. A novel method to discover fluoroquinolone antibiotic resistance (qnr genes in fragmented nucleotide sequences

    Directory of Open Access Journals (Sweden)

    Boulund Fredrik


    Full Text Available Abstract Background Broad-spectrum fluoroquinolone antibiotics are central in modern health care and are used to treat and prevent a wide range of bacterial infections. The recently discovered qnr genes provide a mechanism of resistance with the potential to rapidly spread between bacteria using horizontal gene transfer. As for many antibiotic resistance genes present in pathogens today, qnr genes are hypothesized to originate from environmental bacteria. The vast amount of data generated by shotgun metagenomics can therefore be used to explore the diversity of qnr genes in more detail. Results In this paper we describe a new method to identify qnr genes in nucleotide sequence data. We show, using cross-validation, that the method has a high statistical power of correctly classifying sequences from novel classes of qnr genes, even for fragments as short as 100 nucleotides. Based on sequences from public repositories, the method was able to identify all previously reported plasmid-mediated qnr genes. In addition, several fragments from novel putative qnr genes were identified in metagenomes. The method was also able to annotate 39 chromosomal variants of which 11 have previously not been reported in literature. Conclusions The method described in this paper significantly improves the sensitivity and specificity of identification and annotation of qnr genes in nucleotide sequence data. The predicted novel putative qnr genes in the metagenomic data support the hypothesis of a large and uncharacterized diversity within this family of resistance genes in environmental bacterial communities. An implementation of the method is freely available at

  15. Cloning Sequencing and Structural Manipulation of the Enterotoxin D and E Genes from Staphylococcus aureus (United States)


    time. Further characterization of the plasmid was carried out by restriction mapping of pIB485 was performed. pIB485 DNA was digested with EcoRI...the interruption of the gene by insertion of the phage DNA. To characterize this unique regulation of gene expression, we sequenced the lipase a solution of 0.1% carboxymethylcellulose (added to stabilize the emulsion) by sonication for 7 - 10 minutes at 50w. This suspension was used to

  16. Reanalysis of RNA-sequencing data reveals several additional fusion genes with multiple isoforms.

    Directory of Open Access Journals (Sweden)

    Sara Kangaspeska

    Full Text Available RNA-sequencing and tailored bioinformatic methodologies have paved the way for identification of expressed fusion genes from the chaotic genomes of solid tumors. We have recently successfully exploited RNA-sequencing for the discovery of 24 novel fusion genes in breast cancer. Here, we demonstrate the importance of continuous optimization of the bioinformatic methodology for this purpose, and report the discovery and experimental validation of 13 additional fusion genes from the same samples. Integration of copy number profiling with the RNA-sequencing results revealed that the majority of the gene fusions were promoter-donating events that occurred at copy number transition points or involved high-level DNA-amplifications. Sequencing of genomic fusion break points confirmed that DNA-level rearrangements underlie selected fusion transcripts. Furthermore, a significant portion (>60% of the fusion genes were alternatively spliced. This illustrates the importance of reanalyzing sequencing data as gene definitions change and bioinformatic methods improve, and highlights the previously unforeseen isoform diversity among fusion transcripts.

  17. cDNA cloning and sequence analysis of NIb gene of soybean mosaic virus

    Institute of Scientific and Technical Information of China (English)

    刘俊君; 彭学贤; 莽克强


    cDNA of soybean mosaic virus (Beijing isolate, SMV-BJ) has been synthesized, using viralgenomic RNA as template and random hexanucleotides as primers. Based on the sequences of SMV-BJ coat protein (CP) gene as well as SMV- and WMV-II-related regions, oligonucleotides were made as primers for polymerase chain reaction (PCR). NIb gene of SMV-BJ was amplified by PCR, and cloned into pBluescript SK. The complete sequence was determined. The comparison of NIb genes between SMV-BJ and WMV-II . (USA) shows that similarities for nucleotide sequence reach 80.3%, and the deduced amino acid sequence. 91 3%. In consideration of the high identities in between the CP gene and the 3’-non-coding region between them, WMV-II might be considered as a watermelon strain of SMV Besides, some unexpected sequences were found in the 3’-region of 2 NIb gene clones. Following modification and splicing, a binary vector of NIb gene has been constructed for its expression in higher plant for the purpose of studying the possible repl

  18. RNA-Seq analysis and gene discovery of Andrias davidianus using Illumina short read sequencing.

    Directory of Open Access Journals (Sweden)

    Fenggang Li

    Full Text Available The Chinese giant salamander, Andrias davidianus, is an important species in the course of evolution; however, there is insufficient genomic data in public databases for understanding its immunologic mechanisms. High-throughput transcriptome sequencing is necessary to generate an enormous number of transcript sequences from A. davidianus for gene discovery. In this study, we generated more than 40 million reads from samples of spleen and skin tissue using the Illumina paired-end sequencing technology. De novo assembly yielded 87,297 transcripts with a mean length of 734 base pairs (bp. Based on the sequence similarities, searching with known proteins, 38,916 genes were identified. Gene enrichment analysis determined that 981 transcripts were assigned to the immune system. Tissue-specific expression analysis indicated that 443 of transcripts were specifically expressed in the spleen and skin. Among these transcripts, 147 transcripts were found to be involved in immune responses and inflammatory reactions, such as fucolectin, β-defensins and lymphotoxin beta. Eight tissue-specific genes were selected for validation using real time reverse transcription quantitative PCR (qRT-PCR. The results showed that these genes were significantly more expressed in spleen and skin than in other tissues, suggesting that these genes have vital roles in the immune response. This work provides a comprehensive genomic sequence resource for A. davidianus and lays the foundation for future research on the immunologic and disease resistance mechanisms of A. davidianus and other amphibians.

  19. Complexity of rice Hsp100 gene family: lessons from rice genome sequence data

    Indian Academy of Sciences (India)

    Gaurav Batra; Vineeta Singh Chauhan; Amanjot Singh; Neelam K Sarkar; Anil Grover


    Elucidation of genome sequence provides an excellent platform to understand detailed complexity of the various gene families. Hsp100 is an important family of chaperones in diverse living systems. There are eight putative gene loci encoding for Hsp100 proteins in Arabidopsis genome. In rice, two full-length Hsp100 cDNAs have been isolated and sequenced so far. Analysis of rice genomic sequence by in silico approach showed that two isolated rice Hsp100 cDNAs correspond to Os05g44340 and Os02g32520 genes in the rice genome database. There appears to be three additional proteins (encoded by Os03g31300, Os04g32560 and Os04g33210 gene loci) that are variably homologous to Os05g44340 and Os02g32520 throughout the entire amino acid sequence. The above five rice Hsp100 genes show significant similarities in the signature sequences known to be conserved among Hsp100 proteins. While Os05g44340 encodes cytoplasmic Hsp100 protein, those encoded by the other four genes are predicted to have chloroplast transit peptides.

  20. Haplotypes and Sequence Variation in the Ovine Adiponectin Gene (ADIPOQ

    Directory of Open Access Journals (Sweden)

    Qing-Ming An


    Full Text Available The adiponectin gene (ADIPOQ plays an important role in energy homeostasis. In this study five separate regions (regions 1 to 5 of ovine ADIPOQ were analysed using PCR-SSCP. Four different PCR-SSCP patterns (A1-D1, A2-D2 were detected in region-1 and region-2, respectively, with seven and six SNPs being revealed. In region-3, three different patterns (A3-C3 and three SNPs were observed. Two patterns (A4-B4, A5-B5 and two and one SNPs were observed in region-4 and region-5, respectively. In total, nineteen SNPs were detected, with five of them in the coding region and two (c.46T/C and c.515G/A putatively resulting in amino acid changes (p.Tyr16His and p.Lys172Arg. In region-1, -2 and -3 of 316 sheep from eight New Zealand breeds, variants A1, A2 and A3 were the most common, although variant frequencies differed in the eight breeds. Across region-1 and region-3, nine haplotypes were identified and haplotypes A1-A3, A1-C3, B1-A3 and B1-C3 were most common. These results indicate that the ADIPOQ gene is polymorphic and suggest that further analysis is required to see if the variation in the gene is associated with animal production traits.

  1. Genomic sequence around butterfly wing development genes: annotation and comparative analysis.

    Directory of Open Access Journals (Sweden)

    Inês C Conceição

    Full Text Available BACKGROUND: Analysis of genomic sequence allows characterization of genome content and organization, and access beyond gene-coding regions for identification of functional elements. BAC libraries, where relatively large genomic regions are made readily available, are especially useful for species without a fully sequenced genome and can increase genomic coverage of phylogenetic and biological diversity. For example, no butterfly genome is yet available despite the unique genetic and biological properties of this group, such as diversified wing color patterns. The evolution and development of these patterns is being studied in a few target species, including Bicyclus anynana, where a whole-genome BAC library allows targeted access to large genomic regions. METHODOLOGY/PRINCIPAL FINDINGS: We characterize ∼1.3 Mb of genomic sequence around 11 selected genes expressed in B. anynana developing wings. Extensive manual curation of in silico predictions, also making use of a large dataset of expressed genes for this species, identified repetitive elements and protein coding sequence, and highlighted an expansion of Alcohol dehydrogenase genes. Comparative analysis with orthologous regions of the lepidopteran reference genome allowed assessment of conservation of fine-scale synteny (with detection of new inversions and translocations and of DNA sequence (with detection of high levels of conservation of non-coding regions around some, but not all, developmental genes. CONCLUSIONS: The general properties and organization of the available B. anynana genomic sequence are similar to the lepidopteran reference, despite the more than 140 MY divergence. Our results lay the groundwork for further studies of new interesting findings in relation to both coding and non-coding sequence: 1 the Alcohol dehydrogenase expansion with higher similarity between the five tandemly-repeated B. anynana paralogs than with the corresponding B. mori orthologs, and 2 the high

  2. Structural analysis of DNA sequence: evidence for lateral gene transfer in Thermotoga maritima

    DEFF Research Database (Denmark)

    Worning, Peder; Jensen, Lars Juhl; Nelson, K. E.


    The recently published complete DNA sequence of the bacterium Thermotoga maritima provides evidence, based on protein sequence conservation, for lateral gene transfer between Archaea and Bacteria. We introduce a new method of periodicity analysis of DNA sequences, based on structural parameters......, which brings independent evidence for the lateral gene transfer in the genome of T.maritima, The structural analysis relates the Archaea-like DNA sequences to the genome of Pyrococcus horikoshii. Analysis of 24 complete genomic DNA sequences shows different periodicity patterns for organisms...... of different origin, The typical genomic periodicity for Bacteria is 11 bp whilst it is 10 bp for Archaea, Eukaryotes have more complex spectra but the dominant period in the yeast Saccharomyces cerevisiae is 10.2 bp. These periodicities are most likely reflective of differences in chromatin structure....

  3. Genomic organization and sequence analysis of the vomeronasal receptor V2R genes in mouse genome

    Institute of Scientific and Technical Information of China (English)

    YANG Hui; Zhang YaPing


    Two multigene superfamilies, named V1R and V2R, encoding seven-transmembrane-domain G-protein coupled receptors (GPCRs) have been identified as pheromone receptors in mammals. Three V2R gene families have been described in mouse and rat. Here we screened the updated mouse genome sequence database and finally retrieved 63 putative functional V2R genes including three newly identified genes which formed a new additional family. We described the genomic organization of these genes and also characterized the conservation of mouse V2R protein sequences. These genomic and sequence information we described are useful as part of the evidence to speculate the functional domain of V2Rs and should give aid to the functionality study in the future.

  4. Annotation and Re-Sequencing of Genes from De Novo Transcriptome Assembly of Abies alba (Pinaceae

    Directory of Open Access Journals (Sweden)

    Anna M. Roschanski


    Full Text Available Premise of the study: We present a protocol for the annotation of transcriptome sequence data and the identification of candidate genes therein using the example of the nonmodel conifer Abies alba. Methods and Results: A normalized cDNA library was built from an A. alba seedling. The sequencing on a 454 platform yielded more than 1.5 million reads that were de novo assembled into 25 149 contigs. Two complementary approaches were applied to annotate gene fragments that code for (1 well-known proteins and (2 proteins that are potentially adaptively relevant. Primer development and testing yielded 88 amplicons that could successfully be resequenced from genomic DNA. Conclusions: The annotation workflow offers an efficient way to identify potential adaptively relevant genes from the large quantity of transcriptome sequence data. The primer set presented should be prioritized for single-nucleotide polymorphism detection in adaptively relevant genes in A. alba.

  5. Sequence and secondary structure of the mitochondrial 16S ribosomal RNA gene of Ixodes scapularis. (United States)

    Krakowetz, Chantel N; Chilton, Neil B


    The complete DNA sequences and secondary structure of the mitochondrial (mt) 16S ribosomal (r) RNA gene were determined for six Ixodes scapularis adults. There were 44 variable nucleotide positions in the 1252 bp sequence alignment. Most (95%) nucleotide alterations did not affect the integrity of the secondary structure of the gene because they either occurred at unpaired positions or represented compensatory changes that maintained the base pairing in helices. A large proportion (75%) of the intraspecific variation in DNA sequence occurred within Domains I, II and VI of the 16S gene. Therefore, several regions within this gene may be highly informative for studies of the population genetics and phylogeography of I. scapularis, a major vector of pathogens of humans and domestic animals in North America.

  6. Phylogenetic analysis of freshwater mussel corbicula regularis by 18s rRNA gene sequencing

    Directory of Open Access Journals (Sweden)

    Magare V N


    Full Text Available Corbicula regularis is a freshwater mussel found in the Indian sub-continent. In the present study, phylogenetic characterization of this important bivalve was attempted using 18S ribosomal RNA gene markers. Genomic DNA was extracted and 18S rRNA gene was amplified by universal primers. The amplification product was sequenced and compared with the nucleotide databases available online to evaluate phylogenetic relationship of the animal under study. Results indicated that 18S rRNA gene sequences of C. regularis showed high degree of similarity to another freshwater mussel, C. fluminea. This work constitutes the first ever sequence deposition of the C. regularis in the nucleotide databases highlighting the usefulness of 18S ribosomal gene markers for phylogenetic analysis.

  7. Strategy for microbiome analysis using 16S rRNA gene sequence analysis on the Illumina sequencing platform. (United States)

    Ram, Jeffrey L; Karim, Aos S; Sendler, Edward D; Kato, Ikuko


    Understanding the identity and changes of organisms in the urogenital and other microbiomes of the human body may be key to discovering causes and new treatments of many ailments, such as vaginosis. High-throughput sequencing technologies have recently enabled discovery of the great diversity of the human microbiome. The cost per base of many of these sequencing platforms remains high (thousands of dollars per sample); however, the Illumina Genome Analyzer (IGA) is estimated to have a cost per base less than one-fifth of its nearest competitor. The main disadvantage of the IGA for sequencing PCR-amplified 16S rRNA genes is that the maximum read-length of the IGA is only 100 bases; whereas, at least 300 bases are needed to obtain phylogenetically informative data down to the genus and species level. In this paper we describe and conduct a pilot test of a multiplex sequencing strategy suitable for achieving total reads of > 300 bases per extracted DNA molecule on the IGA. Results show that all proposed primers produce products of the expected size and that correct sequences can be obtained, with all proposed forward primers. Various bioinformatic optimization of the Illumina Bustard analysis pipeline proved necessary to extract the correct sequence from IGA image data, and these modifications of the data files indicate that further optimization of the analysis pipeline may improve the quality rankings of the data and enable more sequence to be correctly analyzed. The successful application of this method could result in an unprecedentedly deep description (800,000 taxonomic identifications per sample) of the urogenital and other microbiomes in a large number of samples at a reasonable cost per sample.

  8. Analysis of hepatitis B virus genotyping and drug resistance gene mutations based on massively parallel sequencing. (United States)

    Han, Yingxin; Zhang, Yinxin; Mei, Yanhua; Wang, Yuqi; Liu, Tao; Guan, Yanfang; Tan, Deming; Liang, Yu; Yang, Ling; Yi, Xin


    Drug resistance to nucleoside analogs is a serious problem worldwide. Both drug resistance gene mutation detection and HBV genotyping are helpful for guiding clinical treatment. Total HBV DNA from 395 patients who were treated with single or multiple drugs including Lamivudine, Adefovir, Entecavir, Telbivudine, Tenofovir and Emtricitabine were sequenced using the HiSeq 2000 sequencing system and validated using the 3730 sequencing system. In addition, a mixed sample of HBV plasmid DNA was used to determine the cutoff value for HiSeq-sequencing, and 52 of the 395 samples were sequenced three times to evaluate the repeatability and stability of this technology. Of the 395 samples sequenced using both HiSeq and 3730 sequencing, the results from 346 were consistent, and the results from 49 were inconsistent. Among the 49 inconsistent results, 13 samples were detected as drug-resistance-positive using HiSeq but negative using 3730, and the other 36 samples showed a higher number of drug-resistance-positive gene mutations using HiSeq 2000 than using 3730. Gene mutations had an apparent frequency of 1% as assessed by the plasmid testing. Therefore, a 1% cutoff value was adopted. Furthermore, the experiment was repeated three times, and the same results were obtained in 49/52 samples using the HiSeq sequencing system. HiSeq sequencing can be used to analyze HBV gene mutations with high sensitivity, high fidelity, high throughput and automation and is a potential method for hepatitis B virus gene mutation detection and genotyping.

  9. Expression of the human glucokinase gene: important roles of the 5' flanking and intron 1 sequences.

    Directory of Open Access Journals (Sweden)

    Yi Wang

    Full Text Available BACKGROUND: Glucokinase plays important tissue-specific roles in human physiology, where it acts as a sensor of blood glucose levels in the pancreas, and a few other cells of the gut and brain, and as the rate-limiting step in glucose metabolism in the liver. Liver-specific expression is driven by one of the two tissue-specific promoters, and has an absolute requirement for insulin. The sequences that mediate regulation by insulin are incompletely understood. METHODOLOGY/PRINCIPAL FINDINGS: To better understand the liver-specific expression of the human glucokinase gene we compared the structures of this gene from diverse mammals. Much of the sequence located between the 5' pancreatic beta-cell-specific and downstream liver-specific promoters of the glucokinase genes is composed of repetitive DNA elements that were inserted in parallel on different mammalian lineages. The transcriptional activity of the liver-specific promoter 5' flanking sequences were tested with and without downstream intronic sequences in two human liver cells lines, HepG2 and L-02. While glucokinase liver-specific 5' flanking sequences support expression in liver cell lines, a sequence located about 2000 bases 3' to the liver-specific mRNA start site represses gene expression. Enhanced reporter gene expression was observed in both cell lines when cells were treated with fetal calf serum, but only in the L-02 cells was expression enhanced by insulin. CONCLUSIONS/SIGNIFICANCE: Our results suggest that the normal liver L-02 cell line may be a better model to understand the regulation of the liver-specific expression of the human glucokinase gene. Our results also suggest that sequences downstream of the liver-specific mRNA start site have important roles in the regulation of liver-specific glucokinase gene expression.

  10. The impact of gene duplication, insertion, deletion, lateral gene transfer and sequencing error on orthology inference: a simulation study. (United States)

    Dalquen, Daniel A; Altenhoff, Adrian M; Gonnet, Gaston H; Dessimoz, Christophe


    The identification of orthologous genes, a prerequisite for numerous analyses in comparative and functional genomics, is commonly performed computationally from protein sequences. Several previous studies have compared the accuracy of orthology inference methods, but simulated data has not typically been considered in cross-method assessment studies. Yet, while dependent on model assumptions, simulation-based benchmarking offers unique advantages: contrary to empirical data, all aspects of simulated data are known with certainty. Furthermore, the flexibility of simulation makes it possible to investigate performance factors in isolation of one another.Here, we use simulated data to dissect the performance of six methods for orthology inference available as standalone software packages (Inparanoid, OMA, OrthoInspector, OrthoMCL, QuartetS, SPIMAP) as well as two generic approaches (bidirectional best hit and reciprocal smallest distance). We investigate the impact of various evolutionary forces (gene duplication, insertion, deletion, and lateral gene transfer) and technological artefacts (ambiguous sequences) on orthology inference. We show that while gene duplication/loss and insertion/deletion are well handled by most methods (albeit for different trade-offs of precision and recall), lateral gene transfer disrupts all methods. As for ambiguous sequences, which might result from poor sequencing, assembly, or genome annotation, we show that they affect alignment score-based orthology methods more strongly than their distance-based counterparts.

  11. The impact of gene duplication, insertion, deletion, lateral gene transfer and sequencing error on orthology inference: a simulation study.

    Directory of Open Access Journals (Sweden)

    Daniel A Dalquen

    Full Text Available The identification of orthologous genes, a prerequisite for numerous analyses in comparative and functional genomics, is commonly performed computationally from protein sequences. Several previous studies have compared the accuracy of orthology inference methods, but simulated data has not typically been considered in cross-method assessment studies. Yet, while dependent on model assumptions, simulation-based benchmarking offers unique advantages: contrary to empirical data, all aspects of simulated data are known with certainty. Furthermore, the flexibility of simulation makes it possible to investigate performance factors in isolation of one another.Here, we use simulated data to dissect the performance of six methods for orthology inference available as standalone software packages (Inparanoid, OMA, OrthoInspector, OrthoMCL, QuartetS, SPIMAP as well as two generic approaches (bidirectional best hit and reciprocal smallest distance. We investigate the impact of various evolutionary forces (gene duplication, insertion, deletion, and lateral gene transfer and technological artefacts (ambiguous sequences on orthology inference. We show that while gene duplication/loss and insertion/deletion are well handled by most methods (albeit for different trade-offs of precision and recall, lateral gene transfer disrupts all methods. As for ambiguous sequences, which might result from poor sequencing, assembly, or genome annotation, we show that they affect alignment score-based orthology methods more strongly than their distance-based counterparts.

  12. Myelin protein zero gene sequencing diagnoses Charcot-Marie-Tooth Type 1B disease

    Energy Technology Data Exchange (ETDEWEB)

    Su, Y.; Zhang, H.; Madrid, R. [Univ. of California, San Francisco, CA (United States)] [and others


    Charcot-Marie-Tooth disease (CMT), the most common genetic neuropathy, affects about 1 in 2600 people in Norway and is found worldwide. CMT Type 1 (CMT1) has slow nerve conduction with demyelinated Schwann cells. Autosomal dominant CMT Type 1B (CMT1B) results from mutations in the myelin protein zero gene which directs the synthesis of more than half of all Schwann cell protein. This gene was mapped to the chromosome 1q22-1q23.1 borderline by fluorescence in situ hybridization. The first 7 of 7 reported CMT1B mutations are unique. Thus the most effective means to identify CMT1B mutations in at-risk family members and fetuses is to sequence the entire coding sequence in dominant or sporadic CMT patients without the CMT1A duplication. Of the 19 primers used in 16 pars to uniquely amplify the entire MPZ coding sequence, 6 primer pairs were used to amplify and sequence the 6 exons. The DyeDeoxy Terminator cycle sequencing method used with four different color fluorescent lables was superior to manual sequencing because it sequences more bases unambiguously from extracted genomic DNA samples within 24 hours. This protocol was used to test 28 CMT and Dejerine-Sottas patients without CMT1A gene duplication. Sequencing MPZ gene-specific amplified fragments identified 9 polymorphic sites within the 6 exons that encode the 248 amino acid MPZ protein. The large number of major CMT1B mutations identified by single strand sequencing are being verified by reverse strand sequencing and when possible, by restriction enzyme analysis. This protocol can be used to distringuish CMT1B patients from othre CMT phenotypes and to determine the CMT1B status of relatives both presymptomatically and prenatally.

  13. Identification of vernalization responsive genes in the winter wheat cultivar Jing841 by transcriptome sequencing

    Indian Academy of Sciences (India)



    This study aimed to identify vernalization responsive genes in the winter wheat cultivar Jing841 by comparing the transcriptome data with that of a spring wheat cultivar Liaochun10. For each cultivar, seedlings before and after the vernalizationtreatment were sequenced by Solexa/Illumina sequencing. Genes differentially expressed after and before vernalization were identified as differentially expressed genes (DEGs) using false discovery rate (FDR) ≤ 0.001 and |log2 (fold change)| >1 as cutoffs. The Jing841-specific DEGs were screened and subjected to functional annotation using gene ontology (GO) database.Vernalization responsive genes among the specific genes were selected for validation by quantitative reverse transcription polymerase chain reaction (qRT-PCR) and the expression change over the time was investigated for the top 11 genes with the most significant expression differences. A total of 138,062 unigenes were obtained. Overall, 636 DEGs were identified as vernalization responsive genes including some known genes such as VRN-1 and COR14a, and some unknown contigs. The qRT-PCR validated changes in the expression of 18 DEGs that were detected by RNA-seq. Among them, 11 genes displayed four different types of expression patterns over time during the 30-day-long vernalization treatment. Genes or contigs such as VRN-A1, COR14a, IRIP, unigene1806 and Cl18953. Contig2 probably have critical roles in vernalization.

  14. Body fat distribution in women with familial partial lipodystrophy caused by mutation in the lamin A/C gene

    Directory of Open Access Journals (Sweden)

    Luciana Z Monteiro


    Full Text Available Familial partial lipodystrophy (FPLD, Dunnigan variety, is an autosomal dominant disorder caused due to missense mutations in the lamin A/C (LMNA gene encoding nuclear lamina proteins. Patients with FPLD are predisposed to metabolic complications of insulin resistance such as diabetes. We sought to evaluate and compare body fat distribution with dual-emission X-ray absorptiometry in women with and without FPLD and identify densitometric, clinical and metabolic features.

  15. Association and expression analysis of single nucleotide polymorphisms of partial tumor necrosis factor alpha gene with mastitis in crossbred cattle. (United States)

    Ranjan, Sanjeev; Bhushan, Bharat; Panigrahi, Manjit; Kumar, Amit; Deb, Rajib; Kumar, Pushpendra; Sharma, Deepak


    A total of 129 crossbred cows were selected to explore the genotypic and expression profiling of partial TNF-α gene and its association with mastitis susceptibility. Two exon spanning region of TNF-α gene (221 bp and 239 bp) were amplified by Polymerase Chain Reaction (PCR). The different genotypic analysis by SSCP revealed that 221 bp fragment was monomorphic, whereas 239 bp was polymorphic. Association studies revealed that AA genotypes of 239 bp were more prevalent in mastitis group and the mRNA expression of TNF-α was significantly (P mastitis resistance selection in dairy cattle.

  16. Promoter region sequence differences in the A and G gamma globin genes of Brazilian sickle cell anemia patients

    Directory of Open Access Journals (Sweden)

    C.G. Barbosa


    Full Text Available Fetal hemoglobin (HbF, encoded by the HBG2 and HBG1 genes, is the best-known genetic modulator of sickle cell anemia, varying dramatically in concentration in the blood of these patients. This variation is partially associated with polymorphisms located in the promoter region of the HBG2 and HBG1 genes. In order to explore known and unknown polymorphisms in these genes, the sequences of their promoter regions were screened in sickle cell anemia patients and correlated with both their HbF levels and their βS-globin haplotypes. Additionally, the sequences were compared with genes from 2 healthy groups, a reference one (N = 104 and an Afro-descendant one (N = 98, to identify polymorphisms linked to the ethnic background.The reference group was composed by healthy individuals from the general population. Four polymorphisms were identified in the promoter region of HBG2 and 8 in the promoter region of HBG1 among the studied groups. Four novel single nucleotide polymorphisms (SNP located at positions -324, -317, -309 and -307 were identified in the reference group. A deletion located between -396 and -391 in the HBG2 promoter region and the SNP -271 C→T in the HBG1 promoter region were associated with the Central African Republic βS-globin haplotype. In contrast, the -369 C→G and 309 A→G SNPs in the HBG2 promoter region were correlated to the Benin haplotype. The polymorphisms -396_-391 del HBG2, -369 SNP HBG2 and -271 SNP HBG1 correlated with HbF levels. Hence, we suggest an important role of HBG2 and HBG1 gene polymorphisms on the HbF synthesis.

  17. Promoter region sequence differences in the A and G gamma globin genes of Brazilian sickle cell anemia patients. (United States)

    Barbosa, C G; Goncalves-Santos, N J; Souza-Ribeiro, S B; Moura-Neto, J P; Takahashi, D; Silva, D O; Hurtado-Guerrero, A F; Reis, M G; Goncalves, M S


    Fetal hemoglobin (HbF), encoded by the HBG2 and HBG1 genes, is the best-known genetic modulator of sickle cell anemia, varying dramatically in concentration in the blood of these patients. This variation is partially associated with polymorphisms located in the promoter region of the HBG2 and HBG1 genes. In order to explore known and unknown polymorphisms in these genes, the sequences of their promoter regions were screened in sickle cell anemia patients and correlated with both their HbF levels and their betaS-globin haplotypes. Additionally, the sequences were compared with genes from 2 healthy groups, a reference one (N = 104) and an Afro-descendant one (N = 98), to identify polymorphisms linked to the ethnic background.The reference group was composed by healthy individuals from the general population. Four polymorphisms were identified in the promoter region of HBG2 and 8 in the promoter region of HBG1 among the studied groups. Four novel single nucleotide polymorphisms (SNP) located at positions -324, -317, -309 and -307 were identified in the reference group. A deletion located between -396 and -391 in the HBG2 promoter region and the SNP -271 C-->T in the HBG1 promoter region were associated with the Central African Republic betaS-globin haplotype. In contrast, the -369 C-->G and 309 A-->G SNPs in the HBG2 promoter region were correlated to the Benin haplotype. The polymorphisms -396_-391 del HBG2, -369 SNP HBG2 and -271 SNP HBG1 correlated with HbF levels. Hence, we suggest an important role of HBG2 and HBG1 gene polymorphisms on the HbF synthesis.

  18. Sequence diversities of serine-aspartate repeat genes among Staphylococcus aureus isolates from different hosts presumably by horizontal gene transfer.

    Directory of Open Access Journals (Sweden)

    Huping Xue

    Full Text Available BACKGROUND: Horizontal gene transfer (HGT is recognized as one of the major forces for bacterial genome evolution. Many clinically important bacteria may acquire virulence factors and antibiotic resistance through HGT. The comparative genomic analysis has become an important tool for identifying HGT in emerging pathogens. In this study, the Serine-Aspartate Repeat (Sdr family has been compared among different sources of Staphylococcus aureus (S. aureus to discover sequence diversities within their genomes. METHODOLOGY/PRINCIPAL FINDINGS: Four sdr genes were analyzed for 21 different S. aureus strains and 218 mastitis-associated S. aureus isolates from Canada. Comparative genomic analyses revealed that S. aureus strains from bovine mastitis (RF122 and mastitis isolates in this study, ovine mastitis (ED133, pig (ST398, chicken (ED98, and human methicillin-resistant S. aureus (MRSA (TCH130, MRSA252, Mu3, Mu50, N315, 04-02981, JH1 and JH9 were highly associated with one another, presumably due to HGT. In addition, several types of insertion and deletion were found in sdr genes of many isolates. A new insertion sequence was found in mastitis isolates, which was presumably responsible for the HGT of sdrC gene among different strains. Moreover, the sdr genes could be used to type S. aureus. Regional difference of sdr genes distribution was also indicated among the tested S. aureus isolates. Finally, certain associations were found between sdr genes and subclinical or clinical mastitis isolates. CONCLUSIONS: Certain sdr gene sequences were shared in S. aureus strains and isolates from different species presumably due to HGT. Our results also suggest that the distributional assay of virulence factors should detect the full sequences or full functional regions of these factors. The traditional assay using short conserved regions may not be accurate or credible. These findings have important implications with regard to animal husbandry practices that may

  19. Estimating variation within the genes and inferring the phylogeny of 186 sequenced diverse Escherichia coli genomes

    Directory of Open Access Journals (Sweden)

    Kaas Rolf S


    Full Text Available Abstract Background Escherichia coli exists in commensal and pathogenic forms. By measuring the variation of individual genes across more than a hundred sequenced genomes, gene variation can be studied in detail, including the number of mutations found for any given gene. This knowledge will be useful for creating better phylogenies, for determination of molecular clocks and for improved typing techniques. Results We find 3,051 gene clusters/families present in at least 95% of the genomes and 1,702 gene clusters present in 100% of the genomes. The former 'soft core' of about 3,000 gene families is perhaps more biologically relevant, especially considering that many of these genome sequences are draft quality. The E. coli pan-genome for this set of isolates contains 16,373 gene clusters. A core-gene tree, based on alignment and a pan-genome tree based on gene presence/absence, maps the relatedness of the 186 sequenced E. coli genomes. The core-gene tree displays high confidence and divides the E. coli strains into the observed MLST type clades and also separates defined phylotypes. Conclusion The results of comparing a large and diverse E. coli dataset support the theory that reliable and good resolution phylogenies can be inferred from the core-genome. The results further suggest that the resolution at the isolate level may, subsequently be improved by targeting more variable genes. The use of whole genome sequencing will make it possible to eliminate, or at least reduce, the need for several typing steps used in traditional epidemiology.

  20. Sequence and chromosomal localization of the mouse brevican gene

    DEFF Research Database (Denmark)

    Rauch, U; Meyer, H; Brakebusch, C


    Brevican is a brain-specific proteoglycan belonging to the aggrecan family. Phage clones containing the complete mouse brevican open reading frame of 2649 bp and the complete 3'-untranslated region of 341 bp were isolated from a mouse brain cDNA library, and cosmid clones containing the mouse bre...... to an alternative brevican cDNA, coding for a GPI-linked isoform. Single strand conformation polymorphism analysis mapped the brevican gene (Bcan) to chromosome 3 between the microsatellite markers D3Mit22 and D3Mit11....

  1. Application of gene sequencing directly to identify the pathogens in specimens

    Institute of Scientific and Technical Information of China (English)

    LU Xin-xin; YUAN Liang; WAN Xiao-hua; GENG Jia-jing


    Background Accurate identification of bacterial isolates is an essential task in clinical microbiology. This study compared culturing to analyzing 16S rRNA gene sequences as methods to identify bacteria in clinical samples. We developed a key technique to directly identify bacteria in clinical samples via nucleic acid sequences, thus improving the ability to confirm pathogens.Methods We obtained 225 samples from Beijing Tongran Hospital and examined them by conventional culture and 16S rDNA sequencing to identify pathogens. This study made use of a modified sample pre-treatment technique which came from our laboratory to extract DNA. 16S rDNA was amplified by PCR. The amplified product was sequenced on a CEQ8000 capillary sequencer. Sequences were uploaded to the GenBank BLAST database for comparison.Results Among the positively cultivated bacterial strains, seven strains were identified differently by Vitek32 and by 16S rDNA sequencing. Twelve samples that were negative by standard culturing were determined to have pathogens by sequence analysis.Conclusion The use of 16S rRNA gene sequencing can improve clinical microbiology by providing better identification of unidentified bacteria or providing reference identification of unusual strains.

  2. Automated DNA mutation detection using universal conditions direct sequencing: application to ten muscular dystrophy genes

    Directory of Open Access Journals (Sweden)

    Wu Bai-Lin


    Full Text Available Abstract Background One of the most common and efficient methods for detecting mutations in genes is PCR amplification followed by direct sequencing. Until recently, the process of designing PCR assays has been to focus on individual assay parameters rather than concentrating on matching conditions for a set of assays. Primers for each individual assay were selected based on location and sequence concerns. The two primer sequences were then iteratively adjusted to make the individual assays work properly. This generally resulted in groups of assays with different annealing temperatures that required the use of multiple thermal cyclers or multiple passes in a single thermal cycler making diagnostic testing time-consuming, laborious and expensive. These factors have severely hampered diagnostic testing services, leaving many families without an answer for the exact cause of a familial genetic disease. A search of GeneTests for sequencing analysis of the entire coding sequence for genes that are known to cause muscular dystrophies returns only a small list of laboratories that perform comprehensive gene panels. The hypothesis for the study was that a complete set of universal assays can be designed to amplify and sequence any gene or family of genes using computer aided design tools. If true, this would allow automation and optimization of the mutation detection process resulting in reduced cost and increased throughput. Results An automated process has been developed for the detection of deletions, duplications/insertions and point mutations in any gene or family of genes and has been applied to ten genes known to bear mutations that cause muscular dystrophy: DMD; CAV3; CAPN3; FKRP; TRIM32; LMNA; SGCA; SGCB; SGCG; SGCD. Using this process, mutations have been found in five DMD patients and four LGMD patients (one in the FKRP gene, one in the CAV3 gene, and two likely causative heterozygous pairs of variations in the CAPN3 gene of two other

  3. The complete genome sequences of a Peruvian and a Colombian isolate of Andean potato latent virus and partial sequences of further isolates suggest the existence of two distinct potato-infecting tymovirus species. (United States)

    Kreuze, Jan; Koenig, Renate; De Souza, Joao; Vetten, Heinrich Josef; Muller, Giovanna; Flores, Betty; Ziebell, Heiko; Cuellar, Wilmer


    The complete genomic RNA sequences of the tymovirus isolates Hu and Col from potato which originally had been considered to be strains of the same virus species, i.e. Andean potato latent virus (APLV), were determined by siRNA sequencing and assembly, and found to share only c. 65% nt sequence identity. This result together with those of serological tests and comparisons of the coat protein gene sequences of additional tymovirus isolates from potato suggest that the species Andean potato latent virus should be subdivided into two species, i.e. APLV and Andean potato mild mosaic virus (APMMV). Primers were designed for the broad specificity detection of both viruses.

  4. Multiple Cis-Acting Sequences Contribute to Evolved Regulatory Variation for Drosophila Adh Genes (United States)

    Fang, X. M.; Brennan, M. D.


    Drosophila affinidisjuncta and Drosophila hawaiiensis are closely related species that display distinct tissue-specific expression patterns for their homologous alcohol dehydrogenase genes (Adh genes). In Drosophila melanogaster transformants, both genes are expressed at high levels in the larval and adult fat bodies, but the D. affinidisjuncta gene is expressed 10-50-fold more strongly in the larval and adult midguts and Malpighian tubules. The present study reports the mapping of cis-acting sequences contributing to the regulatory differences between these two genes in transformants. Chimeric genes were constructed and introduced into the germ line of D. melanogaster. Stage- and tissue-specific expression patterns were determined by measuring steady-state RNA levels in larvae and adults. Three portions of the promoter region make distinct contributions to the tissue-specific regulatory differences between the native genes. Sequences immediately upstream of the distal promoter have a strong effect in the adult Malpighian tubules, while sequences between the two promoters are relatively important in the larval Malpighian tubules. A third gene segment, immediately upstream of the proximal promoter, influences levels of the proximal Adh transcript in all tissues and developmental stages examined, and largely accounts for the regulatory difference in the larval and adult midguts. However, these as well as other sequences make smaller contributions to various aspects of the tissue-specific regulatory differences. In addition, some chimeric genes display aberrant RNA levels for the whole organism, suggesting close physical association between sequences involved in tissue-specific regulatory differences and those important for Adh expression in the larval and adult fat bodies. PMID:1644276

  5. Stress gene (hsp70) sequences and quantitative expression in Milnesium tardigradum (Tardigrada) during active and cryptobiotic stages. (United States)

    Schill, Ralph O; Steinbrück, Günther H B; Köhler, Heinz-R


    The eutardigrade Milnesium tardigradum can undergo cryptobiosis, i.e. entry into a reversible ametabolic stage induced by dehydration, cooling and, probably, osmotic and anoxic stress. For the first time in tardigrades, we described partial sequences of three heat-shock protein (hsp70 family) genes and examined gene expression on the way from an active to a cryptobiotic and back to an active stage again. Results showed different patterns of gene expression in the hsp70 isoforms. All three isoforms seem to be true heat-shock proteins since transcription could be clearly enhanced by temperature elevation. Isoform 1 and, at a lower level, isoform 3 do not seem to have a specific function for cryptobiosis. By contrast, transcription of isoform 2 is significantly induced in the transitional stage between the active and the cryptobiotic stage, resulting in a comparatively high mRNA copy number also during cryptobiosis. This pattern of induction implies that isoform 2 is the most relevant hsp70 gene for M. tardigradum individuals entering the cryptobiotic stage.

  6. Candidate genes revealed by a genome scan for mosquito resistance to a bacterial insecticide: sequence and gene expression variations

    Directory of Open Access Journals (Sweden)

    David Jean-Philippe


    Full Text Available Abstract Background Genome scans are becoming an increasingly popular approach to study the genetic basis of adaptation and speciation, but on their own, they are often helpless at identifying the specific gene(s or mutation(s targeted by selection. This shortcoming is hopefully bound to disappear in the near future, thanks to the wealth of new genomic resources that are currently being developed for many species. In this article, we provide a foretaste of this exciting new era by conducting a genome scan in the mosquito Aedes aegypti with the aim to look for candidate genes involved in resistance to Bacillus thuringiensis subsp. israelensis (Bti insecticidal toxins. Results The genome of a Bti-resistant and a Bti-susceptible strains was surveyed using about 500 MITE-based molecular markers, and the loci showing the highest inter-strain genetic differentiation were sequenced and mapped on the Aedes aegypti genome sequence. Several good candidate genes for Bti-resistance were identified in the vicinity of these highly differentiated markers. Two of them, coding for a cadherin and a leucine aminopeptidase, were further examined at the sequence and gene expression levels. In the resistant strain, the cadherin gene displayed patterns of nucleotide polymorphisms consistent with the action of positive selection (e.g. an excess of high compared to intermediate frequency mutations, as well as a significant under-expression compared to the susceptible strain. Conclusion Both sequence and gene expression analyses agree to suggest a role for positive selection in the evolution of this cadherin gene in the resistant strain. However, it is unlikely that resistance to Bti is conferred by this gene alone, and further investigation will be needed to characterize other genes significantly associated with Bti resistance in Ae. aegypti. Beyond these results, this article illustrates how genome scans can build on the body of new genomic information (here, full

  7. Sequence Analysis of the Protein Structure Homology Modeling of Growth Hormone Gene from Salmo trutta caspius

    Directory of Open Access Journals (Sweden)

    Abolhasan Rezaei


    Full Text Available In view of the growth hormone protein investigated and characterized from Salmo trutta caspius. Growth hormone gene in the Salmo trutta caspius have six exons in the full length that is translated into a Molecular Weight (kDa: ssDNA: 64.98 and dsDNA: 129.6. There are also 210 amino acid residue. The assembled full length of DNA contains open reading frame of growth hormone gene that contains 15 sequences in the full length. The average GC content is 47% and AT content is 53%. This protein multiple alignment has shown that this peptide is 100% identical to the corresponding homologous protein in the growth hormone protein which including Salmo salar (Accession number: AAA49558.1 and Rainbow trout (Salmo trutta (Accession number: AAA49555.1" sequences. The sequence of protein had deposited in Gene Bank, Accession number: AEK70940. Also we were analyzed second and third structure between sequences reported in Gene Bank Network system. The results are shown, there are homology between second structure in three sequences including: Salmo trutta caspius, Salmo salar and Rainbow trout. Regarding third structure, Salmo trutta caspius and Salmo salar are same type, but Rainbow trout has different homology with Salmo trutta caspius and Salmo salar. However, the sequences were observed three parallel " helix and in second structure there were almost same percent β sheet.

  8. Phylogenetic analysis of Mexican Babesia bovis isolates using msa and ssrRNA gene sequences. (United States)

    Genis, Alma D; Mosqueda, Juan J; Borgonio, Verónica M; Falcón, Alfonso; Alvarez, Antonio; Camacho, Minerva; de Lourdes Muñoz, Maria; Figueroa, Julio V


    Variable merozoite surface antigens of Babesia bovis are exposed glycoproteins having a role in erythrocyte invasion. Members of this gene family include msa-1 and msa-2 (msa-2c, msa-2a(1), msa-2a(2), and msa-2b). Small subunit ribosomal (ssr)RNA gene is subject to evolutive pressure and has been used in phylogenetic studies. To determine the phylogenetic relationship among B. bovis Mexican isolates using different genetic markers, PCR amplicons, corresponding to msa-1, msa-2c, msa-2b, and ssrRNA genes, were cloned and plasmids carrying the corresponding inserts were sequenced. Comparative analysis of nucleotide and deduced amino acid sequences revealed distinct degrees of variability and identity among the coding gene sequences obtained from 12 geographically different B. bovis isolates and a reference strain. Overall sequence identities of 47.7%, 72.3%, 87.7%, and 94% were determined for msa-1, msa-2b, msa-2c, and ssrRNA, respectively. A robust phylogenetic tree was obtained with msa-2b sequences. The phylogenetic analysis suggests that Mexican B. bovis isolates group in clades not concordant with the Mexican geography. However, the Mexican isolates group together in an American clade separated from the Australian clade. Sequence heterogeneity in msa-1, msa-2b, and msa-2c coding regions of Mexican B. bovis isolates present in different geographical regions can be a result of either differential evolutive pressure or cattle movement from commercial trade.

  9. Sequence and organization of coelacanth neurohypophysial hormone genes: Evolutionary history of the vertebrate neurohypophysial hormone gene locus

    Directory of Open Access Journals (Sweden)

    Brenner Sydney


    Full Text Available Abstract Background The mammalian neurohypophysial hormones, vasopressin and oxytocin are involved in osmoregulation and uterine smooth muscle contraction respectively. All jawed vertebrates contain at least one homolog each of vasopressin and oxytocin whereas jawless vertebrates contain a single neurohypophysial hormone called vasotocin. The vasopressin homolog in non-mammalian vertebrates is vasotocin; and the oxytocin homolog is mesotocin in non-eutherian tetrapods, mesotocin and [Phe2]mesotocin in lungfishes, and isotocin in ray-finned fishes. The genes encoding vasopressin and oxytocin genes are closely linked in the human and rodent genomes in a tail-to-tail orientation. In contrast, their pufferfish homologs (vasotocin and isotocin are located on the same strand of DNA with isotocin gene located upstream of vasotocin gene separated by five genes, suggesting that this locus has experienced rearrangements in either mammalian or ray-finned fish lineage, or in both lineages. The coelacanths occupy a unique phylogenetic position close to the divergence of the mammalian and ray-finned fish lineages. Results We have sequenced a coelacanth (Latimeria menadoensis BAC clone encompassing the neurohypophysial hormone genes and investigated the evolutionary history of the vertebrate neurohypophysial hormone gene locus within a comparative genomics framework. The coelacanth contains vasotocin and mesotocin genes like non-mammalian tetrapods. The coelacanth genes are present on the same strand of DNA with no intervening genes, with the vasotocin gene located upstream of the mesotocin gene. Nucleotide sequences of the second exons of the two genes are under purifying selection implying a regulatory function. We have also analyzed the neurohypophysial hormone gene locus in the genomes of opossum, chicken and Xenopus tropicalis. The opossum contains two tandem copies of vasopressin and mesotocin genes. The vasotocin and mesotocin genes in chicken and

  10. Effect of 5'-flanking sequence deletions on expression of the human insulin gene in transgenic mice

    DEFF Research Database (Denmark)

    Fromont-Racine, M; Bucchini, D; Madsen, O;


    Expression of the human insulin gene was examined in transgenic mouse lines carrying the gene with various lengths of DNA sequences 5' to the transcription start site (+1). Expression of the transgene was demonstrated by 1) the presence of human C-peptide in urine, 2) the presence of specific tra...... of the transgene was observed in cell types other than beta-islet cells.......Expression of the human insulin gene was examined in transgenic mouse lines carrying the gene with various lengths of DNA sequences 5' to the transcription start site (+1). Expression of the transgene was demonstrated by 1) the presence of human C-peptide in urine, 2) the presence of specific......, and -168 allowed correct initiation of the transcripts and cell specificity of expression, while quantitative expression gradually decreased. Deletion to -58 completely abolished the expression of the gene. The amount of human product that in mice harboring the longest fragment contributes up to 50...

  11. [Cloning and sequencing the isopenicillin N synthetase(IPNS) gene from Streptomyces cattleya]. (United States)

    Wang, Y; Li, R


    Great homology existed between IPNS genes from surphur-containing beta-lactam antibiotics producers including procaryotes and eucaryotes. A DNA homologous band was confirmed in S. cattleya by Southern blot analysis using IPNS gene from S. lipmanii as a probe. A recombinant plasmid containing the cyclase gene involved in thienamycin biosynthesis and IPNS gene was obtained by complementary cloning with mutant from S. cattleya. DNA sequencing revealed that the IPNS gene of S. cattleya consists of 963 bp encoding a protein of 321 amino acids with ATG as start codon, TGA as stop codon. Pairwise comparison of the predicted amino acid sequences showed 56% and 64% similarity with IPNSs of S. clavuligerus and S. lipmanii, respectively.

  12. Isolation, sequence identification and tissue expression profiles of 3 novel porcine genes: ASPA, NAGA, and HEXA. (United States)

    Shu, Xianghua; Liu, Yonggang; Yang, Liangyu; Song, Chunlian; Hou, Jiafa


    The complete coding sequences of 3 porcine genes - ASPA, NAGA, and HEXA - were amplified by the reverse transcriptase polymerase chain reaction (RT-PCR) based on the conserved sequence information of the mouse or other mammals and referenced pig ESTs. These 3 novel porcine genes were then deposited in the NCBI database and assigned GeneIDs: 100142661, 100142664 and 100142667. The phylogenetic tree analysis revealed that the porcine ASPA, NAGA, and HEXA all have closer genetic relationships with the ASPA, NAGA, and HEXA of cattle. Tissue expression profile analysis was also carried out and results revealed that swine ASPA, NAGA, and HEXA genes were differentially expressed in various organs, including skeletal muscle, the heart, liver, fat, kidney, lung, and small and large intestines. Our experiment is the first one to establish the foundation for further research on these 3 swine genes.

  13. Cloning, sequencing and variability analysis of the gap gene from Mycoplasma hominis

    DEFF Research Database (Denmark)

    Mygind, Tina; Jacobsen, Iben Søgaard; Melkova, Renata


    The gap gene encodes the glycolytic enzyme glyceraldehyde 3-phosphate dehydrogenase (GAPDH). The gene was cloned and sequenced from the Mycoplasma hominis type strain PG21(T). The intraspecies variability was investigated by inspection of restriction fragment length polymorphism (RFLP) patterns...... after polymerase chain reaction (PCR) amplification of the gap gene from 15 strains and furthermore by sequencing of part of the gene in eight strains. The M. hominis gap gene was found to vary more than the Escherichia coli counterpart, but the variation at nucleotide level gave rise to only a few...... to a 104-kDa band in addition to the expected 36-kDa band. The protein reacting at 104 kDa is a M. hominis protein with either an epitope similar to one on GAPDH, or it is an immunoglobulin binding protein...

  14. Nucleotide sequence and transcription of a trypomastigote surface antigen gene of Trypanosoma cruzi. (United States)

    Fouts, D L; Ruef, B J; Ridley, P T; Wrightsman, R A; Peterson, D S; Manning, J E


    In previous studies we identified a 500-bp segment of the gene, TSA-1, which encodes an 85-kDa trypomastigote-specific surface antigen of the Peru strain of Trypanosoma cruzi. TSA-1 was shown to be located at a telomeric site and to contain a 27-bp tandem repeat unit within the coding region. This repeat unit defines a discrete subset of a multigene family and places the TSA-1 gene within this subset. In this study, we present the complete nucleotide sequence of the TSA-1 gene from the Peru strain. By homology matrix analysis, fragments of two other trypomastigote specific surface antigen genes, pTt34 and SA85-1.1, are shown to have extensive sequence homology with TSA-1 indicating that these genes are members of the same gene family as TSA-1. The TSA-1 subfamily was also found to be active in two other strains of T. cruzi, one of which contains multiple telomeric members and one of which contains a single non-telomeric member, suggesting that transcription is not necessarily dependent on the gene being located at a telomeric site. Also, while some of the sequences found in this gene family are present in 2 size classes of poly(A)+ RNA, others appear to be restricted to only 1 of the 2 RNA classes.

  15. Cloning and sequence analysis of a gene encoding polygalacturonase-inhibiting protein from cotton

    Institute of Scientific and Technical Information of China (English)


    Polygalacturonase-inhibiting proteins (PGIP) play important roles in plant defense of pathogen, especially fungi. A pair of degenerated primers is designed based on the conserved sequence of 20 other known pgip genes and used to amplify Gossypium barbadense cultivation 7124 cDNA library by touch-down PCR. A 561 bp internal fragment of the pgip gene is obtained and used to design the primers for rapid amplification of cDNA ends. A composite pgip gene sequence is constructed from the products of 5′ and 3′ RACE, which are 666 bp and 906 bp respectively. Analysis of nucleic acid sequence shows 69.2% and 68.7% similarity to Citrus and Poncirus pgip genes, respectively. Its open reading frame of the gene encodes a polypeptide of 330 amino acids, in which 10 leucine-rich repeats arrange tandemly. A new set of primers is designed to the 5′ and 3′ ends of the gene, which allows amplification of the full-length gene from the cotton cDNA library. Genomic DNA analysis reveals that this gene has no intron.

  16. Sequence analysis of the phage 21 genes for prohead assembly and head completion. (United States)

    Smith, M P; Feiss, M


    Phage 21 is a temperate lambdoid coliphage, and its head-encoding genes, as well as those of phage lambda, are descended from a common ancestral phage. The head protein-encoding genes of phage 21 have been sequenced, confirming earlier genetic studies indicating that the head-encoding genes of 21 and lambda are analogous in location, size, and function. The phage 21 head-encoding genes identified (and their lambda analogues) include: 3(W), 4(B), 5(C), 6(Nu3), shp (D), 7(E), and 8(FII), respectively. An open reading frame, orf1, is analogous in position and shares some sequence identity with FI, a phage lambda gene involved in DNA packaging. The phage 21 major head protein, gp7, is predicted to have strong sequence identity (65%) with the lambda major capsid protein, gpE, including amino acids known to be important for capsid form determination. The nested genes 5/6 of phage 21 and C/Nu3 of lambda differ by several rearrangements including deletions and a triplication. The possibility that lambda genes C/Nu3 evolved from ancestal nested genes containing a triplication is discussed.

  17. Targeted Next-Generation Sequencing Successfully Detects Causative Genes in Chinese Patients with Hereditary Hearing Loss. (United States)

    Chen, Siqi; Dong, Cheng; Wang, Qi; Zhong, Zhen; Qi, Yu; Ke, Xiaomei; Liu, Yuhe


    We attempted to identify the genetic epidemiology of hereditary hearing loss among the Chinese Han population using next-generation sequencing (NGS). The entire length of the genes GJB2, SLC26A4, and GJB3, as well as exons of 57 additional candidate genes were sequenced from 116 individuals suffering from hearing loss. Thirty potentially causative mutations from these 60 genes were identified as the likely etiologies of hearing loss in 67 of the cases. In our study, SLC26A4 and GJB2 were the most frequently affected genes among the Chinese Han population with hearing loss. Collectively, they account for 52.8% of the cases, followed by MTRNR1, PCDH15, and TECTA. These data also illustrate that NGS can be used to identify rare alleles responsible for hereditary hearing loss: 22 of the 30 (73.3%) genes identified with mutations are rarely mutated in hereditary hearing loss and only account for 21.5% (42/195) of the total mutation frequency, explaining no more than 2% for each gene. These rarely mutated genes would be missed by conventional diagnostic sequencing approaches. NGS can be used effectively to identify both the common and rare genes causing hereditary hearing loss.

  18. Distant horizontal gene transfer is rare for multiple families of prokaryotic insertion sequences. (United States)

    Wagner, Andreas; de la Chaux, Nicole


    Horizontal gene transfer in prokaryotes is rampant on short and intermediate evolutionary time scales. It poses a fundamental problem to our ability to reconstruct the evolutionary tree of life. Is it also frequent over long evolutionary distances? To address this question, we analyzed the evolution of 2,091 insertion sequences from all 20 major families in 438 completely sequenced prokaryotic genomes. Specifically, we mapped insertion sequence occurrence on a 16S rDNA tree of the genomes we analyzed, and we also constructed phylogenetic trees of the insertion sequence transposase coding sequences. We found only 30 cases of likely horizontal transfer among distantly related prokaryotic clades. Most of these horizontal transfer events are ancient. Only seven events are recent. Almost all of these transfer events occur between pairs of human pathogens or commensals. If true also for other, non-mobile DNA, the rarity of distant horizontal transfer increases the odds of reliable phylogenetic inference from sequence data.

  19. TGM6 identified as a novel causative gene of spinocerebellar ataxias using exome sequencing. (United States)

    Wang, Jun Ling; Yang, Xu; Xia, Kun; Hu, Zheng Mao; Weng, Ling; Jin, Xin; Jiang, Hong; Zhang, Peng; Shen, Lu; Guo, Ji Feng; Li, Nan; Li, Ying Rui; Lei, Li Fang; Zhou, Jie; Du, Juan; Zhou, Ya Fang; Pan, Qian; Wang, Jian; Wang, Jun; Li, Rui Qiang; Tang, Bei Sha


    Autosomal-dominant spinocerebellar ataxias constitute a large, heterogeneous group of progressive neurodegenerative diseases with multiple types. To date, classical genetic studies have revealed 31 distinct genetic forms of spinocerebellar ataxias and identified 19 causative genes. Traditional positional cloning strategies, however, have limitations for finding causative genes of rare Mendelian disorders. Here, we used a combined strategy of exome sequencing and linkage analysis to identify a novel spinocerebellar ataxia causative gene, TGM6. We sequenced the whole exome of four patients in a Chinese four-generation spinocerebellar ataxia family and identified a missense mutation, c.1550T-G transition (L517W), in exon 10 of TGM6. This change is at a highly conserved position, is predicted to have a functional impact, and completely cosegregated with the phenotype. The exome results were validated using linkage analysis. The mutation we identified using exome sequencing was located in the same region (20p13-12.2) as that identified by linkage analysis, which cross-validated TGM6 as the causative spinocerebellar ataxia gene in this family. We also showed that the causative gene could be mapped by a combined method of linkage analysis and sequencing of one sample from the family. We further confirmed our finding by identifying another missense mutation c.980A-G transition (D327G) in exon seven of TGM6 in an additional spinocerebellar ataxia family, which also cosegregated with the phenotype. Both mutations were absent in 500 normal unaffected individuals of matched geographical ancestry. The finding of TGM6 as a novel causative gene of spinocerebellar ataxia illustrates whole-exome sequencing of affected individuals from one family as an effective and cost efficient method for mapping genes of rare Mendelian disorders and the use of linkage analysis and exome sequencing for further improving efficiency.

  20. Citrus plastid-related gene profiling based on expressed sequence tag analyses

    Directory of Open Access Journals (Sweden)

    Tercilio Calsa Jr.


    Full Text Available Plastid-related sequences, derived from putative nuclear or plastome genes, were searched in a large collection of expressed sequence tags (ESTs and genomic sequences from the Citrus Biotechnology initiative in Brazil. The identified putative Citrus chloroplast gene sequences were compared to those from Arabidopsis, Eucalyptus and Pinus. Differential expression profiling for plastid-directed nuclear-encoded proteins and photosynthesis-related gene expression variation between Citrus sinensis and Citrus reticulata, when inoculated or not with Xylella fastidiosa, were also analyzed. Presumed Citrus plastome regions were more similar to Eucalyptus. Some putative genes appeared to be preferentially expressed in vegetative tissues (leaves and bark or in reproductive organs (flowers and fruits. Genes preferentially expressed in fruit and flower may be associated with hypothetical physiological functions. Expression pattern clustering analysis suggested that photosynthesis- and carbon fixation-related genes appeared to be up- or down-regulated in a resistant or susceptible Citrus species after Xylella inoculation in comparison to non-infected controls, generating novel information which may be helpful to develop novel genetic manipulation strategies to control Citrus variegated chlorosis (CVC.

  1. Candida famata (Debaryomyces hansenii) DNA sequences containing genes involved in riboflavin synthesis. (United States)

    Voronovsky, Andriy Y; Abbas, Charles A; Dmytruk, Kostyantyn V; Ishchuk, Olena P; Kshanovska, Barbara V; Sybirna, Kateryna A; Gaillardin, Claude; Sibirny, Andriy A


    Previously cloned Candida famata (Debaryomyces hansenii) strain VKM Y-9 genomic DNA fragments containing genes RIB1 (codes for GTP cyclohydrolase II), RIB2 (encodes specific reductase), RIB5 (codes for dimethylribityllumazine synthase), RIB6 (encodes dihydroxybutanone phosphate synthase) and RIB7 (codes for riboflavin synthase) were sequenced. The derived amino acid sequences of C. famata RIB genes showed extensive homology to the corresponding sequences of riboflavin synthesis enzymes of other yeast species. The highest identity was observed to homologues of D. hansenii CBS767, as C. famata is the anamorph of this hemiascomycetous yeast. The D. hansenii CBS767 RIB3 gene encoding specific deaminase was cloned. This gene successfully complemented riboflavin auxotrophy of the rib3 mutant of flavinogenic yeast, Pichia guilliermondii. Putative iron-responsive elements (potential sites for binding of the transcription factors Fep1p or Aft1p and Aft2p) were found in the upstream regions of some C. famata and D. hansenii RIB genes. The sequences of C. famata RIB genes have been submitted to the EMBL data library under Accession Nos AJ810169-AJ810173.

  2. Infective Endocarditis: Identification of Catalase-Negative, Gram-Positive Cocci from Blood Cultures by Partial 16S rRNA Gene Analysis and by Vitek 2 Examination

    DEFF Research Database (Denmark)

    Abdul-Redha, Rawaa Jalil; Kemp, Michael; Bangsborg, Jette M


    Streptococci, enterococci and Streptococcus-like bacteria are frequent etiologic agents of infective endocarditis and correct species identification can be a laboratory challenge. Viridans streptococci (VS) not seldomly cause contamination of blood cultures. Vitek 2 and partial sequencing of the ...

  3. Molecular characterization of human T-cell lymphotropic virus type 1 full and partial genomes by Illumina massively parallel sequencing technology.

    Directory of Open Access Journals (Sweden)

    Rodrigo Pessôa

    Full Text Available BACKGROUND: Here, we report on the partial and full-length genomic (FLG variability of HTLV-1 sequences from 90 well-characterized subjects, including 48 HTLV-1 asymptomatic carriers (ACs, 35 HTLV-1-associated myelopathy/tropical spastic paraparesis (HAM/TSP and 7 adult T-cell leukemia/lymphoma (ATLL patients, using an Illumina paired-end protocol. METHODS: Blood samples were collected from 90 individuals, and DNA was extracted from the PBMCs to measure the proviral load and to amplify the HTLV-1 FLG from two overlapping fragments. The amplified PCR products were subjected to deep sequencing. The sequencing data were assembled, aligned, and mapped against the HTLV-1 genome with sufficient genetic resemblance and utilized for further phylogenetic analysis. RESULTS: A high-throughput sequencing-by-synthesis instrument was used to obtain an average of 3210- and 5200-fold coverage of the partial (n = 14 and FLG (n = 76 data from the HTLV-1 strains, respectively. The results based on the phylogenetic trees of consensus sequences from partial and FLGs revealed that 86 (95.5% individuals were infected with the transcontinental sub-subtypes of the cosmopolitan subtype (aA and that 4 individuals (4.5% were infected with the Japanese sub-subtypes (aB. A comparison of the nucleotide and amino acids of the FLG between the three clinical settings yielded no correlation between the sequenced genotype and clinical outcomes. The evolutionary relationships among the HTLV sequences were inferred from nucleotide sequence, and the results are consistent with the hypothesis that there were multiple introductions of the transcontinental subtype in Brazil. CONCLUSIONS: This study has increased the number of subtype aA full-length genomes from 8 to 81 and HTLV-1 aB from 2 to 5 sequences. The overall data confirmed that the cosmopolitan transcontinental sub-subtypes were the most prevalent in the Brazilian population. It is hoped that this valuable genomic data

  4. The complete mitochondrial genome sequence and gene organization of Tridentiger trigonocephalus (Gobiidae: Gobionellinae) with phylogenetic consideration. (United States)

    Wei, Hongqing; Ma, Hongyu; Ma, Chunyan; Zhang, Fengying; Wang, Wei; Chen, Wei; Ma, Lingbo


    The complete mitochondrial genome plays an important role in studies of genome-level characteristics and phylogenetic relationships. Here we determined the complete mitogenome sequence of Tridentiger trigonocephalus (Perciformes, Gobiidae), and discovered its phylogenetic relationship. This circular genome was 16 662 bp in length, and consisted of 37 typical genes, including 13 protein-coding genes, 22 tRNA genes, and two rRNA genes. The gene order of T. trigonocephalus mitochondrial genome was identical to those observed in most other vertebrates. Of 37 genes, 28 were encoded by heavy strand, while the others were encoded by light strand. The phylogenetic tree constructed by 13 concatenated protein-coding genes showed that T. trigonocephalus was closest to T. bifasciatus, and then to T. barbatus among the 20 species within suborder Gobioidei. This work should facilitate the studies on population genetic diversity, and molecular evolution in Gobioidei fishes.

  5. Rapid evolution of the sequences and gene repertoires of secreted proteins in bacteria.

    Directory of Open Access Journals (Sweden)

    Teresa Nogueira

    Full Text Available Proteins secreted to the extracellular environment or to the periphery of the cell envelope, the secretome, play essential roles in foraging, antagonistic and mutualistic interactions. We hypothesize that arms races, genetic conflicts and varying selective pressures should lead to the rapid change of sequences and gene repertoires of the secretome. The analysis of 42 bacterial pan-genomes shows that secreted, and especially extracellular proteins, are predominantly encoded in the accessory genome, i.e. among genes not ubiquitous within the clade. Genes encoding outer membrane proteins might engage more frequently in intra-chromosomal gene conversion because they are more often in multi-genic families. The gene sequences encoding the secretome evolve faster than the rest of the genome and in particular at non-synonymous positions. Cell wall proteins in Firmicutes evolve particularly fast when compared with outer membrane proteins of Proteobacteria. Virulence factors are over-represented in the secretome, notably in outer membrane proteins, but cell localization explains more of the variance in substitution rates and gene repertoires than sequence homology to known virulence factors. Accordingly, the repertoires and sequences of the genes encoding the secretome change fast in the clades of obligatory and facultative pathogens and also in the clades of mutualists and free-living bacteria. Our study shows that cell localization shapes genome evolution. In agreement with our hypothesis, the repertoires and the sequences of genes encoding secreted proteins evolve fast. The particularly rapid change of extracellular proteins suggests that these public goods are key players in bacterial adaptation.

  6. Complete nucleotide sequences of two adjacent early vaccinia virus genes located within the inverted terminal repetition. (United States)

    Venkatesan, S; Gershowitz, A; Moss, B


    The proximal part of the 10,000-base pair (bp) inverted terminal repetition of vaccinia virus DNA encodes at least three early mRNAs. A 2,236-bp segment of the repetition was sequenced to characterize two of the genes. This task was facilitated by constructing a series of recombinants containing overlapping deletions; oligonucleotide linkers with synthetic restriction sites provided points for radioactive labeling before sequencing by the chemical degradation method of Maxam and Gilbert (Methods Enzymol. 65:499-560, 1980). The ends of the transcripts were mapped by hybridizing labeled DNA fragments to early viral RNA and resolving nuclease S1-protected fragments in sequencing gels, by sequencing cDNA clones, and from the lengths of the RNAs. The nucleotide sequences for at least 60 bp upstream of both transcriptional initiation sites are more than 80% adenine . thymine rich and contain long runs of adenines and thymines with some homology to procaryotic and eucaryotic consensus sequences. The gene transcribed in the rightward direction encodes an RNA of approximately 530 nucleotides with a single open reading frame of 420 nucleotides. Preceding the first AUG, there is a heptanucleotide that can hybridize to the 3' end of 18S rRNA with only one mismatch. The derived amino acid sequence of the protein indicated a molecular weight of 15,500. The gene transcribed in the leftward direction encodes an RNA 1,000 to 1,100 nucleotides long with an open reading frame of 996 nucleotides and a leader sequence of only 5 to 6 nucleotides. The derived amino acid sequence of this protein indicated a molecular weight of 38,500. The 3' ends of the two transcripts were located within 100 bp of each other. Although there are adenine . thymine-rich clusters near the putative transcriptional termination sites, specific AATAAA polyadenylic acid signal sequences are absent.

  7. Versatile Cosmid Vectors for the Isolation, Expression, and Rescue of Gene Sequences: Studies with the Human α -globin Gene Cluster (United States)

    Lau, Yun-Fai; Kan, Yuet Wai


    We have developed a series of cosmids that can be used as vectors for genomic recombinant DNA library preparations, as expression vectors in mammalian cells for both transient and stable transformations, and as shuttle vectors between bacteria and mammalian cells. These cosmids were constructed by inserting one of the SV2-derived selectable gene markers-SV2-gpt, SV2-DHFR, and SV2-neo-in cosmid pJB8. High efficiency of genomic cloning was obtained with these cosmids and the size of the inserts was 30-42 kilobases. We isolated recombinant cosmids containing the human α -globin gene cluster from these genomic libraries. The simian virus 40 DNA in these selectable gene markers provides the origin of replication and enhancer sequences necessary for replication in permissive cells such as COS 7 cells and thereby allows transient expression of α -globin genes in these cells. These cosmids and their recombinants could also be stably transformed into mammalian cells by using the respective selection systems. Both of the adult α -globin genes were more actively expressed than the embryonic zeta -globin genes in these transformed cell lines. Because of the presence of the cohesive ends of the Charon 4A phage in the cosmids, the transforming DNA sequences could readily be rescued from these stably transformed cells into bacteria by in vitro packaging of total cellular DNA. Thus, these cosmid vectors are potentially useful for direct isolation of structural genes.

  8. GeneAnalytics: An Integrative Gene Set Analysis Tool for Next Generation Sequencing, RNAseq and Microarray Data. (United States)

    Ben-Ari Fuchs, Shani; Lieder, Iris; Stelzer, Gil; Mazor, Yaron; Buzhor, Ella; Kaplan, Sergey; Bogoch, Yoel; Plaschkes, Inbar; Shitrit, Alina; Rappaport, Noa; Kohn, Asher; Edgar, Ron; Shenhav, Liraz; Safran, Marilyn; Lancet, Doron; Guan-Golan, Yaron; Warshawsky, David; Shtrichman, Ronit


    Postgenomics data are produced in large volumes by life sciences and clinical applications of novel omics diagnostics and therapeutics for precision medicine. To move from "data-to-knowledge-to-innovation," a crucial missing step in the current era is, however, our limited understanding of biological and clinical contexts associated with data. Prominent among the emerging remedies to this challenge are the gene set enrichment tools. This study reports on GeneAnalytics™ ( ), a comprehensive and easy-to-apply gene set analysis tool for rapid contextualization of expression patterns and functional signatures embedded in the postgenomics Big Data domains, such as Next Generation Sequencing (NGS), RNAseq, and microarray experiments. GeneAnalytics' differentiating features include in-depth evidence-based scoring algorithms, an intuitive user interface and proprietary unified data. GeneAnalytics employs the LifeMap Science's GeneCards suite, including the GeneCards®--the human gene database; the MalaCards-the human diseases database; and the PathCards--the biological pathways database. Expression-based analysis in GeneAnalytics relies on the LifeMap Discovery®--the embryonic development and stem cells database, which includes manually curated expression data for normal and diseased tissues, enabling advanced matching algorithm for gene-tissue association. This assists in evaluating differentiation protocols and discovering biomarkers for tissues and cells. Results are directly linked to gene, disease, or cell "cards" in the GeneCards suite. Future developments aim to enhance the GeneAnalytics algorithm as well as visualizations, employing varied graphical display items. Such attributes make GeneAnalytics a broadly applicable postgenomics data analyses and interpretation tool for translation of data to knowledge-based innovation in various Big Data fields such as precision medicine, ecogenomics, nutrigenomics, pharmacogenomics, vaccinomics

  9. Cloning, sequencing, and functional analysis of the 5'-flanking region of the rat 3 alpha-hydroxysteroid/dihydrodiol dehydrogenase gene. (United States)

    Lin, H K; Penning, T M


    Rat liver 3 alpha-hydroxysteroid/dihydrodiol dehydrogenase (3 alpha-HSD/DD) is a member of the aldo-keto reductase gene superfamily. It displays high constitutive expression and inactivates circulating steroid hormones and suppresses the formation of polycyclic aromatic hydrocarbon anti- and syn-diol-epoxides (ultimate carcinogens). To elucidate mechanisms responsible for constitutive expression of the 3 alpha-HSD/DD gene a rat genomic library obtained from adult Sprague-Dawley female liver (HaeIII partial digest) was screened, using a probe corresponding to the 5'-end of the cDNA (-15 to +250), and a 15.8-kb genomic clone was isolated. Sequencing revealed that 6.3 kb contained exon 1 (+16 to +138 bp) plus additional introns and exons. The transcription start site (+1) was located by primer extension analysis, and the initiation codon, ATG, was located at +55 bp. The remaining 9.5 kb represented the 5'-flanking region of the rat 3 alpha-HSD/DD gene. A 1.6-kb fragment of this region was sequenced. A TATTTAA sequence (TATA box) was found at 33 bp upstream from the major transcription start site. cis-acting elements responsible for the constitutive expression of the rat 3 alpha-HSD/DD gene were located on the 5'-flanking region by transient transfection of reporter-gene (chloramphenicol acetyl transferase, CAT) constructs into human hepatoma cells (HepG2). CAT assays identified the basal promoter between (-199 and +55 bp), the presence of a proximal enhancer (-498 to -199 bp) which stimulated CAT activity 6-fold, the existence of a powerful silencer (-755 to -498 bp), and a strong distal enhancer (-4.0 to -2.0 kb) which increased CAT activity by 20-40-fold. A computer search of available consensus sequences for trans-acting factors revealed that a cluster of Oct-sites were uniquely located in the silencer region. Using the negative response element (-797 to -498 bp) as a probe and nuclear extracts from HepG2 cells, three bands were identified by gel mobility shift

  10. Isolation of laccase gene-specific sequences from white rot and brown rot fungi by PCR

    Energy Technology Data Exchange (ETDEWEB)

    D`Souza, T.M.; Boominathan, K.; Reddy, C.A. [Michigan State Univ., East Lansing, MI (United States)


    Degenerate primers corresponding to the consensus sequences of the copper-binding regions in the N-terminal domains of known basidiomycete laccases were used to isolate laccase gene-specific sequences from strains representing nine genera of wood rot fungi. All except three gave the expected PCR product of about 200 bp. Computer searches of the databases identified the sequences of each of the PCR product of about 200 bp. Computer searches of the databases identified the sequence of each of the PCR products analyzed as a laccase gene sequence, suggesting the specificity of the primers. PCR products of the white rot fungi Ganoderma lucidum, Phlebia brevispora, and Trametes versicolor showed 65 to 74% nucleotide sequence similarity to each other; the similarity in deduced amino acid sequences was 83 to 91%. The PCR products of Lentinula edodes and Lentinus tigrinus, on the other hand, showed relatively low nucleotide and amino acid similarities (58 to 64 and 62 to 81%, respectively); however, these similarities were still much higher than when compared with the corresponding regions in the laccases of the ascomycete fungi Aspergillus nidulans and Neurospora crassa. A few of the white rot fungi, as well as Gloeophyllum trabeum, a brown rot fungus, gave a 144-bp PCR fragment which had a nucleotide sequence similarity of 60 to 71%. Demonstration of laccase activity in G. trabeum and several other brown rot fungi was of particular interest because these organisms were not previously shown to produce laccases. 36 refs., 6 figs., 2 tabs.

  11. Resequencing of the common marmoset genome improves genome assemblies and gene-coding sequence analysis. (United States)

    Sato, Kengo; Kuroki, Yoko; Kumita, Wakako; Fujiyama, Asao; Toyoda, Atsushi; Kawai, Jun; Iriki, Atsushi; Sasaki, Erika; Okano, Hideyuki; Sakakibara, Yasubumi


    The first draft of the common marmoset (Callithrix jacchus) genome was published by the Marmoset Genome Sequencing and Analysis Consortium. The draft was based on whole-genome shotgun sequencing, and the current assembly version is Callithrix_jacches-3.2.1, but there still exist 187,214 undetermined gap regions and supercontigs and relatively short contigs that are unmapped to chromosomes in the draft genome. We performed resequencing and assembly of the genome of common marmoset by deep sequencing with high-throughput sequencing technology. Several different sequence runs using Illumina sequencing platforms were executed, and 181 Gbp of high-quality bases including mate-pairs with long insert lengths of 3, 8, 20, and 40 Kbp were obtained, that is, approximately 60× coverage. The resequencing significantly improved the MGSAC draft genome sequence. The N50 of the contigs, which is a statistical measure used to evaluate assembly quality, doubled. As a result, 51% of the contigs (total length: 299 Mbp) that were unmapped to chromosomes in the MGSAC draft were merged with chromosomal contigs, and the improved genome sequence helped to detect 5,288 new genes that are homologous to human cDNAs and the gaps in 5,187 transcripts of the Ensembl gene annotations were completely filled.

  12. Phasic-like stimulation of the medial forebrain bundle augments striatal gene expression despite methamphetamine-induced partial dopamine denervation. (United States)

    Howard, Christopher D; Pastuzyn, Elissa D; Barker-Haliski, Melissa L; Garris, Paul A; Keefe, Kristen A


    Methamphetamine-induced partial dopamine depletions are associated with impaired basal ganglia function, including decreased preprotachykinin mRNA expression and impaired transcriptional activation of activity-regulated, cytoskeleton-associated (Arc) gene in striatum. Recent work implicates deficits in phasic dopamine signaling as a potential mechanism linking methamphetamine-induced dopamine loss to impaired basal ganglia function. This study thus sought to establish a causal link between phasic dopamine transmission and altered basal ganglia function by determining whether the deficits in striatal neuron gene expression could be restored by increasing phasic dopamine release. Three weeks after pretreatment with saline or a neurotoxic regimen of methamphetamine, rats underwent phasic- or tonic-like stimulation of ascending dopamine neurons. Striatal gene expression was examined using in situ hybridization histochemistry. Phasic-like, but not tonic-like, stimulation induced immediate-early genes Arc and zif268 in both groups, despite the partial striatal dopamine denervation in methamphetamine-pretreated rats, with the Arc expression occurring in presumed striatonigral efferent neurons. Phasic-like stimulation also restored preprotachykinin mRNA expression. These results suggest that disruption of phasic dopamine signaling likely underlies methamphetamine-induced impairments in basal ganglia function, and that restoring phasic dopamine signaling may be a viable approach to manage long-term consequences of methamphetamine-induced dopamine loss on basal ganglia functions.

  13. Preliminary study on mitochondrial 16S rRNA gene sequences and phylogeny of flatfishes (Pleuronectiformes)

    Institute of Scientific and Technical Information of China (English)


    A 605 bp section of mitochondrial 16S rRNA gene from Paralichthys olivaceus, Pseudorhombus cinnamomeus, Psetta maxima and Kareius bicoloratus, which represent 3 families of Order Pleuronectiformes was amplified by PCR and sequenced to show the molecular systematics of Pleuronectiformes for comparison with related gene sequences of other 6 flatfish downloaded from GenBank. Phylogenetic analysis based on genetic distance from related gene sequences of 10 flatfish showed that this method was ideal to explore the relationship between species, genera and families. Phylogenetic trees set-up is based on neighbor-joining, maximum parsimony and maximum likelihood methods that accords to the general rule of Pleuronectiformes evolution. But they also resulted in some confusion. Unlike data from morphological characters, P. olivaceus clustered with K.bicoloratus, but P. cinnamomeus did not cluster with P. olivaceus, which is worth further studying.

  14. Cloning and sequencing of the beta-glucosidase gene from Acetobacter xylinum ATCC 23769. (United States)

    Tajima, K; Nakajima, K; Yamashita, H; Shiba, T; Munekata, M; Takai, M


    The beta-glucosidase gene (bglxA) was cloned from the genomic DNA of Acetobacter xylinum ATCC 23769 and its nucleotide sequence (2200 bp) was determined. This bglxA gene was present downstream of the cellulose synthase operon and coded for a polypeptide of molecular mass 79 kDa. The overexpression of the beta-glucosidase in A. xylinum caused a tenfold increase in activity compared to the wild-type strain. In addition, the action pattern of the enzyme was identified as G3ase activity. The deduced amino acid sequence of the bglxA gene showed 72.3%, 49.6%, and 45.1% identity with the beta-glucosidases from A. xylinum subsp. sucrofermentans, Cellvibrio gilvus, and Mycobacterium tuberculosis, respectively. Based on amino acid sequence similarities, the beta-glucosidase (BglxA) was assigned to family 3 of the glycosyl hydrolases.

  15. Shine-Dalgarno sequence of bacteriophage T4: GAGG prevails in early genes. (United States)

    Malys, Naglis


    Translation initiation is governed by a limited number of mRNA sequence motifs within the translation initiation region (TIR). In bacteria and bacteriophages, one of the most important determinants is a Shine-Dalgarno (SD) sequence that base pairs with the anti-SD sequence GAUCACCUCCUUA localized in the 3' end of 16S rRNA. This work assesses a diversity of TIR features in phage T4, focusing on the SD sequence, its spacing to the start codon and relationship to gene expression and essentiality patterns. Analysis shows that GAGG is predominant of all core SD motifs in T4 and its related phages, particularly in early genes. Possible implication of the RegB activity is discussed.

  16. A Primer for Disease Gene Prioritization Using Next-Generation Sequencing Data

    Directory of Open Access Journals (Sweden)

    Shuoguo Wang


    Full Text Available High-throughput next-generation sequencing (NGS technology produces a tremendous amount of raw sequence data. The challenges for researchers are to process the raw data, to map the sequences to genome, to discover variants that are different from the reference genome, and to prioritize/rank the variants for the question of interest. The recent development of many computational algorithms and programs has vastly improved the ability to translate sequence data into valuable information for disease gene identification. However, the NGS data analysis is complex and could be overwhelming for researchers who are not familiar with the process. Here, we outline the analysis pipeline and describe some of the most commonly used principles and tools for analyzing NGS data for disease gene identification.

  17. Natural variation in CBF gene sequence, gene expression and freezing tolerance in the Versailles core collection of Arabidopsis thaliana

    Directory of Open Access Journals (Sweden)

    Brunel Dominique


    Full Text Available Abstract Background Plants from temperate regions are able to withstand freezing temperatures due to a process known as cold acclimation, which is a prior exposure to low, but non-freezing temperatures. During acclimation, a large number of genes are induced, bringing about biochemical changes in the plant, thought to be responsible for the subsequent increase in freezing tolerance. Key regulatory proteins in this process are the CBF1, 2 and 3 transcription factors which control the expression of a set of target genes referred to as the "CBF regulon". Results To assess the role of the CBF genes in cold acclimation and freezing tolerance of Arabidopsis thaliana, the CBF genes and their promoters were sequenced in the Versailles core collection, a set of 48 accessions that maximizes the naturally-occurring genetic diversity, as well as in the commonly used accessions Col-0 and WS. Extensive polymorphism was found in all three genes. Freezing tolerance was measured in all accessions to assess the variability in acclimated freezing tolerance. The effect of sequence polymorphism was investigated by evaluating the kinetics of CBF gene expression, as well as that of a subset of the target COR genes, in a set of eight accessions with contrasting freezing tolerance. Our data indicate that CBF genes as well as the selected COR genes are cold induced in all accessions, irrespective of their freezing tolerance. Although we observed different levels of expression in different accessions, CBF or COR gene expression was not closely correlated with freezing tolerance. Conclusion Our results indicate that the Versailles core collection contains significant natural variation with respect to freezing tolerance, polymorphism in the CBF genes and CBF and COR gene expression. Although there tends to be more CBF and COR gene expression in tolerant accessions, there are exceptions, reinforcing the idea that a complex network of genes is involved in freezing tolerance

  18. Comparative sequence analysis of cis elements present in Glycine max L. leghemoglobin lba and lbc3 genes

    DEFF Research Database (Denmark)

    She, Q; Sandal, N N; Stougaard, J


    The soybean leghemoglobin lba gene promoter sequence was determined and aligned with the promoter sequence of the soybean lbc3 gene from the same gene family. Five highly conserved regions were found. There are two large conserved regions, one of which overlaps the basic promoter while the other ...

  19. Putative and unique gene sequence utilization for the design of species specific probes as modeled by Lactobacillus plantarum (United States)

    The concept of utilizing putative and unique gene sequences for the design of species specific probes was tested. The abundance profile of assigned functions within the Lactobacillus plantarum genome was used for the identification of the putative and unique gene sequence, csh. The targeted gene (cs...

  20. Sequencing and comparative analysis of fugu protocadherin clusters reveal diversity of protocadherin genes among teleosts

    Directory of Open Access Journals (Sweden)

    Rajasegaran Vikneswari


    Full Text Available Abstract Background The synaptic cell adhesion molecules, protocadherins, are a vertebrate innovation that accompanied the emergence of the neural tube and the elaborate central nervous system. In mammals, the protocadherins are encoded by three closely-linked clusters (α, β and γ of tandem genes and are hypothesized to provide a molecular code for specifying the remarkably-diverse neural connections in the central nervous system. Like mammals, the coelacanth, a lobe-finned fish, contains a single protocadherin locus, also arranged into α, β and γ clusters. Zebrafish, however, possesses two protocadherin loci that contain more than twice the number of genes as the coelacanth, but arranged only into α and γ clusters. To gain further insight into the evolutionary history of protocadherin clusters, we have sequenced and analyzed protocadherin clusters from the compact genome of the pufferfish, Fugu rubripes. Results Fugu contains two unlinked protocadherin loci, Pcdh1 and Pcdh2, that collectively consist of at least 77 genes. The fugu Pcdh1 locus has been subject to extensive degeneration, resulting in the complete loss of Pcdh1γ cluster. The fugu Pcdh genes have undergone lineage-specific regional gene conversion processes that have resulted in a remarkable regional sequence homogenization among paralogs in the same subcluster. Phylogenetic analyses show that most protocadherin genes are orthologous between fugu and zebrafish either individually or as paralog groups. Based on the inferred phylogenetic relationships of fugu and zebrafish genes, we have reconstructed the evolutionary history of protocadherin clusters in the teleost fish lineage. Conclusion Our results demonstrate the exceptional evolutionary dynamism of protocadherin genes in vertebrates in general, and in teleost fishes in particular. Besides the 'fish-specific' whole genome duplication, the evolution of protocadherin genes in teleost fishes is influenced by lineage

  1. Comparison of the aflR gene sequences of strains in Aspergillus section Flavi. (United States)

    Lee, Chao-Zong; Liou, Guey-Yuh; Yuan, Gwo-Fang


    Aflatoxins are polyketide-derived secondary metabolites produced by Aspergillus parasiticus, Aspergillus flavus, Aspergillus nomius and a few other species. The toxic effects of aflatoxins have adverse consequences for human health and agricultural economics. The aflR gene, a regulatory gene for aflatoxin biosynthesis, encodes a protein containing a zinc-finger DNA-binding motif. Although Aspergillus oryzae and Aspergillus sojae, which are used in fermented foods and in ingredient manufacture, have no record of producing aflatoxin, they have been shown to possess an aflR gene. This study examined 34 strains of Aspergillus section Flavi. The aflR gene of 23 of these strains was successfully amplified and sequenced. No aflR PCR products were found in five A. sojae strains or six strains of A. oryzae. These PCR results suggested that the aflR gene is absent or significantly different in some A. sojae and A. oryzae strains. The sequenced aflR genes from the 23 positive strains had greater than 96.6 % similarity, which was particularly conserved in the zinc-finger DNA-binding domain. The aflR gene of A. sojae has two obvious characteristics: an extra CTCATG sequence fragment and a C to T transition that causes premature termination of AFLR protein synthesis. Differences between A. parasiticus/A. sojae and A. flavus/A. oryzae aflR genes were also identified. Some strains of A. flavus as well as A. flavus var. viridis, A. oryzae var. viridis and A. oryzae var. effuses have an A. oryzae-type aflR gene. For all strains with the A. oryzae-type aflR gene, there was no evidence of aflatoxin production. It is suggested that for safety reasons, the aflR gene could be examined to assess possible aflatoxin production by Aspergillus section Flavi strains.

  2. p21WAF1/CIP1 gene DNA sequencing and its expression in human osteosarcoma

    Institute of Scientific and Technical Information of China (English)

    廖威明; 张春林; 李佛保; 曾炳芳; 曾益新


    Background Mutation and expression change of p21WAF1/CIP1 may play a role in the growth of osteosarcoma. This study was to investigate the expression of the p21WAF1/CIP1 gene in human osteosarcoma, p21WAF1/CIP1 gene DNA sequence change and their relationships with the phenotype and clinical prognosis.Methods p21WAF1/CIP1 gene in 10 normal people and the tumours of 45 osteosarcoma patients were examined using polymerase chain reaction-single strand conformation polymorphism (PCR-SSCP) with silver staining. The PCR product with an abnormal strand was sequenced directly. The p21WAF1/CIP1 gene mRNA and P21 protein of 45 cases of osteosarcoma were investigated by using in situ hybridization and immunohistochemistry, respectively. Results The occurrence of P21 protein in osteosarcoma was 17.78% (8/45), and p21WAF1/CIP1 mRNA expression in osteosarcoma was 42.22% (19/45). The p21WAF1/CIP1 gene DNA sequencing of amplified production showed that in p21WAF1/CIP1 gene exon 3 of 36 cases of human osteosarcoma, there were 17 cases (47.22%) with C→T at position 609; 10 normal blood samp