genetic sequences encoding: Topics by WorldWideScience.org

Sample records for genetic sequences encoding

Towards predicting the encoding capability of MR fingerprinting sequences.

Science.gov (United States)

Sommer, K; Amthor, T; Doneva, M; Koken, P; Meineke, J; Börnert, P

2017-09-01

Sequence optimization and appropriate sequence selection is still an unmet need in magnetic resonance fingerprinting (MRF). The main challenge in MRF sequence design is the lack of an appropriate measure of the sequence's encoding capability. To find such a measure, three different candidates for judging the encoding capability have been investigated: local and global dot-product-based measures judging dictionary entry similarity as well as a Monte Carlo method that evaluates the noise propagation properties of an MRF sequence. Consistency of these measures for different sequence lengths as well as the capability to predict actual sequence performance in both phantom and in vivo measurements was analyzed. While the dot-product-based measures yielded inconsistent results for different sequence lengths, the Monte Carlo method was in a good agreement with phantom experiments. In particular, the Monte Carlo method could accurately predict the performance of different flip angle patterns in actual measurements. The proposed Monte Carlo method provides an appropriate measure of MRF sequence encoding capability and may be used for sequence optimization. Copyright © 2017 Elsevier Inc. All rights reserved.
Sequence-to-Sequence Prediction of Vehicle Trajectory via LSTM Encoder-Decoder Architecture

OpenAIRE

Park, Seong Hyeon; Kim, ByeongDo; Kang, Chang Mook; Chung, Chung Choo; Choi, Jun Won

2018-01-01

In this paper, we propose a deep learning based vehicle trajectory prediction technique which can generate the future trajectory sequence of surrounding vehicles in real time. We employ the encoder-decoder architecture which analyzes the pattern underlying in the past trajectory using the long short-term memory (LSTM) based encoder and generates the future trajectory sequence using the LSTM based decoder. This structure produces the $K$ most likely trajectory candidates over occupancy grid ma...
Cyclic Concatenated Genetic Encoder: A mathematical proposal for biological inferences.

Science.gov (United States)

Duarte-González, M E; Echeverri, O Y; Guevara, J M; Palazzo, R

2018-01-01

The organization of the genetic information and its ability to be conserved and translated to proteins with low error rates have been the subject of study by scientists from different disciplines. Recently, it has been proposed that living organisms display an intra-cellular transmission system of genetic information, similar to a model of digital communication system, in which there is the ability to detect and correct errors. In this work, the concept of Concatenated Genetic Encoder is introduced and applied to the analysis of protein sequences as a tool for exploring evolutionary relationships. For such purposes Error Correcting Codes (ECCs) are used to represent proteins. A methodology for representing or identifying proteins by use of BCH codes over ℤ 20 and F 4 ×ℤ 5 is proposed and cytochrome b6-f complex subunit 6-OS sequences, corresponding to different plants species, are analyzed according to the proposed methodology and results are contrasted to phylogenetic and taxonomic analyses. Through the analyses, it was observed that using BCH codes only some sequences are identified, all of which differ in one amino acid from the original sequence. In addition, mathematical relationships among identified sequences are established by considering minimal polynomials, where such sequences showed a close relationship as revealed in the phylogenetic reconstruction. Results, here shown, point out that communication theory may provide biology of interesting and useful tools to identify biological relationships among proteins, however the proposed methodology needs to be improved and rigorously tested in order to become into an applicable tool for biological analysis. Copyright © 2017 Elsevier B.V. All rights reserved.
Sequence variation in the alpha-toxin encoding plc gene of Clostridium perfringens strains isolated from diseased and healthy chickens

DEFF Research Database (Denmark)

Abildgaard, L; Engberg, RM; Pedersen, Karl

2009-01-01

The aim of the present study was to analyse the genetic diversity of the alpha-toxin encoding plc gene and the variation in a-toxin production of Clostridium perfringens type A strains isolated from presumably healthy chickens and chickens suffering from either necrotic enteritis (NE) or cholangio......-hepatitis. The a-toxin encoding plc genes from 60 different pulsed-field gel electrophoresis (PFGE) types (strains) of C perfringens were sequenced and translated in silico to amino acid sequences and the a-toxin production was investigated in batch cultures of 45 of the strains using an enzyme...
Toward a Better Compression for DNA Sequences Using Huffman Encoding.

Science.gov (United States)

Al-Okaily, Anas; Almarri, Badar; Al Yami, Sultan; Huang, Chun-Hsi

2017-04-01

Due to the significant amount of DNA data that are being generated by next-generation sequencing machines for genomes of lengths ranging from megabases to gigabases, there is an increasing need to compress such data to a less space and a faster transmission. Different implementations of Huffman encoding incorporating the characteristics of DNA sequences prove to better compress DNA data. These implementations center on the concepts of selecting frequent repeats so as to force a skewed Huffman tree, as well as the construction of multiple Huffman trees when encoding. The implementations demonstrate improvements on the compression ratios for five genomes with lengths ranging from 5 to 50 Mbp, compared with the standard Huffman tree algorithm. The research hence suggests an improvement on all such DNA sequence compression algorithms that use the conventional Huffman encoding. The research suggests an improvement on all DNA sequence compression algorithms that use the conventional Huffman encoding. Accompanying software is publicly available (AL-Okaily, 2016 ).
Nucleotide sequences of two genomic DNAs encoding peroxidase of Arabidopsis thaliana.

Science.gov (United States)

Intapruk, C; Higashimura, N; Yamamoto, K; Okada, N; Shinmyo, A; Takano, M

1991-02-15

The peroxidase (EC 1.11.1.7)-encoding gene of Arabidopsis thaliana was screened from a genomic library using a cDNA encoding a neutral isozyme of horseradish, Armoracia rusticana, peroxidase (HRP) as a probe, and two positive clones were isolated. From the comparison with the sequences of the HRP-encoding genes, we concluded that two clones contained peroxidase-encoding genes, and they were named prxCa and prxEa. Both genes consisted of four exons and three introns; the introns had consensus nucleotides, GT and AG, at the 5' and 3' ends, respectively. The lengths of each putative exon of the prxEa gene were the same as those of the HRP-basic-isozyme-encoding gene, prxC3, and coded for 349 amino acids (aa) with a sequence homology of 89% to that encoded by prxC3. The prxCa gene was very close to the HRP-neutral-isozyme-encoding gene, prxC1b, and coded for 354 aa with 91% homology to that encoded by prxC1b. The aa sequence homology was 64% between the two peroxidases encoded by prxCa and prxEa.
Nonlinear inversion of potential-field data using a hybrid-encoding genetic algorithm

Science.gov (United States)

Chen, C.; Xia, J.; Liu, J.; Feng, G.

2006-01-01

Using a genetic algorithm to solve an inverse problem of complex nonlinear geophysical equations is advantageous because it does not require computer gradients of models or "good" initial models. The multi-point search of a genetic algorithm makes it easier to find the globally optimal solution while avoiding falling into a local extremum. As is the case in other optimization approaches, the search efficiency for a genetic algorithm is vital in finding desired solutions successfully in a multi-dimensional model space. A binary-encoding genetic algorithm is hardly ever used to resolve an optimization problem such as a simple geophysical inversion with only three unknowns. The encoding mechanism, genetic operators, and population size of the genetic algorithm greatly affect search processes in the evolution. It is clear that improved operators and proper population size promote the convergence. Nevertheless, not all genetic operations perform perfectly while searching under either a uniform binary or a decimal encoding system. With the binary encoding mechanism, the crossover scheme may produce more new individuals than with the decimal encoding. On the other hand, the mutation scheme in a decimal encoding system will create new genes larger in scope than those in the binary encoding. This paper discusses approaches of exploiting the search potential of genetic operations in the two encoding systems and presents an approach with a hybrid-encoding mechanism, multi-point crossover, and dynamic population size for geophysical inversion. We present a method that is based on the routine in which the mutation operation is conducted in the decimal code and multi-point crossover operation in the binary code. The mix-encoding algorithm is called the hybrid-encoding genetic algorithm (HEGA). HEGA provides better genes with a higher probability by a mutation operator and improves genetic algorithms in resolving complicated geophysical inverse problems. Another significant
Molecular characterization of long direct repeat (LDR) sequences expressing a stable mRNA encoding for a 35-amino-acid cell-killing peptide and a cis-encoded small antisense RNA in Escherichia coli.

Science.gov (United States)

Kawano, Mitsuoki; Oshima, Taku; Kasai, Hiroaki; Mori, Hirotada

2002-07-01

Genome sequence analyses of Escherichia coli K-12 revealed four copies of long repetitive elements. These sequences are designated as long direct repeat (LDR) sequences. Three of the repeats (LDR-A, -B, -C), each approximately 500 bp in length, are located as tandem repeats at 27.4 min on the genetic map. Another copy (LDR-D), 450 bp in length and nearly identical to LDR-A, -B and -C, is located at 79.7 min, a position that is directly opposite the position of LDR-A, -B and -C. In this study, we demonstrate that LDR-D encodes a 35-amino-acid peptide, LdrD, the overexpression of which causes rapid cell killing and nucleoid condensation of the host cell. Northern blot and primer extension analysis showed constitutive transcription of a stable mRNA (approximately 370 nucleotides) encoding LdrD and an unstable cis-encoded antisense RNA (approximately 60 nucleotides), which functions as a trans-acting regulator of ldrD translation. We propose that LDR encodes a toxin-antitoxin module. LDR-homologous sequences are not pre-sent on any known plasmids but are conserved in Salmonella and other enterobacterial species.
Extraordinarily Adaptive Properties of the Genetically Encoded Amino Acids

Science.gov (United States)

Ilardo, Melissa; Meringer, Markus; Freeland, Stephen; Rasulev, Bakhtiyor; Cleaves II, H. James

2015-01-01

Using novel advances in computational chemistry, we demonstrate that the set of 20 genetically encoded amino acids, used nearly universally to construct all coded terrestrial proteins, has been highly influenced by natural selection. We defined an adaptive set of amino acids as one whose members thoroughly cover relevant physico-chemical properties, or “chemistry space.” Using this metric, we compared the encoded amino acid alphabet to random sets of amino acids. These random sets were drawn from a computationally generated compound library containing 1913 alternative amino acids that lie within the molecular weight range of the encoded amino acids. Sets that cover chemistry space better than the genetically encoded alphabet are extremely rare and energetically costly. Further analysis of more adaptive sets reveals common features and anomalies, and we explore their implications for synthetic biology. We present these computations as evidence that the set of 20 amino acids found within the standard genetic code is the result of considerable natural selection. The amino acids used for constructing coded proteins may represent a largely global optimum, such that any aqueous biochemistry would use a very similar set. PMID:25802223
Extraordinarily adaptive properties of the genetically encoded amino acids.

Science.gov (United States)

Ilardo, Melissa; Meringer, Markus; Freeland, Stephen; Rasulev, Bakhtiyor; Cleaves, H James

2015-03-24

Using novel advances in computational chemistry, we demonstrate that the set of 20 genetically encoded amino acids, used nearly universally to construct all coded terrestrial proteins, has been highly influenced by natural selection. We defined an adaptive set of amino acids as one whose members thoroughly cover relevant physico-chemical properties, or "chemistry space." Using this metric, we compared the encoded amino acid alphabet to random sets of amino acids. These random sets were drawn from a computationally generated compound library containing 1913 alternative amino acids that lie within the molecular weight range of the encoded amino acids. Sets that cover chemistry space better than the genetically encoded alphabet are extremely rare and energetically costly. Further analysis of more adaptive sets reveals common features and anomalies, and we explore their implications for synthetic biology. We present these computations as evidence that the set of 20 amino acids found within the standard genetic code is the result of considerable natural selection. The amino acids used for constructing coded proteins may represent a largely global optimum, such that any aqueous biochemistry would use a very similar set.
Imaging dynamic redox processes with genetically encoded probes.

Science.gov (United States)

Ezeriņa, Daria; Morgan, Bruce; Dick, Tobias P

2014-08-01

Redox signalling plays an important role in many aspects of physiology, including that of the cardiovascular system. Perturbed redox regulation has been associated with numerous pathological conditions; nevertheless, the causal relationships between redox changes and pathology often remain unclear. Redox signalling involves the production of specific redox species at specific times in specific locations. However, until recently, the study of these processes has been impeded by a lack of appropriate tools and methodologies that afford the necessary redox species specificity and spatiotemporal resolution. Recently developed genetically encoded fluorescent redox probes now allow dynamic real-time measurements, of defined redox species, with subcellular compartment resolution, in intact living cells. Here we discuss the available genetically encoded redox probes in terms of their sensitivity and specificity and highlight where uncertainties or controversies currently exist. Furthermore, we outline major goals for future probe development and describe how progress in imaging methodologies will improve our ability to employ genetically encoded redox probes in a wide range of situations. This article is part of a special issue entitled "Redox Signalling in the Cardiovascular System." Copyright © 2014 Elsevier Ltd. All rights reserved.
Whole-exome sequencing reveals genetic variants associated with chronic kidney disease characterized by tubulointerstitial damages in North Central Region, Sri Lanka.

Science.gov (United States)

Nanayakkara, Shanika; Senevirathna, S T M L D; Parahitiyawa, Nipuna B; Abeysekera, Tilak; Chandrajith, Rohana; Ratnatunga, Neelakanthi; Hitomi, Toshiaki; Kobayashi, Hatasu; Harada, Kouji H; Koizumi, Akio

2015-09-01

The familial clustering observed in chronic kidney disease of uncertain etiology (CKDu) characterized by tubulointerstitial damages in the North Central Region of Sri Lanka strongly suggests the involvement of genetic factors in its pathogenesis. The objective of the present study is to use whole-exome sequencing to identify the genetic variants associated with CKDu. Whole-exome sequencing of eight CKDu cases and eight controls was performed, followed by direct sequencing of candidate loci in 301 CKDu cases and 276 controls. Association study revealed rs34970857 (c.658G > A/p.V220M) located in the KCNA10 gene encoding a voltage-gated K channel as the most promising SNP with the highest odds ratio of 1.74. Four rare variants were identified in gene encoding Laminin beta2 (LAMB2) which is known to cause congenital nephrotic syndrome. Three out of four variants in LAMB2 were novel variants found exclusively in cases. Genetic investigations provide strong evidence on the presence of genetic susceptibility for CKDu. Possibility of presence of several rare variants associated with CKDu in this population is also suggested.
A novel Y-xylosidase, nucleotide sequence encoding it and use thereof.

NARCIS (Netherlands)

Graaff, de L.H.; Peij, van N.N.M.E.; Broeck, van den H.C.; Visser, J.

1996-01-01

A nucleotide sequence is provided which encodes a peptide having beta-xylosidase activity and exhibits at least 30mino acid identity with the amino acid sequence shown in SEQ ID NO. 1 or hybridises under stringent conditions with a nucleotide sequence shown in SEQ ID NO. 1, or a part thereof having
Cloning, sequencing and expression of cDNA encoding growth ...

Indian Academy of Sciences (India)

Unknown

of medicine, animal husbandry, fish farming and animal ..... northern pike (Esox lucius) growth hormone; Mol. Mar. Biol. ... prolactin 1-luciferase fusion gene in African catfish and ... 1988 Cloning and sequencing of cDNA that encodes goat.
On the edge of language acquisition: inherent constraints on encoding multisyllabic sequences in the neonate brain.

Science.gov (United States)

Ferry, Alissa L; Fló, Ana; Brusini, Perrine; Cattarossi, Luigi; Macagno, Francesco; Nespor, Marina; Mehler, Jacques

2016-05-01

To understand language, humans must encode information from rapid, sequential streams of syllables - tracking their order and organizing them into words, phrases, and sentences. We used Near-Infrared Spectroscopy (NIRS) to determine whether human neonates are born with the capacity to track the positions of syllables in multisyllabic sequences. After familiarization with a six-syllable sequence, the neonate brain responded to the change (as shown by an increase in oxy-hemoglobin) when the two edge syllables switched positions but not when two middle syllables switched positions (Experiment 1), indicating that they encoded the syllables at the edges of sequences better than those in the middle. Moreover, when a 25 ms pause was inserted between the middle syllables as a segmentation cue, neonates' brains were sensitive to the change (Experiment 2), indicating that subtle cues in speech can signal a boundary, with enhanced encoding of the syllables located at the edges of that boundary. These findings suggest that neonates' brains can encode information from multisyllabic sequences and that this encoding is constrained. Moreover, subtle segmentation cues in a sequence of syllables provide a mechanism with which to accurately encode positional information from longer sequences. Tracking the order of syllables is necessary to understand language and our results suggest that the foundations for this encoding are present at birth. © 2015 John Wiley & Sons Ltd.
Isolation of endophytic bacteria from arboreal species of the Amazon and identification by sequencing of the 16S rRNA encoding gene

Directory of Open Access Journals (Sweden)

Mariza M. Coêlho

2011-01-01

Full Text Available Endophytic bacteria from three arboreal species native to the Amazon (Carapa guianenses, Ceiba pentandra, and Swietenia macrophylla, were isolated and identified, through partial sequencing of the 16S rRNA encoding gene. From these, 16 isolates were obtained, although, when compared to sequences deposited in GenBank, only seven had produced identifiable fragments. Bacillus, Pantoea and two non-culturable samples were identified. Results obtained through sequence analysis revealed low genetic diversity across the isolates, even when analyzing different species and plant structures. This is the first report concerning the isolation and identification of endophytic bacteria in these plant species.
Multilocus sequence typing and rtxA toxin gene sequencing analysis of Kingella kingae isolates demonstrates genetic diversity and international clones.

Directory of Open Access Journals (Sweden)

Romain Basmaci

Full Text Available BACKGROUND: Kingella kingae, a normal component of the upper respiratory flora, is being increasingly recognized as an important invasive pathogen in young children. Genetic diversity of this species has not been studied. METHODS: We analyzed 103 strains from different countries and clinical origins by a new multilocus sequence-typing (MLST schema. Putative virulence gene rtxA, encoding an RTX toxin, was also sequenced, and experimental virulence of representative strains was assessed in a juvenile-rat model. RESULTS: Thirty-six sequence-types (ST and nine ST-complexes (STc were detected. The main STc 6, 14 and 23 comprised 23, 17 and 20 strains respectively, and were internationally distributed. rtxA sequencing results were mostly congruent with MLST, and showed horizontal transfer events. Of interest, all members of the distantly related ST-6 (n = 22 and ST-5 (n = 4 harboured a 33 bp duplication or triplication in their rtxA sequence, suggesting that this genetic trait arose through selective advantage. The animal model revealed significant differences in virulence among strains of the species. CONCLUSION: MLST analysis reveals international spread of ST-complexes and will help to decipher acquisition and evolution of virulence traits and diversity of pathogenicity among K. kingae strains, for which an experimental animal model is now available.
Typing of Panton-Valentine Leukocidin-Encoding Phages and lukSF-PV Gene Sequence Variation in Staphylococcus aureus from China.

Science.gov (United States)

Zhao, Huanqiang; Hu, Fupin; Jin, Shu; Xu, Xiaogang; Zou, Yuhan; Ding, Baixing; He, Chunyan; Gong, Fang; Liu, Qingzhong

2016-01-01

Panton-Valentine leukocidin (PVL, encoded by lukSF-PV genes), a bi-component and pore-forming toxin, is carried by different staphylococcal bacteriophages. The prevalence of PVL in Staphylococcus aureus has been reported around the globe. However, the data on PVL-encoding phage types, lukSF-PV gene variation and chromosomal phage insertion sites for PVL-positive S. aureus are limited, especially in China. In order to obtain a more complete understanding of the molecular epidemiology of PVL-positive S. aureus, an integrated and modified PCR-based scheme was applied to detect the PVL-encoding phage types. Phage insertion locus and the lukSF-PV variant were determined by PCR and sequencing. Meanwhile, the genetic background was characterized by staphylococcal cassette chromosome mec (SCCmec) typing, staphylococcal protein A (spa) gene polymorphisms typing, pulsed-field gel electrophoresis (PFGE) typing, accessory gene regulator (agr) locus typing and multilocus sequence typing (MLST). Seventy eight (78/1175, 6.6%) isolates possessed the lukSF-PV genes and 59.0% (46/78) of PVL-positive strains belonged to CC59 lineage. Eight known different PVL-encoding phage types were detected, and Φ7247PVL/ΦST5967PVL (n = 13) and ΦPVL (n = 12) were the most prevalent among them. While 25 (25/78, 32.1%) isolates, belonging to ST30, and ST59 clones, were unable to be typed by the modified PCR-based scheme. Single nucleotide polymorphisms (SNPs) were identified at five locations in the lukSF-PV genes, two of which were non-synonymous. Maximum-likelihood tree analysis of attachment sites sequences detected six SNP profiles for attR and eight for attL, respectively. In conclusion, the PVL-positive S. aureus mainly harbored Φ7247PVL/ΦST5967PVL and ΦPVL in the regions studied. lukSF-PV gene sequences, PVL-encoding phages, and phage insertion locus generally varied with lineages. Moreover, PVL-positive clones that have emerged worldwide likely carry distinct phages.
Typing of Panton-Valentine Leukocidin-encoding Phages and lukSF-PV Gene Sequence Variation in Staphylococcus aureus from China

Directory of Open Access Journals (Sweden)

Huanqiang Zhao

2016-08-01

Full Text Available Panton-Valentine leucocidin (PVL, encoded by lukSF-PV genes, a bi-component and pore-forming toxin, is carried by different staphylococcal bacteriophages. The prevalence of PVL in Staphylococcus aureus (S. aureus have been reported around the globe. However, the data on PVL-encoding phage types, lukSF-PV gene variation and chromosomal phage insertion sites for PVL-positive S. aureus are limited, especially in China. In order to obtain a more complete understanding of the molecular epidemiology of PVL-positive S. aureus, an integrated and modified PCR-based scheme was applied to detect the PVL-encoding phage types. Phage insertion locus and the lukSF-PV variant were determined by PCR and sequencing. Meanwhile, the genetic background was characterized by staphylococcal cassette chromosome mec (SCCmec typing, staphylococcal protein A (spa gene polymorphisms typing, pulsed-field gel electrophoresis (PFGE typing, accessory gene regulator (agr locus typing and multilocus sequence typing (MLST. Seventy eight (78/1175, 6.6% isolates possessed the lukSF-PV genes and 59.0% (46/78 of PVL-positive strains belonged to CC59 lineage. Eight known different PVL-encoding phage types were detected, and Φ7247PVL/ΦST5967PVL (n=13 and ΦPVL (n=12 were the most prevalent among them. While 25 (25/78, 32.1% isolates, belonging to ST30 and ST59 clones, were unable to be typed by the modified PCR-based scheme. Single nucleotide polymorphisms (SNPs were identified at five locations in the lukSF-PV genes, two of which were non-synonymous. Maximum-likelihood tree analysis of attachment sites sequences detected six SNP profiles for attR and eight for attL, respectively. In conclusion, the PVL-positive S. aureus mainly harbored Φ7247PVL/ΦST5967PVL and ΦPVL in the regions studied. lukSF-PV gene sequences, PVL-encoding phages and phage insertion locus generally varied with lineages. Moreover, PVL-positive clones that have emerged worldwide likely carry distinct phages.
Polymeric peptide pigments with sequence-encoded properties

Energy Technology Data Exchange (ETDEWEB)

Lampel, Ayala; McPhee, Scott A.; Park, Hang-Ah; Scott, Gary G.; Humagain, Sunita; Hekstra, Doeke R.; Yoo, Barney; Frederix, Pim W. J. M.; Li, Tai-De; Abzalimov, Rinat R.; Greenbaum, Steven G.; Tuttle, Tell; Hu, Chunhua; Bettinger, Christopher J.; Ulijn, Rein V.

2017-06-08

Melanins are a family of heterogeneous polymeric pigments that provide ultraviolet (UV) light protection, structural support, coloration, and free radical scavenging. Formed by oxidative oligomerization of catecholic small molecules, the physical properties of melanins are influenced by covalent and noncovalent disorder. We report the use of tyrosine-containing tripeptides as tunable precursors for polymeric pigments. In these structures, phenols are presented in a (supra-)molecular context dictated by the positions of the amino acids in the peptide sequence. Oxidative polymerization can be tuned in a sequence-dependent manner, resulting in peptide sequence–encoded properties such as UV absorbance, morphology, coloration, and electrochemical properties over a considerable range. Short peptides have low barriers to application and can be easily scaled, suggesting near-term applications in cosmetics and biomedicine.

Genetic analysis of the VP2-encoding gene of canine parvovirus strains from Africa.

Science.gov (United States)

Dogonyaro, Banenat B; Bosman, Anna-Mari; Sibeko, Kgomotso P; Venter, Estelle H; van Vuuren, Moritz

2013-08-30

Since the emergence of canine parvovirus type-2 (CPV-2) in the early 1970s, it has been evolving into novel genetic and antigenic variants (CPV-2a, 2b and 2c) that are unevenly distributed throughout the world. Genetic characterization of CPV-2 has not been documented in Africa since 1998 apart from the study carried out in Tunisia 2009. A total of 139 field samples were collected from South Africa and Nigeria, detected using PCR and the full length VP2-encoding gene of 27 positive samples were sequenced and genetically analyzed. Nigerian samples (n=6), South Africa (n=19) and vaccine strains (n=2) were compared with existing sequences obtained from GenBank. The results showed the presence of both CPV-2a and 2b in South Africa and only CPV-2a in Nigeria. No CPV-2c strain was detected during this study. Phylogenetic analysis showed a clustering not strictly associated with the geographical origin of the analyzed strains, although most of the South African strains tended to cluster together and the viral strains analyzed in this study were not completely distinct from CPV-2 strains from other parts of the world. Amino acid analysis showed predicted amino acid changes. Copyright © 2013 Elsevier B.V. All rights reserved.
Transfection of genetically encoded photoswitchable probes for STORM imaging.

Science.gov (United States)

Bates, Mark; Jones, Sara A; Zhuang, Xiaowei

2013-06-01

Conventional fluorescence microscopy is limited by its spatial resolution, leaving many biological structures too small to be studied in detail. Stochastic optical reconstruction microscopy (STORM) is a method for superresolution fluorescence imaging based on the high accuracy localization of individual fluorophores. It uses optically switchable fluorophores: molecules that can be switched between a nonfluorescent and a fluorescent state by exposure to light. This protocol describes the transfection of genetically encoded photoswitchable probes for STORM imaging. It includes a discussion of how to choose a photoswitchable fluorescent protein; standard molecular biology techniques should be used to generate a plasmid containing the sequence of the photoswitchable protein linked to the gene of interest. Once the plasmid has been generated and has been verified, it can be introduced into cells via any standard means of gene delivery, such as lipofection or electroporation. Optimal conditions will vary considerably for different cell lines and plasmids. Here, we present an example protocol for the transfection of BS-C-1 cells with an mEos2-vimentin plasmid using the lipid-based reagent FuGENE6.
Forward Genetics by Sequencing EMS Variation-Induced Inbred Lines

Directory of Open Access Journals (Sweden)

Charles Addo-Quaye

2017-02-01

Full Text Available In order to leverage novel sequencing techniques for cloning genes in eukaryotic organisms with complex genomes, the false positive rate of variant discovery must be controlled for by experimental design and informatics. We sequenced five lines from three pedigrees of ethyl methanesulfonate (EMS-mutagenized Sorghum bicolor, including a pedigree segregating a recessive dwarf mutant. Comparing the sequences of the lines, we were able to identify and eliminate error-prone positions. One genomic region contained EMS mutant alleles in dwarfs that were homozygous reference sequences in wild-type siblings and heterozygous in segregating families. This region contained a single nonsynonymous change that cosegregated with dwarfism in a validation population and caused a premature stop codon in the Sorghum ortholog encoding the gibberellic acid (GA biosynthetic enzyme ent-kaurene oxidase. Application of exogenous GA rescued the mutant phenotype. Our method for mapping did not require outcrossing and introduced no segregation variance. This enables work when line crossing is complicated by life history, permitting gene discovery outside of genetic models. This inverts the historical approach of first using recombination to define a locus and then sequencing genes. Our formally identical approach first sequences all the genes and then seeks cosegregation with the trait. Mutagenized lines lacking obvious phenotypic alterations are available for an extension of this approach: mapping with a known marker set in a line that is phenotypically identical to starting material for EMS mutant generation.
Dynamic encoding of speech sequence probability in human temporal cortex.

Science.gov (United States)

Leonard, Matthew K; Bouchard, Kristofer E; Tang, Claire; Chang, Edward F

2015-05-06

Sensory processing involves identification of stimulus features, but also integration with the surrounding sensory and cognitive context. Previous work in animals and humans has shown fine-scale sensitivity to context in the form of learned knowledge about the statistics of the sensory environment, including relative probabilities of discrete units in a stream of sequential auditory input. These statistics are a defining characteristic of one of the most important sequential signals humans encounter: speech. For speech, extensive exposure to a language tunes listeners to the statistics of sound sequences. To address how speech sequence statistics are neurally encoded, we used high-resolution direct cortical recordings from human lateral superior temporal cortex as subjects listened to words and nonwords with varying transition probabilities between sound segments. In addition to their sensitivity to acoustic features (including contextual features, such as coarticulation), we found that neural responses dynamically encoded the language-level probability of both preceding and upcoming speech sounds. Transition probability first negatively modulated neural responses, followed by positive modulation of neural responses, consistent with coordinated predictive and retrospective recognition processes, respectively. Furthermore, transition probability encoding was different for real English words compared with nonwords, providing evidence for online interactions with high-order linguistic knowledge. These results demonstrate that sensory processing of deeply learned stimuli involves integrating physical stimulus features with their contextual sequential structure. Despite not being consciously aware of phoneme sequence statistics, listeners use this information to process spoken input and to link low-level acoustic representations with linguistic information about word identity and meaning. Copyright © 2015 the authors 0270-6474/15/357203-12$15.00/0.
Designing universal primers for the isolation of DNA sequences encoding Proanthocyanidins biosynthetic enzymes in Crataegus aronia

Directory of Open Access Journals (Sweden)

Zuiter Afnan

2012-08-01

Full Text Available Abstract Background Hawthorn is the common name of all plant species in the genus Crataegus, which belongs to the Rosaceae family. Crataegus are considered useful medicinal plants because of their high content of proanthocyanidins (PAs and other related compounds. To improve PAs production in Crataegus tissues, the sequences of genes encoding PAs biosynthetic enzymes are required. Findings Different bioinformatics tools, including BLAST, multiple sequence alignment and alignment PCR analysis were used to design primers suitable for the amplification of DNA fragments from 10 candidate genes encoding enzymes involved in PAs biosynthesis in C. aronia. DNA sequencing results proved the utility of the designed primers. The primers were used successfully to amplify DNA fragments of different PAs biosynthesis genes in different Rosaceae plants. Conclusion To the best of our knowledge, this is the first use of the alignment PCR approach to isolate DNA sequences encoding PAs biosynthetic enzymes in Rosaceae plants.
Analysis of intra-host genetic diversity of Prunus necrotic ringspot virus (PNRSV) using amplicon next generation sequencing.

Science.gov (United States)

Kinoti, Wycliff M; Constable, Fiona E; Nancarrow, Narelle; Plummer, Kim M; Rodoni, Brendan

2017-01-01

PCR amplicon next generation sequencing (NGS) analysis offers a broadly applicable and targeted approach to detect populations of both high- or low-frequency virus variants in one or more plant samples. In this study, amplicon NGS was used to explore the diversity of the tripartite genome virus, Prunus necrotic ringspot virus (PNRSV) from 53 PNRSV-infected trees using amplicons from conserved gene regions of each of PNRSV RNA1, RNA2 and RNA3. Sequencing of the amplicons from 53 PNRSV-infected trees revealed differing levels of polymorphism across the three different components of the PNRSV genome with a total number of 5040, 2083 and 5486 sequence variants observed for RNA1, RNA2 and RNA3 respectively. The RNA2 had the lowest diversity of sequences compared to RNA1 and RNA3, reflecting the lack of flexibility tolerated by the replicase gene that is encoded by this RNA component. Distinct PNRSV phylo-groups, consisting of closely related clusters of sequence variants, were observed in each of PNRSV RNA1, RNA2 and RNA3. Most plant samples had a single phylo-group for each RNA component. Haplotype network analysis showed that smaller clusters of PNRSV sequence variants were genetically connected to the largest sequence variant cluster within a phylo-group of each RNA component. Some plant samples had sequence variants occurring in multiple PNRSV phylo-groups in at least one of each RNA and these phylo-groups formed distinct clades that represent PNRSV genetic strains. Variants within the same phylo-group of each Prunus plant sample had ≥97% similarity and phylo-groups within a Prunus plant sample and between samples had less ≤97% similarity. Based on the analysis of diversity, a definition of a PNRSV genetic strain was proposed. The proposed definition was applied to determine the number of PNRSV genetic strains in each of the plant samples and the complexity in defining genetic strains in multipartite genome viruses was explored.
Analysis of intra-host genetic diversity of Prunus necrotic ringspot virus (PNRSV using amplicon next generation sequencing.

Directory of Open Access Journals (Sweden)

Wycliff M Kinoti

Full Text Available PCR amplicon next generation sequencing (NGS analysis offers a broadly applicable and targeted approach to detect populations of both high- or low-frequency virus variants in one or more plant samples. In this study, amplicon NGS was used to explore the diversity of the tripartite genome virus, Prunus necrotic ringspot virus (PNRSV from 53 PNRSV-infected trees using amplicons from conserved gene regions of each of PNRSV RNA1, RNA2 and RNA3. Sequencing of the amplicons from 53 PNRSV-infected trees revealed differing levels of polymorphism across the three different components of the PNRSV genome with a total number of 5040, 2083 and 5486 sequence variants observed for RNA1, RNA2 and RNA3 respectively. The RNA2 had the lowest diversity of sequences compared to RNA1 and RNA3, reflecting the lack of flexibility tolerated by the replicase gene that is encoded by this RNA component. Distinct PNRSV phylo-groups, consisting of closely related clusters of sequence variants, were observed in each of PNRSV RNA1, RNA2 and RNA3. Most plant samples had a single phylo-group for each RNA component. Haplotype network analysis showed that smaller clusters of PNRSV sequence variants were genetically connected to the largest sequence variant cluster within a phylo-group of each RNA component. Some plant samples had sequence variants occurring in multiple PNRSV phylo-groups in at least one of each RNA and these phylo-groups formed distinct clades that represent PNRSV genetic strains. Variants within the same phylo-group of each Prunus plant sample had ≥97% similarity and phylo-groups within a Prunus plant sample and between samples had less ≤97% similarity. Based on the analysis of diversity, a definition of a PNRSV genetic strain was proposed. The proposed definition was applied to determine the number of PNRSV genetic strains in each of the plant samples and the complexity in defining genetic strains in multipartite genome viruses was explored.
Heterogeneous genetic diversity pattern in Plasmodium vivax genes encoding merozoite surface proteins (MSP) -7E, -7F and -7L.

Science.gov (United States)

Garzón-Ospina, Diego; Forero-Rodríguez, Johanna; Patarroyo, Manuel A

2014-12-13

The msp-7 gene has become differentially expanded in the Plasmodium genus; Plasmodium vivax has the highest copy number of this gene, several of which encode antigenic proteins in merozoites. DNA sequences from thirty-six Colombian clinical isolates from P. vivax (pv) msp-7E, -7F and -7L genes were analysed for characterizing and studying the genetic diversity of these pvmsp-7 members which are expressed during the intra-erythrocyte stage; natural selection signals producing the variation pattern so observed were evaluated. The pvmsp-7E gene was highly polymorphic compared to pvmsp-7F and pvmsp-7L which were seen to have limited genetic diversity; pvmsp-7E polymorphism was seen to have been maintained by different types of positive selection. Even though these copies seemed to be species-specific duplications, a search in the Plasmodium cynomolgi genome (P. vivax sister taxon) showed that both species shared the whole msp-7 repertoire. This led to exploring the long-term effect of natural selection by comparing the orthologous sequences which led to finding signatures for lineage-specific positive selection. The results confirmed that the P. vivax msp-7 family has a heterogeneous genetic diversity pattern; some members are highly conserved whilst others are highly diverse. The results suggested that the 3'-end of these genes encode MSP-7 proteins' functional region whilst the central region of pvmsp-7E has evolved rapidly. The lineage-specific positive selection signals found suggested that mutations occurring in msp-7s genes during host switch may have succeeded in adapting the ancestral P. vivax parasite population to humans.
Exome sequencing and genetic testing for MODY.

Directory of Open Access Journals (Sweden)

Stefan Johansson

Full Text Available Genetic testing for monogenic diabetes is important for patient care. Given the extensive genetic and clinical heterogeneity of diabetes, exome sequencing might provide additional diagnostic potential when standard Sanger sequencing-based diagnostics is inconclusive.The aim of the study was to examine the performance of exome sequencing for a molecular diagnosis of MODY in patients who have undergone conventional diagnostic sequencing of candidate genes with negative results.We performed exome enrichment followed by high-throughput sequencing in nine patients with suspected MODY. They were Sanger sequencing-negative for mutations in the HNF1A, HNF4A, GCK, HNF1B and INS genes. We excluded common, non-coding and synonymous gene variants, and performed in-depth analysis on filtered sequence variants in a pre-defined set of 111 genes implicated in glucose metabolism.On average, we obtained 45 X median coverage of the entire targeted exome and found 199 rare coding variants per individual. We identified 0-4 rare non-synonymous and nonsense variants per individual in our a priori list of 111 candidate genes. Three of the variants were considered pathogenic (in ABCC8, HNF4A and PPARG, respectively, thus exome sequencing led to a genetic diagnosis in at least three of the nine patients. Approximately 91% of known heterozygous SNPs in the target exomes were detected, but we also found low coverage in some key diabetes genes using our current exome sequencing approach. Novel variants in the genes ARAP1, GLIS3, MADD, NOTCH2 and WFS1 need further investigation to reveal their possible role in diabetes.Our results demonstrate that exome sequencing can improve molecular diagnostics of MODY when used as a complement to Sanger sequencing. However, improvements will be needed, especially concerning coverage, before the full potential of exome sequencing can be realized.
A Synthetic Oligo Library and Sequencing Approach Reveals an Insulation Mechanism Encoded within Bacterial σ54 Promoters

Directory of Open Access Journals (Sweden)

Lior Levy

2017-10-01

Full Text Available We use an oligonucleotide library of >10,000 variants to identify an insulation mechanism encoded within a subset of σ54 promoters. Insulation manifests itself as reduced protein expression for a downstream gene that is expressed by transcriptional readthrough. It is strongly associated with the presence of short CT-rich motifs (3–5 bp, positioned within 25 bp upstream of the Shine-Dalgarno (SD motif of the silenced gene. We provide evidence that insulation is triggered by binding of the ribosome binding site (RBS to the upstream CT-rich motif. We also show that, in E. coli, insulator sequences are preferentially encoded within σ54 promoters, suggesting an important regulatory role for these sequences in natural contexts. Our findings imply that sequence-specific regulatory effects that are sparsely encoded by short motifs may not be easily detected by lower throughput studies. Such sequence-specific phenomena can be uncovered with a focused oligo library (OL design that mitigates sequence-related variance, as exemplified herein.
Translating working memory into action: behavioral and neural evidence for using motor representations in encoding visuo-spatial sequences.

Science.gov (United States)

Langner, Robert; Sternkopf, Melanie A; Kellermann, Tanja S; Grefkes, Christian; Kurth, Florian; Schneider, Frank; Zilles, Karl; Eickhoff, Simon B

2014-07-01

The neurobiological organization of action-oriented working memory is not well understood. To elucidate the neural correlates of translating visuo-spatial stimulus sequences into delayed (memory-guided) sequential actions, we measured brain activity using functional magnetic resonance imaging while participants encoded sequences of four to seven dots appearing on fingers of a left or right schematic hand. After variable delays, sequences were to be reproduced with the corresponding fingers. Recall became less accurate with longer sequences and was initiated faster after long delays. Across both hands, encoding and recall activated bilateral prefrontal, premotor, superior and inferior parietal regions as well as the basal ganglia, whereas hand-specific activity was found (albeit to a lesser degree during encoding) in contralateral premotor, sensorimotor, and superior parietal cortex. Activation differences after long versus short delays were restricted to motor-related regions, indicating that rehearsal during long delays might have facilitated the conversion of the memorandum into concrete motor programs at recall. Furthermore, basal ganglia activity during encoding selectively predicted correct recall. Taken together, the results suggest that to-be-reproduced visuo-spatial sequences are encoded as prospective action representations (motor intentions), possibly in addition to retrospective sensory codes. Overall, our study supports and extends multi-component models of working memory, highlighting the notion that sensory input can be coded in multiple ways depending on what the memorandum is to be used for. Copyright © 2013 Wiley Periodicals, Inc.
Cloning and Sequence Analysis of Vibrio halioticoli Genes Encoding Three Types of Polyguluronate Lyase.

Science.gov (United States)

Sugimura; Sawabe; Ezura

2000-01-01

The alginate lyase-coding genes of Vibrio halioticoli IAM 14596(T), which was isolated from the gut of the abalone Haliotis discus hannai, were cloned using plasmid vector pUC 18, and expressed in Escherichia coli. Three alginate lyase-positive clones, pVHB, pVHC, and pVHE, were obtained, and all clones expressed the enzyme activity specific for polyguluronate. Three genes, alyVG1, alyVG2, and alyVG3, encoding polyguluronate lyase were sequenced: alyVG1 from pVHB was composed of a 1056-bp open reading frame (ORF) encoding 352 amino acid residues; alyVG2 gene from pVHC was composed of a 993-bp ORF encoding 331 amino acid residues; and alyVG3 gene from pVHE was composed of a 705-bp ORF encoding 235 amino acid residues. Comparison of nucleotide and deduced amino acid sequences among AlyVG1, AlyVG2, and AlyVG3 revealed low homologies. The identity value between AlyVG1 and AlyVG2 was 18.7%, and that between AlyVG2 and AlyVG3 was 17.0%. A higher identity value (26.0%) was observed between AlyVG1 and AlyVG3. Sequence comparison among known polyguluronate lyases including AlyVG1, AlyVG2, and AlyVG3 also did not reveal an identical region in these sequences. However, AlyVG1 showed the highest identity value (36.2%) and the highest similarity (73.3%) to AlyA from Klebsiella pneumoniae. A consensus region comprising nine amino acid (YFKAGXYXQ) in the carboxy-terminal region previously reported by Mallisard and colleagues was observed only in AlyVG1 and AlyVG2.
Nucleotide sequence of the Agrobacterium tumefaciens octopine Ti plasmid-encoded tmr gene

NARCIS (Netherlands)

Heidekamp, F.; Dirkse, W.G.; Hille, J.; Ormondt, H. van

1983-01-01

The nucleotide sequence of the tmr gene, encoded by the octopine Ti plasmid from Agrobacterium tumefaciens (pTiAch5), was determined. The T-DNA, which encompasses this gene, is involved in tumor formation and maintenance, and probably mediates the cytokinin-independent growth of transformed plant
Genetic variation in KCNA5

DEFF Research Database (Denmark)

Christophersen, Ingrid E; Olesen, Morten S; Liang, Bo

2012-01-01

AimsGenetic factors may be important in the development of atrial fibrillation (AF) in the young. KCNA5 encodes the potassium channel a-subunit K(V)1.5, which underlies the voltage-gated atrial-specific potassium current I(Kur). KCNAB2 encodes K(V)ß2, a ß-subunit of K(V)1.5, which increases I......(Kur). Three studies have identified loss-of-function mutations in KCNA5 in patients with idiopathic AF. We hypothesized that early-onset lone AF is associated with high prevalence of genetic variants in KCNA5 and KCNAB2.Methods and resultsThe coding sequences of KCNA5 and KCNAB2 were sequenced in 307 patients...
cDNA encoding a polypeptide including a hevein sequence

Energy Technology Data Exchange (ETDEWEB)

Raikhel, Natasha V. (Okemos, MI); Broekaert, Willem F. (Dilbeek, BE); Chua, Nam-Hai (Scarsdale, NY); Kush, Anil (New York, NY)

1993-02-16

A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a pu GOVERNMENT RIGHTS This application was funded under Department of Energy Contract DE-AC02-76ER01338. The U.S. Government has certain rights under this application and any patent issuing thereon.
A genetically encoded ratiometric sensor to measure extracellular pH in microdomains bounded by basolateral membranes of epithelial cells.

Science.gov (United States)

Urra, Javier; Sandoval, Moisés; Cornejo, Isabel; Barros, L Felipe; Sepúlveda, Francisco V; Cid, L Pablo

2008-10-01

Extracellular pH, especially in relatively inaccessible microdomains between cells, affects transport membrane protein activity and might have an intercellular signaling role. We have developed a genetically encoded extracellular pH sensor capable of detecting pH changes in basolateral spaces of epithelial cells. It consists of a chimerical membrane protein displaying concatenated enhanced variants of cyan fluorescence protein (ECFP) and yellow fluorescence protein (EYFP) at the external aspect of the cell surface. The construct, termed pHCECSensor01, was targeted to basolateral membranes of Madin-Darby canine kidney (MDCK) cells by means of a sequence derived from the aquaporin AQP4. The fusion of pH-sensitive EYFP with pH-insensitive ECFP allows ratiometric pH measurements. The titration curve of pHCECSensor01 in vivo had a pK (a) value of 6.5 +/- 0.04. Only minor effects of extracellular chloride on pHCECSensor01 were observed around the physiological concentrations of this anion. In MDCK cells, the sensor was able to detect changes in pH secondary to H(+) efflux into the basolateral spaces elicited by an ammonium prepulse or lactate load. This genetically encoded sensor has the potential to serve as a noninvasive tool for monitoring changes in extracellular pH microdomains in epithelial and other tissues in vivo.
The complete nucleotide sequences of the five genetically distinct plastid genomes of Oenothera, subsection Oenothera: I. sequence evaluation and plastome evolution.

Science.gov (United States)

Greiner, Stephan; Wang, Xi; Rauwolf, Uwe; Silber, Martina V; Mayer, Klaus; Meurer, Jörg; Haberer, Georg; Herrmann, Reinhold G

2008-04-01

The flowering plant genus Oenothera is uniquely suited for studying molecular mechanisms of speciation. It assembles an intriguing combination of genetic features, including permanent translocation heterozygosity, biparental transmission of plastids, and a general interfertility of well-defined species. This allows an exchange of plastids and nuclei between species often resulting in plastome-genome incompatibility. For evaluation of its molecular determinants we present the complete nucleotide sequences of the five basic, genetically distinguishable plastid chromosomes of subsection Oenothera (=Euoenothera) of the genus, which are associated in distinct combinations with six basic genomes. Sizes of the chromosomes range from 163 365 bp (plastome IV) to 165 728 bp (plastome I), display between 96.3% and 98.6% sequence similarity and encode a total of 113 unique genes. Plastome diversification is caused by an abundance of nucleotide substitutions, small insertions, deletions and repetitions. The five plastomes deviate from the general ancestral design of plastid chromosomes of vascular plants by a subsection-specific 56 kb inversion within the large single-copy segment. This inversion disrupted operon structures and predates the divergence of the subsection presumably 1 My ago. Phylogenetic relationships suggest plastomes I-III in one clade, while plastome IV appears to be closest to the common ancestor.
The complete nucleotide sequences of the five genetically distinct plastid genomes of Oenothera, subsection Oenothera: I. Sequence evaluation and plastome evolution†

Science.gov (United States)

Greiner, Stephan; Wang, Xi; Rauwolf, Uwe; Silber, Martina V.; Mayer, Klaus; Meurer, Jörg; Haberer, Georg; Herrmann, Reinhold G.

2008-01-01

The flowering plant genus Oenothera is uniquely suited for studying molecular mechanisms of speciation. It assembles an intriguing combination of genetic features, including permanent translocation heterozygosity, biparental transmission of plastids, and a general interfertility of well-defined species. This allows an exchange of plastids and nuclei between species often resulting in plastome–genome incompatibility. For evaluation of its molecular determinants we present the complete nucleotide sequences of the five basic, genetically distinguishable plastid chromosomes of subsection Oenothera (=Euoenothera) of the genus, which are associated in distinct combinations with six basic genomes. Sizes of the chromosomes range from 163 365 bp (plastome IV) to 165 728 bp (plastome I), display between 96.3% and 98.6% sequence similarity and encode a total of 113 unique genes. Plastome diversification is caused by an abundance of nucleotide substitutions, small insertions, deletions and repetitions. The five plastomes deviate from the general ancestral design of plastid chromosomes of vascular plants by a subsection-specific 56 kb inversion within the large single-copy segment. This inversion disrupted operon structures and predates the divergence of the subsection presumably 1 My ago. Phylogenetic relationships suggest plastomes I–III in one clade, while plastome IV appears to be closest to the common ancestor. PMID:18299283
AIB1 gene amplification and the instability of polyQ encoding sequence in breast cancer cell lines

Directory of Open Access Journals (Sweden)

Clarke Robert

2006-05-01

Full Text Available Abstract Background The poly Q polymorphism in AIB1 (amplified in breast cancer gene is usually assessed by fragment length analysis which does not reveal the actual sequence variation. The purpose of this study is to investigate the sequence variation of poly Q encoding region in breast cancer cell lines at single molecule level, and to determine if the sequence variation is related to AIB1 gene amplification. Methods The polymorphic poly Q encoding region of AIB1 gene was investigated at the single molecule level by PCR cloning/sequencing. The amplification of AIB1 gene in various breast cancer cell lines were studied by real-time quantitative PCR. Results Significant amplifications (5–23 folds of AIB1 gene were found in 2 out of 9 (22% ER positive cell lines (in BT-474 and MCF-7 but not in BT-20, ZR-75-1, T47D, BT483, MDA-MB-361, MDA-MB-468 and MDA-MB-330. The AIB1 gene was not amplified in any of the ER negative cell lines. Different passages of MCF-7 cell lines and their derivatives maintained the feature of AIB1 amplification. When the cells were selected for hormone independence (LCC1 and resistance to 4-hydroxy tamoxifen (4-OH TAM (LCC2 and R27, ICI 182,780 (LCC9 or 4-OH TAM, KEO and LY 117018 (LY-2, AIB1 copy number decreased but still remained highly amplified. Sequencing analysis of poly Q encoding region of AIB1 gene did not reveal specific patterns that could be correlated with AIB1 gene amplification. However, about 72% of the breast cancer cell lines had at least one under represented (3CAA(CAG9(CAACAG3(CAACAGCAG2CAA of the original cell line, a number of altered poly Q encoding sequences were found in the derivatives of MCF-7 cell lines. Conclusion These data suggest that poly Q encoding region of AIB1 gene is somatic unstable in breast cancer cell lines. The instability and the sequence characteristics, however, do not appear to be associated with the level of the gene amplification.
Sequence of a cloned cDNA encoding human ribosomal protein S11

Energy Technology Data Exchange (ETDEWEB)

Lott, J B; Mackie, G A

1988-02-11

The authors have isolated a cloned cDNA that encodes human ribosomal protein (rp) S11 by screening a human fibroblast cDNA library with a labelled 204 bp DNA fragment encompassing residues 212-416 of pRS11, a rat rp Sll cDNA clone. The human rp S11 cloned cDNA consists of 15 residues of the 5' leader, the entire coding sequence and all 51 residues of the 3' untranslated region. The predicted amino acid sequence of 158 residues is identical to rat rpS11. The nucleotide sequence in the coding region differs, however, from that in rat in the first position in two codons and in the third position in 44 codons.

Nucleic acids encoding phloem small RNA-binding proteins and transgenic plants comprising them

Science.gov (United States)

Lucas, William J.; Yoo, Byung-Chun; Lough, Tony J.; Varkonyi-Gasic, Erika

2007-03-13

The present invention provides a polynucleotide sequence encoding a component of the protein machinery involved in small RNA trafficking, Cucurbita maxima phloem small RNA-binding protein (CmPSRB 1), and the corresponding polypeptide sequence. The invention also provides genetic constructs and transgenic plants comprising the polynucleotide sequence encoding a phloem small RNA-binding protein to alter (e.g., prevent, reduce or elevate) non-cell autonomous signaling events in the plants involving small RNA metabolism. These signaling events are involved in a broad spectrum of plant physiological and biochemical processes, including, for example, systemic resistance to pathogens, responses to environmental stresses, e.g., heat, drought, salinity, and systemic gene silencing (e.g., viral infections).
Whole-genome sequencing and genetic variant analysis of a Quarter Horse mare.

KAUST Repository

Doan, Ryan; Cohen, Noah D; Sawyer, Jason; Ghaffari, Noushin; Johnson, Charlie D; Dindot, Scott V

2012-01-01

BACKGROUND: The catalog of genetic variants in the horse genome originates from a few select animals, the majority originating from the Thoroughbred mare used for the equine genome sequencing project. The purpose of this study was to identify genetic variants, including single nucleotide polymorphisms (SNPs), insertion/deletion polymorphisms (INDELs), and copy number variants (CNVs) in the genome of an individual Quarter Horse mare sequenced by next-generation sequencing. RESULTS: Using massively parallel paired-end sequencing, we generated 59.6 Gb of DNA sequence from a Quarter Horse mare resulting in an average of 24.7X sequence coverage. Reads were mapped to approximately 97% of the reference Thoroughbred genome. Unmapped reads were de novo assembled resulting in 19.1 Mb of new genomic sequence in the horse. Using a stringent filtering method, we identified 3.1 million SNPs, 193 thousand INDELs, and 282 CNVs. Genetic variants were annotated to determine their impact on gene structure and function. Additionally, we genotyped this Quarter Horse for mutations of known diseases and for variants associated with particular traits. Functional clustering analysis of genetic variants revealed that most of the genetic variation in the horse's genome was enriched in sensory perception, signal transduction, and immunity and defense pathways. CONCLUSIONS: This is the first sequencing of a horse genome by next-generation sequencing and the first genomic sequence of an individual Quarter Horse mare. We have increased the catalog of genetic variants for use in equine genomics by the addition of novel SNPs, INDELs, and CNVs. The genetic variants described here will be a useful resource for future studies of genetic variation regulating performance traits and diseases in equids.
Whole-genome sequencing and genetic variant analysis of a Quarter Horse mare.

KAUST Repository

Doan, Ryan

2012-02-17

BACKGROUND: The catalog of genetic variants in the horse genome originates from a few select animals, the majority originating from the Thoroughbred mare used for the equine genome sequencing project. The purpose of this study was to identify genetic variants, including single nucleotide polymorphisms (SNPs), insertion/deletion polymorphisms (INDELs), and copy number variants (CNVs) in the genome of an individual Quarter Horse mare sequenced by next-generation sequencing. RESULTS: Using massively parallel paired-end sequencing, we generated 59.6 Gb of DNA sequence from a Quarter Horse mare resulting in an average of 24.7X sequence coverage. Reads were mapped to approximately 97% of the reference Thoroughbred genome. Unmapped reads were de novo assembled resulting in 19.1 Mb of new genomic sequence in the horse. Using a stringent filtering method, we identified 3.1 million SNPs, 193 thousand INDELs, and 282 CNVs. Genetic variants were annotated to determine their impact on gene structure and function. Additionally, we genotyped this Quarter Horse for mutations of known diseases and for variants associated with particular traits. Functional clustering analysis of genetic variants revealed that most of the genetic variation in the horse\\'s genome was enriched in sensory perception, signal transduction, and immunity and defense pathways. CONCLUSIONS: This is the first sequencing of a horse genome by next-generation sequencing and the first genomic sequence of an individual Quarter Horse mare. We have increased the catalog of genetic variants for use in equine genomics by the addition of novel SNPs, INDELs, and CNVs. The genetic variants described here will be a useful resource for future studies of genetic variation regulating performance traits and diseases in equids.
Chemical fingerprints encode mother–offspring similarity, colony membership, relatedness, and genetic quality in fur seals

Science.gov (United States)

Stoffel, Martin A.; Caspers, Barbara A.; Forcada, Jaume; Giannakara, Athina; Baier, Markus; Eberhart-Phillips, Luke; Müller, Caroline; Hoffman, Joseph I.

2015-01-01

Chemical communication underpins virtually all aspects of vertebrate social life, yet remains poorly understood because of its highly complex mechanistic basis. We therefore used chemical fingerprinting of skin swabs and genetic analysis to explore the chemical cues that may underlie mother–offspring recognition in colonially breeding Antarctic fur seals. By sampling mother–offspring pairs from two different colonies, using a variety of statistical approaches and genotyping a large panel of microsatellite loci, we show that colony membership, mother–offspring similarity, heterozygosity, and genetic relatedness are all chemically encoded. Moreover, chemical similarity between mothers and offspring reflects a combination of genetic and environmental influences, the former partly encoded by substances resembling known pheromones. Our findings reveal the diversity of information contained within chemical fingerprints and have implications for understanding mother–offspring communication, kin recognition, and mate choice. PMID:26261311
Human genome and genetic sequencing research and informed consent

International Nuclear Information System (INIS)

Iwakawa, Mayumi

2003-01-01

On March 29, 2001, the Ethical Guidelines for Human Genome and Genetic Sequencing Research were established. They have intended to serve as ethical guidelines for all human genome and genetic sequencing research practice, for the purpose of upholding respect for human dignity and rights and enforcing use of proper methods in the pursuit of human genome and genetic sequencing research, with the understanding and cooperation of the public. The RadGenomics Project has prepared a research protocol and informed consent document that follow these ethical guidelines. We have endeavored to protect the privacy of individual information, and have established a procedure for examination of research practices by an ethics committee. Here we report our procedure in order to offer this concept to the patients. (authors)
The Phytophthora sojae avirulence locus Avr3c encodes a multi-copy RXLR effector with sequence polymorphisms among pathogen strains.

Directory of Open Access Journals (Sweden)

Suomeng Dong

Full Text Available Root and stem rot disease of soybean is caused by the oomycete Phytophthora sojae. The avirulence (Avr genes of P. sojae control race-cultivar compatibility. In this study, we identify the P. sojae Avr3c gene and show that it encodes a predicted RXLR effector protein of 220 amino acids. Sequence and transcriptional data were compared for predicted RXLR effectors occurring in the vicinity of Avr4/6, as genetic linkage of Avr3c and Avr4/6 was previously suggested. Mapping of DNA markers in a F(2 population was performed to determine whether selected RXLR effector genes co-segregate with the Avr3c phenotype. The results pointed to one RXLR candidate gene as likely to encode Avr3c. This was verified by testing selected genes by a co-bombardment assay on soybean plants with Rps3c, thus demonstrating functionality and confirming the identity of Avr3c. The Avr3c gene together with eight other predicted genes are part of a repetitive segment of 33.7 kb. Three near-identical copies of this segment occur in a tandem array. In P. sojae strain P6497, two identical copies of Avr3c occur within the repeated segments whereas the third copy of this RXLR effector has diverged in sequence. The Avr3c gene is expressed during the early stages of infection in all P. sojae strains examined. Virulent alleles of Avr3c that differ in amino acid sequence were identified in other strains of P. sojae. Gain of virulence was acquired through mutation and subsequent sequence exchanges between the two copies of Avr3c. The results illustrate the importance of segmental duplications and RXLR effector evolution in the control of race-cultivar compatibility in the P. sojae and soybean interaction.
Sequence of a cDNA encoding turtle high mobility group 1 protein.

Science.gov (United States)

Zheng, Jifang; Hu, Bi; Wu, Duansheng

2005-07-01

In order to understand sequence information about turtle HMG1 gene, a cDNA encoding HMG1 protein of the Chinese soft-shell turtle (Pelodiscus sinensis) was amplified by RT-PCR from kidney total RNA, and was cloned, sequenced and analyzed. The results revealed that the open reading frame (ORF) of turtle HMG1 cDNA is 606 bp long. The ORF codifies 202 amino acid residues, from which two DNA-binding domains and one polyacidic region are derived. The DNA-binding domains share higher amino acid identity with homologues sequences of chicken (96.5%) and mammalian (74%) than homologues sequence of rainbow trout (67%). The polyacidic region shows 84.6% amino acid homology with the equivalent region of chicken HMG1 cDNA. Turtle HMG1 protein contains 3 Cys residues located at completely conserved positions. Conservation in sequence and structure suggests that the functions of turtle HMG1 cDNA may be highly conserved during evolution. To our knowledge, this is the first report of HMG1 cDNA sequence in any reptilian.
CDNA encoding a polypeptide including a hevein sequence

Science.gov (United States)

Raikhel, Natasha V.; Broekaert, Willem F.; Chua, Nam-Hai; Kush, Anil

1995-03-21

A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74-79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli.
Encoding and recall of finger sequences in experienced pianists compared with musically naïve controls: a combined behavioral and functional imaging study.

Science.gov (United States)

Pau, S; Jahn, G; Sakreida, K; Domin, M; Lotze, M

2013-01-01

Long-term intensive sensorimotor training alters functional representation of the motor and sensory system and might even result in structural changes. However, there is not much knowledge about how previous training impacts learning transfer and functional representation. We tested 14 amateur pianists and 15 musically naïve participants in a short-term finger sequence training procedure, differing considerably from piano playing and measured associated functional representation with functional magnetic resonance imaging. The conditions consisted of encoding a finger sequence indicated by hand symbols ("sequence encoding") and subsequently replaying the sequence from memory, both with and without auditory feedback ("sequence retrieval"). Piano players activated motor areas and the mirror neuron system more strongly than musically naïve participants during encoding. When retrieving the sequence, musically naïve participants showed higher activation in similar brain areas. Thus, retrieval activations of naïve participants were comparable to encoding activations of piano players, who during retrieval performed the sequences more accurately despite lower motor activations. Interestingly, both groups showed primary auditory activation even during sequence retrieval without auditory feedback, supporting previous reports about coactivation of the auditory cortex after learned association with motor performance. When playing with auditory feedback, only pianists lateralized to the left auditory cortex. During encoding activation in left primary somatosensory cortex in the height of the finger representations had a predictive value for increased motor performance later on (error rates). Contrarily, decreased performance was associated with increased visual cortex activation during encoding. Our study extends previous reports about training transfer of motor knowledge resulting in superior training effects in musicians. Performance increase went along with activity in
Genetic selection and DNA sequences of 4.5S RNA homologs

DEFF Research Database (Denmark)

Brown, S; Thon, G; Tolentino, E

1989-01-01

A general strategy for cloning the functional homologs of an Escherichia coli gene was used to clone homologs of 4.5S RNA from other bacteria. The genes encoding these homologs were selected by their ability to complement a deletion of the gene for 4.5S RNA. DNA sequences of the regions encoding...
Genetically encoded lipid-polypeptide hybrid biomaterials that exhibit temperature-triggered hierarchical self-assembly

Science.gov (United States)

Mozhdehi, Davoud; Luginbuhl, Kelli M.; Simon, Joseph R.; Dzuricky, Michael; Berger, Rüdiger; Varol, H. Samet; Huang, Fred C.; Buehne, Kristen L.; Mayne, Nicholas R.; Weitzhandler, Isaac; Bonn, Mischa; Parekh, Sapun H.; Chilkoti, Ashutosh

2018-05-01

Post-translational modification of proteins is a strategy widely used in biological systems. It expands the diversity of the proteome and allows for tailoring of both the function and localization of proteins within cells as well as the material properties of structural proteins and matrices. Despite their ubiquity in biology, with a few exceptions, the potential of post-translational modifications in biomaterials synthesis has remained largely untapped. As a proof of concept to demonstrate the feasibility of creating a genetically encoded biohybrid material through post-translational modification, we report here the generation of a family of three stimulus-responsive hybrid materials—fatty-acid-modified elastin-like polypeptides—using a one-pot recombinant expression and post-translational lipidation methodology. These hybrid biomaterials contain an amphiphilic domain, composed of a β-sheet-forming peptide that is post-translationally functionalized with a C14 alkyl chain, fused to a thermally responsive elastin-like polypeptide. They exhibit temperature-triggered hierarchical self-assembly across multiple length scales with varied structure and material properties that can be controlled at the sequence level.
RNA-DNA sequence differences spell genetic code ambiguities

DEFF Research Database (Denmark)

Bentin, Thomas; Nielsen, Michael L

2013-01-01

A recent paper in Science by Li et al. 2011(1) reports widespread sequence differences in the human transcriptome between RNAs and their encoding genes termed RNA-DNA differences (RDDs). The findings could add a new layer of complexity to gene expression but the study has been criticized. ...
Serine Protease Variants Encoded by Echis ocellatus Venom Gland cDNA: Cloning and Sequencing Analysis

Directory of Open Access Journals (Sweden)

S. S. Hasson

2010-01-01

Full Text Available Envenoming by Echis saw-scaled viper is the leading cause of death and morbidity in Africa due to snake bite. Despite its medical importance, there have been few investigations into the toxin composition of the venom of this viper. Here, we report the cloning of cDNA sequences encoding four groups or isoforms of the haemostasis-disruptive Serine protease proteins (SPs from the venom glands of Echis ocellatus. All these SP sequences encoded the cysteine residues scaffold that form the 6-disulphide bonds responsible for the characteristic tertiary structure of venom serine proteases. All the Echis ocellatus EoSP groups showed varying degrees of sequence similarity to published viper venom SPs. However, these groups also showed marked intercluster sequence conservation across them which were significantly different from that of previously published viper SPs. Because viper venom SPs exhibit a high degree of sequence similarity and yet exert profoundly different effects on the mammalian haemostatic system, no attempt was made to assign functionality to the new Echis ocellatus EoSPs on the basis of sequence alone. The extraordinary level of interspecific and intergeneric sequence conservation exhibited by the Echis ocellatus EoSPs and analogous serine proteases from other viper species leads us to speculate that antibodies to representative molecules should neutralise (that we will exploit, by epidermal DNA immunization the biological function of this important group of venom toxins in vipers that are distributed throughout Africa, the Middle East, and the Indian subcontinent.
cDNAs encoding [D-Ala2]deltorphin precursors from skin of Phyllomedusa bicolor also contain genetic information for three dermorphin-related opioid peptides.

OpenAIRE

Richter, K; Egger, R; Negri, L; Corsi, R; Severini, C; Kreil, G

1990-01-01

We present the structure of four precursors for [D-Ala2]deltorphins I and II as deduced from cDNAs cloned from skin of the frog Phyllomedusa bicolor. These contain the genetic information for one copy of [D-Ala2]deltorphin II and zero, one, or three copies of [D-Ala2]deltorphin I. In each case, the D-alanine of the end product is encoded by a normal GCG codon for L-alanine. In addition, the existence of three peptides related to dermorphin was predicted from the amino acid sequence of the pre...
Genetically encoded probes for NAD+/NADH monitoring.

Science.gov (United States)

Bilan, Dmitry S; Belousov, Vsevolod V

2016-11-01

NAD + and NADH participate in many metabolic reactions. The NAD + /NADH ratio is an important parameter reflecting the general metabolic and redox state of different types of cells. For a long time, in situ and in vivo NAD + /NADH monitoring has been hampered by the lack of suitable tools. The recent development of genetically encoded indicators based on fluorescent proteins linked to specific nucleotide-binding domains has already helped to address this monitoring problem. In this review, we will focus on four available indicators: Peredox, Frex family probes, RexYFP and SoNar. Each indicator has advantages and limitations. We will also discuss the most important points that should be considered when selecting a suitable indicator for certain experimental conditions. Copyright © 2016 Elsevier Inc. All rights reserved.
Quantifying population genetic differentiation from next-generation sequencing data

DEFF Research Database (Denmark)

Fumagalli, Matteo; Garrett Vieira, Filipe Jorge; Korneliussen, Thorfinn Sand

2013-01-01

method for quantifying population genetic differentiation from next-generation sequencing data. In addition, we present a strategy to investigate population structure via Principal Components Analysis. Through extensive simulations, we compare the new method herein proposed to approaches based...... on genotype calling and demonstrate a marked improvement in estimation accuracy for a wide range of conditions. We apply the method to a large-scale genomic data set of domesticated and wild silkworms sequenced at low coverage. We find that we can infer the fine-scale genetic structure of the sampled......Over the last few years, new high-throughput DNA sequencing technologies have dramatically increased speed and reduced sequencing costs. However, the use of these sequencing technologies is often challenged by errors and biases associated with the bioinformatical methods used for analyzing the data...
Enhanced immunogenicity of DNA fusion vaccine encoding secreted hepatitis B surface antigen and chemokine RANTES

International Nuclear Information System (INIS)

Kim, Seung Jo; Suh, Dongchul; Park, Sang Eun; Park, Jeong-Sook; Byun, Hyang-Min; Lee, Chan; Lee, Sun Young; Kim, Inho; Oh, Yu-Kyoung

2003-01-01

To increase the potency of DNA vaccines, we constructed genetic fusion vaccines encoding antigen, secretion signal, and/or chemokine RANTES. The DNA vaccines encoding secreted hepatitis B surface antigen (HBsAg) were constructed by inserting HBsAg gene into an expression vector with an endoplasmic reticulum (ER)-targeting secretory signal sequence. The plasmid encoding secretory HBsAg (pER/HBs) was fused to cDNA of RANTES, generating pER/HBs/R. For comparison, HBsAg genes were cloned into pVAX1 vector with no signal sequence (pHBs), and further linked to the N-terminus of RANTES (pHBs/R). Immunofluorescence study showed the cytoplasmic localization of HBsAg protein expressed from pHBs and pHBs/R, but not from pER/HBs and pER/HBs/R at 48 h after transfection. In mice, RANTES-fused DNA vaccines more effectively elicited the levels of HBsAg-specific IgG antibodies than pHBs. All the DNA vaccines induced higher levels of IgG 2a rather than IgG 1 antibodies. Of RANTES-fused vaccines, pER/HBs/R encoding the secreted fusion protein revealed much higher humoral and CD8 + T cell-stimulating responses compared to pHBs/R. These results suggest that the immunogenicity of DNA vaccines could be enhanced by genetic fusion to a secretory signal peptide sequence and RANTES
Nucleotide sequence and genetic organization of Hungarian grapevine chrome mosaic nepovirus RNA2.

Science.gov (United States)

Brault, V; Hibrand, L; Candresse, T; Le Gall, O; Dunez, J

1989-10-11

The complete nucleotide sequence of hungarian grapevine chrome mosaic nepovirus (GCMV) RNA2 has been determined. The RNA sequence is 4441 nucleotides in length, excluding the poly(A) tail. A polyprotein of 1324 amino acids with a calculated molecular weight of 146 kDa is encoded in a single long open reading frame extending from nucleotides 218 to 4190. This polyprotein is homologous with the protein encoded by the S strain of tomato black ring virus (TBRV) RNA2, the only other nepovirus sequenced so far. Direct sequencing of the viral coat protein and in vitro translation of transcripts derived from cDNA sequences demonstrate that, as for comoviruses, the coat protein is located at the carboxy terminus of the polyprotein. A model for the expression of GCMV RNA2 is presented.
Engineering a genetically-encoded SHG chromophore by electrostatic targeting to the membrane

Directory of Open Access Journals (Sweden)

Yuka eJinno

2014-11-01

Full Text Available Although second harmonic generation (SHG microscopy provides unique imaging advantages for voltage imaging and other biological applications, genetically-encoded SHG chromophores remain relatively unexplored. SHG only arises from non-centrosymmetric media, so an anisotropic arrangement of chromophores is essential to provide strong SHG signals. Here, inspired by the mechanism by which K-Ras4B associates with plasma membranes, we sought to achieve asymmetric arrangements of chromophores at the membrane-cytoplasm interface using the fluorescent protein mVenus. After adding a farnesylation motif to the C-terminus of mVenus, nine amino acids composing its -barrel surface were replaced by lysine, forming an electrostatic patch. This protein (mVe9Knus-CVIM was efficiently targeted to the plasma membrane in a geometrically defined manner and exhibited SHG in HEK293 cells. In agreement with its design, mVe9Knus-CVIM hyperpolarizability was oriented at a small angle (~7.3º from the membrane normal. Genetically-encoded SHG chromophores could serve as a molecular platform for imaging membrane potential.
SCALCE: boosting sequence compression algorithms using locally consistent encoding.

Science.gov (United States)

Hach, Faraz; Numanagic, Ibrahim; Alkan, Can; Sahinalp, S Cenk

2012-12-01

provides up to 2.01 times better compression while improving the running time by a factor of 5.17. SCALCE also provides the option to compress the quality scores as well as the read names, in addition to the reads themselves. This is achieved by compressing the quality scores through order-3 Arithmetic Coding (AC) and the read names through gzip through the reordering SCALCE provides on the reads. This way, in comparison with gzip compression of the unordered FASTQ files (including reads, read names and quality scores), SCALCE (together with gzip and arithmetic encoding) can provide up to 3.34 improvement in the compression rate and 1.26 improvement in running time. Our algorithm, SCALCE (Sequence Compression Algorithm using Locally Consistent Encoding), is implemented in C++ with both gzip and bzip2 compression options. It also supports multithreading when gzip option is selected, and the pigz binary is available. It is available at http://scalce.sourceforge.net. fhach@cs.sfu.ca or cenk@cs.sfu.ca Supplementary data are available at Bioinformatics online.

The Effects of Meiosis/Genetics Integration and Instructional Sequence on College Biology Student Achievement in Genetics.

Science.gov (United States)

Browning, Mark

The purpose of the research was to manipulate two aspects of genetics instruction in order to measure their effects on college, introductory biology students' achievement in genetics. One instructional sequence that was used dealt first with monohybrid autosomal inheritance patterns, then sex-linkage. The alternate sequence was the reverse.…
Experimental evolution, genetic analysis and genome re-sequencing reveal the mutation conferring artemisinin resistance in an isogenic lineage of malaria parasites

KAUST Repository

Hunt, Paul

2010-09-16

Background: Classical and quantitative linkage analyses of genetic crosses have traditionally been used to map genes of interest, such as those conferring chloroquine or quinine resistance in malaria parasites. Next-generation sequencing technologies now present the possibility of determining genome-wide genetic variation at single base-pair resolution. Here, we combine in vivo experimental evolution, a rapid genetic strategy and whole genome re-sequencing to identify the precise genetic basis of artemisinin resistance in a lineage of the rodent malaria parasite, Plasmodium chabaudi. Such genetic markers will further the investigation of resistance and its control in natural infections of the human malaria, P. falciparum.Results: A lineage of isogenic in vivo drug-selected mutant P. chabaudi parasites was investigated. By measuring the artemisinin responses of these clones, the appearance of an in vivo artemisinin resistance phenotype within the lineage was defined. The underlying genetic locus was mapped to a region of chromosome 2 by Linkage Group Selection in two different genetic crosses. Whole-genome deep coverage short-read re-sequencing (IlluminaSolexa) defined the point mutations, insertions, deletions and copy-number variations arising in the lineage. Eight point mutations arise within the mutant lineage, only one of which appears on chromosome 2. This missense mutation arises contemporaneously with artemisinin resistance and maps to a gene encoding a de-ubiquitinating enzyme.Conclusions: This integrated approach facilitates the rapid identification of mutations conferring selectable phenotypes, without prior knowledge of biological and molecular mechanisms. For malaria, this model can identify candidate genes before resistant parasites are commonly observed in natural human malaria populations. 2010 Hunt et al; licensee BioMed Central Ltd.
Systematic Dissection of Sequence Elements Controlling σ70 Promoters Using a Genomically-Encoded Multiplexed Reporter Assay in E. coli.

Science.gov (United States)

Urtecho, Guillaume; Tripp, Arielle D; Insigne, Kimberly; Kim, Hwangbeom; Kosuri, Sriram

2018-02-01

Promoters are the key drivers of gene expression and are largely responsible for the regulation of cellular responses to time and environment. In E. coli , decades of studies have revealed most, if not all, of the sequence elements necessary to encode promoter function. Despite our knowledge of these motifs, it is still not possible to predict the strength and regulation of a promoter from primary sequence alone. Here we develop a novel multiplexed assay to study promoter function in E. coli by building a site-specific genomic recombination-mediated cassette exchange (RMCE) system that allows for the facile construction and testing of large libraries of genetic designs integrated into precise genomic locations. We build and test a library of 10,898 σ70 promoter variants consisting of all combinations of a set of eight -35 elements, eight -10 elements, three UP elements, eight spacers, and eight backgrounds. We find that the -35 and -10 sequence elements can explain approximately 74% of the variance in promoter strength within our dataset using a simple log-linear statistical model. Neural network models can explain greater than 95% of the variance in our dataset, and show the increased power is due to nonlinear interactions of other elements such as the spacer, background, and UP elements.
Genetic programs can be compressed and autonomously decompressed in live cells

Science.gov (United States)

Lapique, Nicolas; Benenson, Yaakov

2018-04-01

Fundamental computer science concepts have inspired novel information-processing molecular systems in test tubes1-13 and genetically encoded circuits in live cells14-21. Recent research has shown that digital information storage in DNA, implemented using deep sequencing and conventional software, can approach the maximum Shannon information capacity22 of two bits per nucleotide23. In nature, DNA is used to store genetic programs, but the information content of the encoding rarely approaches this maximum24. We hypothesize that the biological function of a genetic program can be preserved while reducing the length of its DNA encoding and increasing the information content per nucleotide. Here we support this hypothesis by describing an experimental procedure for compressing a genetic program and its subsequent autonomous decompression and execution in human cells. As a test-bed we choose an RNAi cell classifier circuit25 that comprises redundant DNA sequences and is therefore amenable for compression, as are many other complex gene circuits15,18,26-28. In one example, we implement a compressed encoding of a ten-gene four-input AND gate circuit using only four genetic constructs. The compression principles applied to gene circuits can enable fitting complex genetic programs into DNA delivery vehicles with limited cargo capacity, and storing compressed and biologically inert programs in vivo for on-demand activation.
Plastid, nuclear and reverse transcriptase sequences in the mitochondrial genome of Oenothera: is genetic information transferred between organelles via RNA?

Science.gov (United States)

Schuster, W; Brennicke, A

1987-01-01

We describe an open reading frame (ORF) with high homology to reverse transcriptase in the mitochondrial genome of Oenothera. This ORF displays all the characteristics of an active plant mitochondrial gene with a possible ribosome binding site and 39% T in the third codon position. It is located between a sequence fragment from the plastid genome and one of nuclear origin downstream from the gene encoding subunit 5 of the NADH dehydrogenase. The nuclear derived sequence consists of 528 nucleotides from the small ribosomal RNA and contains an expansion segment unique to nuclear rRNAs. The plastid sequence contains part of the ribosomal protein S4 and the complete tRNA(Ser). The observation that only transcribed sequences have been found i more than one subcellular compartment in higher plants suggests that interorganellar transfer of genetic information may occur via RNA and subsequent local reverse transcription and genomic integration. PMID:14650433
Mutagenesis in sequence encoding of human factor VII for gene therapy of hemophilia

Directory of Open Access Journals (Sweden)

B Kazemi

2009-12-01

Full Text Available "nBackground: Current treatment of hemophilia which is one of the most common bleeding disorders, involves replacement therapy using concentrates of FVIII and FIX .However, these concentrates have been associated with viral infections and thromboembolic complications and development of antibodies. "nThe use of recombinant human factor VII (rhFVII is effective for the treatment of patients with hemophilia A or B, who develop antibodies ( referred as inhibitors against replacement therapy , because it induces coagulation independent of FVIII and FIX. However, its short half-life and high cost have limited its use. One potential solution to this problem may be the use of FVIIa gene transfer, which would attain continuing therapeutic levels of expression from a single injection. The aim of this study was to engineer a novel hFVII (human FVII gene containing a cleavage site for the intracellular protease and furin, by PCR mutagenesis "nMethods: The sequence encoding light and heavy chains of hFVII, were amplified by using hFVII/pTZ57R and specific primers, separately. The PCR products were cloned in pTZ57R vector. "nResults and discussion: Cloning was confirmed by restriction analysis or PCR amplification using specific primers and plasmid universal primers. Mutagenesis of sequence encoding light and heavy chain was confirmed by restriction enzyme. "nConclusion: In the present study, it was provided recombinant plasmids based on mutant form of DNA encoding light and heavy chains. Joining mutant form of DNA encoding light chain with mutant heavy chain led to a new variant of hFVII. This variant can be activated by furin and an increase in the proportion of activated form of FVII. This mutant form of hFVII may be used for gene therapy of hemophilia.
Genomic multiple sequence alignments: refinement using a genetic algorithm

Directory of Open Access Journals (Sweden)

Lefkowitz Elliot J

2005-08-01

Full Text Available Abstract Background Genomic sequence data cannot be fully appreciated in isolation. Comparative genomics – the practice of comparing genomic sequences from different species – plays an increasingly important role in understanding the genotypic differences between species that result in phenotypic differences as well as in revealing patterns of evolutionary relationships. One of the major challenges in comparative genomics is producing a high-quality alignment between two or more related genomic sequences. In recent years, a number of tools have been developed for aligning large genomic sequences. Most utilize heuristic strategies to identify a series of strong sequence similarities, which are then used as anchors to align the regions between the anchor points. The resulting alignment is globally correct, but in many cases is suboptimal locally. We describe a new program, GenAlignRefine, which improves the overall quality of global multiple alignments by using a genetic algorithm to improve local regions of alignment. Regions of low quality are identified, realigned using the program T-Coffee, and then refined using a genetic algorithm. Because a better COFFEE (Consistency based Objective Function For alignmEnt Evaluation score generally reflects greater alignment quality, the algorithm searches for an alignment that yields a better COFFEE score. To improve the intrinsic slowness of the genetic algorithm, GenAlignRefine was implemented as a parallel, cluster-based program. Results We tested the GenAlignRefine algorithm by running it on a Linux cluster to refine sequences from a simulation, as well as refine a multiple alignment of 15 Orthopoxvirus genomic sequences approximately 260,000 nucleotides in length that initially had been aligned by Multi-LAGAN. It took approximately 150 minutes for a 40-processor Linux cluster to optimize some 200 fuzzy (poorly aligned regions of the orthopoxvirus alignment. Overall sequence identity increased only
Genetic engineering of syringyl-enriched lignin in plants

Science.gov (United States)

Chiang, Vincent Lee; Li, Laigeng

2004-11-02

The present invention relates to a novel DNA sequence, which encodes a previously unidentified lignin biosynthetic pathway enzyme, sinapyl alcohol dehydrogenase (SAD) that regulates the biosynthesis of syringyl lignin in plants. Also provided are methods for incorporating this novel SAD gene sequence or substantially similar sequences into a plant genome for genetic engineering of syringyl-enriched lignin in plants.
Next generation sequencing and its applications in forensic genetics.

Science.gov (United States)

Børsting, Claus; Morling, Niels

2015-09-01

It has been almost a decade since the first next generation sequencing (NGS) technologies emerged and quickly changed the way genetic research is conducted. Today, full genomes are mapped and published almost weekly and with ever increasing speed and decreasing costs. NGS methods and platforms have matured during the last 10 years, and the quality of the sequences has reached a level where NGS is used in clinical diagnostics of humans. Forensic genetic laboratories have also explored NGS technologies and especially in the last year, there has been a small explosion in the number of scientific articles and presentations at conferences with forensic aspects of NGS. These contributions have demonstrated that NGS offers new possibilities for forensic genetic case work. More information may be obtained from unique samples in a single experiment by analyzing combinations of markers (STRs, SNPs, insertion/deletions, mRNA) that cannot be analyzed simultaneously with the standard PCR-CE methods used today. The true variation in core forensic STR loci has been uncovered, and previously unknown STR alleles have been discovered. The detailed sequence information may aid mixture interpretation and will increase the statistical weight of the evidence. In this review, we will give an introduction to NGS and single-molecule sequencing, and we will discuss the possible applications of NGS in forensic genetics. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Human α2-HS-glycoprotein: the A and B chains with a connecting sequence are encoded by a single mRNA transcript

International Nuclear Information System (INIS)

Lee, C.C.; Bowman, B.H.; Yang, F.

1987-01-01

The α 2 -HS-glycoprotein (AHSG) is a plasma protein reported to play roles in bone mineralization and in the immune response. It is composed of two subunits, the A and B chains. Recombinant plasmids containing human cDNA AHSG have been isolated by screening an adult human liver library with a mixed oligonucleotide probe. The cDNA clones containing AHSG inserts span approximately 1.5 kilobase pairs and include the entire AHSG coding sequence, demonstrating that the A and B chains are encoded by a single mRNA transcript. The cDNA sequence predicts an 18-amino-acid signal peptide, followed by the A-chain sequence of AHSG. A heretofore unseen connecting sequence of 40 amino acids was deduced between the A- and B-chain sequences. The connecting sequence demonstrates the unique amino acid doublets and collagen triplets found in the A and B chains; it is not homologous with other reported amino acid sequences. The connecting sequence may be cleaved in a posttranslational step by limited proteolysis before mature AHSG is released into the circulation or may vary in its presence because of alternative processing. The AHSG cDNA was utilized for mapping the AHSG gene to the 3q21→qter region of human chromosome 3. The availability of the AHSG cDNA clone will facilitate the analysis of its genetic control and gene expression during development and bone formation
Screening for genetically modified organisms sequences in food ...

African Journals Online (AJOL)

We used the Allin 2.0 GMO screening system from Biosmart, Switzerland to screen for the presence of genetically modified food sequences in maize meal samples, fresh fruit and vegetables from some retailers around Gaborone, Botswana. The Allin 2.0 is a multiplex PCR system for the detection of genetically modified ...
Modeling genetic imprinting effects of DNA sequences with multilocus polymorphism data

Directory of Open Access Journals (Sweden)

Staud Roland

2009-08-01

Full Text Available Abstract Single nucleotide polymorphisms (SNPs represent the most widespread type of DNA sequence variation in the human genome and they have recently emerged as valuable genetic markers for revealing the genetic architecture of complex traits in terms of nucleotide combination and sequence. Here, we extend an algorithmic model for the haplotype analysis of SNPs to estimate the effects of genetic imprinting expressed at the DNA sequence level. The model provides a general procedure for identifying the number and types of optimal DNA sequence variants that are expressed differently due to their parental origin. The model is used to analyze a genetic data set collected from a pain genetics project. We find that DNA haplotype GAC from three SNPs, OPRKG36T (with two alleles G and T, OPRKA843G (with alleles A and G, and OPRKC846T (with alleles C and T, at the kappa-opioid receptor, triggers a significant effect on pain sensitivity, but with expression significantly depending on the parent from which it is inherited (p = 0.008. With a tremendous advance in SNP identification and automated screening, the model founded on haplotype discovery and statistical inference may provide a useful tool for genetic analysis of any quantitative trait with complex inheritance.
Multilocus sequence analysis of Treponema denticola strains of diverse origin

Directory of Open Access Journals (Sweden)

Mo Sisu

2013-02-01

Full Text Available Abstract Background The oral spirochete bacterium Treponema denticola is associated with both the incidence and severity of periodontal disease. Although the biological or phenotypic properties of a significant number of T. denticola isolates have been reported in the literature, their genetic diversity or phylogeny has never been systematically investigated. Here, we describe a multilocus sequence analysis (MLSA of 20 of the most highly studied reference strains and clinical isolates of T. denticola; which were originally isolated from subgingival plaque samples taken from subjects from China, Japan, the Netherlands, Canada and the USA. Results The sequences of the 16S ribosomal RNA gene, and 7 conserved protein-encoding genes (flaA, recA, pyrH, ppnK, dnaN, era and radC were successfully determined for each strain. Sequence data was analyzed using a variety of bioinformatic and phylogenetic software tools. We found no evidence of positive selection or DNA recombination within the protein-encoding genes, where levels of intraspecific sequence polymorphism varied from 18.8% (flaA to 8.9% (dnaN. Phylogenetic analysis of the concatenated protein-encoding gene sequence data (ca. 6,513 nucleotides for each strain using Bayesian and maximum likelihood approaches indicated that the T. denticola strains were monophyletic, and formed 6 well-defined clades. All analyzed T. denticola strains appeared to have a genetic origin distinct from that of ‘Treponema vincentii’ or Treponema pallidum. No specific geographical relationships could be established; but several strains isolated from different continents appear to be closely related at the genetic level. Conclusions Our analyses indicate that previous biological and biophysical investigations have predominantly focused on a subset of T. denticola strains with a relatively narrow range of genetic diversity. Our methodology and results establish a genetic framework for the discrimination and phylogenetic
Genetically encoded proton sensors reveal activity-dependent pH changes in neurons

Directory of Open Access Journals (Sweden)

Joseph Valentino Raimondo

2012-05-01

Full Text Available The regulation of hydrogen ion concentration (pH is fundamental to cell viability, metabolism and enzymatic function. Within the nervous system, the control of pH is also involved in diverse and dynamic processes including development, synaptic transmission and the control of network excitability. As pH affects neuronal activity, and can also itself be altered by neuronal activity, the existence of tools to accurately measure hydrogen ion fluctuations is important for understanding the role pH plays under physiological and pathological conditions. Outside of their use as a marker of synaptic release, genetically encoded pH sensors have not been utilised to study hydrogen ion fluxes associated with network activity. By combining whole-cell patch clamp with simultaneous two-photon or confocal imaging, we quantified the amplitude and time course of neuronal, intracellular, acidic transients evoked by epileptiform activity in two separate in vitro models of temporal lobe epilepsy. In doing so, we demonstrate the suitability of three genetically encoded pH sensors: deGFP4, E2GFP and Cl-sensor for investigating activity-dependent pH changes at the level of single neurons.
Genetically encoded proton sensors reveal activity-dependent pH changes in neurons.

Science.gov (United States)

Raimondo, Joseph V; Irkle, Agnese; Wefelmeyer, Winnie; Newey, Sarah E; Akerman, Colin J

2012-01-01

The regulation of hydrogen ion concentration (pH) is fundamental to cell viability, metabolism, and enzymatic function. Within the nervous system, the control of pH is also involved in diverse and dynamic processes including development, synaptic transmission, and the control of network excitability. As pH affects neuronal activity, and can also itself be altered by neuronal activity, the existence of tools to accurately measure hydrogen ion fluctuations is important for understanding the role pH plays under physiological and pathological conditions. Outside of their use as a marker of synaptic release, genetically encoded pH sensors have not been utilized to study hydrogen ion fluxes associated with network activity. By combining whole-cell patch clamp with simultaneous two-photon or confocal imaging, we quantified the amplitude and time course of neuronal, intracellular, acidic transients evoked by epileptiform activity in two separate in vitro models of temporal lobe epilepsy. In doing so, we demonstrate the suitability of three genetically encoded pH sensors: deGFP4, E(2)GFP, and Cl-sensor for investigating activity-dependent pH changes at the level of single neurons.
Hypermutability of CpG dinucleotides in the propeptide-encoding sequence of the human albumin gene

International Nuclear Information System (INIS)

Brennan, S.O.; Peach, R.; Myles, T.; George, P.; Arai, Kunio; Madison, J.; Watkins, S.; Putnam, F.W.; Laurell, C.B.; Galliano, M.

1990-01-01

An electrophoretically slow albumin variant was detected with a phenotype frequency of about 1:1,000 in Sweden and was also found in a family of Scottish descent from Kaikoura, New Zealand, and in five families in Tradate, Italy. Structural study established that the major variant component was arginyl-albumin, in which arginine at the -1 position of the propeptide is still attached to the processed albumin. A minor component with the amino-terminal sequence of proalbumin was also present as 3-6% of the total albumin. After amplification of the gene segment encoding the prepro sequence of albumin, specific hybridization of DNA to an oligonucleotide probe encoding cysteine at position -2 indicated the mutation of arginine at the -2 position to cysteine (-2 Arg → Cys). This produced the propeptide sequence Arg-Gly-Val-Phe-Cys-Arg. This was confirmed by sequence analysis after pyridylethylation of the cysteine. This mutation produces an alternate signal peptidase cleavage site in the variant proalbumin precursor of arginyl-albumin giving rise to two possible products, arginyl-albumin and the variant proalbumin. Another plasma from Bremen had an alloalbumin with a previously described substitution (1 Asp → Val), which also affects propeptide cleavage. Hypermutability of two CpG dinucleotides in the codons for the diarginyl sequence may account for the frequency of mutations in the propeptide. Mutation at these two sites results in a series of recurrent proalbumin variants that have arisen independently in diverse populations
cDNA encoding a polypeptide including a hevein sequence

Energy Technology Data Exchange (ETDEWEB)

Raikhel, N.V.; Broekaert, W.F.; Chua, N.H.; Kush, A.

2000-07-04

A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74--79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli.
cDNA encoding a polypeptide including a hevein sequence

Energy Technology Data Exchange (ETDEWEB)

Raikhel, N.V.; Broekaert, W.F.; Chua, N.H.; Kush, A.

1999-05-04

A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74--79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli. 12 figs.
cDNA encoding a polypeptide including a hevein sequence

Energy Technology Data Exchange (ETDEWEB)

Raikhel, Natasha V. (Okemos, MI); Broekaert, Willem F. (Dilbeek, BE); Chua, Nam-Hai (Scarsdale, NY); Kush, Anil (New York, NY)

1999-05-04

A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74-79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli.
cDNA encoding a polypeptide including a hevein sequence

Energy Technology Data Exchange (ETDEWEB)

Raikhel, N.V.; Broekaert, W.F.; Chua, N.H.; Kush, A.

1995-03-21

A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1,018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74--79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli. 11 figures.

Genetic and functional analysis of the gene encoding GAP-43 in schizophrenia.

Science.gov (United States)

Shen, Yu-Chih; Tsai, Ho-Min; Cheng, Min-Chih; Hsu, Shih-Hsin; Chen, Shih-Fen; Chen, Chia-Hsiang

2012-02-01

In earlier reports, growth-associated protein 43 (GAP-43) has been shown to be critical for initial establishment or reorganization of synaptic connections, a process thought to be disrupted in schizophrenia. Additionally, abnormal GAP-43 expression in different brain regions has been linked to this disorder in postmortem brain studies. In this study, we investigated the involvement of the gene encoding GAP-43 in the susceptibility to schizophrenia. We searched for genetic variants in the promoter region and 3 exons (including both UTR ends) of the GAP-43 gene using direct sequencing in a sample of patients with schizophrenia (n=586) and non-psychotic controls (n=576), both being Han Chinese from Taiwan, and conducted an association and functional study. We identified 11 common polymorphisms in the GAP-43 gene. SNP and haplotype-based analyses displayed no associations with schizophrenia. Additionally, we identified 4 rare variants in 5 out of 586 patients, including 1 variant located at the promoter region (c.-258-4722G>T) and 1 synonymous (V110V) and 2 missense (G150R and P188L) variants located at exon 2. No rare variants were found in the control subjects. The results of the reporter gene assay demonstrated that the regulatory activity of construct containing c.-258-4722T was significantly lower as compared to the wild type construct (c.-258-4722G; panalysis also demonstrated the functional relevance of other rare variants. Our study lends support to the hypothesis of multiple rare mutations in schizophrenia, and it provides genetic clues that indicate the involvement of GAP-43 in this disorder. Copyright © 2011 Elsevier B.V. All rights reserved.
Application of genotyping-by-sequencing on semiconductor sequencing platforms: a comparison of genetic and reference-based marker ordering in barley.

Directory of Open Access Journals (Sweden)

Martin Mascher

Full Text Available The rapid development of next-generation sequencing platforms has enabled the use of sequencing for routine genotyping across a range of genetics studies and breeding applications. Genotyping-by-sequencing (GBS, a low-cost, reduced representation sequencing method, is becoming a common approach for whole-genome marker profiling in many species. With quickly developing sequencing technologies, adapting current GBS methodologies to new platforms will leverage these advancements for future studies. To test new semiconductor sequencing platforms for GBS, we genotyped a barley recombinant inbred line (RIL population. Based on a previous GBS approach, we designed bar code and adapter sets for the Ion Torrent platforms. Four sets of 24-plex libraries were constructed consisting of 94 RILs and the two parents and sequenced on two Ion platforms. In parallel, a 96-plex library of the same RILs was sequenced on the Illumina HiSeq 2000. We applied two different computational pipelines to analyze sequencing data; the reference-independent TASSEL pipeline and a reference-based pipeline using SAMtools. Sequence contigs positioned on the integrated physical and genetic map were used for read mapping and variant calling. We found high agreement in genotype calls between the different platforms and high concordance between genetic and reference-based marker order. There was, however, paucity in the number of SNP that were jointly discovered by the different pipelines indicating a strong effect of alignment and filtering parameters on SNP discovery. We show the utility of the current barley genome assembly as a framework for developing very low-cost genetic maps, facilitating high resolution genetic mapping and negating the need for developing de novo genetic maps for future studies in barley. Through demonstration of GBS on semiconductor sequencing platforms, we conclude that the GBS approach is amenable to a range of platforms and can easily be modified as new
Methods of combined bioprocessing and related microorganisms, thermophilic and/or acidophilic enzymes, and nucleic acids encoding said enzymes

Energy Technology Data Exchange (ETDEWEB)

Thompson, David N.; Apel, William A.; Thompson, Vicki S.; Ward, Thomas E.

2017-08-15

A genetically modified organism comprising: at least one nucleic acid sequence and/or at least one recombinant nucleic acid isolated from Alicyclobacillus acidocaldarius and encoding a polypeptide involved in at least partially degrading, cleaving, transporting, metabolizing, or removing polysaccharides, cellulose, lignocellulose, hemicellulose, lignin, starch, sugars, sugar oligomers, carbohydrates, complex carbohydrates, chitin, heteroxylans, glycosides, xylan-, glucan-, galactan-, or mannan-decorating groups; and at least one nucleic acid sequence and/or at least one recombinant nucleic acid encoding a polypeptide involved in fermenting sugar molecules to a product. Additionally, enzymatic and/or proteinaceous extracts may be isolated from one or more genetically modified organisms. The extracts are utilized to convert biomass into a product. Further provided are methods of converting biomass into products comprising: placing the genetically modified organism and/or enzymatic extracts thereof in fluid contact with polysaccharides, cellulose, lignocellulose, hemicellulose, lignin, starch, sugars, sugar oligomers, carbohydrates, complex carbohydrates, chitin, heteroxylans, glycosides, and/or xylan-, glucan-, galactan-, or mannan-decorating groups.
Methods of combined bioprocessing and related microorganisms, thermophilic and/or acidophilic enzymes, and nucleic acids encoding said enzymes

Science.gov (United States)

Thompson, David N; Apel, William A; Thompson, Vicki S; Ward, Thomas E

2013-07-23

A genetically modified organism comprising: at least one nucleic acid sequence and/or at least one recombinant nucleic acid isolated from Alicyclobacillus acidocaldarius and encoding a polypeptide involved in at least partially degrading, cleaving, transporting, metabolizing, or removing polysaccharides, cellulose, lignocellulose, hemicellulose, lignin, starch, sugars, sugar oligomers, carbohydrates, complex carbohydrates, chitin, heteroxylans, glycosides, xylan-, glucan-, galactan-, or mannan-decorating groups; and at least one nucleic acid sequence and/or at least one recombinant nucleic acid encoding a polypeptide involved in fermenting sugar molecules to a product. Additionally, enzymatic and/or proteinaceous extracts may be isolated from one or more genetically modified organisms. The extracts are utilized to convert biomass into a product. Further provided are methods of converting biomass into products comprising: placing the genetically modified organism and/or enzymatic extracts thereof in fluid contact with polysaccharides, cellulose, lignocellulose, hemicellulose, lignin, starch, sugars, sugar oligomers, carbohydrates, complex carbohydrates, chitin, heteroxylans, glycosides, and/or xylan-, glucan-, galactan-, or mannan-decorating groups.
Methods of combined bioprocessing and related microorganisms, thermophilic and/or acidophilic enzymes, and nucleic acids encoding said enzymes

Energy Technology Data Exchange (ETDEWEB)

Thompson, David N.; Apel, William A.; Thompson, Vicki S.; Ward, Thomas E.

2016-03-22

A genetically modified organism comprising: at least one nucleic acid sequence and/or at least one recombinant nucleic acid isolated from Alicyclobacillus acidocaldarius and encoding a polypeptide involved in at least partially degrading, cleaving, transporting, metabolizing, or removing polysaccharides, cellulose, lignocellulose, hemicellulose, lignin, starch, sugars, sugar oligomers, carbohydrates, complex carbohydrates, chitin, heteroxylans, glycosides, xylan-, glucan-, galactan-, or mannan-decorating groups; and at least one nucleic acid sequence and/or at least one recombinant nucleic acid encoding a polypeptide involved in fermenting sugar molecules to a product. Additionally, enzymatic and/or proteinaceous extracts may be isolated from one or more genetically modified organisms. The extracts are utilized to convert biomass into a product. Further provided are methods of converting biomass into products comprising: placing the genetically modified organism and/or enzymatic extracts thereof in fluid contact with polysaccharides, cellulose, lignocellulose, hemicellulose, lignin, starch, sugars, sugar oligomers, carbohydrates, complex carbohydrates, chitin, heteroxylans, glycosides, and/or xylan-, glucan-, galactan-, or mannan-decorating groups.
a permutation encoding te algorithm solution of reso tation encoding

African Journals Online (AJOL)

eobe

Keywords: Genetic algorithm, resource constrained. 1. INTRODUCTION. 1. .... Nigerian Journal of Technology. Vol. 34, No. 1, January 2015. 128 ... 4. ENCODING OF CHROMOSOME. ENCODING OF CHROMOSOME .... International Multi conference of Engineers and ... method”, Naval Research Logistics, vol 48, issue 2,.
Nucleotide sequences of the genes encoding fructosebisphosphatase and phosphoribulokinase from Xanthobacter flavus H4-14

NARCIS (Netherlands)

Meijer, Wilhelmus; Enequist, H.G.; Terpstra, Peter; Dijkhuizen, L.

The genes encoding fructosebisphosphatase and phosphoribulokinase present on a 2.5 kb SalI fragment from Xanthobacter flavus H4-14 were sequenced. Two large open reading frames (ORFs) were identified, preceded by plausible ribosome-binding sites. The ORFs were transcribed in the same direction and
Hierarchical assembly of viral nanotemplates with encoded microparticles via nucleic acid hybridization.

Science.gov (United States)

Tan, Wui Siew; Lewis, Christina L; Horelik, Nicholas E; Pregibon, Daniel C; Doyle, Patrick S; Yi, Hyunmin

2008-11-04

We demonstrate hierarchical assembly of tobacco mosaic virus (TMV)-based nanotemplates with hydrogel-based encoded microparticles via nucleic acid hybridization. TMV nanotemplates possess a highly defined structure and a genetically engineered high density thiol functionality. The encoded microparticles are produced in a high throughput microfluidic device via stop-flow lithography (SFL) and consist of spatially discrete regions containing encoded identity information, an internal control, and capture DNAs. For the hybridization-based assembly, partially disassembled TMVs were programmed with linker DNAs that contain sequences complementary to both the virus 5' end and a selected capture DNA. Fluorescence microscopy, atomic force microscopy (AFM), and confocal microscopy results clearly indicate facile assembly of TMV nanotemplates onto microparticles with high spatial and sequence selectivity. We anticipate that our hybridization-based assembly strategy could be employed to create multifunctional viral-synthetic hybrid materials in a rapid and high-throughput manner. Additionally, we believe that these viral-synthetic hybrid microparticles may find broad applications in high capacity, multiplexed target sensing.
Increasing the reach of forensic genetics with massively parallel sequencing.

Science.gov (United States)

Budowle, Bruce; Schmedes, Sarah E; Wendt, Frank R

2017-09-01

The field of forensic genetics has made great strides in the analysis of biological evidence related to criminal and civil matters. More so, the discipline has set a standard of performance and quality in the forensic sciences. The advent of massively parallel sequencing will allow the field to expand its capabilities substantially. This review describes the salient features of massively parallel sequencing and how it can impact forensic genetics. The features of this technology offer increased number and types of genetic markers that can be analyzed, higher throughput of samples, and the capability of targeting different organisms, all by one unifying methodology. While there are many applications, three are described where massively parallel sequencing will have immediate impact: molecular autopsy, microbial forensics and differentiation of monozygotic twins. The intent of this review is to expose the forensic science community to the potential enhancements that have or are soon to arrive and demonstrate the continued expansion the field of forensic genetics and its service in the investigation of legal matters.
Isolation and sequence of complementary DNA encoding human extracellular superoxide dismutase

International Nuclear Information System (INIS)

Hjalmarsson, K.; Marklund, S.L.; Engstroem, A.; Edlund, T.

1987-01-01

A complementary DNA (cDNA) clone from a human placenta cDNA library encoding extracellular superoxide dismutase has been isolated and the nucleotide sequence determined. The cDNA has a very high G + C content. EC-SOD is synthesized with a putative 18-amino acid signal peptide, preceding the 222 amino acids in the mature enzyme, indicating that the enzyme is a secretory protein. The first 95 amino acids of the mature enzyme show no sequence homology with other sequenced proteins and there is one possible N-glycosylation site (Asn-89). The amino acid sequence from residues 96-193 shows strong homology (∼ 50%) with the final two-thirds of the sequences of all know eukaryotic CuZn SODs, whereas the homology with the P. leiognathi CuZn SOD is clearly lower. The ligands to Cu and Zn, the cysteines forming the intrasubunit disulfide bridge in the CuZn SODs, and the arginine found in all CuZn SODs in the entrance to the active site can all be identified in EC-SOD. A comparison with bovine CuZn SOD, the three-dimensional structure of which is known, reveals that the homologies occur in the active site and the divergencies are in the part constituting the subunit contact area in CuZn SOD. Amino acid sequence 194-222 in the carboxyl-terminal end of EC-SOD is strongly hydrophilic and contains nine amino acids with a positive charge. This sequence probably confers the affinity of EC-SOD for heparin and heparan sulfate. An analysis of the amino acid sequence homologies with CuZn SODs from various species indicates that the EC-SODs may have evolved form the CuZn SODs before the evolution of fungi and plants
Genetic diagnosis of Mendelian disorders via RNA sequencing.

Science.gov (United States)

Kremer, Laura S; Bader, Daniel M; Mertes, Christian; Kopajtich, Robert; Pichler, Garwin; Iuso, Arcangela; Haack, Tobias B; Graf, Elisabeth; Schwarzmayr, Thomas; Terrile, Caterina; Koňaříková, Eliška; Repp, Birgit; Kastenmüller, Gabi; Adamski, Jerzy; Lichtner, Peter; Leonhardt, Christoph; Funalot, Benoit; Donati, Alice; Tiranti, Valeria; Lombes, Anne; Jardel, Claude; Gläser, Dieter; Taylor, Robert W; Ghezzi, Daniele; Mayr, Johannes A; Rötig, Agnes; Freisinger, Peter; Distelmaier, Felix; Strom, Tim M; Meitinger, Thomas; Gagneur, Julien; Prokisch, Holger

2017-06-12

Across a variety of Mendelian disorders, ∼50-75% of patients do not receive a genetic diagnosis by exome sequencing indicating disease-causing variants in non-coding regions. Although genome sequencing in principle reveals all genetic variants, their sizeable number and poorer annotation make prioritization challenging. Here, we demonstrate the power of transcriptome sequencing to molecularly diagnose 10% (5 of 48) of mitochondriopathy patients and identify candidate genes for the remainder. We find a median of one aberrantly expressed gene, five aberrant splicing events and six mono-allelically expressed rare variants in patient-derived fibroblasts and establish disease-causing roles for each kind. Private exons often arise from cryptic splice sites providing an important clue for variant prioritization. One such event is found in the complex I assembly factor TIMMDC1 establishing a novel disease-associated gene. In conclusion, our study expands the diagnostic tools for detecting non-exonic variants and provides examples of intronic loss-of-function variants with pathological relevance.
Exome sequencing identifies variants in two genes encoding the LIM-proteins NRAP and FHL1 in an Italian patient with BAG3 myofibrillar myopathy.

Science.gov (United States)

D'Avila, Francesca; Meregalli, Mirella; Lupoli, Sara; Barcella, Matteo; Orro, Alessandro; De Santis, Francesca; Sitzia, Clementina; Farini, Andrea; D'Ursi, Pasqualina; Erratico, Silvia; Cristofani, Riccardo; Milanesi, Luciano; Braga, Daniele; Cusi, Daniele; Poletti, Angelo; Barlassina, Cristina; Torrente, Yvan

2016-06-01

Myofibrillar myopathies (MFMs) are genetically heterogeneous dystrophies characterized by the disintegration of Z-disks and myofibrils and are associated with mutations in genes encoding Z-disk or Z-disk-related proteins. The c.626 C > T (p.P209L) mutation in the BAG3 gene has been described as causative of a subtype of MFM. We report a sporadic case of a 26-year-old Italian woman, affected by MFM with axonal neuropathy, cardiomyopathy, rigid spine, who carries the c.626 C > T mutation in the BAG3 gene. The patient and her non-consanguineous healthy parents and brother were studied with whole exome sequencing (WES) to further investigate the genetic basis of this complex phenotype. In the patient, we found that the BAG3 mutation is associated with variants in the NRAP and FHL1 genes that encode muscle-specific, LIM domain containing proteins. Quantitative real time PCR, immunohistochemistry and Western blot analysis of the patient's muscular biopsy showed the absence of NRAP expression and FHL1 accumulation in aggregates in the affected skeletal muscle tissue. Molecular dynamic analysis of the mutated FHL1 domain showed a modification in its surface charge, which could affect its capability to bind its target proteins. To our knowledge this is the first study reporting, in a BAG3 MFM, the simultaneous presence of genetic variants in the BAG3 and FHL1 genes (previously described as independently associated with MFMs) and linking the NRAP gene to MFM for the first time.
Acral peeling skin syndrome resulting from a homozygous nonsense mutation in the CSTA gene encoding cystatin A.

Science.gov (United States)

Krunic, Aleksandar L; Stone, Kristina L; Simpson, Michael A; McGrath, John A

2013-01-01

Acral peeling skin syndrome (APSS) is a clinically and genetically heterogeneous disorder. We used whole-exome sequencing to identify the molecular basis of APSS in a consanguineous Jordanian-American pedigree. We identified a homozygous nonsense mutation (p.Lys22X) in the CSTA gene, encoding cystatin A, that was confirmed using Sanger sequencing. Cystatin A is a protease inhibitor found in the cornified cell envelope, and loss-of-function mutations have previously been reported in two cases of exfoliative ichthyosis. Our study expands the molecular pathology of APSS and demonstrates the value of next-generation sequencing in the genetic characterization of inherited skin diseases. © 2013 Wiley Periodicals, Inc.
Role of Virus-Encoded microRNAs in Avian Viral Diseases

Directory of Open Access Journals (Sweden)

Yongxiu Yao

2014-03-01

Full Text Available With total dependence on the host cell, several viruses have adopted strategies to modulate the host cellular environment, including the modulation of microRNA (miRNA pathway through virus-encoded miRNAs. Several avian viruses, mostly herpesviruses, have been shown to encode a number of novel miRNAs. These include the highly oncogenic Marek’s disease virus-1 (26 miRNAs, avirulent Marek’s disease virus-2 (36 miRNAs, herpesvirus of turkeys (28 miRNAs, infectious laryngotracheitis virus (10 miRNAs, duck enteritis virus (33 miRNAs and avian leukosis virus (2 miRNAs. Despite the closer antigenic and phylogenetic relationship among some of the herpesviruses, miRNAs encoded by different viruses showed no sequence conservation, although locations of some of the miRNAs were conserved within the repeat regions of the genomes. However, some of the virus-encoded miRNAs showed significant sequence homology with host miRNAs demonstrating their ability to serve as functional orthologs. For example, mdv1-miR-M4-5p, a functional ortholog of gga-miR-155, is critical for the oncogenicity of Marek’s disease virus. Additionally, we also describe the potential association of the recently described avian leukosis virus subgroup J encoded E (XSR miRNA in the induction of myeloid tumors in certain genetically-distinct chicken lines. In this review, we describe the advances in our understanding on the role of virus-encoded miRNAs in avian diseases.
Whole Genome Sequences of Three Treponema pallidum ssp. pertenue Strains: Yaws and Syphilis Treponemes Differ in Less than 0.2% of the Genome Sequence

Science.gov (United States)

Chen, Lei; Pospíšilová, Petra; Strouhal, Michal; Qin, Xiang; Mikalová, Lenka; Norris, Steven J.; Muzny, Donna M.; Gibbs, Richard A.; Fulton, Lucinda L.; Sodergren, Erica; Weinstock, George M.; Šmajs, David

2012-01-01

Background The yaws treponemes, Treponema pallidum ssp. pertenue (TPE) strains, are closely related to syphilis causing strains of Treponema pallidum ssp. pallidum (TPA). Both yaws and syphilis are distinguished on the basis of epidemiological characteristics, clinical symptoms, and several genetic signatures of the corresponding causative agents. Methodology/Principal Findings To precisely define genetic differences between TPA and TPE, high-quality whole genome sequences of three TPE strains (Samoa D, CDC-2, Gauthier) were determined using next-generation sequencing techniques. TPE genome sequences were compared to four genomes of TPA strains (Nichols, DAL-1, SS14, Chicago). The genome structure was identical in all three TPE strains with similar length ranging between 1,139,330 bp and 1,139,744 bp. No major genome rearrangements were found when compared to the four TPA genomes. The whole genome nucleotide divergence (dA) between TPA and TPE subspecies was 4.7 and 4.8 times higher than the observed nucleotide diversity (π) among TPA and TPE strains, respectively, corresponding to 99.8% identity between TPA and TPE genomes. A set of 97 (9.9%) TPE genes encoded proteins containing two or more amino acid replacements or other major sequence changes. The TPE divergent genes were mostly from the group encoding potential virulence factors and genes encoding proteins with unknown function. Conclusions/Significance Hypothetical genes, with genetic differences, consistently found between TPE and TPA strains are candidates for syphilitic treponemes virulence factors. Seventeen TPE genes were predicted under positive selection, and eleven of them coded either for predicted exported proteins or membrane proteins suggesting their possible association with the cell surface. Sequence changes between TPE and TPA strains and changes specific to individual strains represent suitable targets for subspecies- and strain-specific molecular diagnostics. PMID:22292095
Genetic diversity in breonadia salicina based on intra-species sequence variation of chloroplast dna spacer sequence

International Nuclear Information System (INIS)

Qurainy, F.A.; Gaafar, A.R.Z.

2014-01-01

Assessment and knowledge of the genetic diversity and variation within and between populations of rare and endangered plants is very important for effective conservation. Intergenic spacer sequences variation of psbA-trnH locus of chloroplast genome was assessed within Breonadia salicina (Rubiaceae), a critically endangered and endemic plant species to South western part of Kingdom of Saudi Arabia. The obtained sequence data from 19 individuals in three populations revealed nine haplotypes. The aligned sequences obtained from the overall Saudi accessions extended to 355 bp, revealing nine haplotypes. A high level of haplotype diversity (Hd = 0.842) and low level of nucleotide diversity (Pi = 0.0058) were detected. Consistently, both hierarchical analysis of molecular variance (AMOVA) and constructed neighbor-joining tree indicated null genetic differentiation among populations. This level of differentiation between populations or between regions in psbA-trnH sequences may be due to effects of the abundance of ancestral haplotype sharing and the presence of private haplotypes fixed for each population. Furthermore, the results revealed almost the same level of genetic diversity in comparison with Yemeni accessions, in which Saudi accessions were sharing three haplotypes from the four haplotypes found in Yemeni accessions. (author)
Molecular cloning and sequence of cDNA encoding the plasma membrane proton pump (H+-ATPase) of Arabidopsis thaliana

International Nuclear Information System (INIS)

Harper, J.F.; Surowy, T.K.; Sussman, M.R.

1989-01-01

In plants, the transport of solutes across the plasma membrane is driven by a proton pump (H + -ATPase) that produces an electric potential and pH gradient. The authors isolated and sequenced a full-length cDNA clone that encodes this enzyme in Arabidopsis thaliana. The protein predicted from its nucleotide sequence encodes 959 amino acids and has a molecular mass of 104,207 Da. The plant protein shows structural features common to a family of cation-translocating ATPases found in the plasma membrane of prokaryotic and eukaryotic cells, with the greatest overall identity in amino acid sequence (36%) to the H + -ATPase observed in the plasma membrane of fungi. The structure predicted from a hydropathy plant contains at least eight transmembrane segments, with most of the protein (73%) extending into the cytoplasm and only 5% of the residues exposed on the external surface. Unique features of the plant enzyme include diverged sequences at the amino and carboxyl termini as well as greater hydrophilic character in three extracellular loops
Using "Arabidopsis" Genetic Sequences to Teach Bioinformatics

Science.gov (United States)

Zhang, Xiaorong

2009-01-01

This article describes a new approach to teaching bioinformatics using "Arabidopsis" genetic sequences. Several open-ended and inquiry-based laboratory exercises have been designed to help students grasp key concepts and gain practical skills in bioinformatics, using "Arabidopsis" leucine-rich repeat receptor-like kinase (LRR…
Applying Next Generation Sequencing to Skeletal Development and Disease

OpenAIRE

Bowen, Margot Elizabeth

2013-01-01

Next Generation Sequencing (NGS) technologies have dramatically increased the throughput and lowered the cost of DNA sequencing. In this thesis, I apply these technologies to unresolved questions in skeletal development and disease. Firstly, I use targeted re-sequencing of genomic DNA to identify the genetic cause of the cartilage tumor syndrome, metachondromatosis (MC). I show that the majority of MC patients carry heterozygous loss-of-function mutations in the PTPN11 gene, which encodes a p...
Genetic variants in nuclear-encoded mitochondrial genes influence AIDS progression.

Directory of Open Access Journals (Sweden)

Sher L Hendrickson

2010-09-01

Full Text Available The human mitochondrial genome includes only 13 coding genes while nuclear-encoded genes account for 99% of proteins responsible for mitochondrial morphology, redox regulation, and energetics. Mitochondrial pathogenesis occurs in HIV patients and genetically, mitochondrial DNA haplogroups with presumed functional differences have been associated with differential AIDS progression.Here we explore whether single nucleotide polymorphisms (SNPs within 904 of the estimated 1,500 genes that specify nuclear-encoded mitochondrial proteins (NEMPs influence AIDS progression among HIV-1 infected patients. We examined NEMPs for association with the rate of AIDS progression using genotypes generated by an Affymetrix 6.0 genotyping array of 1,455 European American patients from five US AIDS cohorts. Successfully genotyped SNPs gave 50% or better haplotype coverage for 679 of known NEMP genes. With a Bonferroni adjustment for the number of genes and tests examined, multiple SNPs within two NEMP genes showed significant association with AIDS progression: acyl-CoA synthetase medium-chain family member 4 (ACSM4 on chromosome 12 and peroxisomal D3,D2-enoyl-CoA isomerase (PECI on chromosome 6.Our previous studies on mitochondrial DNA showed that European haplogroups with presumed functional differences were associated with AIDS progression and HAART mediated adverse events. The modest influences of nuclear-encoded mitochondrial genes found in the current study add support to the idea that mitochondrial function plays a role in AIDS pathogenesis.

Next generation sequencing and its applications in forensic genetics

DEFF Research Database (Denmark)

Børsting, Claus; Morling, Niels

2015-01-01

articles and presentations at conferences with forensic aspects of NGS. These contributions have demonstrated that NGS offers new possibilities for forensic genetic case work. More information may be obtained from unique samples in a single experiment by analyzing combinations of markers (STRs, SNPs......It has been almost a decade since the first next generation sequencing (NGS) technologies emerged and quickly changed the way genetic research is conducted. Today, full genomes are mapped and published almost weekly and with ever increasing speed and decreasing costs. NGS methods and platforms have...... matured during the last 10 years, and the quality of the sequences has reached a level where NGS is used in clinical diagnostics of humans. Forensic genetic laboratories have also explored NGS technologies and especially in the last year, there has been a small explosion in the number of scientific...
Genetic Code Analysis Toolkit: A novel tool to explore the coding properties of the genetic code and DNA sequences

Science.gov (United States)

Kraljić, K.; Strüngmann, L.; Fimmel, E.; Gumbel, M.

2018-01-01

The genetic code is degenerated and it is assumed that redundancy provides error detection and correction mechanisms in the translation process. However, the biological meaning of the code's structure is still under current research. This paper presents a Genetic Code Analysis Toolkit (GCAT) which provides workflows and algorithms for the analysis of the structure of nucleotide sequences. In particular, sets or sequences of codons can be transformed and tested for circularity, comma-freeness, dichotomic partitions and others. GCAT comes with a fertile editor custom-built to work with the genetic code and a batch mode for multi-sequence processing. With the ability to read FASTA files or load sequences from GenBank, the tool can be used for the mathematical and statistical analysis of existing sequence data. GCAT is Java-based and provides a plug-in concept for extensibility. Availability: Open source Homepage:http://www.gcat.bio/
A genetically-encoded chloride and pH sensor for dissociating ion dynamics in the nervous system.

Science.gov (United States)

Raimondo, Joseph V; Joyce, Bradley; Kay, Louise; Schlagheck, Theresa; Newey, Sarah E; Srinivas, Shankar; Akerman, Colin J

2013-01-01

Within the nervous system, intracellular Cl(-) and pH regulate fundamental processes including cell proliferation, metabolism, synaptic transmission, and network excitability. Cl(-) and pH are often co-regulated, and network activity results in the movement of both Cl(-) and H(+). Tools to accurately measure these ions are crucial for understanding their role under physiological and pathological conditions. Although genetically-encoded Cl(-) and pH sensors have been described previously, these either lack ion specificity or are unsuitable for neuronal use. Here we present ClopHensorN-a new genetically-encoded ratiometric Cl(-) and pH sensor that is optimized for the nervous system. We demonstrate the ability of ClopHensorN to dissociate and simultaneously quantify Cl(-) and H(+) concentrations under a variety of conditions. In addition, we establish the sensor's utility by characterizing activity-dependent ion dynamics in hippocampal neurons.
cDNA encoding a polypeptide including a hev ein sequence

Energy Technology Data Exchange (ETDEWEB)

Raikhel, Natasha V. (Okemos, MI); Broekaert, Willem F. (Dilbeek, BE); Chua, Nam-Hai (Scarsdale, NY); Kush, Anil (New York, NY)

2000-07-04

A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74-79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli.
Whole Exome Sequencing Reveals Genetic Predisposition in a Large Family with Retinitis Pigmentosa

Directory of Open Access Journals (Sweden)

Juan Wu

2014-01-01

Full Text Available Next-generation sequencing has become more widely used to reveal genetic defect in monogenic disorders. Retinitis pigmentosa (RP, the leading cause of hereditary blindness worldwide, has been attributed to more than 67 disease-causing genes. Due to the extreme genetic heterogeneity, using general molecular screening alone is inadequate for identifying genetic predispositions in susceptible individuals. In order to identify underlying mutation rapidly, we utilized next-generation sequencing in a four-generation Chinese family with RP. Two affected patients and an unaffected sibling were subjected to whole exome sequencing. Through bioinformatics analysis and direct sequencing confirmation, we identified p.R135W transition in the rhodopsin gene. The mutation was subsequently confirmed to cosegregate with the disease in the family. In this study, our results suggest that whole exome sequencing is a robust method in diagnosing familial hereditary disease.
Evidence for Human Fronto-Central Gamma Activity during Long-Term Memory Encoding of Word Sequences

Science.gov (United States)

Meeuwissen, Esther Berendina; Takashima, Atsuko; Fernández, Guillén; Jensen, Ole

2011-01-01

Although human gamma activity (30–80 Hz) associated with visual processing is often reported, it is not clear to what extend gamma activity can be reliably detected non-invasively from frontal areas during complex cognitive tasks such as long term memory (LTM) formation. We conducted a memory experiment composed of 35 blocks each having three parts: LTM encoding, working memory (WM) maintenance and LTM retrieval. In the LTM encoding and WM maintenance parts, participants had to respectively encode or maintain the order of three sequentially presented words. During LTM retrieval subjects had to reproduce these sequences. Using magnetoencephalography (MEG) we identified significant differences in the gamma and beta activity. Robust gamma activity (55–65 Hz) in left BA6 (supplementary motor area (SMA)/pre-SMA) was stronger during LTM rehearsal than during WM maintenance. The gamma activity was sustained throughout the 3.4 s rehearsal period during which a fixation cross was presented. Importantly, the difference in gamma band activity correlated with memory performance over subjects. Further we observed a weak gamma power difference in left BA6 during the first half of the LTM rehearsal interval larger for successfully than unsuccessfully reproduced word triplets. In the beta band, we found a power decrease in left anterior regions during LTM rehearsal compared to WM maintenance. Also this suppression of beta power correlated with memory performance over subjects. Our findings show that an extended network of brain areas, characterized by oscillatory activity in different frequency bands, supports the encoding of word sequences in LTM. Gamma band activity in BA6 possibly reflects memory processes associated with language and timing, and suppression of beta activity at left frontal sensors is likely to reflect the release of inhibition directly associated with the engagement of language functions. PMID:21738641
Evidence for human fronto-central gamma activity during long-term memory encoding of word sequences.

Directory of Open Access Journals (Sweden)

Esther Berendina Meeuwissen

Full Text Available Although human gamma activity (30-80 Hz associated with visual processing is often reported, it is not clear to what extend gamma activity can be reliably detected non-invasively from frontal areas during complex cognitive tasks such as long term memory (LTM formation. We conducted a memory experiment composed of 35 blocks each having three parts: LTM encoding, working memory (WM maintenance and LTM retrieval. In the LTM encoding and WM maintenance parts, participants had to respectively encode or maintain the order of three sequentially presented words. During LTM retrieval subjects had to reproduce these sequences. Using magnetoencephalography (MEG we identified significant differences in the gamma and beta activity. Robust gamma activity (55-65 Hz in left BA6 (supplementary motor area (SMA/pre-SMA was stronger during LTM rehearsal than during WM maintenance. The gamma activity was sustained throughout the 3.4 s rehearsal period during which a fixation cross was presented. Importantly, the difference in gamma band activity correlated with memory performance over subjects. Further we observed a weak gamma power difference in left BA6 during the first half of the LTM rehearsal interval larger for successfully than unsuccessfully reproduced word triplets. In the beta band, we found a power decrease in left anterior regions during LTM rehearsal compared to WM maintenance. Also this suppression of beta power correlated with memory performance over subjects. Our findings show that an extended network of brain areas, characterized by oscillatory activity in different frequency bands, supports the encoding of word sequences in LTM. Gamma band activity in BA6 possibly reflects memory processes associated with language and timing, and suppression of beta activity at left frontal sensors is likely to reflect the release of inhibition directly associated with the engagement of language functions.
Complete Genome Sequence of Staphylococcus epidermidis 1457.

Science.gov (United States)

Galac, Madeline R; Stam, Jason; Maybank, Rosslyn; Hinkle, Mary; Mack, Dietrich; Rohde, Holger; Roth, Amanda L; Fey, Paul D

2017-06-01

Staphylococcus epidermidis 1457 is a frequently utilized strain that is amenable to genetic manipulation and has been widely used for biofilm-related research. We report here the whole-genome sequence of this strain, which encodes 2,277 protein-coding genes and 81 RNAs within its 2.4-Mb genome and plasmid. Copyright © 2017 Galac et al.
A genetically-encoded chloride and pH sensor for dissociating ion dynamics in the nervous system

Science.gov (United States)

Raimondo, Joseph V.; Joyce, Bradley; Kay, Louise; Schlagheck, Theresa; Newey, Sarah E.; Srinivas, Shankar; Akerman, Colin J.

2013-01-01

Within the nervous system, intracellular Cl− and pH regulate fundamental processes including cell proliferation, metabolism, synaptic transmission, and network excitability. Cl− and pH are often co-regulated, and network activity results in the movement of both Cl− and H+. Tools to accurately measure these ions are crucial for understanding their role under physiological and pathological conditions. Although genetically-encoded Cl− and pH sensors have been described previously, these either lack ion specificity or are unsuitable for neuronal use. Here we present ClopHensorN—a new genetically-encoded ratiometric Cl− and pH sensor that is optimized for the nervous system. We demonstrate the ability of ClopHensorN to dissociate and simultaneously quantify Cl− and H+ concentrations under a variety of conditions. In addition, we establish the sensor's utility by characterizing activity-dependent ion dynamics in hippocampal neurons. PMID:24312004
A genetically-encoded chloride and pH sensor for dissociating ion dynamics in the nervous system

Directory of Open Access Journals (Sweden)

Joseph Valentino Raimondo

2013-11-01

Full Text Available Within the nervous system, intracellular Cl- and pH regulate fundamental processes including cell proliferation, metabolism, synaptic transmission and network excitability. Cl- and pH are often co-regulated, and network activity results in the movement of both Cl- and H+. Tools to accurately measure these ions are crucial for understanding their role under physiological and pathological conditions. Although genetically-encoded Cl- and pH sensors have been described previously, these either lack ion specificity or are unsuitable for neuronal use. Here we present ClopHensorN - a new genetically-encoded ratiometric Cl- and pH sensor that is optimized for the nervous system. We demonstrate the ability of ClopHensorN to dissociate and simultaneously quantify Cl- and H+ concentrations under a variety of conditions. In addition, we establish the sensor’s utility by characterizing activity-dependent ion dynamics in hippocampal neurons.
On the Edge of Language Acquisition: Inherent Constraints on Encoding Multisyllabic Sequences in the Neonate Brain

Science.gov (United States)

Ferry, Alissa L.; Fló, Ana; Brusini, Perrine; Cattarossi, Luigi; Macagno, Francesco; Nespor, Marina; Mehler, Jacques

2016-01-01

To understand language, humans must encode information from rapid, sequential streams of syllables--tracking their order and organizing them into words, phrases, and sentences. We used Near-Infrared Spectroscopy (NIRS) to determine whether human neonates are born with the capacity to track the positions of syllables in multisyllabic sequences.…
GENETIC POLYMORPHISM IN GYMNODINIUM GALATHEANUM CHLOROPLAST DNA SEQUENCES AND DEVELOPMENT OF A MOLECULAR DETECTION ASSAY. (R827084)

Science.gov (United States)

Nuclear and chloroplast-encoded small subunit ribosomal DNA sequences were obtainedfrom several strains of the toxic dinoflagellate Gymnodinium galatheanum. Phylogenetic analyses andcomparison of sequences indicate that the chloroplast sequences show a higher degree of se...
Impact of exome sequencing in inflammatory bowel disease

Science.gov (United States)

Cardinale, Christopher J; Kelsen, Judith R; Baldassano, Robert N; Hakonarson, Hakon

2013-01-01

Approaches to understanding the genetic contribution to inflammatory bowel disease (IBD) have continuously evolved from family- and population-based epidemiology, to linkage analysis, and most recently, to genome-wide association studies (GWAS). The next stage in this evolution seems to be the sequencing of the exome, that is, the regions of the human genome which encode proteins. The GWAS approach has been very fruitful in identifying at least 163 loci as being associated with IBD, and now, exome sequencing promises to take our genetic understanding to the next level. In this review we will discuss the possible contributions that can be made by an exome sequencing approach both at the individual patient level to aid with disease diagnosis and future therapies, as well as in advancing knowledge of the pathogenesis of IBD. PMID:24187447
Simple sequence repeat marker development and genetic mapping ...

Indian Academy of Sciences (India)

polymorphic SSR (simple sequence repeats) markers from libraries enriched for GA, CAA and AAT repeats, as well as 6 ... ers for quinoa was the development of a genetic linkage map ...... Weber J. L. 1990 Informativeness of human (dC-dA)n.
Codon size reduction as the origin of the triplet genetic code.

Directory of Open Access Journals (Sweden)

Pavel V Baranov

Full Text Available The genetic code appears to be optimized in its robustness to missense errors and frameshift errors. In addition, the genetic code is near-optimal in terms of its ability to carry information in addition to the sequences of encoded proteins. As evolution has no foresight, optimality of the modern genetic code suggests that it evolved from less optimal code variants. The length of codons in the genetic code is also optimal, as three is the minimal nucleotide combination that can encode the twenty standard amino acids. The apparent impossibility of transitions between codon sizes in a discontinuous manner during evolution has resulted in an unbending view that the genetic code was always triplet. Yet, recent experimental evidence on quadruplet decoding, as well as the discovery of organisms with ambiguous and dual decoding, suggest that the possibility of the evolution of triplet decoding from living systems with non-triplet decoding merits reconsideration and further exploration. To explore this possibility we designed a mathematical model of the evolution of primitive digital coding systems which can decode nucleotide sequences into protein sequences. These coding systems can evolve their nucleotide sequences via genetic events of Darwinian evolution, such as point-mutations. The replication rates of such coding systems depend on the accuracy of the generated protein sequences. Computer simulations based on our model show that decoding systems with codons of length greater than three spontaneously evolve into predominantly triplet decoding systems. Our findings suggest a plausible scenario for the evolution of the triplet genetic code in a continuous manner. This scenario suggests an explanation of how protein synthesis could be accomplished by means of long RNA-RNA interactions prior to the emergence of the complex decoding machinery, such as the ribosome, that is required for stabilization and discrimination of otherwise weak triplet codon
Nucleotide sequence of Phaseolus vulgaris L. alcohol dehydrogenase encoding cDNA and three-dimensional structure prediction of the deduced protein.

Science.gov (United States)

Amelia, Kassim; Khor, Chin Yin; Shah, Farida Habib; Bhore, Subhash J

2015-01-01

Common beans (Phaseolus vulgaris L.) are widely consumed as a source of proteins and natural products. However, its yield needs to be increased. In line with the agenda of Phaseomics (an international consortium), work of expressed sequence tags (ESTs) generation from bean pods was initiated. Altogether, 5972 ESTs have been isolated. Alcohol dehydrogenase (AD) encoding gene cDNA was a noticeable transcript among the generated ESTs. This AD is an important enzyme; therefore, to understand more about it this study was undertaken. The objective of this study was to elucidate P. vulgaris L. AD (PvAD) gene cDNA sequence and to predict the three-dimensional (3D) structure of deduced protein. positive and negative strands of the PvAD cDNA clone were sequenced using M13 forward and M13 reverse primers to elucidate the nucleotide sequence. Deduced PvAD cDNA and protein sequence was analyzed for their basic features using online bioinformatics tools. Sequence comparison was carried out using bl2seq program, and tree-view program was used to construct a phylogenetic tree. The secondary structures and 3D structure of PvAD protein were predicted by using the PHYRE automatic fold recognition server. The sequencing results analysis showed that PvAD cDNA is 1294 bp in length. It's open reading frame encodes for a protein that contains 371 amino acids. Deduced protein sequence analysis showed the presence of putative substrate binding, catalytic Zn binding, and NAD binding sites. Results indicate that the predicted 3D structure of PvAD protein is analogous to the experimentally determined crystal structure of s-nitrosoglutathione reductase from an Arabidopsis species. The 1294 bp long PvAD cDNA encodes for 371 amino acid long protein that contains conserved domains required for biological functions of AD. The predicted deduced PvAD protein's 3D structure reflects the analogy with the crystal structure of Arabidopsis thaliana s-nitrosoglutathione reductase. Further study is required
Determination of genetic relatedness from low-coverage human genome sequences using pedigree simulations.

Science.gov (United States)

Martin, Michael D; Jay, Flora; Castellano, Sergi; Slatkin, Montgomery

2017-08-01

We develop and evaluate methods for inferring relatedness among individuals from low-coverage DNA sequences of their genomes, with particular emphasis on sequences obtained from fossil remains. We suggest the major factors complicating the determination of relatedness among ancient individuals are sequencing depth, the number of overlapping sites, the sequencing error rate and the presence of contamination from present-day genetic sources. We develop a theoretical model that facilitates the exploration of these factors and their relative effects, via measurement of pairwise genetic distances, without calling genotypes, and determine the power to infer relatedness under various scenarios of varying sequencing depth, present-day contamination and sequencing error. The model is validated by a simulation study as well as the analysis of aligned sequences from present-day human genomes. We then apply the method to the recently published genome sequences of ancient Europeans, developing a statistical treatment to determine confidence in assigned relatedness that is, in some cases, more precise than previously reported. As the majority of ancient specimens are from animals, this method would be applicable to investigate kinship in nonhuman remains. The developed software grups (Genetic Relatedness Using Pedigree Simulations) is implemented in Python and freely available. © 2017 John Wiley & Sons Ltd.
Somatic Genetic Variation in Solid Pseudopapillary Tumor of the Pancreas by Whole Exome Sequencing

Directory of Open Access Journals (Sweden)

Meng Guo

2017-01-01

Full Text Available Solid pseudopapillary tumor of the pancreas (SPT is a rare pancreatic disease with a unique clinical manifestation. Although CTNNB1 gene mutations had been universally reported, genetic variation profiles of SPT are largely unidentified. We conducted whole exome sequencing in nine SPT patients to probe the SPT-specific insertions and deletions (indels and single nucleotide polymorphisms (SNPs. In total, 54 SNPs and 41 indels of prominent variations were demonstrated through parallel exome sequencing. We detected that CTNNB1 mutations presented throughout all patients studied (100%, and a higher count of SNPs was particularly detected in patients with older age, larger tumor, and metastatic disease. By aggregating 95 detected variation events and viewing the interconnections among each of the genes with variations, CTNNB1 was identified as the core portion in the network, which might collaborate with other events such as variations of USP9X, EP400, HTT, MED12, and PKD1 to regulate tumorigenesis. Pathway analysis showed that the events involved in other cancers had the potential to influence the progression of the SNPs count. Our study revealed an insight into the variation of the gene encoding region underlying solid-pseudopapillary neoplasm tumorigenesis. The detection of these variations might partly reflect the potential molecular mechanism.
A High Resolution Genetic Map Anchoring Scaffolds of the Sequenced Watermelon Genome

Science.gov (United States)

Kou, Qinghe; Jiang, Jiao; Guo, Shaogui; Zhang, Haiying; Hou, Wenju; Zou, Xiaohua; Sun, Honghe; Gong, Guoyi; Levi, Amnon; Xu, Yong

2012-01-01

As part of our ongoing efforts to sequence and map the watermelon (Citrullus spp.) genome, we have constructed a high density genetic linkage map. The map positioned 234 watermelon genome sequence scaffolds (an average size of 1.41 Mb) that cover about 330 Mb and account for 93.5% of the 353 Mb of the assembled genomic sequences of the elite Chinese watermelon line 97103 (Citrullus lanatus var. lanatus). The genetic map was constructed using an F8 population of 103 recombinant inbred lines (RILs). The RILs are derived from a cross between the line 97103 and the United States Plant Introduction (PI) 296341-FR (C. lanatus var. citroides) that contains resistance to fusarium wilt (races 0, 1, and 2). The genetic map consists of eleven linkage groups that include 698 simple sequence repeat (SSR), 219 insertion-deletion (InDel) and 36 structure variation (SV) markers and spans ∼800 cM with a mean marker interval of 0.8 cM. Using fluorescent in situ hybridization (FISH) with 11 BACs that produced chromosome-specifc signals, we have depicted watermelon chromosomes that correspond to the eleven linkage groups constructed in this study. The high resolution genetic map developed here should be a useful platform for the assembly of the watermelon genome, for the development of sequence-based markers used in breeding programs, and for the identification of genes associated with important agricultural traits. PMID:22247776
Towards rationally redesigning bacterial signaling systems using information encoded in abundant sequence data

Science.gov (United States)

Cheng, Ryan; Morcos, Faruck; Levine, Herbert; Onuchic, Jose

2014-03-01

An important challenge in biology is to distinguish the subset of residues that allow bacterial two-component signaling (TCS) proteins to preferentially interact with their correct TCS partner such that they can bind and transfer signal. Detailed knowledge of this information would allow one to search sequence-space for mutations that can systematically tune the signal transmission between TCS partners as well as re-encode a TCS protein to preferentially transfer signals to a non-partner. Motivated by the notion that this detailed information is found in sequence data, we explore the mutual sequence co-evolution between signaling partners to infer how mutations can positively or negatively alter their interaction. Using Direct Coupling Analysis (DCA) for determining evolutionarily conserved interprotein interactions, we apply a DCA-based metric to quantify mutational changes in the interaction between TCS proteins and demonstrate that it accurately correlates with experimental mutagenesis studies probing the mutational change in the in vitro phosphotransfer. Our methodology serves as a potential framework for the rational design of TCS systems as well as a framework for the system-level study of protein-protein interactions in sequence-rich systems. This research has been supported by the NSF INSPIRE award MCB-1241332 and by the CTBP sponsored by the NSF (Grant PHY-1308264).

Whole-Genome Sequencing of Sordaria macrospora Mutants Identifies Developmental Genes.

Science.gov (United States)

Nowrousian, Minou; Teichert, Ines; Masloff, Sandra; Kück, Ulrich

2012-02-01

The study of mutants to elucidate gene functions has a long and successful history; however, to discover causative mutations in mutants that were generated by random mutagenesis often takes years of laboratory work and requires previously generated genetic and/or physical markers, or resources like DNA libraries for complementation. Here, we present an alternative method to identify defective genes in developmental mutants of the filamentous fungus Sordaria macrospora through Illumina/Solexa whole-genome sequencing. We sequenced pooled DNA from progeny of crosses of three mutants and the wild type and were able to pinpoint the causative mutations in the mutant strains through bioinformatics analysis. One mutant is a spore color mutant, and the mutated gene encodes a melanin biosynthesis enzyme. The causative mutation is a G to A change in the first base of an intron, leading to a splice defect. The second mutant carries an allelic mutation in the pro41 gene encoding a protein essential for sexual development. In the mutant, we detected a complex pattern of deletion/rearrangements at the pro41 locus. In the third mutant, a point mutation in the stop codon of a transcription factor-encoding gene leads to the production of immature fruiting bodies. For all mutants, transformation with a wild type-copy of the affected gene restored the wild-type phenotype. Our data demonstrate that whole-genome sequencing of mutant strains is a rapid method to identify developmental genes in an organism that can be genetically crossed and where a reference genome sequence is available, even without prior mapping information.
Characterization and immunological identification of cDNA clones encoding two human DNA topoisomerase II isozymes

International Nuclear Information System (INIS)

Chung, T.D.Y.; Drake, F.H.; Tan, K.B.; Per, S.R.; Crooke, S.T.; Mirabelli, C.K.

1989-01-01

Several DNA topoisomerase II partial cDNA clones obtained from a human Raji-HN2 cDNA library were sequenced and two classes of nucleotide sequences were found. One member of the first class, SP1, was identical to an internal fragment of human HeLa cell Topo II cDNA described earlier. A member of the second class, SP11, shared extensive nucleotide (75%) and predicted peptide (92%) sequence similarities with the first two-thirds of HeLa Topo II. Each class of cDNAs hybridized to unique, nonoverlapping restriction enzyme fragments of genomic DNA from several human cell lines. Synthetic 24-mer oligonucleotide probes specific for each cDNA class hybridized to 6.5-kilobase mRNAs; furthermore, hybridization of probe specific for one class was not blocked by probe specific for the other. Antibodies raised against a synthetic SP1-encoded dodecapeptide specifically recognized the 170-kDa form of Topo II, while antibodies raised against the corresponding SP11-encoded dodecapeptide, or a second unique SP11-encoded tridecapeptide, selectively recognized the 180-kDa form of Topo II. These data provide genetic and immunochemical evidence for two Topo II isozymes
A weighted U-statistic for genetic association analyses of sequencing data.

Science.gov (United States)

Wei, Changshuai; Li, Ming; He, Zihuai; Vsevolozhskaya, Olga; Schaid, Daniel J; Lu, Qing

2014-12-01

With advancements in next-generation sequencing technology, a massive amount of sequencing data is generated, which offers a great opportunity to comprehensively investigate the role of rare variants in the genetic etiology of complex diseases. Nevertheless, the high-dimensional sequencing data poses a great challenge for statistical analysis. The association analyses based on traditional statistical methods suffer substantial power loss because of the low frequency of genetic variants and the extremely high dimensionality of the data. We developed a Weighted U Sequencing test, referred to as WU-SEQ, for the high-dimensional association analysis of sequencing data. Based on a nonparametric U-statistic, WU-SEQ makes no assumption of the underlying disease model and phenotype distribution, and can be applied to a variety of phenotypes. Through simulation studies and an empirical study, we showed that WU-SEQ outperformed a commonly used sequence kernel association test (SKAT) method when the underlying assumptions were violated (e.g., the phenotype followed a heavy-tailed distribution). Even when the assumptions were satisfied, WU-SEQ still attained comparable performance to SKAT. Finally, we applied WU-SEQ to sequencing data from the Dallas Heart Study (DHS), and detected an association between ANGPTL 4 and very low density lipoprotein cholesterol. © 2014 WILEY PERIODICALS, INC.
Imaging of Intracellular pH in Tumor Spheroids Using Genetically Encoded Sensor SypHer2.

Science.gov (United States)

Zagaynova, Elena V; Druzhkova, Irina N; Mishina, Natalia M; Ignatova, Nadezhda I; Dudenkova, Varvara V; Shirmanova, Marina V

2017-01-01

Intracellular pH (pHi) is one of the most important parameters that regulate the physiological state of cells and tissues. pHi homeostasis is crucial for normal cell functioning. Cancer cells are characterized by having a higher (neutral to slightly alkaline) pHi and lower (acidic) extracellular pH (pHe) compared to normal cells. This is referred to as a "reversed" pH gradient, and is essential in supporting their accelerated growth rate, invasion and migration, and in suppressing anti-tumor immunity, the promotion of metabolic coupling with fibroblasts and in preventing apoptosis. Moreover, abnormal pH, both pHi and pHe, contribute to drug resistance in cancers. Therefore, the development of methods for measuring pH in living tumor cells is likely to lead to better understanding of tumor biology and to open new ways for cancer treatment. Genetically encoded, fluorescent, pH-sensitive probes represent promising instruments enabling the subcellular measurement of pHi with unrivaled specificity and high accuracy. Here, we describe a protocol for pHi imaging at a microscopic level in HeLa tumor spheroids, using the genetically encoded ratiometric (dual-excitation) pHi indicator, SypHer2.
StrigoQuant: A genetically encoded biosensor for quantifying strigolactone activity and specificity

KAUST Repository

Samodelov, S. L.

2016-11-05

Strigolactones are key regulators of plant development and interaction with symbiotic fungi; however, quantitative tools for strigolactone signaling analysis are lacking. We introduce a genetically encoded hormone biosensor used to analyze strigolactone-mediated processes, including the study of the components involved in the hormone perception/signaling complex and the structural specificity and sensitivity of natural and synthetic strigolactones in Arabidopsis, providing quantitative insights into the stereoselectivity of strigolactone perception. Given the high specificity, sensitivity, dynamic range of activity, modular construction, ease of implementation, and wide applicability, the biosensor StrigoQuant will be useful in unraveling multiple levels of strigolactone metabolic and signaling networks.
Monitoring Human-Induced Pluripotent Stem Cell-Derived Cardiomyocytes with Genetically Encoded Calcium and Voltage Fluorescent Reporters

Directory of Open Access Journals (Sweden)

Rami Shinnawi

2015-10-01

Full Text Available The advent of the human-induced pluripotent stem cell (hiPSC technology has transformed biomedical research, providing new tools for human disease modeling, drug development, and regenerative medicine. To fulfill its unique potential in the cardiovascular field, efficient methods should be developed for high-resolution, large-scale, long-term, and serial functional cellular phenotyping of hiPSC-derived cardiomyocytes (hiPSC-CMs. To achieve this goal, we combined the hiPSC technology with genetically encoded voltage (ArcLight and calcium (GCaMP5G fluorescent indicators. Expression of ArcLight and GCaMP5G in hiPSC-CMs permitted to reliably follow changes in transmembrane potential and intracellular calcium levels, respectively. This allowed monitoring short- and long-term changes in action-potential and calcium-handling properties and the development of arrhythmias in response to several pharmaceutical agents and in hiPSC-CMs derived from patients with different inherited arrhythmogenic syndromes. Combining genetically encoded fluorescent reporters with hiPSC-CMs may bring a unique value to the study of inherited disorders, developmental biology, and drug development and testing.
Can a single-shot black-blood T2-weighted spin-echo echo-planar imaging sequence with sensitivity encoding replace the respiratory-triggered turbo spin-echo sequence for the liver? An optimization and feasibility study.

Science.gov (United States)

Hussain, Shahid M; De Becker, Jan; Hop, Wim C J; Dwarkasing, Soendersing; Wielopolski, Piotr A

2005-03-01

To optimize and assess the feasibility of a single-shot black-blood T2-weighted spin-echo echo-planar imaging (SSBB-EPI) sequence for MRI of the liver using sensitivity encoding (SENSE), and compare the results with those obtained with a T2-weighted turbo spin-echo (TSE) sequence. Six volunteers and 16 patients were scanned at 1.5T (Philips Intera). In the volunteer study, we optimized the SSBB-EPI sequence by interactively changing the parameters (i.e., the resolution, echo time (TE), diffusion weighting with low b-values, and polarity of the phase-encoding gradient) with regard to distortion, suppression of the blood signal, and sensitivity to motion. The influence of each change was assessed. The optimized SSBB-EPI sequence was applied in patients (N = 16). A number of items, including the overall image quality (on a scale of 1-5), were used for graded evaluation. In addition, the signal-to-noise ratio (SNR) of the liver was calculated. Statistical analysis was carried out with the use of Wilcoxon's signed rank test for comparison of the SSBB-EPI and TSE sequences, with P = 0.05 considered the limit for significance. The SSBB-EPI sequence was improved by the following steps: 1) less frequency points than phase-encoding steps, 2) a b-factor of 20, and 3) a reversed polarity of the phase-encoding gradient. In patients, the mean overall image quality score for the optimized SSBB-EPI (3.5 (range: 1-4)) and TSE (3.6 (range: 3-4)), and the SNR of the liver on SSBB-EPI (mean +/- SD = 7.6 +/- 4.0) and TSE (8.9 +/- 4.6) were not significantly different (P > .05). Optimized SSBB-EPI with SENSE proved to be feasible in patients, and the overall image quality and SNR of the liver were comparable to those achieved with the standard respiratory-triggered T2-weighted TSE sequence. (c) 2005 Wiley-Liss, Inc.
Genetic encoding of a bicyclo[6.1.0]nonyne-charged amino acid enables fast cellular protein imaging by metal-free ligation.

Science.gov (United States)

Borrmann, Annika; Milles, Sigrid; Plass, Tilman; Dommerholt, Jan; Verkade, Jorge M M; Wiessler, Manfred; Schultz, Carsten; van Hest, Jan C M; van Delft, Floris L; Lemke, Edward A

2012-09-24

Visualizing biomolecules by fluorescent tagging is a powerful method for studying their behaviour and function inside cells. We prepared and genetically encoded an unnatural amino acid (UAA) that features a bicyclononyne moiety. This UAA offered exceptional reactivity in strain-promoted azide-alkyne cycloadditions. Kinetic measurements revealed that the UAA reacted also remarkably fast in the inverse-electron-demand Diels-Alder cycloaddition with tetrazine-conjugated dyes. Genetic encoding of the new UAA inside mammalian cells and its subsequent selective labeling at low dye concentrations demonstrate the usefulness of the new amino acid for future imaging studies. Copyright © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Exome Sequencing Fails to Identify the Genetic Cause of Aicardi Syndrome.

Science.gov (United States)

Lund, Caroline; Striano, Pasquale; Sorte, Hanne Sørmo; Parisi, Pasquale; Iacomino, Michele; Sheng, Ying; Vigeland, Magnus D; Øye, Anne-Marte; Møller, Rikke Steensbjerre; Selmer, Kaja K; Zara, Federico

2016-09-01

Aicardi syndrome (AS) is a well-characterized neurodevelopmental disorder with an unknown etiology. In this study, we performed whole-exome sequencing in 11 female patients with the diagnosis of AS, in order to identify the disease-causing gene. In particular, we focused on detecting variants in the X chromosome, including the analysis of variants with a low number of sequencing reads, in case of somatic mosaicism. For 2 of the patients, we also sequenced the exome of the parents to search for de novo mutations. We did not identify any genetic variants likely to be damaging. Only one single missense variant was identified by the de novo analyses of the 2 trios, and this was considered benign. The failure to identify a disease gene in this study may be due to technical limitations of our study design, including the possibility that the genetic aberration leading to AS is situated in a non-exonic region or that the mutation is somatic and not detectable by our approach. Alternatively, it is possible that AS is genetically heterogeneous and that 11 patients are not sufficient to reveal the causative genes. Future studies of AS should consider designs where also non-exonic regions are explored and apply a sequencing depth so that also low-grade somatic mosaicism can be detected.
Molecular genetics and epigenetics of CACTA elements

KAUST Repository

Fedoroff, Nina V.

2013-08-21

The CACTA transposons, so named for a highly conserved motif at element ends, comprise one of the most abundant superfamilies of Class 2 (cut-and-paste) plant transposons. CACTA transposons characteristically include subterminal sequences of several hundred nucleotides containing closely spaced direct and inverted repeats of a short, conserved sequence of 14-15 bp. The Supressor-mutator (Spm) transposon, identified and subjected to detailed genetic analysis by Barbara McClintock, remains the paradigmatic element of the CACTA family. The Spm transposon encodes two proteins required for transposition, the transposase (TnpD) and a regulatory protein (TnpA) that binds to the subterminal repeats. Spm expression is subject to both genetic and epigenetic regulation. The Spm-encoded TnpA serves as an activator of the epigenetically inactivated, methylated Spm, stimulating both transient and heritable activation of the transposon. TnpA also serves as a negative regulator of the demethylated active element promoter and is required, in addition to the TnpD, for transposition. © Springer Science+Business Media, New York 2013.
Sugarcane expressed sequences tags (ESTs encoding enzymes involved in lignin biosynthesis pathways

Directory of Open Access Journals (Sweden)

Ramos Rose Lucia Braz

2001-01-01

Full Text Available Lignins are phenolic polymers found in the secondary wall of plant conductive systems where they play an important role by reducing the permeability of the cell wall to water. Lignins are also responsible for the rigidity of the cell wall and are involved in mechanisms of resistance to pathogens. The metabolic routes and enzymes involved in synthesis of lignins have been largely characterized and representative genes that encode enzymes involved in these processes have been cloned from several plant species. The synthesis of lignins is liked to the general metabolism of the phenylpropanoids in plants, having enzymes (e.g. phenylalanine ammonia-lyase (PAL, cinnamate 4-hydroxylase (C4H and caffeic acid O-methyltransferase (COMT common to other processes as well as specific enzymes such as cinnamoyl-CoA reductase (CCR and cinnamyl alcohol dehydrogenase (CAD. Some maize and sorghum mutants, shown to have defective in CAD and/or COMT activity, are easier to digest because they have a reduced lignin content, something which has motivated different research groups to alter the lignin content and composition of model plants by genetic engineering try to improve, for example, the efficiency of paper pulping and digestibility. In the work reported in this paper, we have made an inventory of the sugarcane expressed sequence tag (EST coding for enzymes involved in lignin metabolism which are present in the sugarcane EST genome project (SUCEST database. Our analysis focused on the key enzymes ferulate-5-hydroxylase (F5H, caffeic acid O-methyltransferase (COMT, caffeoyl CoA O-methyltransferase (CCoAOMT, hydroxycinnamate CoA ligase (4CL, cinnamoyl-CoA reductase (CCR and cinnamyl alcohol dehydrogenase (CAD. The comparative analysis of these genes with those described in other species could be used as molecular markers for breeding as well as for the manipulation of lignin metabolism in sugarcane.
Sequence-Based Appraisal of the Genes Encoding Neck and Carbohydrate Recognition Domain of Conglutinin in Blackbuck (Antilope cervicapra and Goat (Capra hircus

Directory of Open Access Journals (Sweden)

Sasmita Barik

2014-01-01

Full Text Available Conglutinin, a collagenous C-type lectin, acts as soluble pattern recognition receptor (PRR in recognition of pathogens. In the present study, genes encoding neck and carbohydrate recognition domain (NCRD of conglutinin in goat and blackbuck were amplified, cloned, and sequenced. The obtained 488 bp ORFs encoding NCRD were submitted to NCBI with accession numbers KC505182 and KC505183. Both nucleotide and predicted amino acid sequences were analysed with sequences of other ruminants retrieved from NCBI GenBank using DNAstar and Megalign5.2 software. Sequence analysis revealed maximum similarity of blackbuck sequence with wild ruminants like nilgai and buffalo, whereas goat sequence displayed maximum similarity with sheep sequence at both nucleotide and amino acid level. Phylogenetic analysis further indicated clear divergence of wild ruminants from the domestic ruminants in separate clusters. The predicted secondary structures of NCRD protein in goat and blackbuck using SWISSMODEL ProtParam online software were found to possess 6 beta-sheets and 3 alpha-helices which are identical to the result obtained in case of sheep, cattle, buffalo, and nilgai. However, quaternary structure in goat, sheep, and cattle was found to differ from that of buffalo, nilgai, and blackbuck, suggesting a probable variation in the efficiency of antimicrobial activity among wild and domestic ruminants.
Systems genetics of complex diseases using RNA-sequencing methods

DEFF Research Database (Denmark)

Mazzoni, Gianluca; Kogelman, Lisette; Suravajhala, Prashanth

2015-01-01

Next generation sequencing technologies have enabled the generation of huge quantities of biological data, and nowadays extensive datasets at different ‘omics levels have been generated. Systems genetics is a powerful approach that allows to integrate different ‘omics level and understand the bio...
Development of expressed sequence tag-simple sequence repeat markers for genetic characterization and population structure analysis of Praxelis clematidea (Asteraceae).

Science.gov (United States)

Wang, Q Z; Huang, M; Downie, S R; Chen, Z X

2016-05-23

Invasive plants tend to spread aggressively in new habitats and an understanding of their genetic diversity and population structure is useful for their management. In this study, expressed sequence tag-simple sequence repeat (EST-SSR) markers were developed for the invasive plant species Praxelis clematidea (Asteraceae) from 5548 Stevia rebaudiana (Asteraceae) expressed sequence tags (ESTs). A total of 133 microsatellite-containing ESTs (2.4%) were identified, of which 56 (42.1%) were hexanucleotide repeat motifs and 50 (37.6%) were trinucleotide repeat motifs. Of the 24 primer pairs designed from these 133 ESTs, 7 (29.2%) resulted in significant polymorphisms. The number of alleles per locus ranged from 5 to 9. The relatively high genetic diversity (H = 0.2667, I = 0.4212, and P = 100%) of P. clematidea was related to high gene flow (Nm = 1.4996) among populations. The coefficient of population differentiation (GST = 0.2500) indicated that most genetic variation occurred within populations. A Mantel test suggested that there was significant correlation between genetic distance and geographical distribution (r = 0.3192, P = 0.012). These results further support the transferability of EST-SSR markers between closely related genera of the same family.
Genetic counselors' views and experiences with the clinical integration of genome sequencing.

Science.gov (United States)

Machini, Kalotina; Douglas, Jessica; Braxton, Alicia; Tsipis, Judith; Kramer, Kate

2014-08-01

In recent years, new sequencing technologies known as next generation sequencing (NGS) have provided scientists the ability to rapidly sequence all known coding as well as non-coding sequences in the human genome. As the two emerging approaches, whole exome (WES) and whole genome (WGS) sequencing, have started to be integrated in the clinical arena, we sought to survey health care professionals who are likely to be involved in the implementation process now and/or in the future (e.g., genetic counselors, geneticists and nurse practitioners). Two hundred twenty-one genetic counselors- one third of whom currently offer WES/WGS-participated in an anonymous online survey. The aims of the survey were first, to identify barriers to the implementation of WES/WGS, as perceived by survey participants; second, to provide the first systematic report of current practices regarding the integration of WES/WGS in clinic and/or research across the US and Canada and to illuminate the roles and challenges of genetic counselors participating in this process; and third to evaluate the impact of WES/WGS on patient care. Our results showed that genetic counseling practices with respect to WES/WGS are consistent with the criteria set forth in the ACMG 2012 policy statement, which highlights indications for testing, reporting, and pre/post test considerations. Our respondents described challenges related to offering WES/WGS, which included billing issues, the duration and content of the consent process, result interpretation and disclosure of incidental findings and variants of unknown significance. In addition, respondents indicated that specialty area (i.e., prenatal and cancer), lack of clinical utility of WES/WGS and concerns about interpretation of test results were factors that prevented them from offering this technology to patients. Finally, study participants identified the aspects of their professional training which have been most beneficial in aiding with the integration of
High-precision, whole-genome sequencing of laboratory strains facilitates genetic studies.

Directory of Open Access Journals (Sweden)

Anjana Srivatsan

2008-08-01

Full Text Available Whole-genome sequencing is a powerful technique for obtaining the reference sequence information of multiple organisms. Its use can be dramatically expanded to rapidly identify genomic variations, which can be linked with phenotypes to obtain biological insights. We explored these potential applications using the emerging next-generation sequencing platform Solexa Genome Analyzer, and the well-characterized model bacterium Bacillus subtilis. Combining sequencing with experimental verification, we first improved the accuracy of the published sequence of the B. subtilis reference strain 168, then obtained sequences of multiple related laboratory strains and different isolates of each strain. This provides a framework for comparing the divergence between different laboratory strains and between their individual isolates. We also demonstrated the power of Solexa sequencing by using its results to predict a defect in the citrate signal transduction pathway of a common laboratory strain, which we verified experimentally. Finally, we examined the molecular nature of spontaneously generated mutations that suppress the growth defect caused by deletion of the stringent response mediator relA. Using whole-genome sequencing, we rapidly mapped these suppressor mutations to two small homologs of relA. Interestingly, stable suppressor strains had mutations in both genes, with each mutation alone partially relieving the relA growth defect. This supports an intriguing three-locus interaction module that is not easily identifiable through traditional suppressor mapping. We conclude that whole-genome sequencing can drastically accelerate the identification of suppressor mutations and complex genetic interactions, and it can be applied as a standard tool to investigate the genetic traits of model organisms.
Extended genetic analysis of Brazilian isolates of Bacillus cereus and Bacillus thuringiensis

Directory of Open Access Journals (Sweden)

Viviane Zahner

2013-02-01

Full Text Available Multiple locus sequence typing (MLST was undertaken to extend the genetic characterization of 29 isolates of Bacillus cereus and Bacillus thuringiensis previously characterized in terms of presence/absence of sequences encoding virulence factors and via variable number tandem repeat (VNTR. Additional analysis involved polymerase chain reaction for the presence of sequences (be, cytK, inA, pag, lef, cya and cap, encoding putative virulence factors, not investigated in the earlier study. MLST analysis ascribed novel and unique sequence types to each of the isolates. A phylogenetic tree was constructed from a single sequence of 2,838 bp of concatenated loci sequences. The strains were not monophyletic by analysis of any specific housekeeping gene or virulence characteristic. No clear association in relation to source of isolation or to genotypic profile based on the presence or absence of putative virulence genes could be identified. Comparison of VNTR profiling with MLST data suggested a correlation between these two methods of genetic analysis. In common with the majority of previous studies, MLST was unable to provide clarification of the basis for pathogenicity among members of the B. cereus complex. Nevertheless, our application of MLST served to reinforce the notion that B. cereus and B. thuringiensis should be considered as the same species.
Chicken genome analysis reveals novel genes encoding biotin-binding proteins related to avidin family

Directory of Open Access Journals (Sweden)

Nordlund Henri R

2005-03-01

Full Text Available Abstract Background A chicken egg contains several biotin-binding proteins (BBPs, whose complete DNA and amino acid sequences are not known. In order to identify and characterise these genes and proteins we studied chicken cDNAs and genes available in the NCBI database and chicken genome database using the reported N-terminal amino acid sequences of chicken egg-yolk BBPs as search strings. Results Two separate hits showing significant homology for these N-terminal sequences were discovered. For one of these hits, the chromosomal location in the immediate proximity of the avidin gene family was found. Both of these hits encode proteins having high sequence similarity with avidin suggesting that chicken BBPs are paralogous to avidin family. In particular, almost all residues corresponding to biotin binding in avidin are conserved in these putative BBP proteins. One of the found DNA sequences, however, seems to encode a carboxy-terminal extension not present in avidin. Conclusion We describe here the predicted properties of the putative BBP genes and proteins. Our present observations link BBP genes together with avidin gene family and shed more light on the genetic arrangement and variability of this family. In addition, comparative modelling revealed the potential structural elements important for the functional and structural properties of the putative BBP proteins.
Genetic architecture of vitamin B12 and folate levels uncovered applying deeply sequenced large datasets

DEFF Research Database (Denmark)

Grarup, Niels; Sulem, Patrick; Sandholt, Camilla H

2013-01-01

of the underlying biology of human traits and diseases. Here, we used a large Icelandic whole genome sequence dataset combined with Danish exome sequence data to gain insight into the genetic architecture of serum levels of vitamin B12 (B12) and folate. Up to 22.9 million sequence variants were analyzed in combined...... in serum B12 or folate levels do not modify the risk of developing these conditions. Yet, the study demonstrates the value of combining whole genome and exome sequencing approaches to ascertain the genetic and molecular architectures underlying quantitative trait associations....
Identification of human microRNA-like sequences embedded within the protein-encoding genes of the human immunodeficiency virus.

Directory of Open Access Journals (Sweden)

Bryan Holland

Full Text Available BACKGROUND: MicroRNAs (miRNAs are highly conserved, short (18-22 nts, non-coding RNA molecules that regulate gene expression by binding to the 3' untranslated regions (3'UTRs of mRNAs. While numerous cellular microRNAs have been associated with the progression of various diseases including cancer, miRNAs associated with retroviruses have not been well characterized. Herein we report identification of microRNA-like sequences in coding regions of several HIV-1 genomes. RESULTS: Based on our earlier proteomics and bioinformatics studies, we have identified 8 cellular miRNAs that are predicted to bind to the mRNAs of multiple proteins that are dysregulated during HIV-infection of CD4+ T-cells in vitro. In silico analysis of the full length and mature sequences of these 8 miRNAs and comparisons with all the genomic and subgenomic sequences of HIV-1 strains in global databases revealed that the first 18/18 sequences of the mature hsa-miR-195 sequence (including the short seed sequence, matched perfectly (100%, or with one nucleotide mismatch, within the envelope (env genes of five HIV-1 genomes from Africa. In addition, we have identified 4 other miRNA-like sequences (hsa-miR-30d, hsa-miR-30e, hsa-miR-374a and hsa-miR-424 within the env and the gag-pol encoding regions of several HIV-1 strains, albeit with reduced homology. Mapping of the miRNA-homologues of env within HIV-1 genomes localized these sequence to the functionally significant variable regions of the env glycoprotein gp120 designated V1, V2, V4 and V5. CONCLUSIONS: We conclude that microRNA-like sequences are embedded within the protein-encoding regions of several HIV-1 genomes. Given that the V1 to V5 regions of HIV-1 envelopes contain specific, well-characterized domains that are critical for immune responses, virus neutralization and disease progression, we propose that the newly discovered miRNA-like sequences within the HIV-1 genomes may have evolved to self-regulate survival of the

A method for partitioning the information contained in a protein sequence between its structure and function.

Science.gov (United States)

Possenti, Andrea; Vendruscolo, Michele; Camilloni, Carlo; Tiana, Guido

2018-05-23

Proteins employ the information stored in the genetic code and translated into their sequences to carry out well-defined functions in the cellular environment. The possibility to encode for such functions is controlled by the balance between the amount of information supplied by the sequence and that left after that the protein has folded into its structure. We study the amount of information necessary to specify the protein structure, providing an estimate that keeps into account the thermodynamic properties of protein folding. We thus show that the information remaining in the protein sequence after encoding for its structure (the 'information gap') is very close to what needed to encode for its function and interactions. Then, by predicting the information gap directly from the protein sequence, we show that it may be possible to use these insights from information theory to discriminate between ordered and disordered proteins, to identify unknown functions, and to optimize artificially-designed protein sequences. This article is protected by copyright. All rights reserved. © 2018 Wiley Periodicals, Inc.
Genetics and Molecular Biology of Epstein-Barr Virus-Encoded BART MicroRNA: A Paradigm for Viral Modulation of Host Immune Response Genes and Genome Stability

Directory of Open Access Journals (Sweden)

David H. Dreyfus

2017-01-01

Full Text Available Epstein-Barr virus, a ubiquitous human herpesvirus, is associated through epidemiologic evidence with common autoimmune syndromes and cancers. However, specific genetic mechanisms of pathogenesis have been difficult to identify. In this review, the author summarizes evidence that recently discovered noncoding RNAs termed microRNA encoded by Epstein-Barr virus BARF (BamHI A right frame termed BART (BamHI A right transcripts are modulators of human immune response genes and genome stability in infected and bystander cells. BART expression is apparently regulated by complex feedback loops with the host immune response regulatory NF-κB transcription factors. EBV-encoded BZLF-1 (ZEBRA protein could also regulate BART since ZEBRA contains a terminal region similar to ankyrin proteins such as IκBα that regulate host NF-κB. BALF-2 (BamHI A left frame transcript, a viral homologue of the immunoglobulin and T cell receptor gene recombinase RAG-1 (recombination-activating gene-1, may also be coregulated with BART since BALF-2 regulatory sequences are located near the BART locus. Viral-encoded microRNA and viral mRNA transferred to bystander cells through vesicles, defective viral particles, or other mechanisms suggest a new paradigm in which bystander or hit-and-run mechanisms enable the virus to transiently or chronically alter human immune response genes as well as the stability of the human genome.
Draft genome sequence of Actinotignum schaalii DSM 15541T: Genetic insights into the lifestyle, cell fitness and virulence.

Directory of Open Access Journals (Sweden)

Atteyet F Yassin

Full Text Available The permanent draft genome sequence of Actinotignum schaalii DSM 15541T is presented. The annotated genome includes 2,130,987 bp, with 1777 protein-coding and 58 rRNA-coding genes. Genome sequence analysis revealed absence of genes encoding for: components of the PTS systems, enzymes of the TCA cycle, glyoxylate shunt and gluconeogensis. Genomic data revealed that A. schaalii is able to oxidize carbohydrates via glycolysis, the nonoxidative pentose phosphate and the Entner-Doudoroff pathways. Besides, the genome harbors genes encoding for enzymes involved in the conversion of pyruvate to lactate, acetate and ethanol, which are found to be the end products of carbohydrate fermentation. The genome contained the gene encoding Type I fatty acid synthase required for de novo FAS biosynthesis. The plsY and plsX genes encoding the acyltransferases necessary for phosphatidic acid biosynthesis were absent from the genome. The genome harbors genes encoding enzymes responsible for isoprene biosynthesis via the mevalonate (MVA pathway. Genes encoding enzymes that confer resistance to reactive oxygen species (ROS were identified. In addition, A. schaalii harbors genes that protect the genome against viral infections. These include restriction-modification (RM systems, type II toxin-antitoxin (TA, CRISPR-Cas and abortive infection system. A. schaalii genome also encodes several virulence factors that contribute to adhesion and internalization of this pathogen such as the tad genes encoding proteins required for pili assembly, the nanI gene encoding exo-alpha-sialidase, genes encoding heat shock proteins and genes encoding type VII secretion system. These features are consistent with anaerobic and pathogenic lifestyles. Finally, resistance to ciprofloxacin occurs by mutation in chromosomal genes that encode the subunits of DNA-gyrase (GyrA and topisomerase IV (ParC enzymes, while resistant to metronidazole was due to the frxA gene, which encodes NADPH
Inter-population differences in otolith morphology are genetically encoded in the killifish Aphanius fasciatus (Cyprinodontiformes

Directory of Open Access Journals (Sweden)

Ali Annabi

2013-06-01

Full Text Available Inter-population differences in otolith shape, morphology and chemistry have been used effectively as indicators for stock assessment or for recognizing environmental adaptation in fishes. However, the precise parameters that affect otolith morphology remain incompletely understood. Here we provide the first direct support for the hypothesis that inter-population differences in otolith morphology are genetically encoded. The study is based on otolith morphology and two mitochondrial markers (D-loop, 16S rRNA of three natural populations of Aphanius fasciatus (Teleostei: Cyprinodontidae from Southeast Tunisia. Otolith and genetic data yielded congruent tree topologies. Divergence of populations likely results from isolation events in the course of the Pleistocene sea level drops. We propose that otolith morphology is a valuable tool for resolving genetic diversity also within other teleost species, which may be important for ecosystem management and conservation of genetic diversity. As reconstructions of ancient teleost fish faunas are often solely based on fossil otoliths, our discoveries may also lead to a new approach to research in palaeontology.
Metal resistance sequences and transgenic plants

Science.gov (United States)

Meagher, Richard Brian; Summers, Anne O.; Rugh, Clayton L.

1999-10-12

The present invention provides nucleic acid sequences encoding a metal ion resistance protein, which are expressible in plant cells. The metal resistance protein provides for the enzymatic reduction of metal ions including but not limited to divalent Cu, divalent mercury, trivalent gold, divalent cadmium, lead ions and monovalent silver ions. Transgenic plants which express these coding sequences exhibit increased resistance to metal ions in the environment as compared with plants which have not been so genetically modified. Transgenic plants with improved resistance to organometals including alkylmercury compounds, among others, are provided by the further inclusion of plant-expressible organometal lyase coding sequences, as specifically exemplified by the plant-expressible merB coding sequence. Furthermore, these transgenic plants which have been genetically modified to express the metal resistance coding sequences of the present invention can participate in the bioremediation of metal contamination via the enzymatic reduction of metal ions. Transgenic plants resistant to organometals can further mediate remediation of organic metal compounds, for example, alkylmetal compounds including but not limited to methyl mercury, methyl lead compounds, methyl cadmium and methyl arsenic compounds, in the environment by causing the freeing of mercuric or other metal ions and the reduction of the ionic mercury or other metal ions to the less toxic elemental mercury or other metals.
Cloning and sequencing of cDNA encoding human DNA topoisomerase II and localization of the gene to chromosome region 17q21-22

International Nuclear Information System (INIS)

Tsai-Pflugfelder, M.; Liu, L.F.; Liu, A.A.; Tewey, K.M.; Whang-Peng, J.; Knutsen, T.; Huebner, K.; Croce, C.M.; Wang, J.C.

1988-01-01

Two overlapping cDNA clones encoding human DNA topoisomerase II were identified by two independent methods. In one, a human cDNA library in phage λ was screened by hybridization with a mixed oligonucleotide probe encoding a stretch of seven amino acids found in yeast and Drosophila DNA topoisomerase II; in the other, a different human cDNA library in a λgt11 expression vector was screened for the expression of antigenic determinants that are recognized by rabbit antibodies specific to human DNA topoisomerase II. The entire coding sequences of the human DNA topoisomerase II gene were determined from these and several additional clones, identified through the use of the cloned human TOP2 gene sequences as probes. Hybridization between the cloned sequences and mRNA and genomic DNA indicates that the human enzyme is encoded by a single-copy gene. The location of the gene was mapped to chromosome 17q21-22 by in situ hybridization of a cloned fragment to metaphase chromosomes and by hybridization analysis with a panel of mouse-human hybrid cell lines, each retaining a subset of human chromosomes
Crystallization and preliminary X-ray characterization of the genetically encoded fluorescent calcium indicator protein GCaMP2

International Nuclear Information System (INIS)

Rodríguez Guilbe, María M.; Alfaro Malavé, Elisa C.; Akerboom, Jasper; Marvin, Jonathan S.; Looger, Loren L.; Schreiter, Eric R.

2008-01-01

The genetically encoded fluorescent calcium-indicator protein GCaMP2 was crystallized in the calcium-saturated form. X-ray diffraction data were collected to 2.0 Å resolution and the structure was solved by molecular replacement. Fluorescent proteins and their engineered variants have played an important role in the study of biology. The genetically encoded calcium-indicator protein GCaMP2 comprises a circularly permuted fluorescent protein coupled to the calcium-binding protein calmodulin and a calmodulin target peptide, M13, derived from the intracellular calmodulin target myosin light-chain kinase and has been used to image calcium transients in vivo. To aid rational efforts to engineer improved variants of GCaMP2, this protein was crystallized in the calcium-saturated form. X-ray diffraction data were collected to 2.0 Å resolution. The crystals belong to space group C2, with unit-cell parameters a = 126.1, b = 47.1, c = 68.8 Å, β = 100.5° and one GCaMP2 molecule in the asymmetric unit. The structure was phased by molecular replacement and refinement is currently under way
Engineering of a genetically encodable fluorescent voltage sensor exploiting fast Ci-VSP voltage-sensing movements.

Science.gov (United States)

Lundby, Alicia; Mutoh, Hiroki; Dimitrov, Dimitar; Akemann, Walther; Knöpfel, Thomas

2008-06-25

Ci-VSP contains a voltage-sensing domain (VSD) homologous to that of voltage-gated potassium channels. Using charge displacement ('gating' current) measurements we show that voltage-sensing movements of this VSD can occur within 1 ms in mammalian membranes. Our analysis lead to development of a genetically encodable fluorescent protein voltage sensor (VSFP) in which the fast, voltage-dependent conformational changes of the Ci-VSP voltage sensor are transduced to similarly fast fluorescence read-outs.
A putative peroxidase cDNA from turnip and analysis of the encoded protein sequence.

Science.gov (United States)

Romero-Gómez, S; Duarte-Vázquez, M A; García-Almendárez, B E; Mayorga-Martínez, L; Cervantes-Avilés, O; Regalado, C

2008-12-01

A putative peroxidase cDNA was isolated from turnip roots (Brassica napus L. var. purple top white globe) by reverse transcriptase-polymerase chain reaction (RT-PCR) and rapid amplification of cDNA ends (RACE). Total RNA extracted from mature turnip roots was used as a template for RT-PCR, using a degenerated primer designed to amplify the highly conserved distal motif of plant peroxidases. The resulting partial sequence was used to design the rest of the specific primers for 5' and 3' RACE. Two cDNA fragments were purified, sequenced, and aligned with the partial sequence from RT-PCR, and a complete overlapping sequence was obtained and labeled as BbPA (Genbank Accession No. AY423440, named as podC). The full length cDNA is 1167bp long and contains a 1077bp open reading frame (ORF) encoding a 358 deduced amino acid peroxidase polypeptide. The putative peroxidase (BnPA) showed a calculated Mr of 34kDa, and isoelectric point (pI) of 4.5, with no significant identity with other reported turnip peroxidases. Sequence alignment showed that only three peroxidases have a significant identity with BnPA namely AtP29a (84%), and AtPA2 (81%) from Arabidopsis thaliana, and HRPA2 (82%) from horseradish (Armoracia rusticana). Work is in progress to clone this gene into an adequate host to study the specific role and possible biotechnological applications of this alternative peroxidase source.
Genetic mutation analysis of human gastric adenocarcinomas using ion torrent sequencing platform.

Directory of Open Access Journals (Sweden)

Zhi Xu

Full Text Available Gastric cancer is the one of the major causes of cancer-related death, especially in Asia. Gastric adenocarcinoma, the most common type of gastric cancer, is heterogeneous and its incidence and cause varies widely with geographical regions, gender, ethnicity, and diet. Since unique mutations have been observed in individual human cancer samples, identification and characterization of the molecular alterations underlying individual gastric adenocarcinomas is a critical step for developing more effective, personalized therapies. Until recently, identifying genetic mutations on an individual basis by DNA sequencing remained a daunting task. Recent advances in new next-generation DNA sequencing technologies, such as the semiconductor-based Ion Torrent sequencing platform, makes DNA sequencing cheaper, faster, and more reliable. In this study, we aim to identify genetic mutations in the genes which are targeted by drugs in clinical use or are under development in individual human gastric adenocarcinoma samples using Ion Torrent sequencing. We sequenced 737 loci from 45 cancer-related genes in 238 human gastric adenocarcinoma samples using the Ion Torrent Ampliseq Cancer Panel. The sequencing analysis revealed a high occurrence of mutations along the TP53 locus (9.7% in our sample set. Thus, this study indicates the utility of a cost and time efficient tool such as Ion Torrent sequencing to screen cancer mutations for the development of personalized cancer therapy.
cDNAs encoding [D-Ala2]deltorphin precursors from skin of Phyllomedusa bicolor also contain genetic information for three dermorphin-related opioid peptides.

Science.gov (United States)

Richter, K; Egger, R; Negri, L; Corsi, R; Severini, C; Kreil, G

1990-06-01

We present the structure of four precursors for [D-Ala2]deltorphins I and II as deduced from cDNAs cloned from skin of the frog Phyllomedusa bicolor. These contain the genetic information for one copy of [D-Ala2]deltorphin II and zero, one, or three copies of [D-Ala2]deltorphin I. In each case, the D-alanine of the end product is encoded by a normal GCG codon for L-alanine. In addition, the existence of three peptides related to dermorphin was predicted from the amino acid sequence of the precursors. These peptides were synthesized with a D-alanine in position 2 and their pharmacological properties were tested. Two of them, [Lys7]dermorphin-OH and [Trp4,Asn7]dermorphin-OH, were found to have roughly the same affinity and selectivity for mu-type opioid receptors as dermorphin.
Rapid evolution of the sequences and gene repertoires of secreted proteins in bacteria.

Directory of Open Access Journals (Sweden)

Teresa Nogueira

Full Text Available Proteins secreted to the extracellular environment or to the periphery of the cell envelope, the secretome, play essential roles in foraging, antagonistic and mutualistic interactions. We hypothesize that arms races, genetic conflicts and varying selective pressures should lead to the rapid change of sequences and gene repertoires of the secretome. The analysis of 42 bacterial pan-genomes shows that secreted, and especially extracellular proteins, are predominantly encoded in the accessory genome, i.e. among genes not ubiquitous within the clade. Genes encoding outer membrane proteins might engage more frequently in intra-chromosomal gene conversion because they are more often in multi-genic families. The gene sequences encoding the secretome evolve faster than the rest of the genome and in particular at non-synonymous positions. Cell wall proteins in Firmicutes evolve particularly fast when compared with outer membrane proteins of Proteobacteria. Virulence factors are over-represented in the secretome, notably in outer membrane proteins, but cell localization explains more of the variance in substitution rates and gene repertoires than sequence homology to known virulence factors. Accordingly, the repertoires and sequences of the genes encoding the secretome change fast in the clades of obligatory and facultative pathogens and also in the clades of mutualists and free-living bacteria. Our study shows that cell localization shapes genome evolution. In agreement with our hypothesis, the repertoires and the sequences of genes encoding secreted proteins evolve fast. The particularly rapid change of extracellular proteins suggests that these public goods are key players in bacterial adaptation.
Cloning and characterization of cDNAs encoding the complete sequence of decay-accelerating factor of human complement

International Nuclear Information System (INIS)

Medof, M.E.; Lublin, D.M.; Holers, V.M.; Ayers, D.J.; Getty, R.R.; Leykam, J.F.; Atkinson, J.P.; Tykocinski, M.L.

1987-01-01

cDNAs encoding the complement decay-accelerating factor (DAF) were isolated from HeLa and differentiated HL-60 λgt cDNA libraries by screening with a codon preference oligonucleotide corresponding to DAF NH 2 -terminal amino acids 3-14. The composite cDNA sequence showed a 347-amino acid protein preceded by an NH 2 -terminal leader peptide sequence. The translated sequence beginning at the DAF NH 2 terminus encodes four contiguous ≅ 61-amino acid long repetitive units of internal homology. The repetitive regions contain four conserved cysteines, one proline, one glycine, one glycine/alanine, four leucines/isoleucines/valines, one serine, three tyrosines/phenylalanines, and on tryptophan and show striking homology to similar regions previously identified in factor B, C2, C4 binding protein, factor H, C1r, factor XIII, interleukin 2 receptor, and serum β 2 -glycoprotein I. The consensus repeats are attached to a 70-amino acid long segment rich in serine and threonine (potential O-glycosylation sites), which is in turn followed by a stretch of hydrophobic amino acids. RNA blot analysis of HeLa and HL-60 RNA revealed three DAF mRNA species of 3.1, 2.7, and 2.0 kilobases. The results indicate that portions of the DAF gene may have evolved from a DNA element common to the above proteins, that DAF cDNA predicts a COOH-terminal anchoring polypeptide, and that distinct species of DAF message are elaborated in cells
Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease.

Science.gov (United States)

Dilliott, Allison A; Farhan, Sali M K; Ghani, Mahdi; Sato, Christine; Liang, Eric; Zhang, Ming; McIntyre, Adam D; Cao, Henian; Racacho, Lemuel; Robinson, John F; Strong, Michael J; Masellis, Mario; Bulman, Dennis E; Rogaeva, Ekaterina; Lang, Anthony; Tartaglia, Carmela; Finger, Elizabeth; Zinman, Lorne; Turnbull, John; Freedman, Morris; Swartz, Rick; Black, Sandra E; Hegele, Robert A

2018-04-04

Next-generation sequencing (NGS) is quickly revolutionizing how research into the genetic determinants of constitutional disease is performed. The technique is highly efficient with millions of sequencing reads being produced in a short time span and at relatively low cost. Specifically, targeted NGS is able to focus investigations to genomic regions of particular interest based on the disease of study. Not only does this further reduce costs and increase the speed of the process, but it lessens the computational burden that often accompanies NGS. Although targeted NGS is restricted to certain regions of the genome, preventing identification of potential novel loci of interest, it can be an excellent technique when faced with a phenotypically and genetically heterogeneous disease, for which there are previously known genetic associations. Because of the complex nature of the sequencing technique, it is important to closely adhere to protocols and methodologies in order to achieve sequencing reads of high coverage and quality. Further, once sequencing reads are obtained, a sophisticated bioinformatics workflow is utilized to accurately map reads to a reference genome, to call variants, and to ensure the variants pass quality metrics. Variants must also be annotated and curated based on their clinical significance, which can be standardized by applying the American College of Medical Genetics and Genomics Pathogenicity Guidelines. The methods presented herein will display the steps involved in generating and analyzing NGS data from a targeted sequencing panel, using the ONDRISeq neurodegenerative disease panel as a model, to identify variants that may be of clinical significance.
Using inter simple sequence repeat (ISSR) markers to study genetic ...

African Journals Online (AJOL)

enoh

2012-04-10

Apr 10, 2012 ... Genetic relationships among the cultivars was assessed by using six inter simple sequence ... polymorphism breeders of this species in order to find the ..... well as the high level of heterozygosity due to the cross- pollinating ...
Defining the Sequence Elements and Candidate Genes for the Coloboma Mutation.

Directory of Open Access Journals (Sweden)

Elizabeth A. Robb

Full Text Available The chicken coloboma mutation exhibits features similar to human congenital developmental malformations such as ocular coloboma, cleft-palate, dwarfism, and polydactyly. The coloboma-associated region and encoded genes were investigated using advanced genomic, genetic, and gene expression technologies. Initially, the mutation was linked to a 990 kb region encoding 11 genes; the application of the genetic and genomic tools led to a reduction of the linked region to 176 kb and the elimination of 7 genes. Furthermore, bioinformatics analyses of capture array-next generation sequence data identified genetic elements including SNPs, insertions, deletions, gaps, chromosomal rearrangements, and miRNA binding sites within the introgressed causative region relative to the reference genome sequence. Coloboma-specific variants within exons, UTRs, and splice sites were studied for their contribution to the mutant phenotype. Our compiled results suggest three genes for future studies. The three candidate genes, SLC30A5 (a zinc transporter, CENPH (a centromere protein, and CDK7 (a cyclin-dependent kinase, are differentially expressed (compared to normal embryos at stages and in tissues affected by the coloboma mutation. Of these genes, two (SLC30A5 and CENPH are considered high-priority candidate based upon studies in other vertebrate model systems.
Characterisation of the genetic diversity of Brucella by multilocus sequencing

Directory of Open Access Journals (Sweden)

MacMillan Alastair P

2007-04-01

Full Text Available Abstract Background Brucella species include economically important zoonotic pathogens that can infect a wide range of animals. There are currently six classically recognised species of Brucella although, as yet unnamed, isolates from various marine mammal species have been reported. In order to investigate genetic relationships within the group and identify potential diagnostic markers we have sequenced multiple genetic loci from a large sample of Brucella isolates representing the known diversity of the genus. Results Nine discrete genomic loci corresponding to 4,396 bp of sequence were examined from 160 Brucella isolates. By assigning each distinct allele at a locus an arbitrary numerical designation the population was found to represent 27 distinct sequence types (STs. Diversity at each locus ranged from 1.03–2.45% while overall genetic diversity equated to 1.5%. Most loci examined represent housekeeping gene loci and, in all but one case, the ratio of non-synonymous to synonymous change was substantially Brucella species, B. abortus, B. melitensis, B. ovis and B. neotomae correspond to well-separated clusters. With the exception of biovar 5, B. suis isolates cluster together, although they form a more diverse group than other classical species with a number of distinct STs corresponding to the remaining four biovars. B. canis isolates are located on the same branch very closely related to, but distinguishable from, B. suis biovar 3 and 4 isolates. Marine mammal isolates represent a distinct, though rather weakly supported, cluster within which individual STs display one of three clear host preferences. Conclusion The sequence database provides a powerful dataset for addressing ongoing controversies in Brucella taxonomy and a tool for unambiguously placing atypical, phenotypically discordant or newly emerging Brucella isolates. Furthermore, by using the phylogenetic backbone described here, robust and rationally selected markers for use in
pHlash: a new genetically encoded and ratiometric luminescence sensor of intracellular pH.

Science.gov (United States)

Zhang, Yunfei; Xie, Qiguang; Robertson, J Brian; Johnson, Carl Hirschie

2012-01-01

We report the development of a genetically encodable and ratiometic pH probe named "pHlash" that utilizes Bioluminescence Resonance Energy Transfer (BRET) rather than fluorescence excitation. The pHlash sensor-composed of a donor luciferase that is genetically fused to a Venus fluorophore-exhibits pH dependence of its spectral emission in vitro. When expressed in either yeast or mammalian cells, pHlash reports basal pH and cytosolic acidification in vivo. Its spectral ratio response is H(+) specific; neither Ca(++), Mg(++), Na(+), nor K(+) changes the spectral form of its luminescence emission. Moreover, it can be used to image pH in single cells. This is the first BRET-based sensor of H(+) ions, and it should allow the approximation of pH in cytosolic and organellar compartments in applications where current pH probes are inadequate.
pHlash: a new genetically encoded and ratiometric luminescence sensor of intracellular pH.

Directory of Open Access Journals (Sweden)

Yunfei Zhang

Full Text Available We report the development of a genetically encodable and ratiometic pH probe named "pHlash" that utilizes Bioluminescence Resonance Energy Transfer (BRET rather than fluorescence excitation. The pHlash sensor-composed of a donor luciferase that is genetically fused to a Venus fluorophore-exhibits pH dependence of its spectral emission in vitro. When expressed in either yeast or mammalian cells, pHlash reports basal pH and cytosolic acidification in vivo. Its spectral ratio response is H(+ specific; neither Ca(++, Mg(++, Na(+, nor K(+ changes the spectral form of its luminescence emission. Moreover, it can be used to image pH in single cells. This is the first BRET-based sensor of H(+ ions, and it should allow the approximation of pH in cytosolic and organellar compartments in applications where current pH probes are inadequate.
A sequence-based genetic map of Medicago truncatula and comparison of marker colinearity with M. sativa

NARCIS (Netherlands)

Choi, H.K.; Kim, D.; Uhm, T.; Limpens, E.H.M.; Lim, H.; Mun, J.H.; Kalo, P.; Penmetsa, R.V.; Seres, A.; Kulikova, O.; Roe, B.A.; Bisseling, T.; Kiss, G.B.; Cook, D.R.

2004-01-01

A core genetic map of the legume Medicago truncatula has been established by analyzing the segregation of 288 sequence-characterized genetic markers in an E, population composed of 93 individuals. These molecular markers correspond to 141 ESTs, 80 BAC end sequence tags, and 67 resistance gene

The genetic basis of DOORS syndrome : an exome-sequencing study

NARCIS (Netherlands)

Campeau, Philippe M.; Kasperaviciute, Dalia; Lu, James T.; Burrage, Lindsay C.; Kim, Choel; Hori, Mutsuki; Powell, Berkley R.; Stewart, Fiona; Felix, Temis Maria; van den Ende, Jenneke; Wisniewska, Marzena; Kayserili, Huelya; Rump, Patrick; Nampoothiri, Sheela; Aftimos, Salim; Mey, Antje; Nair, Lal D. V.; Begleiter, Michael L.; De Bie, Isabelle; Meenakshi, Girish; Murray, Mitzi L.; Repetto, Gabriela M.; Golabi, Mahin; Blair, Edward; Male, Alison; Giuliano, Fabienne; Kariminejad, Ariana; Newman, William G.; Bhaskar, Sanjeev S.; Dickerson, Jonathan E.; Kerr, Bronwyn; Banka, Siddharth; Giltay, Jacques C.; Wieczorek, Dagmar; Tostevin, Anna; Wiszniewska, Joanna; Cheung, Sau Wai; Hennekam, Raoul C.; Gibbs, Richard A.; Lee, Brendan H.; Sisodiya, Sanjay M.

Background Deafness, onychodystrophy, osteodystrophy, mental retardation, and seizures (DOORS) syndrome is a rare autosomal recessive disorder of unknown cause. We aimed to identify the genetic basis of this syndrome by sequencing most coding exons in affected individuals. Methods Through a search
Engineering of a genetically encodable fluorescent voltage sensor exploiting fast Ci-VSP voltage-sensing movements.

Directory of Open Access Journals (Sweden)

Alicia Lundby

2008-06-01

Full Text Available Ci-VSP contains a voltage-sensing domain (VSD homologous to that of voltage-gated potassium channels. Using charge displacement ('gating' current measurements we show that voltage-sensing movements of this VSD can occur within 1 ms in mammalian membranes. Our analysis lead to development of a genetically encodable fluorescent protein voltage sensor (VSFP in which the fast, voltage-dependent conformational changes of the Ci-VSP voltage sensor are transduced to similarly fast fluorescence read-outs.
Challenges and opportunities in estimating viral genetic diversity from next-generation sequencing data

Directory of Open Access Journals (Sweden)

Niko eBeerenwinkel

2012-09-01

Full Text Available Many viruses, including the clinically relevant RNA viruses HIV and HCV, exist in large populations and display high genetic heterogeneity within and between infected hosts. Assessing intra-patient viral genetic diversity is essential for understanding the evolutionary dynamics of viruses, for designing effective vaccines, and for the success of antiviral therapy. Next-generation sequencing technologies allow the rapid and cost-effective acquisition of thousands to millions of short DNA sequences from a single sample. However, this approach entails several challenges in experimental design and computational data analysis. Here, we review the entire process of inferring viral diversity from sample collection to computing measures of genetic diversity. We discuss sample preparation, including reverse transcription and amplification, and the effect of experimental conditions on diversity estimates due to in vitro base substitutions, insertions, deletions, and recombination. The use of different next-generation sequencing platforms and their sequencing error profiles are compared in the context of various applications of diversity estimation, ranging from the detection of single nucleotide variants to the reconstruction of whole-genome haplotypes. We describe the statistical and computational challenges arising from these technical artifacts, and we review existing approaches, including available software, for their solution. Finally, we discuss open problems, and highlight successful biomedical applications and potential future clinical use of next-generation sequencing to estimate viral diversity.
A selfish genetic element confers non-Mendelian inheritance in rice.

Science.gov (United States)

Yu, Xiaowen; Zhao, Zhigang; Zheng, Xiaoming; Zhou, Jiawu; Kong, Weiyi; Wang, Peiran; Bai, Wenting; Zheng, Hai; Zhang, Huan; Li, Jing; Liu, Jiafan; Wang, Qiming; Zhang, Long; Liu, Kai; Yu, Yang; Guo, Xiuping; Wang, Jiulin; Lin, Qibing; Wu, Fuqing; Ren, Yulong; Zhu, Shanshan; Zhang, Xin; Cheng, Zhijun; Lei, Cailin; Liu, Shijia; Liu, Xi; Tian, Yunlu; Jiang, Ling; Ge, Song; Wu, Chuanyin; Tao, Dayun; Wang, Haiyang; Wan, Jianmin

2018-06-08

Selfish genetic elements are pervasive in eukaryote genomes, but their role remains controversial. We show that qHMS7 , a major quantitative genetic locus for hybrid male sterility between wild rice ( Oryza meridionalis ) and Asian cultivated rice ( O. sativa ), contains two tightly linked genes [ Open Reading Frame 2 ( ORF2 ) and ORF3 ]. ORF2 encodes a toxic genetic element that aborts pollen in a sporophytic manner, whereas ORF3 encodes an antidote that protects pollen in a gametophytic manner. Pollens lacking ORF3 are selectively eliminated, leading to segregation distortion in the progeny. Analysis of the genetic sequence suggests that ORF3 arose first, followed by gradual functionalization of ORF2 Furthermore, this toxin-antidote system may have promoted the differentiation and/or maintained the genome stability of wild and cultivated rice. Copyright © 2018 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.
The genetic basis of DOORS syndrome: an exome-sequencing study

NARCIS (Netherlands)

Campeau, Philippe M.; Kasperaviciute, Dalia; Lu, James T.; Burrage, Lindsay C.; Kim, Choel; Hori, Mutsuki; Powell, Berkley R.; Stewart, Fiona; Félix, Têmis Maria; van den Ende, Jenneke; Wisniewska, Marzena; Kayserili, Hülya; Rump, Patrick; Nampoothiri, Sheela; Aftimos, Salim; Mey, Antje; Nair, Lal D. V.; Begleiter, Michael L.; de Bie, Isabelle; Meenakshi, Girish; Murray, Mitzi L.; Repetto, Gabriela M.; Golabi, Mahin; Blair, Edward; Male, Alison; Giuliano, Fabienne; Kariminejad, Ariana; Newman, William G.; Bhaskar, Sanjeev S.; Dickerson, Jonathan E.; Kerr, Bronwyn; Banka, Siddharth; Giltay, Jacques C.; Wieczorek, Dagmar; Tostevin, Anna; Wiszniewska, Joanna; Cheung, Sau Wai; Hennekam, Raoul C.; Gibbs, Richard A.; Lee, Brendan H.; Sisodiya, Sanjay M.

2014-01-01

Deafness, onychodystrophy, osteodystrophy, mental retardation, and seizures (DOORS) syndrome is a rare autosomal recessive disorder of unknown cause. We aimed to identify the genetic basis of this syndrome by sequencing most coding exons in affected individuals. Through a search of available case
A Naturally Encoded Dipeptide Handle for Bioorthogonal Chan-Lam Coupling.

Science.gov (United States)

Ohata, Jun; Zeng, Yimeng; Segatori, Laura; Ball, Zachary T

2018-04-03

Manipulation of biomacromolecules is ideally achieved through unique and bioorthogonal chemical reactions of genetically encoded, naturally occurring functional groups. The toolkit of methods for site-specific conjugation is limited by selectivity concerns and a dearth of naturally occurring functional groups with orthogonal reactivity. We report that pyroglutamate amide N-H bonds exhibit bioorthogonal copper-catalyzed Chan-Lam coupling at pyroglutamate-histidine dipeptide sequences. The pyroglutamate residue is readily incorporated into proteins of interest by natural enzymatic pathways, allowing specific bioconjugation at a minimalist dipeptide tag. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Characterization of unknown genetic modifications using high throughput sequencing and computational subtraction

Directory of Open Access Journals (Sweden)

Butenko Melinka A

2009-10-01

Full Text Available Abstract Background When generating a genetically modified organism (GMO, the primary goal is to give a target organism one or several novel traits by using biotechnology techniques. A GMO will differ from its parental strain in that its pool of transcripts will be altered. Currently, there are no methods that are reliably able to determine if an organism has been genetically altered if the nature of the modification is unknown. Results We show that the concept of computational subtraction can be used to identify transgenic cDNA sequences from genetically modified plants. Our datasets include 454-type sequences from a transgenic line of Arabidopsis thaliana and published EST datasets from commercially relevant species (rice and papaya. Conclusion We believe that computational subtraction represents a powerful new strategy for determining if an organism has been genetically modified as well as to define the nature of the modification. Fewer assumptions have to be made compared to methods currently in use and this is an advantage particularly when working with unknown GMOs.
Nucleotide sequence of a human cDNA encoding a ras-related protein (rap1B)

Energy Technology Data Exchange (ETDEWEB)

Pizon, V; Lerosey, I; Chardin, P; Tavitian, A [INSERM, Paris (France)

1988-08-11

The authors have previously characterized two human ras-related genes rap1 and rap2. Using the rap1 clone as probe they isolated and sequenced a new rap cDNA encoding the 184aa rap1B protein. The rap1B protein is 95% identical to rap1 and shares several properties with the ras protein suggesting that it could bind GTP/GDP and have a membrane location. As for rap1, the structural characteristics of rap1B suggest that the rap and ras proteins might interact on the same effector.
Characterizing novel endogenous retroviruses from genetic variation inferred from short sequence reads

DEFF Research Database (Denmark)

Mourier, Tobias; Mollerup, Sarah; Vinner, Lasse

2015-01-01

From Illumina sequencing of DNA from brain and liver tissue from the lion, Panthera leo, and tumor samples from the pike-perch, Sander lucioperca, we obtained two assembled sequence contigs with similarity to known retroviruses. Phylogenetic analyses suggest that the pike-perch retrovirus belongs...... to the epsilonretroviruses, and the lion retrovirus to the gammaretroviruses. To determine if these novel retroviral sequences originate from an endogenous retrovirus or from a recently integrated exogenous retrovirus, we assessed the genetic diversity of the parental sequences from which the short Illumina reads...
Application of whole genome shotgun sequencing for detection and characterization of genetically modified organisms and derived products.

Science.gov (United States)

Holst-Jensen, Arne; Spilsberg, Bjørn; Arulandhu, Alfred J; Kok, Esther; Shi, Jianxin; Zel, Jana

2016-07-01

The emergence of high-throughput, massive or next-generation sequencing technologies has created a completely new foundation for molecular analyses. Various selective enrichment processes are commonly applied to facilitate detection of predefined (known) targets. Such approaches, however, inevitably introduce a bias and are prone to miss unknown targets. Here we review the application of high-throughput sequencing technologies and the preparation of fit-for-purpose whole genome shotgun sequencing libraries for the detection and characterization of genetically modified and derived products. The potential impact of these new sequencing technologies for the characterization, breeding selection, risk assessment, and traceability of genetically modified organisms and genetically modified products is yet to be fully acknowledged. The published literature is reviewed, and the prospects for future developments and use of the new sequencing technologies for these purposes are discussed.
Monitoring thioredoxin redox with a genetically encoded red fluorescent biosensor.

Science.gov (United States)

Fan, Yichong; Makar, Merna; Wang, Michael X; Ai, Hui-Wang

2017-09-01

Thioredoxin (Trx) is one of the two major thiol antioxidants, playing essential roles in redox homeostasis and signaling. Despite its importance, there is a lack of methods for monitoring Trx redox dynamics in live cells, hindering a better understanding of physiological and pathological roles of the Trx redox system. In this work, we developed the first genetically encoded fluorescent biosensor for Trx redox by engineering a redox relay between the active-site cysteines of human Trx1 and rxRFP1, a redox-sensitive red fluorescent protein. We used the resultant biosensor-TrxRFP1-to selectively monitor perturbations of Trx redox in various mammalian cell lines. We subcellularly localized TrxRFP1 to image compartmentalized Trx redox changes. We further combined TrxRFP1 with a green fluorescent Grx1-roGFP2 biosensor to simultaneously monitor Trx and glutathione redox dynamics in live cells in response to chemical and physiologically relevant stimuli.
Genetic polymorphism in Gymnodinium galatheanum chloroplast DNA sequences and development of a molecular detection assay.

Science.gov (United States)

Tengs, T; Bowers, H A; Ziman, A P; Stoecker, D K; Oldach, D W

2001-02-01

Nuclear and chloroplast-encoded small subunit ribosomal DNA sequences were obtained from several strains of the toxic dinoflagellate Gymnodinium galatheanum. Phylogenetic analyses and comparison of sequences indicate that the chloroplast sequences show a higher degree of sequence divergence than the nuclear homologue. The chloroplast sequences were chosen as targets for the development of a 5'--3' exonuclease assay for detection of the organism. The assay has a very high degree of specificity and has been used to screen environmental water samples from a fish farm where the presence of this dinoflagellate species has previously been associated with fish kills. Various hypotheses for the derived nature of the chloroplast sequences are discussed, as well as what is known about the toxicity of the species.
Draft genome sequence and genetic transformation of the oleaginous alga Nannochloropis gaditana.

Science.gov (United States)

Radakovits, Randor; Jinkerson, Robert E; Fuerstenberg, Susan I; Tae, Hongseok; Settlage, Robert E; Boore, Jeffrey L; Posewitz, Matthew C

2012-02-21

The potential use of algae in biofuels applications is receiving significant attention. However, none of the current algal model species are competitive production strains. Here we present a draft genome sequence and a genetic transformation method for the marine microalga Nannochloropsis gaditana CCMP526. We show that N. gaditana has highly favourable lipid yields, and is a promising production organism. The genome assembly includes nuclear (~29 Mb) and organellar genomes, and contains 9,052 gene models. We define the genes required for glycerolipid biogenesis and detail the differential regulation of genes during nitrogen-limited lipid biosynthesis. Phylogenomic analysis identifies genetic attributes of this organism, including unique stramenopile photosynthesis genes and gene expansions that may explain the distinguishing photoautotrophic phenotypes observed. The availability of a genome sequence and transformation methods will facilitate investigations into N. gaditana lipid biosynthesis and permit genetic engineering strategies to further improve this naturally productive alga.
Imaging Intracellular pH in Live Cells with a Genetically-Encoded Red Fluorescent Protein Sensor

OpenAIRE

Tantama, Mathew; Hung, Yin Pun; Yellen, Gary

2011-01-01

Intracellular pH affects protein structure and function, and proton gradients underlie the function of organelles such as lysosomes and mitochondria. We engineered a genetically-encoded pH sensor by mutagenesis of the red fluorescent protein mKeima, providing a new tool to image intracellular pH in live cells. This sensor, named pHRed, is the first ratiometric, single-protein red fluorescent sensor of pH. Fluorescence emission of pHRed peaks at 610 nm while exhibiting dual excitation peaks at...
Visualizing presynaptic calcium dynamics and vesicle fusion with a single genetically encoded reporter at individual synapses

Directory of Open Access Journals (Sweden)

Rachel E Jackson

2016-07-01

Full Text Available Synaptic transmission depends on the influx of calcium into the presynaptic compartment, which drives neurotransmitter release. Genetically encoded reporters are widely used tools to understand these processes, particularly pHluorin-based reporters that report vesicle exocytosis and endocytosis through pH dependent changes in fluorescence, and genetically encoded calcium indicators (GECIs that exhibit changes in fluorescence upon binding to calcium. The recent expansion of the color palette of available indicators has made it possible to image multiple probes simultaneously within a cell. We have constructed a single molecule reporter capable of concurrent imaging of both presynaptic calcium influx and exocytosis, by fusion of sypHy, the vesicle associated protein synaptophysin containing a GFP-based pHluorin sensor, with the red-shifted GECI R-GECO1. Due to the fixed stoichiometry of the two probes, the ratio of the two responses can also be measured, providing an all optical correlate of the calcium dependence of release. Here, we have characterized stimulus-evoked sypHy-RGECO responses of hippocampal synapses in vitro, exploring the effects of different stimulus strengths and frequencies as well as variations in external calcium concentrations. By combining live sypHy-RGECO imaging with post-hoc fixation and immunofluorescence, we have also investigated correlations between structural and functional properties of synapses.
Vast diversity of prokaryotic virus genomes encoding double jelly-roll major capsid proteins uncovered by genomic and metagenomic sequence analysis.

Science.gov (United States)

Yutin, Natalya; Bäckström, Disa; Ettema, Thijs J G; Krupovic, Mart; Koonin, Eugene V

2018-04-10

Analysis of metagenomic sequences has become the principal approach for the study of the diversity of viruses. Many recent, extensive metagenomic studies on several classes of viruses have dramatically expanded the visible part of the virosphere, showing that previously undetected viruses, or those that have been considered rare, actually are important components of the global virome. We investigated the provenance of viruses related to tail-less bacteriophages of the family Tectiviridae by searching genomic and metagenomics sequence databases for distant homologs of the tectivirus-like Double Jelly-Roll major capsid proteins (DJR MCP). These searches resulted in the identification of numerous genomes of virus-like elements that are similar in size to tectiviruses (10-15 kilobases) and have diverse gene compositions. By comparison of the gene repertoires, the DJR MCP-encoding genomes were classified into 6 distinct groups that can be predicted to differ in reproduction strategies and host ranges. Only the DJR MCP gene that is present by design is shared by all these genomes, and most also encode a predicted DNA-packaging ATPase; the rest of the genes are present only in subgroups of this unexpectedly diverse collection of DJR MCP-encoding genomes. Only a minority encode a DNA polymerase which is a hallmark of the family Tectiviridae and the putative family "Autolykiviridae". Notably, one of the identified putative DJR MCP viruses encodes a homolog of Cas1 endonuclease, the integrase involved in CRISPR-Cas adaptation and integration of transposon-like elements called casposons. This is the first detected occurrence of Cas1 in a virus. Many of the identified elements are individual contigs flanked by inverted or direct repeats and appear to represent complete, extrachromosomal viral genomes, whereas others are flanked by bacterial genes and thus can be considered as proviruses. These contigs come from metagenomes of widely different environments, some dominated by
Construction of a high-density genetic map for grape using next generation restriction-site associated DNA sequencing

Directory of Open Access Journals (Sweden)

Wang Nian

2012-08-01

Full Text Available Abstract Background Genetic mapping and QTL detection are powerful methodologies in plant improvement and breeding. Construction of a high-density and high-quality genetic map would be of great benefit in the production of superior grapes to meet human demand. High throughput and low cost of the recently developed next generation sequencing (NGS technology have resulted in its wide application in genome research. Sequencing restriction-site associated DNA (RAD might be an efficient strategy to simplify genotyping. Combining NGS with RAD has proven to be powerful for single nucleotide polymorphism (SNP marker development. Results An F1 population of 100 individual plants was developed. In-silico digestion-site prediction was used to select an appropriate restriction enzyme for construction of a RAD sequencing library. Next generation RAD sequencing was applied to genotype the F1 population and its parents. Applying a cluster strategy for SNP modulation, a total of 1,814 high-quality SNP markers were developed: 1,121 of these were mapped to the female genetic map, 759 to the male map, and 1,646 to the integrated map. A comparison of the genetic maps to the published Vitis vinifera genome revealed both conservation and variations. Conclusions The applicability of next generation RAD sequencing for genotyping a grape F1 population was demonstrated, leading to the successful development of a genetic map with high density and quality using our designed SNP markers. Detailed analysis revealed that this newly developed genetic map can be used for a variety of genome investigations, such as QTL detection, sequence assembly and genome comparison.
Signal sequence and keyword trap in silico for selection of full-length human cDNAs encoding secretion or membrane proteins from oligo-capped cDNA libraries.

Science.gov (United States)

Otsuki, Tetsuji; Ota, Toshio; Nishikawa, Tetsuo; Hayashi, Koji; Suzuki, Yutaka; Yamamoto, Jun-ichi; Wakamatsu, Ai; Kimura, Kouichi; Sakamoto, Katsuhiko; Hatano, Naoto; Kawai, Yuri; Ishii, Shizuko; Saito, Kaoru; Kojima, Shin-ichi; Sugiyama, Tomoyasu; Ono, Tetsuyoshi; Okano, Kazunori; Yoshikawa, Yoko; Aotsuka, Satoshi; Sasaki, Naokazu; Hattori, Atsushi; Okumura, Koji; Nagai, Keiichi; Sugano, Sumio; Isogai, Takao

2005-01-01

We have developed an in silico method of selection of human full-length cDNAs encoding secretion or membrane proteins from oligo-capped cDNA libraries. Fullness rates were increased to about 80% by combination of the oligo-capping method and ATGpr, software for prediction of translation start point and the coding potential. Then, using 5'-end single-pass sequences, cDNAs having the signal sequence were selected by PSORT ('signal sequence trap'). We also applied 'secretion or membrane protein-related keyword trap' based on the result of BLAST search against the SWISS-PROT database for the cDNAs which could not be selected by PSORT. Using the above procedures, 789 cDNAs were primarily selected and subjected to full-length sequencing, and 334 of these cDNAs were finally selected as novel. Most of the cDNAs (295 cDNAs: 88.3%) were predicted to encode secretion or membrane proteins. In particular, 165(80.5%) of the 205 cDNAs selected by PSORT were predicted to have signal sequences, while 70 (54.2%) of the 129 cDNAs selected by 'keyword trap' preserved the secretion or membrane protein-related keywords. Many important cDNAs were obtained, including transporters, receptors, and ligands, involved in significant cellular functions. Thus, an efficient method of selecting secretion or membrane protein-encoding cDNAs was developed by combining the above four procedures.
A genetically-encoded chloride and pH sensor for dissociating ion dynamics in the nervous system

OpenAIRE

Raimondo, Joseph V.; Joyce, Bradley; Kay, Louise; Schlagheck, Theresa; Newey, Sarah E.; Srinivas, Shankar; Akerman, Colin J.

2013-01-01

Within the nervous system, intracellular Cl− and pH regulate fundamental processes including cell proliferation, metabolism, synaptic transmission, and network excitability. Cl− and pH are often co-regulated, and network activity results in the movement of both Cl− and H+. Tools to accurately measure these ions are crucial for understanding their role under physiological and pathological conditions. Although genetically-encoded Cl− and pH sensors have been described previously, these either l...
Molecular mechanisms for protein-encoded inheritance

Science.gov (United States)

Wiltzius, Jed J. W.; Landau, Meytal; Nelson, Rebecca; Sawaya, Michael R.; Apostol, Marcin I.; Goldschmidt, Lukasz; Soriaga, Angela B.; Cascio, Duilio; Rajashankar, Kanagalaghatta; Eisenberg, David

2013-01-01

Strains are phenotypic variants, encoded by nucleic acid sequences in chromosomal inheritance and by protein “conformations” in prion inheritance and transmission. But how is a protein “conformation” stable enough to endure transmission between cells or organisms? Here new polymorphic crystal structures of segments of prion and other amyloid proteins offer structural mechanisms for prion strains. In packing polymorphism, prion strains are encoded by alternative packings (polymorphs) of β-sheets formed by the same segment of a protein; in a second mechanism, segmental polymorphism, prion strains are encoded by distinct β-sheets built from different segments of a protein. Both forms of polymorphism can produce enduring “conformations,” capable of encoding strains. These molecular mechanisms for transfer of information into prion strains share features with the familiar mechanism for transfer of information by nucleic acid inheritance, including sequence specificity and recognition by non-covalent bonds. PMID:19684598

The Bifidobacterium dentium Bd1 genome sequence reflects its genetic adaptation to the human oral cavity.

Directory of Open Access Journals (Sweden)

Marco Ventura

2009-12-01

Full Text Available Bifidobacteria, one of the relatively dominant components of the human intestinal microbiota, are considered one of the key groups of beneficial intestinal bacteria (probiotic bacteria. However, in addition to health-promoting taxa, the genus Bifidobacterium also includes Bifidobacterium dentium, an opportunistic cariogenic pathogen. The genetic basis for the ability of B. dentium to survive in the oral cavity and contribute to caries development is not understood. The genome of B. dentium Bd1, a strain isolated from dental caries, was sequenced to completion to uncover a single circular 2,636,368 base pair chromosome with 2,143 predicted open reading frames. Annotation of the genome sequence revealed multiple ways in which B. dentium has adapted to the oral environment through specialized nutrient acquisition, defences against antimicrobials, and gene products that increase fitness and competitiveness within the oral niche. B. dentium Bd1 was shown to metabolize a wide variety of carbohydrates, consistent with genome-based predictions, while colonization and persistence factors implicated in tissue adhesion, acid tolerance, and the metabolism of human saliva-derived compounds were also identified. Global transcriptome analysis demonstrated that many of the genes encoding these predicted traits are highly expressed under relevant physiological conditions. This is the first report to identify, through various genomic approaches, specific genetic adaptations of a Bifidobacterium taxon, Bifidobacterium dentium Bd1, to a lifestyle as a cariogenic microorganism in the oral cavity. In silico analysis and comparative genomic hybridization experiments clearly reveal a high level of genome conservation among various B. dentium strains. The data indicate that the genome of this opportunistic cariogen has evolved through a very limited number of horizontal gene acquisition events, highlighting the narrow boundaries that separate commensals from
Molecular evolution of avian reovirus: evidence for genetic diversity and reassortment of the S-class genome segments and multiple cocirculating lineages

International Nuclear Information System (INIS)

Liu, Hung J.; Lee, Long H.; Hsu, Hsiao W.; Kuo, Liam C.; Liao, Ming H.

2003-01-01

Nucleotide sequences of the S-class genome segments of 17 field-isolates and vaccine strains of avian reovirus (ARV) isolated over a 23-year period from different hosts, pathotypes, and geographic locations were examined and analyzed to define phylogenetic profiles and evolutionary mechanism. The S1 genome segment showed noticeably higher divergence than the other S-class genes. The σC-encoding gene has evolved into six distinct lineages. In contrast, the other S-class genes showed less divergence than that of the σC-encoding gene and have evolved into two to three major distinct lineages, respectively. Comparative sequence analysis provided evidence indicating extensive sequence divergence between ARV and other orthoreoviruses. The evolutionary trees of each gene were distinct, suggesting that these genes evolve in an independent manner. Furthermore, variable topologies were the result of frequent genetic reassortment among multiple cocirculating lineages. Results showed genetic diversity correlated more closely with date of isolation and geographic sites than with host species and pathotypes. This is the first evidence demonstrating genetic variability among circulating ARVs through a combination of evolutionary mechanisms involving multiple cocirculating lineages and genetic reassortment. The evolutionary rates and patterns of base substitutions were examined. The evolutionary rate for the σC-encoding gene and σC protein was higher than for the other S-class genes and other family of viruses. With the exception of the σC-encoding gene, which nonsynonymous substitutions predominate over synonymous, the evolutionary process of the other S-class genes can be explained by the neutral theory of molecular evolution. Results revealed that synonymous substitutions predominate over nonsynonymous in the S-class genes, even though genetic diversity and substitution rates vary among the viruses
Asymmetrical distribution of non-conserved regulatory sequences at PHOX2B is reflected at the ENCODE loci and illuminates a possible genome-wide trend

Directory of Open Access Journals (Sweden)

McCallion Andrew S

2009-01-01

Full Text Available Abstract Background Transcriptional regulatory elements are central to development and interspecific phenotypic variation. Current regulatory element prediction tools rely heavily upon conservation for prediction of putative elements. Recent in vitro observations from the ENCODE project combined with in vivo analyses at the zebrafish phox2b locus suggests that a significant fraction of regulatory elements may fall below commonly applied metrics of conservation. We propose to explore these observations in vivo at the human PHOX2B locus, and also evaluate the potential evidence for genome-wide applicability of these observations through a novel analysis of extant data. Results Transposon-based transgenic analysis utilizing a tiling path proximal to human PHOX2B in zebrafish recapitulates the observations at the zebrafish phox2b locus of both conserved and non-conserved regulatory elements. Analysis of human sequences conserved with previously identified zebrafish phox2b regulatory elements demonstrates that the orthologous sequences exhibit overlapping regulatory control. Additionally, analysis of non-conserved sequences scattered over 135 kb 5' to PHOX2B, provides evidence of non-conserved regulatory elements positively biased with close proximity to the gene. Furthermore, we provide a novel analysis of data from the ENCODE project, finding a non-uniform distribution of regulatory elements consistent with our in vivo observations at PHOX2B. These observations remain largely unchanged when one accounts for the sequence repeat content of the assayed intervals, when the intervals are sub-classified by biological role (developmental versus non-developmental, or by gene density (gene desert versus non-gene desert. Conclusion While regulatory elements frequently display evidence of evolutionary conservation, a fraction appears to be undetected by current metrics of conservation. In vivo observations at the PHOX2B locus, supported by our analyses of in
Cloning, characterization and heterologous expression of epoxide hydrolase-encoding cDNA sequences from yeasts belonging to the genera Rhodotorula and Rhodosporidium

NARCIS (Netherlands)

Visser, H.; Weijers, C.A.G.M.; Ooyen, van A.J.J.; Verdoes, J.C.

2002-01-01

Epoxide hydrolase-encoding cDNA sequences were isolated from the basidiomycetous yeast species Rhodosporidium toruloides CBS 349, Rhodosporidium toruloides CBS 14 and Rhodotorula araucariae CBS 6031 in order to evaluate the molecular data and potential application of this type of enzymes. The
The protein encoded by the proto-oncogene DEK changes the topology of chromatin and reduces the efficiency of DNA replication in a chromatin-specific manner

DEFF Research Database (Denmark)

Alexiadis, V; Waldmann, T; Andersen, Jens S.

2000-01-01

The structure of chromatin regulates the genetic activity of the underlying DNA sequence. We report here that the protein encoded by the proto-oncogene DEK, which is involved in acute myelogenous leukemia, induces alterations of the superhelical density of DNA in chromatin. The change in topology...
A high-density genetic map for anchoring genome sequences and identifying QTLs associated with dwarf vine in pumpkin (Cucurbita maxima Duch.).

Science.gov (United States)

Zhang, Guoyu; Ren, Yi; Sun, Honghe; Guo, Shaogui; Zhang, Fan; Zhang, Jie; Zhang, Haiying; Jia, Zhangcai; Fei, Zhangjun; Xu, Yong; Li, Haizhen

2015-12-24

Pumpkin (Cucurbita maxima Duch.) is an economically important crop belonging to the Cucurbitaceae family. However, very few genomic and genetic resources are available for this species. As part of our ongoing efforts to sequence the pumpkin genome, high-density genetic map is essential for anchoring and orienting the assembled scaffolds. In addition, a saturated genetic map can facilitate quantitative trait locus (QTL) mapping. A set of 186 F2 plants derived from the cross of pumpkin inbred lines Rimu and SQ026 were genotyped using the genotyping-by-sequencing approach. Using the SNPs we identified, a high-density genetic map containing 458 bin-markers was constructed, spanning a total genetic distance of 2,566.8 cM across the 20 linkage groups of C. maxima with a mean marker density of 5.60 cM. Using this map we were able to anchor 58 assembled scaffolds that covered about 194.5 Mb (71.7%) of the 271.4 Mb assembled pumpkin genome, of which 44 (183.0 Mb; 67.4%) were oriented. Furthermore, the high-density genetic map was used to identify genomic regions highly associated with an important agronomic trait, dwarf vine. Three QTLs on linkage groups (LGs) 1, 3 and 4, respectively, were recovered. One QTL, qCmB2, which was located in an interval of 0.42 Mb on LG 3, explained 21.4% phenotypic variations. Within qCmB2, one gene, Cma_004516, encoding the gibberellin (GA) 20-oxidase in the GA biosynthesis pathway, had a 1249-bp deletion in its promoter in bush type lines, and its expression level was significantly increased during the vine growth and higher in vine type lines than bush type lines, supporting Cma_004516 as a possible candidate gene controlling vine growth in pumpkin. A high-density pumpkin genetic map was constructed, which was used to successfully anchor and orient the assembled genome scaffolds, and to identify QTLs highly associated with pumpkin vine length. The map provided a valuable resource for gene cloning and marker assisted breeding in pumpkin and
Genetic variability of Echinococcus granulosus from the Tibetan plateau inferred by mitochondrial DNA sequences.

Science.gov (United States)

Yan, Ning; Nie, Hua-Ming; Jiang, Zhong-Rong; Yang, Ai-Guo; Deng, Shi-Jin; Guo, Li; Yu, Hua; Yan, Yu-Bao; Tsering, Dawa; Kong, Wei-Shu; Wang, Ning; Wang, Jia-Hai; Xie, Yue; Fu, Yan; Yang, De-Ying; Wang, Shu-Xian; Gu, Xiao-Bin; Peng, Xue-Rong; Yang, Guang-You

2013-09-01

To analyse genetic variability and population structure, 84 isolates of Echinococcus granulosus (Cestoda: Taeniidae) collected from various host species at different sites of the Tibetan plateau in China were sequenced for the whole mitochondrial nad1 (894 bp) and atp6 (513 bp) genes. The vast majority were classified as G1 genotype (n=82), and two samples from human patients in Sichuan province were identified as G3 genotype. Based on the concatenated sequences of nad1+atp6, 28 different haplotypes (NA1-NA28) were identified. A parsimonious network of the concatenated sequence haplotypes showed star-like features in the overall population, with NA1 as the major haplotype in the population networks. By AMOVA it was shown that variation of E. granulosus within the overall population was the main pattern of the total genetic variability. Neutrality indexes of the concatenated sequence (nad1+atp6) were computed by Tajima's D and Fu's Fs tests and showed high negative values for E. granulosus, indicating significant deviations from neutrality. FST and Nm values suggested that the populations were not genetically differentiated. Copyright © 2013 Elsevier B.V. All rights reserved.
An abundance of rare functional variants in 202 drug target genes sequenced in 14.002 people

DEFF Research Database (Denmark)

Nelson, Matthew R.; Wegmann, Daniel; Ehm, Margaret G.

2012-01-01

Rare genetic variants contribute to complex disease risk; however, the abundance of rare variants in human populations remains unknown. We explored this spectrum of variation by sequencing 202 genes encoding drug targets in 14,002 individuals. We find rare variants are abundant (1 every 17 bases)...
The Matrix Method of Representation, Analysis and Classification of Long Genetic Sequences

Directory of Open Access Journals (Sweden)

Ivan V. Stepanyan

2017-01-01

Full Text Available The article is devoted to a matrix method of comparative analysis of long nucleotide sequences by means of presenting each sequence in the form of three digital binary sequences. This method uses a set of symmetries of biochemical attributes of nucleotides. It also uses the possibility of presentation of every whole set of N-mers as one of the members of a Kronecker family of genetic matrices. With this method, a long nucleotide sequence can be visually represented as an individual fractal-like mosaic or another regular mosaic of binary type. In contrast to natural nucleotide sequences, artificial random sequences give non-regular patterns. Examples of binary mosaics of long nucleotide sequences are shown, including cases of human chromosomes and penicillins. The obtained results are then discussed.
Human genetics and genomics a decade after the release of the draft sequence of the human genome

Science.gov (United States)

2011-01-01

Substantial progress has been made in human genetics and genomics research over the past ten years since the publication of the draft sequence of the human genome in 2001. Findings emanating directly from the Human Genome Project, together with those from follow-on studies, have had an enormous impact on our understanding of the architecture and function of the human genome. Major developments have been made in cataloguing genetic variation, the International HapMap Project, and with respect to advances in genotyping technologies. These developments are vital for the emergence of genome-wide association studies in the investigation of complex diseases and traits. In parallel, the advent of high-throughput sequencing technologies has ushered in the 'personal genome sequencing' era for both normal and cancer genomes, and made possible large-scale genome sequencing studies such as the 1000 Genomes Project and the International Cancer Genome Consortium. The high-throughput sequencing and sequence-capture technologies are also providing new opportunities to study Mendelian disorders through exome sequencing and whole-genome sequencing. This paper reviews these major developments in human genetics and genomics over the past decade. PMID:22155605
Genome-wide comparative analysis of NBS-encoding genes between Brassica species and Arabidopsis thaliana.

Science.gov (United States)

Yu, Jingyin; Tehrim, Sadia; Zhang, Fengqi; Tong, Chaobo; Huang, Junyan; Cheng, Xiaohui; Dong, Caihua; Zhou, Yanqiu; Qin, Rui; Hua, Wei; Liu, Shengyi

2014-01-03

Plant disease resistance (R) genes with the nucleotide binding site (NBS) play an important role in offering resistance to pathogens. The availability of complete genome sequences of Brassica oleracea and Brassica rapa provides an important opportunity for researchers to identify and characterize NBS-encoding R genes in Brassica species and to compare with analogues in Arabidopsis thaliana based on a comparative genomics approach. However, little is known about the evolutionary fate of NBS-encoding genes in the Brassica lineage after split from A. thaliana. Here we present genome-wide analysis of NBS-encoding genes in B. oleracea, B. rapa and A. thaliana. Through the employment of HMM search and manual curation, we identified 157, 206 and 167 NBS-encoding genes in B. oleracea, B. rapa and A. thaliana genomes, respectively. Phylogenetic analysis among 3 species classified NBS-encoding genes into 6 subgroups. Tandem duplication and whole genome triplication (WGT) analyses revealed that after WGT of the Brassica ancestor, NBS-encoding homologous gene pairs on triplicated regions in Brassica ancestor were deleted or lost quickly, but NBS-encoding genes in Brassica species experienced species-specific gene amplification by tandem duplication after divergence of B. rapa and B. oleracea. Expression profiling of NBS-encoding orthologous gene pairs indicated the differential expression pattern of retained orthologous gene copies in B. oleracea and B. rapa. Furthermore, evolutionary analysis of CNL type NBS-encoding orthologous gene pairs among 3 species suggested that orthologous genes in B. rapa species have undergone stronger negative selection than those in B .oleracea species. But for TNL type, there are no significant differences in the orthologous gene pairs between the two species. This study is first identification and characterization of NBS-encoding genes in B. rapa and B. oleracea based on whole genome sequences. Through tandem duplication and whole genome
Genetical and functional investigation of fliC genes encoding flagellar serotype H4 in wildtype strains of Escherichia coli and in a laboratory E. coli K-12 strain expressing flagellar antigen type H48

Directory of Open Access Journals (Sweden)

Schaudinn Christoph

2005-01-01

Full Text Available Abstract Background Serotyping of O-(lipopolysaccharide and H-(flagellar antigens is a wideley used method for identification of pathogenic strains and clones of Escherichia coli. At present, 176 O- and 53 H-antigens are described for E. coli which occur in different combinations in the strains. The flagellar antigen H4 is widely present in E. coli strains of different O-serotypes and pathotypes and we have investigated the genetic relationship between H4 encoding fliC genes by PCR, nucleotide sequencing and expression studies. Results The complete nucleotide sequence of fliC genes present in E. coli reference strains U9-41 (O2:K1:H4 and P12b (O15:H17 was determined and both were found 99.3% (1043 of 1050 nucleotides identical in their coding sequence. A PCR/RFLP protocol was developed for typing of fliC-H4 strains and 88 E. coli strains reacting with H4 antiserum were investigated. Nucleotide sequencing of complete fliC genes of six E. coli strains which were selected based on serum agglutination titers, fliC-PCR genotyping and reference data revealed 96.6 to 100% identity on the amino acid level. The functional expression of flagellin encoded by fliC-H4 from strain U9-41 and from our strain P12b which is an H4 expressing variant type was investigated in the E. coli K-12 strain JM109 which encodes flagellar type H48. The fliC recombinant plasmid carrying JM109 strains reacted with both H4 and H48 specific antisera whereas JM109 reacted only with the H48 antiserum. By immunoelectron microscopy, we could show that the flagella made by the fliC-H4 recombinant plasmid carrying strain are constituted of H48 and H4 flagellins which are co-assembled into functional flagella. Conclusion The flagellar serotype H4 is encoded by closely related fliC genes present in serologically different types of E. coli strainswhich were isolated at different time periods and geographical locations. Our expression studies show for the first time, that flagellins of
Spatially conserved regulatory elements identified within human and mouse Cd247 gene using high-throughput sequencing data from the ENCODE project

DEFF Research Database (Denmark)

Pundhir, Sachin; Hannibal, Tine Dahlbæk; Bang-Berthelsen, Claus Heiner

2014-01-01

. In this study, we have utilized the wealth of high-throughput sequencing data produced during the Encyclopedia of DNA Elements (ENCODE) project to identify spatially conserved regulatory elements within the Cd247 gene from human and mouse. We show the presence of two transcription factor binding sites...
Exploring genetic variation in the tomato (Solanum section Lycopersicon) clade by whole-genome sequencing

NARCIS (Netherlands)

Aflitos, S.A.; Schijlen, E.G.W.M.; Jong, de J.H.S.G.M.; Ridder, de D.; Smit, S.; Finkers, H.J.; Bakker, F.T.; Geest, van de H.C.; Lintel Hekkert, te B.; Haarst, van J.C.; Smits, L.W.M.; Koops, A.J.; Sanchez-Perez, M.J.; Heusden, van A.W.; Visser, R.G.F.; Schranz, M.E.; Peters, S.A.

2014-01-01

We explored genetic variation by sequencing a selection of 84 tomato accessions and related wild species representative for the Lycopersicon, Arcanum, Eriopersicon, and Neolycopersicon groups which has yielded a huge amount of precious data on sequence diversity in the tomato clade. Three new
Exploring genetic variation in the tomato (Solanum section Lycopersicon) clade by whole-genome sequencing

NARCIS (Netherlands)

Aflitos, S.; Schijlen, E.; de Jong, H.; de Ridder, D.; Smit, S.; Finkers, R.; Wang, J.; Zhang, G.; Li, N.; Mao, L.; Bakker, F.; Dirks, R.; Breit, T.; Gravendeel, B.; Huits, H.; Struss, D.; Swanson-Wagner, R.; van Leeuwen, H.; van Ham, R.C.H.J.; Fito, L.; Guignier, L.; Sevilla, M.; Ellul, P.; Ganko, E.; Kapur, A.; Reclus, E.; de Geus, B.; van de Geest, H.; te Lintel Hekkert, B.; van Haarst, J.; Smits, L.; Koops, A.; Sanchez-Perez, G.; van Heusden, A.W.; Visser, R.; Quan, Z.; Min, J.; Liao, L.; Wang, X.; Wang, G.; Yue, Z.; Yang, X.; Xu, N.; Schranz, E.; Smets, E.; Vos, R.; Rauwerda, J.; Ursem, R.; Schuit, C.; Kerns, M.; van den Berg, J.; Vriezen, W.; Janssen, A.; Datema, E.; Jahrman, T.; Moquet, F.; Bonnet, J.; Peters, S.

2014-01-01

We explored genetic variation by sequencing a selection of 84 tomato accessions and related wild species representative of the Lycopersicon, Arcanum, Eriopersicon and Neolycopersicon groups, which has yielded a huge amount of precious data on sequence diversity in the tomato clade. Three new
Genetic architecture of retinal and macular degenerative diseases: the promise and challenges of next-generation sequencing

Science.gov (United States)

2013-01-01

Inherited retinal degenerative diseases (RDDs) display wide variation in their mode of inheritance, underlying genetic defects, age of onset, and phenotypic severity. Molecular mechanisms have not been delineated for many retinal diseases, and treatment options are limited. In most instances, genotype-phenotype correlations have not been elucidated because of extensive clinical and genetic heterogeneity. Next-generation sequencing (NGS) methods, including exome, genome, transcriptome and epigenome sequencing, provide novel avenues towards achieving comprehensive understanding of the genetic architecture of RDDs. Whole-exome sequencing (WES) has already revealed several new RDD genes, whereas RNA-Seq and ChIP-Seq analyses are expected to uncover novel aspects of gene regulation and biological networks that are involved in retinal development, aging and disease. In this review, we focus on the genetic characterization of retinal and macular degeneration using NGS technology and discuss the basic framework for further investigations. We also examine the challenges of NGS application in clinical diagnosis and management. PMID:24112618
Cloning and sequencing of a gene encoding a 21-kilodalton outer membrane protein from Bordetella avium and expression of the gene in Salmonella typhimurium.

Science.gov (United States)

Gentry-Weeks, C R; Hultsch, A L; Kelly, S M; Keith, J M; Curtiss, R

1992-01-01

Three gene libraries of Bordetella avium 197 DNA were prepared in Escherichia coli LE392 by using the cosmid vectors pCP13 and pYA2329, a derivative of pCP13 specifying spectinomycin resistance. The cosmid libraries were screened with convalescent-phase anti-B. avium turkey sera and polyclonal rabbit antisera against B. avium 197 outer membrane proteins. One E. coli recombinant clone produced a 56-kDa protein which reacted with convalescent-phase serum from a turkey infected with B. avium 197. In addition, five E. coli recombinant clones were identified which produced B. avium outer membrane proteins with molecular masses of 21, 38, 40, 43, and 48 kDa. At least one of these E. coli clones, which encoded the 21-kDa protein, reacted with both convalescent-phase turkey sera and antibody against B. avium 197 outer membrane proteins. The gene for the 21-kDa outer membrane protein was localized by Tn5seq1 mutagenesis, and the nucleotide sequence was determined by dideoxy sequencing. DNA sequence analysis of the 21-kDa protein revealed an open reading frame of 582 bases that resulted in a predicted protein of 194 amino acids. Comparison of the predicted amino acid sequence of the gene encoding the 21-kDa outer membrane protein with protein sequences in the National Biomedical Research Foundation protein sequence data base indicated significant homology to the OmpA proteins of Shigella dysenteriae, Enterobacter aerogenes, E. coli, and Salmonella typhimurium and to Neisseria gonorrhoeae outer membrane protein III, Haemophilus influenzae protein P6, and Pseudomonas aeruginosa porin protein F. The gene (ompA) encoding the B. avium 21-kDa protein hybridized with 4.1-kb DNA fragments from EcoRI-digested, chromosomal DNA of Bordetella pertussis and Bordetella bronchiseptica and with 6.0- and 3.2-kb DNA fragments from EcoRI-digested, chromosomal DNA of B. avium and B. avium-like DNA, respectively. A 6.75-kb DNA fragment encoding the B. avium 21-kDa protein was subcloned into the
Multi-species sequence comparison reveals conservation of ghrelin gene-derived splice variants encoding a truncated ghrelin peptide.

Science.gov (United States)

Seim, Inge; Jeffery, Penny L; Thomas, Patrick B; Walpole, Carina M; Maugham, Michelle; Fung, Jenny N T; Yap, Pei-Yi; O'Keeffe, Angela J; Lai, John; Whiteside, Eliza J; Herington, Adrian C; Chopin, Lisa K

2016-06-01

The peptide hormone ghrelin is a potent orexigen produced predominantly in the stomach. It has a number of other biological actions, including roles in appetite stimulation, energy balance, the stimulation of growth hormone release and the regulation of cell proliferation. Recently, several ghrelin gene splice variants have been described. Here, we attempted to identify conserved alternative splicing of the ghrelin gene by cross-species sequence comparisons. We identified a novel human exon 2-deleted variant and provide preliminary evidence that this splice variant and in1-ghrelin encode a C-terminally truncated form of the ghrelin peptide, termed minighrelin. These variants are expressed in humans and mice, demonstrating conservation of alternative splicing spanning 90 million years. Minighrelin appears to have similar actions to full-length ghrelin, as treatment with exogenous minighrelin peptide stimulates appetite and feeding in mice. Forced expression of the exon 2-deleted preproghrelin variant mirrors the effect of the canonical preproghrelin, stimulating cell proliferation and migration in the PC3 prostate cancer cell line. This is the first study to characterise an exon 2-deleted preproghrelin variant and to demonstrate sequence conservation of ghrelin gene-derived splice variants that encode a truncated ghrelin peptide. This adds further impetus for studies into the alternative splicing of the ghrelin gene and the function of novel ghrelin peptides in vertebrates.
Genetic analysis of 430 Chinese Cynodon dactylon accessions using sequence-related amplified polymorphism markers.

Science.gov (United States)

Huang, Chunqiong; Liu, Guodao; Bai, Changjun; Wang, Wenqiang

2014-10-21

Although Cynodon dactylon (C. dactylon) is widely distributed in China, information on its genetic diversity within the germplasm pool is limited. The objective of this study was to reveal the genetic variation and relationships of 430 C. dactylon accessions collected from 22 Chinese provinces using sequence-related amplified polymorphism (SRAP) markers. Fifteen primer pairs were used to amplify specific C. dactylon genomic sequences. A total of 481 SRAP fragments were generated, with fragment sizes ranging from 260-1800 base pairs (bp). Genetic similarity coefficients (GSC) among the 430 accessions averaged 0.72 and ranged from 0.53-0.96. Cluster analysis conducted by two methods, namely the unweighted pair-group method with arithmetic averages (UPGMA) and principle coordinate analysis (PCoA), separated the accessions into eight distinct groups. Our findings verify that Chinese C. dactylon germplasms have rich genetic diversity, which is an excellent basis for C. dactylon breeding for new cultivars.
Genetic mapping and exome sequencing identify variants associated with five novel diseases.

Directory of Open Access Journals (Sweden)

Erik G Puffenberger

Full Text Available The Clinic for Special Children (CSC has integrated biochemical and molecular methods into a rural pediatric practice serving Old Order Amish and Mennonite (Plain children. Among the Plain people, we have used single nucleotide polymorphism (SNP microarrays to genetically map recessive disorders to large autozygous haplotype blocks (mean = 4.4 Mb that contain many genes (mean = 79. For some, uninformative mapping or large gene lists preclude disease-gene identification by Sanger sequencing. Seven such conditions were selected for exome sequencing at the Broad Institute; all had been previously mapped at the CSC using low density SNP microarrays coupled with autozygosity and linkage analyses. Using between 1 and 5 patient samples per disorder, we identified sequence variants in the known disease-causing genes SLC6A3 and FLVCR1, and present evidence to strongly support the pathogenicity of variants identified in TUBGCP6, BRAT1, SNIP1, CRADD, and HARS. Our results reveal the power of coupling new genotyping technologies to population-specific genetic knowledge and robust clinical data.

Improved entropy encoding for high efficient video coding standard

Directory of Open Access Journals (Sweden)

B.S. Sunil Kumar

2018-03-01

Full Text Available The High Efficiency Video Coding (HEVC has better coding efficiency, but the encoding performance has to be improved to meet the growing multimedia applications. This paper improves the standard entropy encoding by introducing the optimized weighing parameters, so that higher rate of compression can be accomplished over the standard entropy encoding. The optimization is performed using the recently introduced firefly algorithm. The experimentation is carried out using eight benchmark video sequences and the PSNR for varying rate of data transmission is investigated. Comparative analysis based on the performance statistics is made with the standard entropy encoding. From the obtained results, it is clear that the originality of the decoded video sequence is preserved far better than the proposed method, though the compression rate is increased. Keywords: Entropy, Encoding, HEVC, PSNR, Compression
A bacterial genetic screen identifies functional coding sequences of the insect mariner transposable element Famar1 amplified from the genome of the earwig, Forficula auricularia.

Science.gov (United States)

Barry, Elizabeth G; Witherspoon, David J; Lampe, David J

2004-02-01

Transposons of the mariner family are widespread in animal genomes and have apparently infected them by horizontal transfer. Most species carry only old defective copies of particular mariner transposons that have diverged greatly from their active horizontally transferred ancestor, while a few contain young, very similar, and active copies. We report here the use of a whole-genome screen in bacteria to isolate somewhat diverged Famar1 copies from the European earwig, Forficula auricularia, that encode functional transposases. Functional and nonfunctional coding sequences of Famar1 and nonfunctional copies of Ammar1 from the European honey bee, Apis mellifera, were sequenced to examine their molecular evolution. No selection for sequence conservation was detected in any clade of a tree derived from these sequences, not even on branches leading to functional copies. This agrees with the current model for mariner transposon evolution that expects neutral evolution within particular hosts, with selection for function occurring only upon horizontal transfer to a new host. Our results further suggest that mariners are not finely tuned genetic entities and that a greater amount of sequence diversification than had previously been appreciated can occur in functional copies in a single host lineage. Finally, this method of isolating active copies can be used to isolate other novel active transposons without resorting to reconstruction of ancestral sequences.
Using multi-locus allelic sequence data to estimate genetic divergence among four Lilium (Liliaceae) cultivars

NARCIS (Netherlands)

Shahin, A.; Smulders, M.J.M.; Tuyl, van J.M.; Arens, P.F.P.; Bakker, F.T.

2014-01-01

Next Generation Sequencing (NGS) may enable estimating relationships among genotypes using allelic variation of multiple nuclear genes simultaneously. We explored the potential and caveats of this strategy in four genetically distant Lilium cultivars to estimate their genetic divergence from
Development of Genetic Markers in Eucalyptus Species by Target Enrichment and Exome Sequencing

Science.gov (United States)

Dasgupta, Modhumita Ghosh; Dharanishanthi, Veeramuthu; Agarwal, Ishangi; Krutovsky, Konstantin V.

2015-01-01

The advent of next-generation sequencing has facilitated large-scale discovery, validation and assessment of genetic markers for high density genotyping. The present study was undertaken to identify markers in genes supposedly related to wood property traits in three Eucalyptus species. Ninety four genes involved in xylogenesis were selected for hybridization probe based nuclear genomic DNA target enrichment and exome sequencing. Genomic DNA was isolated from the leaf tissues and used for on-array probe hybridization followed by Illumina sequencing. The raw sequence reads were trimmed and high-quality reads were mapped to the E. grandis reference sequence and the presence of single nucleotide variants (SNVs) and insertions/ deletions (InDels) were identified across the three species. The average read coverage was 216X and a total of 2294 SNVs and 479 InDels were discovered in E. camaldulensis, 2383 SNVs and 518 InDels in E. tereticornis, and 1228 SNVs and 409 InDels in E. grandis. Additionally, SNV calling and InDel detection were conducted in pair-wise comparisons of E. tereticornis vs. E. grandis, E. camaldulensis vs. E. tereticornis and E. camaldulensis vs. E. grandis. This study presents an efficient and high throughput method on development of genetic markers for family– based QTL and association analysis in Eucalyptus. PMID:25602379
Development of genetic markers in Eucalyptus species by target enrichment and exome sequencing.

Directory of Open Access Journals (Sweden)

Modhumita Ghosh Dasgupta

Full Text Available The advent of next-generation sequencing has facilitated large-scale discovery, validation and assessment of genetic markers for high density genotyping. The present study was undertaken to identify markers in genes supposedly related to wood property traits in three Eucalyptus species. Ninety four genes involved in xylogenesis were selected for hybridization probe based nuclear genomic DNA target enrichment and exome sequencing. Genomic DNA was isolated from the leaf tissues and used for on-array probe hybridization followed by Illumina sequencing. The raw sequence reads were trimmed and high-quality reads were mapped to the E. grandis reference sequence and the presence of single nucleotide variants (SNVs and insertions/ deletions (InDels were identified across the three species. The average read coverage was 216X and a total of 2294 SNVs and 479 InDels were discovered in E. camaldulensis, 2383 SNVs and 518 InDels in E. tereticornis, and 1228 SNVs and 409 InDels in E. grandis. Additionally, SNV calling and InDel detection were conducted in pair-wise comparisons of E. tereticornis vs. E. grandis, E. camaldulensis vs. E. tereticornis and E. camaldulensis vs. E. grandis. This study presents an efficient and high throughput method on development of genetic markers for family- based QTL and association analysis in Eucalyptus.
[Application of next-generation semiconductor sequencing technologies in genetic diagnosis of inherited cardiomyopathies].

Science.gov (United States)

Zhao, Yue; Zhang, Hong; Xia, Xue-shan

2015-07-01

Inherited cardiomyopathy is the most common hereditary cardiac disease. It also causes a significant proportion of sudden cardiac deaths in young adults and athletes. So far, approximately one hundred genes have been reported to be involved in cardiomyopathies through different mechanisms. Therefore, the identification of the genetic basis and disease mechanisms of cardiomyopathies are important for establishing a clinical diagnosis and genetic testing. Next-generation semiconductor sequencing (NGSS) technology platform is a high-throughput sequencer capable of analyzing clinically derived genomes with high productivity, sensitivity and specificity. It was launched in 2010 by Life Technologies of USA, and it is based on a high density semiconductor chip, which was covered with tens of thousands of wells. NGSS has been successfully used in candidate gene mutation screening to identify hereditary disease. In this review, we summarize these genetic variations, challenge and application of NGSS in inherited cardiomyopathy, and its value in disease diagnosis, prevention and treatment.
Transcriptome sequencing of the Antarctic vascular plant Deschampsia antarctica Desv. under abiotic stress.

Science.gov (United States)

Lee, Jungeun; Noh, Eun Kyeung; Choi, Hyung-Seok; Shin, Seung Chul; Park, Hyun; Lee, Hyoungseok

2013-03-01

Antarctic hairgrass (Deschampsia antarctica Desv.) is the only natural grass species in the maritime Antarctic. It has been studied as an extremophile that has successfully adapted to marginal land with the harshest environment for terrestrial plants. However, limited genetic research has focused on this species due to the lack of genomic resources. Here, we present the first de novo assembly of its transcriptome by massive parallel sequencing and its expression profile using D. antarctica grown under various stress conditions. Total sequence reads generated by pyrosequencing were assembled into 60,765 unigenes (28,177 contigs and 32,588 singletons). A total of 29,173 unique protein-coding genes were identified based on sequence similarities to known proteins. The combined results from all three stress conditions indicated differential expression of 3,110 genes. Quantitative reverse transcription polymerase chain reaction showed that several well-known stress-responsive genes encoding late embryogenesis abundant protein, dehydrin 1, and ice recrystallization inhibition protein were induced dramatically and that genes encoding U-box-domain-containing protein, electron transfer flavoprotein-ubiquinone, and F-box-containing protein were induced by abiotic stressors in a manner conserved with other plant species. We identified more than 2,000 simple sequence repeats that can be developed as functional molecular markers. This dataset is the most comprehensive transcriptome resource currently available for D. antarctica and is therefore expected to be an important foundation for future genetic studies of grasses and extremophiles.
MUREIN-METABOLIZING ENZYMES FROM ESCHERICHIA-COLI - SEQUENCE-ANALYSIS AND CONTROLLED OVEREXPRESSION OF THE SLT GENE, WHICH ENCODES THE SOLUBLE LYTIC TRANSGLYCOSYLASE

NARCIS (Netherlands)

ENGEL, H; KAZEMIER, B; KECK, W

The complete nucleotide sequence of the slt gene encoding the soluble lytic transglycosylase (Slt; EC 3.2.1.-) from Escherichia coli has been determined. The largest open reading frame identified on a 2.5-kb PvuII-SalI fragment indicates that the enzyme is translated as a preprotein of either 654 or
Global sequence diversity of the lactate dehydrogenase gene in Plasmodium falciparum.

Science.gov (United States)

Simpalipan, Phumin; Pattaradilokrat, Sittiporn; Harnyuttanakorn, Pongchai

2018-01-09

Antigen-detecting rapid diagnostic tests (RDTs) have been recommended by the World Health Organization for use in remote areas to improve malaria case management. Lactate dehydrogenase (LDH) of Plasmodium falciparum is one of the main parasite antigens employed by various commercial RDTs. It has been hypothesized that the poor detection of LDH-based RDTs is attributed in part to the sequence diversity of the gene. To test this, the present study aimed to investigate the genetic diversity of the P. falciparum ldh gene in Thailand and to construct the map of LDH sequence diversity in P. falciparum populations worldwide. The ldh gene was sequenced for 50 P. falciparum isolates in Thailand and compared with hundreds of sequences from P. falciparum populations worldwide. Several indices of molecular variation were calculated, including the proportion of polymorphic sites, the average nucleotide diversity index (π), and the haplotype diversity index (H). Tests of positive selection and neutrality tests were performed to determine signatures of natural selection on the gene. Mean genetic distance within and between species of Plasmodium ldh was analysed to infer evolutionary relationships. Nucleotide sequences of P. falciparum ldh could be classified into 9 alleles, encoding 5 isoforms of LDH. L1a was the most common allelic type and was distributed in P. falciparum populations worldwide. Plasmodium falciparum ldh sequences were highly conserved, with haplotype and nucleotide diversity values of 0.203 and 0.0004, respectively. The extremely low genetic diversity was maintained by purifying selection, likely due to functional constraints. Phylogenetic analysis inferred the close genetic relationship of P. falciparum to malaria parasites of great apes, rather than to other human malaria parasites. This study revealed the global genetic variation of the ldh gene in P. falciparum, providing knowledge for improving detection of LDH-based RDTs and supporting the candidacy of
NUCLEOTIDE SEQUENCING AND TRANSCRIPTIONAL MAPPING OF THE GENES ENCODING BIPHENYL DIOXYGENASE, A MULTICOM- PONENT POLYCHLORINATED-BIPHENYL-DEGRADING ENZYME IN PSEUDOMONAS STRAIN LB400

Science.gov (United States)

The DNA region encoding biphenyl dioxygenase, the first enzyme in the biphenyl-polychlorinated biphenyl degradation pathway of Pseudomonas species strain LB400, was sequenced. Six open reading frames were identified, four of which are homologous to the components of toluene dioxy...
Genotyping-By-Sequencing for Plant Genetic Diversity Analysis: A Lab Guide for SNP Genotyping

Directory of Open Access Journals (Sweden)

Gregory W. Peterson

2014-10-01

Full Text Available Genotyping-by-sequencing (GBS has recently emerged as a promising genomic approach for exploring plant genetic diversity on a genome-wide scale. However, many uncertainties and challenges remain in the application of GBS, particularly in non-model species. Here, we present a GBS protocol we developed and use for plant genetic diversity analysis. It uses two restriction enzymes to reduce genome complexity, applies Illumina multiplexing indexes for barcoding and has a custom bioinformatics pipeline for genotyping. This genetic diversity-focused GBS (gd-GBS protocol can serve as an easy-to-follow lab guide to assist a researcher through every step of a GBS application with five main components: sample preparation, library assembly, sequencing, SNP calling and diversity analysis. Specifically, in this presentation, we provide a brief overview of the GBS approach, describe the gd-GBS procedures, illustrate it with an application to analyze genetic diversity in 20 flax (Linum usitatissimum L. accessions and discuss related issues in GBS application. Following these lab bench procedures and using the custom bioinformatics pipeline, one could generate genome-wide SNP genotype data for a conventional genetic diversity analysis of a non-model plant species.
Construction of a genetically modified wine yeast strain expressing the Aspergillus aculeatus rhaA gene, encoding an -L-Rhamnosidase of enological interest

NARCIS (Netherlands)

Manzanares, P.; Orejas, M.; Vicente Gil, J.; Graaff, de L.H.; Visser, J.; Ramon, D.

2003-01-01

The Aspergillus aculeatus rhaA gene encoding an alpha-L-rhamnosidase has been expressed in both laboratory and industrial wine yeast strains. Wines produced in microvinifications, conducted using a combination of the genetically modified industrial strain expressing rhaA and another strain
Utilisation of ISA Reverse Genetics and Large-Scale Random Codon Re-Encoding to Produce Attenuated Strains of Tick-Borne Encephalitis Virus within Days.

Science.gov (United States)

de Fabritus, Lauriane; Nougairède, Antoine; Aubry, Fabien; Gould, Ernest A; de Lamballerie, Xavier

2016-01-01

Large-scale codon re-encoding is a new method of attenuating RNA viruses. However, the use of infectious clones to generate attenuated viruses has inherent technical problems. We previously developed a bacterium-free reverse genetics protocol, designated ISA, and now combined it with large-scale random codon-re-encoding method to produce attenuated tick-borne encephalitis virus (TBEV), a pathogenic flavivirus which causes febrile illness and encephalitis in humans. We produced wild-type (WT) and two re-encoded TBEVs, containing 273 or 273+284 synonymous mutations in the NS5 and NS5+NS3 coding regions respectively. Both re-encoded viruses were attenuated when compared with WT virus using a laboratory mouse model and the relative level of attenuation increased with the degree of re-encoding. Moreover, all infected animals produced neutralizing antibodies. This novel, rapid and efficient approach to engineering attenuated viruses could potentially expedite the development of safe and effective new-generation live attenuated vaccines.
Application of discrete Fourier inter-coefficient difference for assessing genetic sequence similarity.

Science.gov (United States)

King, Brian R; Aburdene, Maurice; Thompson, Alex; Warres, Zach

2014-01-01

Digital signal processing (DSP) techniques for biological sequence analysis continue to grow in popularity due to the inherent digital nature of these sequences. DSP methods have demonstrated early success for detection of coding regions in a gene. Recently, these methods are being used to establish DNA gene similarity. We present the inter-coefficient difference (ICD) transformation, a novel extension of the discrete Fourier transformation, which can be applied to any DNA sequence. The ICD method is a mathematical, alignment-free DNA comparison method that generates a genetic signature for any DNA sequence that is used to generate relative measures of similarity among DNA sequences. We demonstrate our method on a set of insulin genes obtained from an evolutionarily wide range of species, and on a set of avian influenza viral sequences, which represents a set of highly similar sequences. We compare phylogenetic trees generated using our technique against trees generated using traditional alignment techniques for similarity and demonstrate that the ICD method produces a highly accurate tree without requiring an alignment prior to establishing sequence similarity.
Genetic control of environmental variation of two quantitative traits of Drosophila melanogaster revealed by whole-genome sequencing

DEFF Research Database (Denmark)

Sørensen, Peter; de los Campos, Gustavo; Morgante, Fabio

2015-01-01

and others more volatile performance. Understanding the mechanisms responsible for environmental variability not only informs medical questions but is relevant in evolution and in agricultural science. In this work fully sequenced inbred lines of Drosophila melanogaster were analyzed to study the nature...... of genetic control of environmental variance for two quantitative traits: starvation resistance (SR) and startle response (SL). The evidence for genetic control of environmental variance is compelling for both traits. Sequence information is incorporated in random regression models to study the underlying...... genetic signals, which are shown to be different in the two traits. Genomic variance in sexual dimorphism was found for SR but not for SL. Indeed, the proportion of variance captured by sequence information and the contribution to this variance from four chromosome segments differ between sexes in SR...
Whole-exome sequencing identified a variant in EFTUD2 gene in establishing a genetic diagnosis.

Science.gov (United States)

Rengasamy Venugopalan, S; Farrow, E G; Lypka, M

2017-06-01

Craniofacial anomalies are complex and have an overlapping phenotype. Mandibulofacial Dysostosis and Oculo-Auriculo-Vertebral Spectrum are conditions that share common craniofacial phenotype and present a challenge in arriving at a diagnosis. In this report, we present a case of female proband who was given a differential diagnosis of Treacher Collins syndrome or Hemifacial Microsomia without certainty. Prior genetic testing reported negative for 22q deletion and FGFR screenings. The objective of this study was to demonstrate the critical role of whole-exome sequencing in establishing a genetic diagnosis of the proband. The participants were 14½-year-old affected female proband/parent trio. Proband/parent trio were enrolled in the study. Surgical tissue sample from the proband and parental blood samples were collected and prepared for whole-exome sequencing. Illumina HiSeq 2500 instrument was used for sequencing (125 nucleotide reads/84X coverage). Analyses of variants were performed using custom-developed software, RUNES and VIKING. Variant analyses following whole-exome sequencing identified a heterozygous de novo pathogenic variant, c.259C>T (p.Gln87*), in EFTUD2 (NM_004247.3) gene in the proband. Previous studies have reported that the variants in EFTUD2 gene were associated with Mandibulofacial Dysostosis with Microcephaly. Patients with facial asymmetry, micrognathia, choanal atresia and microcephaly should be analyzed for variants in EFTUD2 gene. Next-generation sequencing techniques, such as whole-exome sequencing offer great promise to improve the understanding of etiologies of sporadic genetic diseases. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Bacillus caldolyticus prs gene encoding phosphoribosyldiphosphate synthase

DEFF Research Database (Denmark)

Krath, Britta N.; Hove-Jensen, Bjarne

1996-01-01

The prs gene, encoding phosphoribosyl-diphosphate (PRPP) synthase, as well as the flanking DNA sequences were cloned and sequenced from the Gram-positive thermophile, Bacillus caldolyticus. Comparison with the homologous sequences from the mesophile, Bacillus subtilis, revealed a gene (gca......D) encoding N-acetylglucosamine-l-phosphate uridyltransferase upstream of prs, and a gene homologous to ctc downstream of prs. cDNA synthesis with a B. caldolyticus gcaD-prs-ctc-specified mRNA as template, followed by amplification utilising the polymerase chain reaction indicated that the three genes are co......-transcribed. Comparison of amino acid sequences revealed a high similarity among PRPP synthases across a wide phylogenetic range. An E. coli strain harbouring the B. caldolyticus prs gene in a multicopy plasmid produced PRPP synthase activity 33-fold over the activity of a haploid B. caldolyticus strain. B. caldolyticus...
The Mycobacterium tuberculosis Rv2540c DNA sequence encodes a bifunctional chorismate synthase

Directory of Open Access Journals (Sweden)

Santos Diógenes S

2008-04-01

Full Text Available Abstract Background The emergence of multi- and extensively-drug resistant Mycobacterium tuberculosis strains has created an urgent need for new agents to treat tuberculosis (TB. The enzymes of shikimate pathway are attractive targets to the development of antitubercular agents because it is essential for M. tuberculosis and is absent from humans. Chorismate synthase (CS is the seventh enzyme of this route and catalyzes the NADH- and FMN-dependent synthesis of chorismate, a precursor of aromatic amino acids, naphthoquinones, menaquinones, and mycobactins. Although the M. tuberculosis Rv2540c (aroF sequence has been annotated to encode a chorismate synthase, there has been no report on its correct assignment and functional characterization of its protein product. Results In the present work, we describe DNA amplification of aroF-encoded CS from M. tuberculosis (MtCS, molecular cloning, protein expression, and purification to homogeneity. N-terminal amino acid sequencing, mass spectrometry and gel filtration chromatography were employed to determine identity, subunit molecular weight and oligomeric state in solution of homogeneous recombinant MtCS. The bifunctionality of MtCS was determined by measurements of both chorismate synthase and NADH:FMN oxidoreductase activities. The flavin reductase activity was characterized, showing the existence of a complex between FMNox and MtCS. FMNox and NADH equilibrium binding was measured. Primary deuterium, solvent and multiple kinetic isotope effects are described and suggest distinct steps for hydride and proton transfers, with the former being more rate-limiting. Conclusion This is the first report showing that a bacterial CS is bifunctional. Primary deuterium kinetic isotope effects show that C4-proS hydrogen is being transferred during the reduction of FMNox by NADH and that hydride transfer contributes significantly to the rate-limiting step of FMN reduction reaction. Solvent kinetic isotope effects and
Massively parallel sequencing and targeted exomes in familial kidney disease can diagnose underlying genetic disorders.

Science.gov (United States)

Mallett, Andrew J; McCarthy, Hugh J; Ho, Gladys; Holman, Katherine; Farnsworth, Elizabeth; Patel, Chirag; Fletcher, Jeffery T; Mallawaarachchi, Amali; Quinlan, Catherine; Bennetts, Bruce; Alexander, Stephen I

2017-12-01

Inherited kidney disease encompasses a broad range of disorders, with both multiple genes contributing to specific phenotypes and single gene defects having multiple clinical presentations. Advances in sequencing capacity may allow a genetic diagnosis for familial renal disease, by testing the increasing number of known causative genes. However, there has been limited translation of research findings of causative genes into clinical settings. Here, we report the results of a national accredited diagnostic genetic service for familial renal disease. An expert multidisciplinary team developed a targeted exomic sequencing approach with ten curated multigene panels (207 genes) and variant assessment individualized to the patient's phenotype. A genetic diagnosis (pathogenic genetic variant[s]) was identified in 58 of 135 families referred in two years. The genetic diagnosis rate was similar between families with a pediatric versus adult proband (46% vs 40%), although significant differences were found in certain panels such as atypical hemolytic uremic syndrome (88% vs 17%). High diagnostic rates were found for Alport syndrome (22 of 27) and tubular disorders (8 of 10), whereas the monogenic diagnostic rate for congenital anomalies of the kidney and urinary tract was one of 13. Quality reporting was aided by a strong clinical renal and genetic multidisciplinary committee review. Importantly, for a diagnostic service, few variants of uncertain significance were found with this targeted, phenotype-based approach. Thus, use of targeted massively parallel sequencing approaches in inherited kidney disease has a significant capacity to diagnose the underlying genetic disorder across most renal phenotypes. Copyright © 2017 International Society of Nephrology. Published by Elsevier Inc. All rights reserved.
EGVII endoglucanase and nucleic acids encoding the same

Science.gov (United States)

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2009-05-05

The present invention provides an endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.

In silico detection of sequence variations modifying transcriptional regulation.

Directory of Open Access Journals (Sweden)

Malin C Andersen

2008-01-01

Full Text Available Identification of functional genetic variation associated with increased susceptibility to complex diseases can elucidate genes and underlying biochemical mechanisms linked to disease onset and progression. For genes linked to genetic diseases, most identified causal mutations alter an encoded protein sequence. Technological advances for measuring RNA abundance suggest that a significant number of undiscovered causal mutations may alter the regulation of gene transcription. However, it remains a challenge to separate causal genetic variations from linked neutral variations. Here we present an in silico driven approach to identify possible genetic variation in regulatory sequences. The approach combines phylogenetic footprinting and transcription factor binding site prediction to identify variation in candidate cis-regulatory elements. The bioinformatics approach has been tested on a set of SNPs that are reported to have a regulatory function, as well as background SNPs. In the absence of additional information about an analyzed gene, the poor specificity of binding site prediction is prohibitive to its application. However, when additional data is available that can give guidance on which transcription factor is involved in the regulation of the gene, the in silico binding site prediction improves the selection of candidate regulatory polymorphisms for further analyses. The bioinformatics software generated for the analysis has been implemented as a Web-based application system entitled RAVEN (regulatory analysis of variation in enhancers. The RAVEN system is available at http://www.cisreg.ca for all researchers interested in the detection and characterization of regulatory sequence variation.
In Silico Detection of Sequence Variations Modifying Transcriptional Regulation

Science.gov (United States)

Andersen, Malin C; Engström, Pär G; Lithwick, Stuart; Arenillas, David; Eriksson, Per; Lenhard, Boris; Wasserman, Wyeth W; Odeberg, Jacob

2008-01-01

Identification of functional genetic variation associated with increased susceptibility to complex diseases can elucidate genes and underlying biochemical mechanisms linked to disease onset and progression. For genes linked to genetic diseases, most identified causal mutations alter an encoded protein sequence. Technological advances for measuring RNA abundance suggest that a significant number of undiscovered causal mutations may alter the regulation of gene transcription. However, it remains a challenge to separate causal genetic variations from linked neutral variations. Here we present an in silico driven approach to identify possible genetic variation in regulatory sequences. The approach combines phylogenetic footprinting and transcription factor binding site prediction to identify variation in candidate cis-regulatory elements. The bioinformatics approach has been tested on a set of SNPs that are reported to have a regulatory function, as well as background SNPs. In the absence of additional information about an analyzed gene, the poor specificity of binding site prediction is prohibitive to its application. However, when additional data is available that can give guidance on which transcription factor is involved in the regulation of the gene, the in silico binding site prediction improves the selection of candidate regulatory polymorphisms for further analyses. The bioinformatics software generated for the analysis has been implemented as a Web-based application system entitled RAVEN (regulatory analysis of variation in enhancers). The RAVEN system is available at http://www.cisreg.ca for all researchers interested in the detection and characterization of regulatory sequence variation. PMID:18208319
Encoding asymmetry of the N-glycosylation motif facilitates glycoprotein evolution.

Directory of Open Access Journals (Sweden)

Ryan Williams

Full Text Available Protein N-glycosylation is found in all domains of life and has a conserved role in glycoprotein folding and stability. In animals, glycoproteins transit through the Golgi where the N-glycans are trimmed and rebuilt with sequences that bind lectins, an innovation that greatly increases structural diversity and redundancy of glycoprotein-lectin interaction at the cell surface. Here we ask whether the natural tension between increasing diversity (glycan-protein interactions and site multiplicity (backup and status quo might be revealed by a phylogenic examination of glycoproteins and NXS/T(X ≠ P N-glycosylation sites. Site loss is more likely by mutation at Asn encoded by two adenosine (A-rich codons, while site gain is more probable by generating Ser or Thr downstream of an existing Asn. Thus mutations produce sites at novel positions more frequently than the reversal of recently lost sites, and therefore more paths though sequence space are made available to natural selection. An intra-species comparison of secretory and cytosolic proteins revealed a departure from equilibrium in sequences one-mutation-away from NXS/T and in (A content, indicating strong selective pressures and exploration of N-glycosylation positions during vertebrate evolution. Furthermore, secretory proteins have evolved at rates proportional to N-glycosylation site number, indicating adaptive interactions between the N-glycans and underlying protein. Given the topology of the genetic code, mutation of (A is more often nonsynonomous, and Lys, another target of many PTMs, is also encoded by two (A-rich codons. An examination of acetyl-Lys sites in proteins indicated similar evolutionary dynamics, consistent with asymmetry of the target and recognition portions of modified sites. Our results suggest that encoding asymmetry is an ancient mechanism of evolvability that increases diversity and experimentation with PTM site positions. Strong selective pressures on PTMs may have
pEVL: A Linear Plasmid for Generating mRNA IVT Templates With Extended Encoded Poly(A Sequences

Directory of Open Access Journals (Sweden)

Alexandra E Grier

2016-01-01

Full Text Available Increasing demand for large-scale synthesis of in vitro transcribed (IVT mRNA is being driven by the increasing use of mRNA for transient gene expression in cell engineering and therapeutic applications. An important determinant of IVT mRNA potency is the 3′ polyadenosine (poly(A tail, the length of which correlates with translational efficiency. However, present methods for generation of IVT mRNA rely on templates derived from circular plasmids or PCR products, in which homopolymeric tracts are unstable, thus limiting encoded poly(A tail lengths to ≃120 base pairs (bp. Here, we have developed a novel method for generation of extended poly(A tracts using a previously described linear plasmid system, pJazz. We find that linear plasmids can successfully propagate poly(A tracts up to ≃500 bp in length for IVT mRNA production. We then modified pJazz by removing extraneous restriction sites, adding a T7 promoter sequence upstream from an extended multiple cloning site, and adding a unique type-IIS restriction site downstream from the encoded poly(A tract to facilitate generation of IVT mRNA with precisely defined encoded poly(A tracts and 3′ termini. The resulting plasmid, designated pEVL, can be used to generate IVT mRNA with consistent defined lengths and terminal residue(s.
Molecular genetic analysis of cereal β-amylase genes using exon-primed intron-crossing (EPIC PCR

Directory of Open Access Journals (Sweden)

Stratula Olga

2014-01-01

Full Text Available The proteins encoded by cereal β-amylase genes Bamy1 and Bamy2 genes play an important role in seedling germination and in the brewing process. Here, we use exon-primed intron-crossing (EPIC to analyse Bamy1 and Bamy2 genetic diversity among 38 accessions belonging to six Poaceae tribes. DNA sequence alignment of multiple Poaceae species β-amylase sequences allowed design of EPIC primers that simultaneously amplify Bamy1 and Bamy2 in all the cereal species investigated. The genetic variation observed in the samples investigated is analysed and discussed, and illustrates the effectiveness of this approach for intra- and interspecific analysis in plant species.
Visualization of local Ca2+ dynamics with genetically encoded bioluminescent reporters.

Science.gov (United States)

Rogers, Kelly L; Stinnakre, Jacques; Agulhon, Cendra; Jublot, Delphine; Shorte, Spencer L; Kremer, Eric J; Brûlet, Philippe

2005-02-01

Measurements of local Ca2+ signalling at different developmental stages and/or in specific cell types is important for understanding aspects of brain functioning. The use of light excitation in fluorescence imaging can cause phototoxicity, photobleaching and auto-fluorescence. In contrast, bioluminescence does not require the input of radiative energy and can therefore be measured over long periods, with very high temporal resolution. Aequorin is a genetically encoded Ca(2+)-sensitive bioluminescent protein, however, its low quantum yield prevents dynamic measurements of Ca2+ responses in single cells. To overcome this limitation, we recently reported the bi-functional Ca2+ reporter gene, GFP-aequorin (GA), which was developed specifically to improve the light output and stability of aequorin chimeras [V. Baubet, et al., (2000) PNAS, 97, 7260-7265]. In the current study, we have genetically targeted GA to different microdomains important in synaptic transmission, including to the mitochondrial matrix, endoplasmic reticulum, synaptic vesicles and to the postsynaptic density. We demonstrate that these reporters enable 'real-time' measurements of subcellular Ca2+ changes in single mammalian neurons using bioluminescence. The high signal-to-noise ratio of these reporters is also important in that it affords the visualization of Ca2+ dynamics in cell-cell communication in neuronal cultures and tissue slices. Further, we demonstrate the utility of this approach in ex-vivo preparations of mammalian retina, a paradigm in which external light input should be controlled. This represents a novel molecular imaging approach for non-invasive monitoring of local Ca2+ dynamics and cellular communication in tissue or whole animal studies.
Genetic characterization of the Pacific sheath-tailed bat (Emballonura semicaudata rotensis) using mitochondrial DNA sequence data

Science.gov (United States)

Oyler-McCance, Sara J.; Valdez, Ernest W.; O'Shea, Thomas J.; Fike, Jennifer A.

2013-01-01

Emballonura semicaudata occurs in the southwestern Pacific and populations on many islands have declined or disappeared. One subspecies (E. semicaudata rotensis) occurs in the Northern Mariana Islands, where it has been extirpated from all but 1 island (Aguiguan). We assessed genetic similarity between the last population of E. s. rotensis and 2 other subspecies, and examined genetic diversity on Aguiguan. We sampled 12 E. s. rotensis, sequenced them at 3 mitochondrial loci, and compared them with published sequences from 2 other subspecies. All 12 E. s. rotensis had identical sequences in each of the 3 regions. Using cytochrome-b (Cytb) data E. s. rotensis was sister to E. s. palauensis in a clade separate from E. s. semicaudata. 12S ribosomal RNA (12S) sequences grouped all E. s. semicaudata in 1 clade with E. s. rotensis in a clade by itself. Genetic distances among the 3 subspecies at Cytb were smallest between E. s. palauensis and E. s. rotensis. Distance between E. s. semicaudata and the other 2 subspecies was not different from the distance between E. s. semicaudata and the full species E. raffrayana. A similar relationship was found using the 12S data. These distances are larger than those typically reported for mammalian subspecies using Cytb sequence and within the range of sister species.
Genome-wide SNP identification by high-throughput sequencing and selective mapping allows sequence assembly positioning using a framework genetic linkage map

Directory of Open Access Journals (Sweden)

Xu Xiangming

2010-12-01

Full Text Available Abstract Background Determining the position and order of contigs and scaffolds from a genome assembly within an organism's genome remains a technical challenge in a majority of sequencing projects. In order to exploit contemporary technologies for DNA sequencing, we developed a strategy for whole genome single nucleotide polymorphism sequencing allowing the positioning of sequence contigs onto a linkage map using the bin mapping method. Results The strategy was tested on a draft genome of the fungal pathogen Venturia inaequalis, the causal agent of apple scab, and further validated using sequence contigs derived from the diploid plant genome Fragaria vesca. Using our novel method we were able to anchor 70% and 92% of sequences assemblies for V. inaequalis and F. vesca, respectively, to genetic linkage maps. Conclusions We demonstrated the utility of this approach by accurately determining the bin map positions of the majority of the large sequence contigs from each genome sequence and validated our method by mapping single sequence repeat markers derived from sequence contigs on a full mapping population.
Two siblings with early infantile myoclonic encephalopathy due to mutation in the gene encoding mitochondrial glutamate/H+ symporter SLC25A22.

Science.gov (United States)

Cohen, Rony; Basel-Vanagaite, Lina; Goldberg-Stern, Hadassah; Halevy, Ayelet; Shuper, Avinoam; Feingold-Zadok, Michal; Behar, Doron M; Straussberg, Rachel

2014-11-01

To characterize a new subset of early myoclonic encephalopathy usually associated with metabolic etiologies with a new genetic entity. We describe two siblings with early myoclonic encephalopathy born to consanguineous parents of Arab Muslim origin from Israel. We used homozygosity mapping and candidate gene sequencing to reveal the genetic basis of the myoclonic syndrome. We found a rare missense mutation in the gene encoding one of the two mitochondrial glutamate/H symporters, SLC25A22. The phenotype of early myoclonic encephalopathy was first linked to the same mutation in 2005 in patients of the same ethnicity as our family. Owing to the devastating nature of this encephalopathy, we focus attention on its clinical history, epileptic semiology, distinct electroencephalography features, and genetic basis. We provide the evidence that an integrated diagnostic strategy combining homozygosity mapping with candidate gene sequencing is efficient in consanguineous families with highly heterogeneous autosomal recessive diseases. Copyright © 2014 European Paediatric Neurology Society. Published by Elsevier Ltd. All rights reserved.
Genetics of Infectious and Inflammatory Diseases: Overlapping Discoveries from Association and Exome-Sequencing Studies.

Science.gov (United States)

Langlais, David; Fodil, Nassima; Gros, Philippe

2017-04-26

Genome technologies have defined a complex genetic architecture in major infectious, inflammatory, and autoimmune disorders. High density marker arrays and Immunochips have powered genome-wide association studies (GWAS) that have mapped nearly 450 genetic risk loci in 22 major inflammatory diseases, including a core of common genes that play a central role in pathological inflammation. Whole-exome and whole-genome sequencing have identified more than 265 genes in which mutations cause primary immunodeficiencies and rare forms of severe inflammatory bowel disease. Combined analysis of inflammatory disease GWAS and primary immunodeficiencies point to shared proteins and pathways that are required for immune cell development and protection against infections and are also associated with pathological inflammation. Finally, sequencing of chromatin immunoprecipitates containing specific transcription factors, with parallel RNA sequencing, has charted epigenetic regulation of gene expression by proinflammatory transcription factors in immune cells, providing complementary information to characterize morbid genes at infectious and inflammatory disease loci.
The study of hydrogen peroxide level under cisplatin action using genetically encoded sensor hyper

Science.gov (United States)

Belova, A. S.; Orlova, A. G.; Maslennikova, A. V.; Brilkina, A. A.; Balalaeva, I. V.; Antonova, N. O.; Mishina, N. M.; Shakhova, N. M.; Belousov, V. V.

2014-03-01

The aim of the work was to study the participation of hydrogen peroxide in reaction of cervical cancer cell line HeLa Kyoto on cisplatin action. Determination of hydrogen peroxide level was performed using genetically encoded fluorescent sensor HyPer2. The dependence of cell viability on cisplatin concentration was determined using MTT assay. Mechanisms of cell death as well as HyPer2 reaction was revealed by flow cytometry after 6-hours of incubation with cisplatin in different concentrations. Cisplatin used in low concentrations had no effect on hydrogen peroxide level in HeLa Kyoto cells. Increase of HyPer2 fluorescence was detected only after exposure with cisplatin in high concentration. The reaction was not the consequence of cell death.
Intracellular pH imaging in cancer cells in vitro and tumors in vivo using the new genetically encoded sensor SypHer2.

Science.gov (United States)

Shirmanova, Marina V; Druzhkova, Irina N; Lukina, Maria M; Matlashov, Mikhail E; Belousov, Vsevolod V; Snopova, Ludmila B; Prodanetz, Natalia N; Dudenkova, Varvara V; Lukyanov, Sergey A; Zagaynova, Elena V

2015-09-01

Measuring intracellular pH (pHi) in tumors is essential for the monitoring of cancer progression and the response of cancer cells to various treatments. The purpose of the study was to develop a method for pHi mapping in living cancer cells in vitro and in tumors in vivo, using the novel genetically encoded indicator, SypHer2. A HeLa Kyoto cell line stably expressing SypHer2 in the cytoplasm was used, to perform ratiometric (dual excitation) imaging of the probe in cell culture, in 3D tumor spheroids and in tumor xenografts in living mice. Using SypHer2, pHi was demonstrated to be 7.34±0.11 in monolayer HeLa cells in vitro under standard cultivation conditions. An increasing pHi gradient from the center to the periphery of the spheroids was displayed. We obtained fluorescence ratio maps for HeLa tumors in vivo and ex vivo. Comparison of the map with the pathomorphology and with hypoxia staining of the tumors revealed a correspondence of the zones with higher pHi to the necrotic and hypoxic areas. Our results demonstrate that pHi imaging with the genetically encoded pHi indicator, SypHer2, can be a valuable tool for evaluating tumor progression in xenograft models. We have demonstrated, for the first time, the possibility of using the genetically encoded sensor SypHer2 for ratiometric pH imaging in cancer cells in vitro and in tumors in vivo. SypHer2 shows great promise as an instrument for pHi monitoring able to provide high accuracy and spatiotemporal resolution. Copyright © 2015 Elsevier B.V. All rights reserved.
Tools for analyzing genetic variants from sequencing data Case study: short tandem repeats

OpenAIRE

Gymrek, Melissa

2016-01-01

This was presented as a BitesizeBio Webinar entitled "Tools for analyzing genetic variants from sequencing data Case study: short tandem repeats"Accompanying scripts can be accessed on github:https://github.com/mgymrek/mgymrek-bitesizebio-webinar
Embryo genome profiling by single-cell sequencing for preimplantation genetic diagnosis in a β-thalassemia family

DEFF Research Database (Denmark)

Xu, Yanwen; Chen, Shengpei; Yin, Xuyang

2015-01-01

for a β-thalassemia-carrier couple to have a healthy second baby. We carried out sequencing for single blastomere cells and the family trio and further developed the analysis pipeline, including recovery of the missing alleles, removal of the majority of errors, and phasing of the embryonic genome...... leukocyte antigen matching tests. CONCLUSIONS: This retrospective study in a β-thalassemia family demonstrates a method for embryo genome recovery through single-cell sequencing, which permits detection of genetic variations in preimplantation genetic diagnosis. It shows the potential of single...
Whole-genome and Transcriptome Sequencing of Prostate Cancer Identify New Genetic Alterations Driving Disease Progression

DEFF Research Database (Denmark)

Ren, Shancheng; Wei, Gong-Hong; Liu, Dongbing

2018-01-01

BACKGROUND: Global disparities in prostate cancer (PCa) incidence highlight the urgent need to identify genomic abnormalities in prostate tumors in different ethnic populations including Asian men. OBJECTIVE: To systematically explore the genomic complexity and define disease-driven genetic......-scale and comprehensive genomic data of prostate cancer from Asian population. Identification of these genetic alterations may help advance prostate cancer diagnosis, prognosis, and treatment....... alterations in PCa. DESIGN, SETTING, AND PARTICIPANTS: The study sequenced whole-genome and transcriptome of tumor-benign paired tissues from 65 treatment-naive Chinese PCa patients. Subsequent targeted deep sequencing of 293 PCa-relevant genes was performed in another cohort of 145 prostate tumors. OUTCOME...
Insights into the genetic structure and diversity of 38 South Asian Indians from deep whole-genome sequencing.

Science.gov (United States)

Wong, Lai-Ping; Lai, Jason Kuan-Han; Saw, Woei-Yuh; Ong, Rick Twee-Hee; Cheng, Anthony Youzhi; Pillai, Nisha Esakimuthu; Liu, Xuanyao; Xu, Wenting; Chen, Peng; Foo, Jia-Nee; Tan, Linda Wei-Lin; Koo, Seok-Hwee; Soong, Richie; Wenk, Markus Rene; Lim, Wei-Yen; Khor, Chiea-Chuen; Little, Peter; Chia, Kee-Seng; Teo, Yik-Ying

2014-05-01

South Asia possesses a significant amount of genetic diversity due to considerable intergroup differences in culture and language. There have been numerous reports on the genetic structure of Asian Indians, although these have mostly relied on genotyping microarrays or targeted sequencing of the mitochondria and Y chromosomes. Asian Indians in Singapore are primarily descendants of immigrants from Dravidian-language-speaking states in south India, and 38 individuals from the general population underwent deep whole-genome sequencing with a target coverage of 30X as part of the Singapore Sequencing Indian Project (SSIP). The genetic structure and diversity of these samples were compared against samples from the Singapore Sequencing Malay Project and populations in Phase 1 of the 1,000 Genomes Project (1 KGP). SSIP samples exhibited greater intra-population genetic diversity and possessed higher heterozygous-to-homozygous genotype ratio than other Asian populations. When compared against a panel of well-defined Asian Indians, the genetic makeup of the SSIP samples was closely related to South Indians. However, even though the SSIP samples clustered distinctly from the Europeans in the global population structure analysis with autosomal SNPs, eight samples were assigned to mitochondrial haplogroups that were predominantly present in Europeans and possessed higher European admixture than the remaining samples. An analysis of the relative relatedness between SSIP with two archaic hominins (Denisovan, Neanderthal) identified higher ancient admixture in East Asian populations than in SSIP. The data resource for these samples is publicly available and is expected to serve as a valuable complement to the South Asian samples in Phase 3 of 1 KGP.
Maternal phylogenetic relationships and genetic variation among Arabian horse populations using whole mitochondrial DNA D-loop sequencing.

Science.gov (United States)

Khanshour, Anas M; Cothran, Ernest Gus

2013-09-13

Maternal inheritance is an essential point in Arabian horse population genetics and strains classification. The mitochondrial DNA (mtDNA) sequencing is a highly informative tool to investigate maternal lineages. We sequenced the whole mtDNA D-loop of 251 Arabian horses to study the genetic diversity and phylogenetic relationships of Arabian populations and to examine the traditional strain classification system that depends on maternal family lines using native Arabian horses from the Middle East. The variability in the upstream region of the D-loop revealed additional differences among the haplotypes that had identical sequences in the hypervariable region 1 (HVR1). While the American-Arabians showed relatively low diversity, the Syrian population was the most variable and contained a very rare and old haplogroup. The Middle Eastern horses had major genetic contributions to the Western horses and there was no clear pattern of differentiation among all tested populations. Our results also showed that several individuals from different strains shared a single haplotype, and individuals from a single strain were represented in clearly separated haplogroups. The whole mtDNA D-loop sequence was more powerful for analysis of the maternal genetic diversity in the Arabian horses than using just the HVR1. Native populations from the Middle East, such as Syrians, could be suggested as a hot spot of genetic diversity and may help in understanding the evolution history of the Arabian horse breed. Most importantly, there was no evidence that the Arabian horse breed has clear subdivisions depending on the traditional maternal based strain classification system.
Mapping DNA damage-dependent genetic interactions in yeast via party mating and barcode fusion genetics.

Science.gov (United States)

Díaz-Mejía, J Javier; Celaj, Albi; Mellor, Joseph C; Coté, Atina; Balint, Attila; Ho, Brandon; Bansal, Pritpal; Shaeri, Fatemeh; Gebbia, Marinella; Weile, Jochen; Verby, Marta; Karkhanina, Anna; Zhang, YiFan; Wong, Cassandra; Rich, Justin; Prendergast, D'Arcy; Gupta, Gaurav; Öztürk, Sedide; Durocher, Daniel; Brown, Grant W; Roth, Frederick P

2018-05-28

Condition-dependent genetic interactions can reveal functional relationships between genes that are not evident under standard culture conditions. State-of-the-art yeast genetic interaction mapping, which relies on robotic manipulation of arrays of double-mutant strains, does not scale readily to multi-condition studies. Here, we describe barcode fusion genetics to map genetic interactions (BFG-GI), by which double-mutant strains generated via en masse "party" mating can also be monitored en masse for growth to detect genetic interactions. By using site-specific recombination to fuse two DNA barcodes, each representing a specific gene deletion, BFG-GI enables multiplexed quantitative tracking of double mutants via next-generation sequencing. We applied BFG-GI to a matrix of DNA repair genes under nine different conditions, including methyl methanesulfonate (MMS), 4-nitroquinoline 1-oxide (4NQO), bleomycin, zeocin, and three other DNA-damaging environments. BFG-GI recapitulated known genetic interactions and yielded new condition-dependent genetic interactions. We validated and further explored a subnetwork of condition-dependent genetic interactions involving MAG1 , SLX4, and genes encoding the Shu complex, and inferred that loss of the Shu complex leads to an increase in the activation of the checkpoint protein kinase Rad53. © 2018 The Authors. Published under the terms of the CC BY 4.0 license.
Third-Generation Sequencing and Analysis of Four Complete Pig Liver Esterase Gene Sequences in Clones Identified by Screening BAC Library.

Science.gov (United States)

Zhou, Qiongqiong; Sun, Wenjuan; Liu, Xiyan; Wang, Xiliang; Xiao, Yuncai; Bi, Dingren; Yin, Jingdong; Shi, Deshi

2016-01-01

Pig liver carboxylesterase (PLE) gene sequences in GenBank are incomplete, which has led to difficulties in studying the genetic structure and regulation mechanisms of gene expression of PLE family genes. The aim of this study was to obtain and analysis of complete gene sequences of PLE family by screening from a Rongchang pig BAC library and third-generation PacBio gene sequencing. After a number of existing incomplete PLE isoform gene sequences were analysed, primers were designed based on conserved regions in PLE exons, and the whole pig genome used as a template for Polymerase chain reaction (PCR) amplification. Specific primers were then selected based on the PCR amplification results. A three-step PCR screening method was used to identify PLE-positive clones by screening a Rongchang pig BAC library and PacBio third-generation sequencing was performed. BLAST comparisons and other bioinformatics methods were applied for sequence analysis. Five PLE-positive BAC clones, designated BAC-10, BAC-70, BAC-75, BAC-119 and BAC-206, were identified. Sequence analysis yielded the complete sequences of four PLE genes, PLE1, PLE-B9, PLE-C4, and PLE-G2. Complete PLE gene sequences were defined as those containing regulatory sequences, exons, and introns. It was found that, not only did the PLE exon sequences of the four genes show a high degree of homology, but also that the intron sequences were highly similar. Additionally, the regulatory region of the genes contained two 720bps reverse complement sequences that may have an important function in the regulation of PLE gene expression. This is the first report to confirm the complete sequences of four PLE genes. In addition, the study demonstrates that each PLE isoform is encoded by a single gene and that the various genes exhibit a high degree of sequence homology, suggesting that the PLE family evolved from a single ancestral gene. Obtaining the complete sequences of these PLE genes provides the necessary foundation for
Genome sequence of Shigella flexneri strain SP1, a diarrheal isolate that encodes an extended-spectrum β-lactamase (ESBL).

Science.gov (United States)

Shen, Ping; Fan, Jianzhong; Guo, Lihua; Li, Jiahua; Li, Ang; Zhang, Jing; Ying, Chaoqun; Ji, Jinru; Xu, Hao; Zheng, Beiwen; Xiao, Yonghong

2017-05-12

Shigellosis is the most common cause of gastrointestinal infections in developing countries. In China, the species most frequently responsible for shigellosis is Shigella flexneri. S. flexneri remains largely unexplored from a genomic standpoint and is still described using a vocabulary based on biochemical and serological properties. Moreover, increasing numbers of ESBL-producing Shigella strains have been isolated from clinical samples. Despite this, only a few cases of ESBL-producing Shigella have been described in China. Therefore, a better understanding of ESBL-producing Shigella from a genomic standpoint is required. In this study, a S. flexneri type 1a isolate SP1 harboring bla CTX-M-14 , which was recovered from the patient with diarrhea, was subjected to whole genome sequencing. The draft genome assembly of S. flexneri strain SP1 consisted of 4,592,345 bp with a G+C content of 50.46%. RAST analysis revealed the genome contained 4798 coding sequences (CDSs) and 100 RNA-encoding genes. We detected one incomplete prophage and six candidate CRISPR loci in the genome. In vitro antimicrobial susceptibility testing demonstrated that strain SP1 is resistant to ampicillin, amoxicillin/clavulanic acid, cefazolin, ceftriaxone and trimethoprim. In silico analysis detected genes mediating resistance to aminoglycosides, β-lactams, phenicol, tetracycline, sulphonamides, and trimethoprim. The bla CTX-M-14 gene was located on an IncFII2 plasmid. A series of virulence factors were identified in the genome. In this study, we report the whole genome sequence of a bla CTX-M-14 -encoding S. flexneri strain SP1. Dozens of resistance determinants were detected in the genome and may be responsible for the multidrug-resistance of this strain, although further confirmation studies are warranted. Numerous virulence factors identified in the strain suggest that isolate SP1 is potential pathogenic. The availability of the genome sequence and comparative analysis with other S

Genetic identification of the bacteriocins produced by Enterococcus faecium IT62 and evidence that bacteriocin 32 is identical to enterocin IT.

Science.gov (United States)

Izquierdo, Esther; Cai, Yimin; Marchioni, Eric; Ennahar, Saïd

2009-05-01

Enterococcus faecium IT62, a strain isolated from ryegrass in Japan, produces three bacteriocins (enterocins L50A, L50B, and IT) that have been previously purified and the primary structures of which have been determined by amino acid sequencing (E. Izquierdo, A. Bednarczyk, C. Schaeffer, Y. Cai, E. Marchioni, A. Van Dorsselaer, and S. Ennahar, Antimicrob. Agents Chemother., 52:1917-1923, 2008). Genetic analysis showed that the bacteriocins of E. faecium IT62 are plasmid encoded, but with the structural genes specifying enterocin L50A and enterocin L50B being carried by a plasmid (pTAB1) that is separate from the one (pTIT1) carrying the structural gene of enterocin IT. Sequencing analysis of a 1,475-bp region from pTAB1 identified two consecutive open reading frames corresponding, with the exception of 2 bp, to the genes entL50A and entL50B, encoding EntL50A and EntL50B, respectively. Both bacteriocins are synthesized without N-terminal leader sequences. Genetic analysis of a sequenced 1,380-bp pTIT1 fragment showed that the genes entIT and entIM, encoding enterocin IT and its immunity protein, respectively, were both found in E. faecium VRE200 for bacteriocin 32. Enterocin IT, a 6,390-Da peptide made up of 54 amino acids, has been previously shown to be identical to the C-terminal part of bacteriocin 32, a 7,998-Da bacteriocin produced by E. faecium VRE200 whose structure was deduced from its structural gene (T. Inoue, H. Tomita, and Y. Ike, Antimicrob. Agents Chemother., 50:1202-1212, 2006). By combining the biochemical and genetic data on enterocin IT, it was concluded that bacteriocin 32 is in fact identical to enterocin IT, both being encoded by the same plasmid-borne gene, and that the N-terminal leader peptide for this bacteriocin is 35 amino acids long and not 19 amino acids long as previously reported.
Human cyclophilin B: A second cyclophilin gene encodes a peptidyl-prolyl isomerase with a signal sequence

International Nuclear Information System (INIS)

Price, E.R.; Zydowsky, L.D.; Jin, Mingjie; Baker, C.H.; McKeon, F.D.; Walsh, C.T.

1991-01-01

The authors report the cloning and characterization of a cDNA encoding a second human cyclosporin A-binding protein (hCyPB). Homology analyses reveal that hCyPB is a member of the cyclophilin B (CyPB) family, which includes yeast CyPB, Drosophila nina A, and rat cyclophilin-like protein. This family is distinguished from the cyclophilin A (CyPA) family by the presence of endoplasmic reticulum (ER)-directed signal sequences. hCyPB has a hydrophobic leader sequence not found in hCyPA, and its first 25 amino acids are removed upon expression in Escherichia coli. Moreover, they show that hCyPB is a peptidyl-prolyl cis-trans isomerase which can be inhibited by cyclosporin A. These observations suggest that other members of the CyPB family will have similar enzymatic properties. Sequence comparisons of the CyPB proteins show a central, 165-amino acid peptidyl-prolyl isomerase and cyclosprorin A-binding domain, flanked by variable N-terminal and C-terminal domains. These two variable regions may impart compartmental specificity and regulation to this family of cyclophilin proteins containing the conserved core domain. Northern blot analyses show that hCyPB mRNA is expressed in the Jurkat T-cell line, consistent with its possible target role in cyclosporin A-mediated immunosuppression
Genome survey sequencing and genetic background characterization of Gracilariopsis lemaneiformis (Rhodophyta) based on next-generation sequencing.

Science.gov (United States)

Zhou, Wei; Hu, Yiyi; Sui, Zhenghong; Fu, Feng; Wang, Jinguo; Chang, Lianpeng; Guo, Weihua; Li, Binbin

2013-01-01

Gracilariopsis lemaneiformis has a high economic value and is one of the most important aquaculture species in China. Despite it is economic importance, it has remained largely unstudied at the genomic level. In this study, we conducted a genome survey of Gp. lemaneiformis using next-generation sequencing (NGS) technologies. In total, 18.70 Gb of high-quality sequence data with an estimated genome size of 97 Mb were obtained by HiSeq 2000 sequencing for Gp. lemaneiformis. These reads were assembled into 160,390 contigs with a N50 length of 3.64 kb, which were further assembled into 125,685 scaffolds with a total length of 81.17 Mb. Genome analysis predicted 3490 genes and a GC% content of 48%. The identified genes have an average transcript length of 1,429 bp, an average coding sequence size of 1,369 bp, 1.36 exons per gene, exon length of 1,008 bp, and intron length of 191 bp. From the initial assembled scaffold, transposable elements constituted 54.64% (44.35 Mb) of the genome, and 7737 simple sequence repeats (SSRs) were identified. Among these SSRs, the trinucleotide repeat type was the most abundant (up to 73.20% of total SSRs), followed by the di- (17.41%), tetra- (5.49%), hexa- (2.90%), and penta- (1.00%) nucleotide repeat type. These characteristics suggest that Gp. lemaneiformis is a model organism for genetic study. This is the first report of genome-wide characterization within this taxon.
Genome Survey Sequencing and Genetic Background Characterization of Gracilariopsis lemaneiformis (Rhodophyta) Based on Next-Generation Sequencing

Science.gov (United States)

Sui, Zhenghong; Fu, Feng; Wang, Jinguo; Chang, Lianpeng; Guo, Weihua; Li, Binbin

2013-01-01

Gracilariopsis lemaneiformis has a high economic value and is one of the most important aquaculture species in China. Despite it is economic importance, it has remained largely unstudied at the genomic level. In this study, we conducted a genome survey of Gp. lemaneiformis using next-generation sequencing (NGS) technologies. In total, 18.70 Gb of high-quality sequence data with an estimated genome size of 97 Mb were obtained by HiSeq 2000 sequencing for Gp. lemaneiformis. These reads were assembled into 160,390 contigs with a N50 length of 3.64 kb, which were further assembled into 125,685 scaffolds with a total length of 81.17 Mb. Genome analysis predicted 3490 genes and a GC% content of 48%. The identified genes have an average transcript length of 1,429 bp, an average coding sequence size of 1,369 bp, 1.36 exons per gene, exon length of 1,008 bp, and intron length of 191 bp. From the initial assembled scaffold, transposable elements constituted 54.64% (44.35 Mb) of the genome, and 7737 simple sequence repeats (SSRs) were identified. Among these SSRs, the trinucleotide repeat type was the most abundant (up to 73.20% of total SSRs), followed by the di- (17.41%), tetra- (5.49%), hexa- (2.90%), and penta- (1.00%) nucleotide repeat type. These characteristics suggest that Gp. lemaneiformis is a model organism for genetic study. This is the first report of genome-wide characterization within this taxon. PMID:23875008
Cloning, expression and characterisation of a novel gene encoding ...

African Journals Online (AJOL)

微软用户

2012-01-12

Jan 12, 2012 ... ... characterisation of a novel gene encoding a chemosensory protein from Bemisia ... The genomic DNA sequence comparisons revealed a 1490 bp intron ... have several conserved sequence motifs, including the. N-terminal ...
Insights into the genetic structure and diversity of 38 South Asian Indians from deep whole-genome sequencing.

Directory of Open Access Journals (Sweden)

Lai-Ping Wong

2014-05-01

Full Text Available South Asia possesses a significant amount of genetic diversity due to considerable intergroup differences in culture and language. There have been numerous reports on the genetic structure of Asian Indians, although these have mostly relied on genotyping microarrays or targeted sequencing of the mitochondria and Y chromosomes. Asian Indians in Singapore are primarily descendants of immigrants from Dravidian-language-speaking states in south India, and 38 individuals from the general population underwent deep whole-genome sequencing with a target coverage of 30X as part of the Singapore Sequencing Indian Project (SSIP. The genetic structure and diversity of these samples were compared against samples from the Singapore Sequencing Malay Project and populations in Phase 1 of the 1,000 Genomes Project (1 KGP. SSIP samples exhibited greater intra-population genetic diversity and possessed higher heterozygous-to-homozygous genotype ratio than other Asian populations. When compared against a panel of well-defined Asian Indians, the genetic makeup of the SSIP samples was closely related to South Indians. However, even though the SSIP samples clustered distinctly from the Europeans in the global population structure analysis with autosomal SNPs, eight samples were assigned to mitochondrial haplogroups that were predominantly present in Europeans and possessed higher European admixture than the remaining samples. An analysis of the relative relatedness between SSIP with two archaic hominins (Denisovan, Neanderthal identified higher ancient admixture in East Asian populations than in SSIP. The data resource for these samples is publicly available and is expected to serve as a valuable complement to the South Asian samples in Phase 3 of 1 KGP.
Optimizing multiple sequence alignments using a genetic algorithm based on three objectives: structural information, non-gaps percentage and totally conserved columns.

Science.gov (United States)

Ortuño, Francisco M; Valenzuela, Olga; Rojas, Fernando; Pomares, Hector; Florido, Javier P; Urquiza, Jose M; Rojas, Ignacio

2013-09-01

Multiple sequence alignments (MSAs) are widely used approaches in bioinformatics to carry out other tasks such as structure predictions, biological function analyses or phylogenetic modeling. However, current tools usually provide partially optimal alignments, as each one is focused on specific biological features. Thus, the same set of sequences can produce different alignments, above all when sequences are less similar. Consequently, researchers and biologists do not agree about which is the most suitable way to evaluate MSAs. Recent evaluations tend to use more complex scores including further biological features. Among them, 3D structures are increasingly being used to evaluate alignments. Because structures are more conserved in proteins than sequences, scores with structural information are better suited to evaluate more distant relationships between sequences. The proposed multiobjective algorithm, based on the non-dominated sorting genetic algorithm, aims to jointly optimize three objectives: STRIKE score, non-gaps percentage and totally conserved columns. It was significantly assessed on the BAliBASE benchmark according to the Kruskal-Wallis test (P algorithm also outperforms other aligners, such as ClustalW, Multiple Sequence Alignment Genetic Algorithm (MSA-GA), PRRP, DIALIGN, Hidden Markov Model Training (HMMT), Pattern-Induced Multi-sequence Alignment (PIMA), MULTIALIGN, Sequence Alignment Genetic Algorithm (SAGA), PILEUP, Rubber Band Technique Genetic Algorithm (RBT-GA) and Vertical Decomposition Genetic Algorithm (VDGA), according to the Wilcoxon signed-rank test (P 0.05) with the advantage of being able to use less structures. Structural information is included within the objective function to evaluate more accurately the obtained alignments. The source code is available at http://www.ugr.es/~fortuno/MOSAStrE/MO-SAStrE.zip.
Application of massively parallel sequencing to genetic diagnosis in multiplex families with idiopathic sensorineural hearing impairment.

Directory of Open Access Journals (Sweden)

Chen-Chi Wu

Full Text Available Despite the clinical utility of genetic diagnosis to address idiopathic sensorineural hearing impairment (SNHI, the current strategy for screening mutations via Sanger sequencing suffers from the limitation that only a limited number of DNA fragments associated with common deafness mutations can be genotyped. Consequently, a definitive genetic diagnosis cannot be achieved in many families with discernible family history. To investigate the diagnostic utility of massively parallel sequencing (MPS, we applied the MPS technique to 12 multiplex families with idiopathic SNHI in which common deafness mutations had previously been ruled out. NimbleGen sequence capture array was designed to target all protein coding sequences (CDSs and 100 bp of the flanking sequence of 80 common deafness genes. We performed MPS on the Illumina HiSeq2000, and applied BWA, SAMtools, Picard, GATK, Variant Tools, ANNOVAR, and IGV for bioinformatics analyses. Initial data filtering with allele frequencies (0.95 prioritized 5 indels (insertions/deletions and 36 missense variants in the 12 multiplex families. After further validation by Sanger sequencing, segregation pattern, and evolutionary conservation of amino acid residues, we identified 4 variants in 4 different genes, which might lead to SNHI in 4 families compatible with autosomal dominant inheritance. These included GJB2 p.R75Q, MYO7A p.T381M, KCNQ4 p.S680F, and MYH9 p.E1256K. Among them, KCNQ4 p.S680F and MYH9 p.E1256K were novel. In conclusion, MPS allows genetic diagnosis in multiplex families with idiopathic SNHI by detecting mutations in relatively uncommon deafness genes.
Molecular and Genetic Basis of Hereditary Connective-Tissue Diseases Accompanied by Frequent Fractures

Directory of Open Access Journals (Sweden)

G. T. Yakhyaeva

2016-01-01

Full Text Available Frequent bone fractures in infancy require the elimination of a large number (> 100 of genetic disorders. The modern diagnostic method of hereditary diseases characterized by debilitating course is a new generation sequencing. The article presents the results of molecular-genetic study conducted in 18 patients with clinical symptoms of connective tissue disorders. 10 (56% patients had mutations in the genes encoding type I collagen chains, leading to the development of osteogenesis imperfecta, 5 (28% — mutations in IV and V type collagen genes that are responsible for the development of Ehlers-Danlos syndrome. 3 (17% patients had mutations in the gene encoding fibrillin-1 protein, deficiency of which is manifested by Marfan syndrome. However, the correlation between patient's phenotype and discovered mutations in the investigated gene is established not in all cases.
Characterization of Urtica dioica agglutinin isolectins and the encoding gene family.

Science.gov (United States)

Does, M P; Ng, D K; Dekker, H L; Peumans, W J; Houterman, P M; Van Damme, E J; Cornelissen, B J

1999-01-01

Urtica dioica agglutinin (UDA) has previously been found in roots and rhizomes of stinging nettles as a mixture of UDA-isolectins. Protein and cDNA sequencing have shown that mature UDA is composed of two hevein domains and is processed from a precursor protein. The precursor contains a signal peptide, two in-tandem hevein domains, a hinge region and a carboxyl-terminal chitinase domain. Genomic fragments encoding precursors for UDA-isolectins have been amplified by five independent polymerase chain reactions on genomic DNA from stinging nettle ecotype Weerselo. One amplified gene was completely sequenced. As compared to the published cDNA sequence, the genomic sequence contains, besides two basepair substitutions, two introns located at the same positions as in other plant chitinases. By partial sequence analysis of 40 amplified genes, 16 different genes were identified which encode seven putative UDA-isolectins. The deduced amino acid sequences share 78.9-98.9% identity. In extracts of roots and rhizomes of stinging nettle ecotype Weerselo six out of these seven isolectins were detected by mass spectrometry. One of them is an acidic form, which has not been identified before. Our results demonstrate that UDA is encoded by a large gene family.
Plasmid-encoded diacetyl (acetoin) reductase in Leuconostoc pseudomesenteroides

DEFF Research Database (Denmark)

Rattray, Fergal P; Myling-Petersen, Dorte; Larsen, Dianna

2003-01-01

A plasmid-borne diacetyl (acetoin) reductase (butA) from Leuconostoc pseudomesenteroides CHCC2114 was sequenced and cloned. Nucleotide sequence analysis revealed an open reading frame encoding a protein of 257 amino acids which had high identity at the amino acid level to diacetyl (acetoin...
Amino acid substitutions in genetic variants of human serum albumin and in sequences inferred from molecular cloning

International Nuclear Information System (INIS)

Takahashi, N.; Takahashi, Y.; Blumberg, B.S.; Putnam, F.W.

1987-01-01

The structural changes in four genetic variants of human serum albumin were analyzed by tandem high-pressure liquid chromatography (HPLC) of the tryptic peptides, HPLC mapping and isoelectric focusing of the CNBr fragments, and amino acid sequence analysis of the purified peptides. Lysine-372 of normal (common) albumin A was changed to glutamic acid both in albumin Naskapi, a widespread polymorphic variant of North American Indians, and in albumin Mersin found in Eti Turks. The two variants also exhibited anomalous migration in NaDodSO 4 /PAGE, which is attributed to a conformational change. The identity of albumins Naskapi and Mersin may have originated through descent from a common mid-Asiatic founder of the two migrating ethnic groups, or it may represent identical but independent mutations of the albumin gene. In albumin Adana, from Eti Turks, the substitution site was not identified but was localized to the region from positions 447 through 548. The substitution of aspartic acid-550 by glycine was found in albumin Mexico-2 from four individuals of the Pima tribe. Although only single-point substitutions have been found in these and in certain other genetic variants of human albumin, five differences exist in the amino acid sequences inferred from cDNA sequences by workers in three other laboratories. However, our results on albumin A and on 14 different genetic variants accord with the amino acid sequence of albumin deduced from the genomic sequence. The apparent amino acid substitutions inferred from comparison of individual cDNA sequences probably reflect artifacts in cloning or in cDNA sequence analysis rather than polymorphism of the coding sections of the albumin gene
Discovery and mapping of a new expressed sequence tag-single nucleotide polymorphism and simple sequence repeat panel for large-scale genetic studies and breeding of Theobroma cacao L.

Science.gov (United States)

Allegre, Mathilde; Argout, Xavier; Boccara, Michel; Fouet, Olivier; Roguet, Yolande; Bérard, Aurélie; Thévenin, Jean Marc; Chauveau, Aurélie; Rivallan, Ronan; Clement, Didier; Courtois, Brigitte; Gramacho, Karina; Boland-Augé, Anne; Tahi, Mathias; Umaharan, Pathmanathan; Brunel, Dominique; Lanaud, Claire

2012-01-01

Theobroma cacao is an economically important tree of several tropical countries. Its genetic improvement is essential to provide protection against major diseases and improve chocolate quality. We discovered and mapped new expressed sequence tag-single nucleotide polymorphism (EST-SNP) and simple sequence repeat (SSR) markers and constructed a high-density genetic map. By screening 149 650 ESTs, 5246 SNPs were detected in silico, of which 1536 corresponded to genes with a putative function, while 851 had a clear polymorphic pattern across a collection of genetic resources. In addition, 409 new SSR markers were detected on the Criollo genome. Lastly, 681 new EST-SNPs and 163 new SSRs were added to the pre-existing 418 co-dominant markers to construct a large consensus genetic map. This high-density map and the set of new genetic markers identified in this study are a milestone in cocoa genomics and for marker-assisted breeding. The data are available at http://tropgenedb.cirad.fr. PMID:22210604
Genetic analysis of the pelA-pelE cluster encoding the acidic and basic pectate lyases in Erwinia chrysanthemi EC16.

Science.gov (United States)

Barras, F; Chatterjee, A K

1987-10-01

In Erwinia chrysanthemi (EC16) the clustered pelA and pelE genes encode an acidic (pI 4.2) and a basic (pI 10.0) pectate lyase (Pel), respectively. The pelA gene has been isolated on a 1.2 kb restriction fragment and the direction of transcription determined. DNA hybridization analysis showed that the pelE sequence shares DNA homology with pelA but not with pelB or pelC, two genes encoding other Pel species in EC16. Since Pel A and Pel E enzymes showed little similarity in terms of catalytic properties, it is proposed that pelA and pelE are duplicates which have highly diverged.
Mitochondrial DNA markers reveal high genetic diversity but low genetic differentiation in the black fly Simulium tani Takaoka & Davies along an elevational gradient in Malaysia.

Directory of Open Access Journals (Sweden)

Van Lun Low

Full Text Available The population genetic structure of Simulium tani was inferred from mitochondria-encoded sequences of cytochrome c oxidase subunits I (COI and II (COII along an elevational gradient in Cameron Highlands, Malaysia. A statistical parsimony network of 71 individuals revealed 71 haplotypes in the COI gene and 43 haplotypes in the COII gene; the concatenated sequences of the COI and COII genes revealed 71 haplotypes. High levels of genetic diversity but low levels of genetic differentiation were observed among populations of S. tani at five elevations. The degree of genetic diversity, however, was not in accordance with an altitudinal gradient, and a Mantel test indicated that elevation did not have a limiting effect on gene flow. No ancestral haplotype of S. tani was found among the populations. Pupae with unique structural characters at the highest elevation showed a tendency to form their own haplotype cluster, as revealed by the COII gene. Tajima's D, Fu's Fs, and mismatch distribution tests revealed population expansion of S. tani in Cameron Highlands. A strong correlation was found between nucleotide diversity and the levels of dissolved oxygen in the streams where S. tani was collected.
Genetic characterization of human herpesvirus type 1: Full-length genome sequence of strain obtained from an encephalitis case from India

Directory of Open Access Journals (Sweden)

Vijay P Bondre

2016-01-01

Interpretation & conclusions: Our results showed that the full-length genome sequence generated from an Indian HSV-1 isolate shared close genetic relationship with the American KOS and Chinese CR38 strains which belonged to the Asian genetic lineage. Recombination analysis of Indian isolate demonstrated multiple recombination crossover points throughout the genome. This full-length genome sequence amplified from the Indian isolate would be helpful to study HSV evolution, genetic basis of differential pathogenesis, host-virus interactions and viral factors contributing towards differential clinical outcome in human infections.
Complete Genome Sequence of Zucchini Yellow Mosaic Virus Strain Kurdistan, Iran.

Science.gov (United States)

Maghamnia, Hamid Reza; Hajizadeh, Mohammad; Azizi, Abdolbaset

2018-03-01

The complete genome sequence of Zucchini yellow mosaic virus strain Kurdistan (ZYMV-Kurdistan) infecting squash from Iran was determined from 13 overlapping fragments. Excluding the poly (A) tail, ZYMV-Kurdistan genome consisted of 9593 nucleotides (nt), with 138 and 211 nt at the 5' and 3' non-translated regions, respectively. It contained two open-reading frames (ORFs), the large ORF encoding a polyprotein of 3080 amino acids (aa) and the small overlapping ORF encoding a P3N-PIPO protein of 74 aa. This isolate had six unique aa differences compared to other ZYMV isolates and shared 79.6-98.8% identities with other ZYMV genome sequences at the nt level and 90.1-99% identities at the aa level. A phylogenetic tree of ZYMV complete genomic sequences showed that Iranian and Central European isolates are closely related and form a phylogenetically homogenous group. All values in the ratio of substitution rates at non-synonymous and synonymous sites ( d N / d S ) were below 1, suggestive of strong negative selection forces during ZYMV protein history. This is the first report of complete genome sequence information of the most prevalent virus in the west of Iran. This study helps our understanding of the genetic diversity of ZYMV isolates infecting cucurbit plants in Iran, virus evolution and epidemiology and can assist in designing better diagnostic tools.
Identities among actin-encoding cDNAs of the Nile tilapia (Oreochromis niloticus and other eukaryote species revealed by nucleotide and amino acid sequence analyses

Directory of Open Access Journals (Sweden)

Andréia B. Poletto

2008-01-01

Full Text Available Actin-encoding cDNAs of Nile tilapia (Oreochromis niloticus were isolated by RT-PCR using total RNA samples of different tissues and further characterized by nucleotide sequencing and in silico amino acid (aa sequence analysis. Comparisons among the actin gene sequences of O. niloticus and those of other species evidenced that the isolated genes present a high similarity to other fish and other vertebrate actin genes. The highest nucleotide resemblance was observed between O. niloticus and O. mossambicus a-actin and b-actin genes. Analysis of the predicted aa sequences revealed two distinct types of cytoplasmic actins, one cardiac muscle actin type and one skeletal muscle actin type that were expressed in different tissues of Nile tilapia. The evolutionary relationships between the Nile tilapia actin genes and diverse other organisms is discussed.
BAC-end sequence-based SNPs and Bin mapping for rapid integration of physical and genetic maps in apple.

Science.gov (United States)

Han, Yuepeng; Chagné, David; Gasic, Ksenija; Rikkerink, Erik H A; Beever, Jonathan E; Gardiner, Susan E; Korban, Schuyler S

2009-03-01

A genome-wide BAC physical map of the apple, Malus x domestica Borkh., has been recently developed. Here, we report on integrating the physical and genetic maps of the apple using a SNP-based approach in conjunction with bin mapping. Briefly, BAC clones located at ends of BAC contigs were selected, and sequenced at both ends. The BAC end sequences (BESs) were used to identify candidate SNPs. Subsequently, these candidate SNPs were genetically mapped using a bin mapping strategy for the purpose of mapping the physical onto the genetic map. Using this approach, 52 (23%) out of 228 BESs tested were successfully exploited to develop SNPs. These SNPs anchored 51 contigs, spanning approximately 37 Mb in cumulative physical length, onto 14 linkage groups. The reliability of the integration of the physical and genetic maps using this SNP-based strategy is described, and the results confirm the feasibility of this approach to construct an integrated physical and genetic maps for apple.
Homozygosity mapping and targeted sanger sequencing reveal genetic defects underlying inherited retinal disease in families from pakistan.

Directory of Open Access Journals (Sweden)

Maleeha Maria

Full Text Available Homozygosity mapping has facilitated the identification of the genetic causes underlying inherited diseases, particularly in consanguineous families with multiple affected individuals. This knowledge has also resulted in a mutation dataset that can be used in a cost and time effective manner to screen frequent population-specific genetic variations associated with diseases such as inherited retinal disease (IRD.We genetically screened 13 families from a cohort of 81 Pakistani IRD families diagnosed with Leber congenital amaurosis (LCA, retinitis pigmentosa (RP, congenital stationary night blindness (CSNB, or cone dystrophy (CD. We employed genome-wide single nucleotide polymorphism (SNP array analysis to identify homozygous regions shared by affected individuals and performed Sanger sequencing of IRD-associated genes located in the sizeable homozygous regions. In addition, based on population specific mutation data we performed targeted Sanger sequencing (TSS of frequent variants in AIPL1, CEP290, CRB1, GUCY2D, LCA5, RPGRIP1 and TULP1, in probands from 28 LCA families.Homozygosity mapping and Sanger sequencing of IRD-associated genes revealed the underlying mutations in 10 families. TSS revealed causative variants in three families. In these 13 families four novel mutations were identified in CNGA1, CNGB1, GUCY2D, and RPGRIP1.Homozygosity mapping and TSS revealed the underlying genetic cause in 13 IRD families, which is useful for genetic counseling as well as therapeutic interventions that are likely to become available in the near future.

JRC GMO-Amplicons: a collection of nucleic acid sequences related to genetically modified organisms.

Science.gov (United States)

Petrillo, Mauro; Angers-Loustau, Alexandre; Henriksson, Peter; Bonfini, Laura; Patak, Alex; Kreysa, Joachim

2015-01-01

The DNA target sequence is the key element in designing detection methods for genetically modified organisms (GMOs). Unfortunately this information is frequently lacking, especially for unauthorized GMOs. In addition, patent sequences are generally poorly annotated, buried in complex and extensive documentation and hard to link to the corresponding GM event. Here, we present the JRC GMO-Amplicons, a database of amplicons collected by screening public nucleotide sequence databanks by in silico determination of PCR amplification with reference methods for GMO analysis. The European Union Reference Laboratory for Genetically Modified Food and Feed (EU-RL GMFF) provides these methods in the GMOMETHODS database to support enforcement of EU legislation and GM food/feed control. The JRC GMO-Amplicons database is composed of more than 240 000 amplicons, which can be easily accessed and screened through a web interface. To our knowledge, this is the first attempt at pooling and collecting publicly available sequences related to GMOs in food and feed. The JRC GMO-Amplicons supports control laboratories in the design and assessment of GMO methods, providing inter-alia in silico prediction of primers specificity and GM targets coverage. The new tool can assist the laboratories in the analysis of complex issues, such as the detection and identification of unauthorized GMOs. Notably, the JRC GMO-Amplicons database allows the retrieval and characterization of GMO-related sequences included in patents documentation. Finally, it can help annotating poorly described GM sequences and identifying new relevant GMO-related sequences in public databases. The JRC GMO-Amplicons is freely accessible through a web-based portal that is hosted on the EU-RL GMFF website. Database URL: http://gmo-crl.jrc.ec.europa.eu/jrcgmoamplicons/. © The Author(s) 2015. Published by Oxford University Press.
Genetic diversity based on 28S rDNA sequences among populations of Culex quinquefasciatus collected at different locations in Tamil Nadu, India.

Science.gov (United States)

Sakthivelkumar, S; Ramaraj, P; Veeramani, V; Janarthanan, S

2015-09-01

The basis of the present study was to distinguish the existence of any genetic variability among populations of Culex quinquefasciatus which would be a valuable tool in the management of mosquito control programmes. In the present study, population of Cx. quinquefasciatus collected at different locations in Tamil Nadu were analyzed for their genetic variation based on 28S rDNA D2 region nucleotide sequences. A high degree of genetic polymorphism was detected in the sequences of D2 region of 28S rDNA on the predicted secondary structures in spite of high nucleotide sequence similarity. The findings based on secondary structure using rDNA sequences suggested the existence of a complex genotypic diversity of Cx. quinquefasciatus population collected at different locations of Tamil Nadu, India. This complexity in genetic diversity in a single mosquito population collected at different locations is considered an important issue towards their influence and nature of vector potential of these mosquitoes.
cDNA encoding a polypeptide including a hevein sequence

Energy Technology Data Exchange (ETDEWEB)

Raikhel, N.V.; Broekaert, W.F.; Namhai Chua; Kush, A.

1993-02-16

A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1,018 nucleotides long and includes an open reading frame of 204 amino acids.
Current View on Phytoplasma Genomes and Encoded Metabolism

Directory of Open Access Journals (Sweden)

Michael Kube

2012-01-01

Full Text Available Phytoplasmas are specialised bacteria that are obligate parasites of plant phloem tissue and insects. These bacteria have resisted all attempts of cell-free cultivation. Genome research is of particular importance to analyse the genetic endowment of such bacteria. Here we review the gene content of the four completely sequenced ‘Candidatus Phytoplasma’ genomes that include those of ‘Ca. P. asteris’ strains OY-M and AY-WB, ‘Ca. P. australiense,’ and ‘Ca. P. mali’. These genomes are characterized by chromosome condensation resulting in sizes below 900 kb and a G + C content of less than 28%. Evolutionary adaption of the phytoplasmas to nutrient-rich environments resulted in losses of genetic modules and increased host dependency highlighted by the transport systems and limited metabolic repertoire. On the other hand, duplication and integration events enlarged the chromosomes and contribute to genome instability. Present differences in the content of membrane and secreted proteins reflect the host adaptation in the phytoplasma strains. General differences are obvious between different phylogenetic subgroups. ‘Ca. P. mali’ is separated from the other strains by its deviating chromosome organization, the genetic repertoire for recombination and excision repair of nucleotides or the loss of the complete energy-yielding part of the glycolysis. Apart from these differences, comparative analysis exemplified that all four phytoplasmas are likely to encode an alternative pathway to generate pyruvate and ATP.
Understanding invasion history and predicting invasive niches using genetic sequencing technology in Australia: case studies from Cucurbitaceae and Boraginaceae.

Science.gov (United States)

Shaik, Razia S; Zhu, Xiaocheng; Clements, David R; Weston, Leslie A

2016-01-01

Part of the challenge in dealing with invasive plant species is that they seldom represent a uniform, static entity. Often, an accurate understanding of the history of plant introduction and knowledge of the real levels of genetic diversity present in species and populations of importance is lacking. Currently, the role of genetic diversity in promoting the successful establishment of invasive plants is not well defined. Genetic profiling of invasive plants should enhance our understanding of the dynamics of colonization in the invaded range. Recent advances in DNA sequencing technology have greatly facilitated the rapid and complete assessment of plant population genetics. Here, we apply our current understanding of the genetics and ecophysiology of plant invasions to recent work on Australian plant invaders from the Cucurbitaceae and Boraginaceae. The Cucurbitaceae study showed that both prickly paddy melon ( Cucumis myriocarpus ) and camel melon ( Citrullus lanatus ) were represented by only a single genotype in Australia, implying that each was probably introduced as a single introduction event. In contrast, a third invasive melon, Citrullus colocynthis , possessed a moderate level of genetic diversity in Australia and was potentially introduced to the continent at least twice. The Boraginaceae study demonstrated the value of comparing two similar congeneric species; one, Echium plantagineum , is highly invasive and genetically diverse, whereas the other, Echium vulgare , exhibits less genetic diversity and occupies a more limited ecological niche. Sequence analysis provided precise identification of invasive plant species, as well as information on genetic diversity and phylogeographic history. Improved sequencing technologies will continue to allow greater resolution of genetic relationships among invasive plant populations, thereby potentially improving our ability to predict the impact of these relationships upon future spread and better manage invaders
Nucleic acid compositions and the encoding proteins

Science.gov (United States)

Preston, III, James F.; Chow, Virginia; Nong, Guang; Rice, John D.; St. John, Franz J.

2014-09-02

The subject invention provides at least one nucleic acid sequence encoding an aldouronate-utilization regulon isolated from Paenibacillus sp. strain JDR-2, a bacterium which efficiently utilizes xylan and metabolizes aldouronates (methylglucuronoxylosaccharides). The subject invention also provides a means for providing a coordinately regulated process in which xylan depolymerization and product assimilation are coupled in Paenibacillus sp. strain JDR-2 to provide a favorable system for the conversion of lignocellulosic biomass to biobased products. Additionally, the nucleic acid sequences encoding the aldouronate-utilization regulon can be used to transform other bacteria to form organisms capable of producing a desired product (e.g., ethanol, 1-butanol, acetoin, 2,3-butanediol, 1,3-propanediol, succinate, lactate, acetate, malate or alanine) from lignocellulosic biomass.
Genetic sequencing for surveillance of drug resistance in tuberculosis in highly endemic countries: a multi-country population-based surveillance study.

Science.gov (United States)

Zignol, Matteo; Cabibbe, Andrea Maurizio; Dean, Anna S; Glaziou, Philippe; Alikhanova, Natavan; Ama, Cecilia; Andres, Sönke; Barbova, Anna; Borbe-Reyes, Angeli; Chin, Daniel P; Cirillo, Daniela Maria; Colvin, Charlotte; Dadu, Andrei; Dreyer, Andries; Driesen, Michèle; Gilpin, Christopher; Hasan, Rumina; Hasan, Zahra; Hoffner, Sven; Hussain, Alamdar; Ismail, Nazir; Kamal, S M Mostofa; Khanzada, Faisal Masood; Kimerling, Michael; Kohl, Thomas Andreas; Mansjö, Mikael; Miotto, Paolo; Mukadi, Ya Diul; Mvusi, Lindiwe; Niemann, Stefan; Omar, Shaheed V; Rigouts, Leen; Schito, Marco; Sela, Ivita; Seyfaddinova, Mehriban; Skenders, Girts; Skrahina, Alena; Tahseen, Sabira; Wells, William A; Zhurilo, Alexander; Weyer, Karin; Floyd, Katherine; Raviglione, Mario C

2018-03-21

In many countries, regular monitoring of the emergence of resistance to anti-tuberculosis drugs is hampered by the limitations of phenotypic testing for drug susceptibility. We therefore evaluated the use of genetic sequencing for surveillance of drug resistance in tuberculosis. Population-level surveys were done in hospitals and clinics in seven countries (Azerbaijan, Bangladesh, Belarus, Pakistan, Philippines, South Africa, and Ukraine) to evaluate the use of genetic sequencing to estimate the resistance of Mycobacterium tuberculosis isolates to rifampicin, isoniazid, ofloxacin, moxifloxacin, pyrazinamide, kanamycin, amikacin, and capreomycin. For each drug, we assessed the accuracy of genetic sequencing by a comparison of the adjusted prevalence of resistance, measured by genetic sequencing, with the true prevalence of resistance, determined by phenotypic testing. Isolates were taken from 7094 patients with tuberculosis who were enrolled in the study between November, 2009, and May, 2014. In all tuberculosis cases, the overall pooled sensitivity values for predicting resistance by genetic sequencing were 91% (95% CI 87-94) for rpoB (rifampicin resistance), 86% (74-93) for katG, inhA, and fabG promoter combined (isoniazid resistance), 54% (39-68) for pncA (pyrazinamide resistance), 85% (77-91) for gyrA and gyrB combined (ofloxacin resistance), and 88% (81-92) for gyrA and gyrB combined (moxifloxacin resistance). For nearly all drugs and in most settings, there was a large overlap in the estimated prevalence of drug resistance by genetic sequencing and the estimated prevalence by phenotypic testing. Genetic sequencing can be a valuable tool for surveillance of drug resistance, providing new opportunities to monitor drug resistance in tuberculosis in resource-poor countries. Before its widespread adoption for surveillance purposes, there is a need to standardise DNA extraction methods, recording and reporting nomenclature, and data interpretation. Bill & Melinda
MicroRNA-encoding long non-coding RNAs

Directory of Open Access Journals (Sweden)

Zhu Xiaopeng

2008-05-01

Full Text Available Abstract Background Recent analysis of the mouse transcriptional data has revealed the existence of ~34,000 messenger-like non-coding RNAs (ml-ncRNAs. Whereas the functional properties of these ml-ncRNAs are beginning to be unravelled, no functional information is available for the large majority of these transcripts. Results A few ml-ncRNA have been shown to have genomic loci that overlap with microRNA loci, leading us to suspect that a fraction of ml-ncRNA may encode microRNAs. We therefore developed an algorithm (PriMir for specifically detecting potential microRNA-encoding transcripts in the entire set of 34,030 mouse full-length ml-ncRNAs. In combination with mouse-rat sequence conservation, this algorithm detected 97 (80 of them were novel strong miRNA-encoding candidates, and for 52 of these we obtained experimental evidence for the existence of their corresponding mature microRNA by microarray and stem-loop RT-PCR. Sequence analysis of the microRNA-encoding RNAs revealed an internal motif, whose presence correlates strongly (R2 = 0.9, P-value = 2.2 × 10-16 with the occurrence of stem-loops with characteristics of known pre-miRNAs, indicating the presence of a larger number microRNA-encoding RNAs (from 300 up to 800 in the ml-ncRNAs population. Conclusion Our work highlights a unique group of ml-ncRNAs and offers clues to their functions.
Transcriptome Analysis Revealed Highly Expressed Genes Encoding Secondary Metabolite Pathways and Small Cysteine-Rich Proteins in the Sclerotium of Lignosus rhinocerotis.

Directory of Open Access Journals (Sweden)

Hui-Yeng Y Yap

Full Text Available Lignosus rhinocerotis (Cooke Ryvarden (tiger milk mushroom has long been known for its nutritional and medicinal benefits among the local communities in Southeast Asia. However, the molecular and genetic basis of its medicinal and nutraceutical properties at transcriptional level have not been investigated. In this study, the transcriptome of L. rhinocerotis sclerotium, the part with medicinal value, was analyzed using high-throughput Illumina HiSeqTM platform with good sequencing quality and alignment results. A total of 3,673, 117, and 59,649 events of alternative splicing, novel transcripts, and SNP variation were found to enrich its current genome database. A large number of transcripts were expressed and involved in the processing of gene information and carbohydrate metabolism. A few highly expressed genes encoding the cysteine-rich cerato-platanin, hydrophobins, and sugar-binding lectins were identified and their possible roles in L. rhinocerotis were discussed. Genes encoding enzymes involved in the biosynthesis of glucans, six gene clusters encoding four terpene synthases and one each of non-ribosomal peptide synthetase and polyketide synthase, and 109 transcribed cytochrome P450 sequences were also identified in the transcriptome. The data from this study forms a valuable foundation for future research in the exploitation of this mushroom in pharmacological and industrial applications.
BGL6 beta-glucosidase and nucleic acids encoding the same

Science.gov (United States)

Dunn-Coleman, Nigel [Los Gatos, CA; Ward, Michael [San Francisco, CA

2009-09-01

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
A strategy for genetic modification of the spike-encoding segment of human reovirus T3D for reovirus targeting.

Science.gov (United States)

van den Wollenberg, D J M; van den Hengel, S K; Dautzenberg, I J C; Cramer, S J; Kranenburg, O; Hoeben, R C

2008-12-01

Human Orthoreovirus Type 3 Dearing is not pathogenic to humans and has been evaluated clinically as an oncolytic agent. Its transduction efficiency and the tumor cell selectivity may be enhanced by incorporating ligands for alternative receptors. However, the genetic modification of reoviruses has been difficult, and genetic targeting of reoviruses has not been reported so far. Here we describe a technique for generating genetically targeted reoviruses. The propagation of wild-type reoviruses on cells expressing a modified sigma 1-encoding segment embedded in a conventional RNA polymerase II transcript leads to substitution of the wild-type genome segment by the modified version. This technique was used for generating reoviruses that are genetically targeted to an artificial receptor expressed on U118MG cells. These cells lack the junction adhesion molecule-1 and therefore resist infection by wild-type reoviruses. The targeted reoviruses were engineered to carry the ligand for this receptor at the C terminus of the sigma 1 spike protein. This demonstrates that the C terminus of the sigma 1 protein is a suitable locale for the insertion of oligopeptide ligands and that targeting of reoviruses is feasible. The genetically targeted viruses can be propagated using the modified U118MG cells as helper cells. This technique may be applicable for the improvement of human reoviruses as oncolytic agents.
Cloning, sequence determination, and expression of the genes encoding the subunits of the nickel-containing 8-hydroxy-5-deazaflavin reducing hydrogenase from Methanobacterium thermoautotrophicum ΔH

International Nuclear Information System (INIS)

Alex, L.A.; Reeve, J.N.; Orme-Johnson, W.H.; Walsh, C.T.

1990-01-01

The genes frhA (1,217 bp), frhB (845 bp), and frhG (710 bp) encoding the three known subunits, α, β, and γ, of the 8-hydroxy-5-deazaflavin (F 420 ) reducing hydrogenase (FRH) from the thermophilic methanogen Methanobacterium thermoautotrophicum ΔH have been cloned, sequenced, and shown to be tightly linked, indicative of a single transcriptional unit. The DNA sequence contains a fourth open reading frame, designated frhD (476 bp), encoding a polypeptide (δ) that does not copurify with the active enzyme. Expression of the frh gene cluster in Escherichia coli shows that four polypeptides are synthesized. When analyzed by SDS-PAGE, the proteins migrate with mobilities consistent with their calculated molecular weights. In order to understand the mechanism of H 2 oxidation by this enzyme, localization of redox cofactors (Ni, Fe/S, FAD) to specific subunits and information on their structure is needed. This has been hindered due to the refractory nature of the enzyme to denaturation methods needed in order to obtain individual subunits with cofactors intact. In this paper they discuss the possible localization of the redox cofactors as implicated from the DNA-derived protein sequences of the subunits. The amino acid sequences of the subunits of the FRH are compared with those of other Ni-containing hydrogenases, including the methyl viologen reducing hydrogenase (MVH) of M. thermoautotrophicum ΔH
An update on the genetics of hyperuricaemia and gout.

Science.gov (United States)

Major, Tanya J; Dalbeth, Nicola; Stahl, Eli A; Merriman, Tony R

2018-06-01

A central aspect of the pathogenesis of gout is elevated urate concentrations, which lead to the formation of monosodium urate crystals. The clinical features of gout result from an individual's immune response to these deposited crystals. Genome-wide association studies (GWAS) have confirmed the importance of urate excretion in the control of serum urate levels and the risk of gout and have identified the kidneys, the gut and the liver as sites of urate regulation. The genetic contribution to the progression from hyperuricaemia to gout remains relatively poorly understood, although genes encoding proteins that are involved in the NLRP3 (NOD-, LRR- and pyrin domain-containing 3) inflammasome pathway play a part. Genome-wide and targeted sequencing is beginning to identify uncommon population-specific variants that are associated with urate levels and gout. Mendelian randomization studies using urate-associated genetic variants as unconfounded surrogates for lifelong urate exposure have not supported claims that urate is causal for metabolic conditions that are comorbidities of hyperuricaemia and gout. Genetic studies have also identified genetic variants that predict responsiveness to therapies (for example, urate-lowering drugs) for treatment of hyperuricaemia. Future research should focus on large GWAS (that include asymptomatic hyperuricaemic individuals) and on increasing the use of whole-genome sequencing data to identify uncommon genetic variants with increased penetrance that might provide opportunities for clinical translation.
Exhaustive search of linear information encoding protein-peptide recognition.

Science.gov (United States)

Kelil, Abdellali; Dubreuil, Benjamin; Levy, Emmanuel D; Michnick, Stephen W

2017-04-01

High-throughput in vitro methods have been extensively applied to identify linear information that encodes peptide recognition. However, these methods are limited in number of peptides, sequence variation, and length of peptides that can be explored, and often produce solutions that are not found in the cell. Despite the large number of methods developed to attempt addressing these issues, the exhaustive search of linear information encoding protein-peptide recognition has been so far physically unfeasible. Here, we describe a strategy, called DALEL, for the exhaustive search of linear sequence information encoded in proteins that bind to a common partner. We applied DALEL to explore binding specificity of SH3 domains in the budding yeast Saccharomyces cerevisiae. Using only the polypeptide sequences of SH3 domain binding proteins, we succeeded in identifying the majority of known SH3 binding sites previously discovered either in vitro or in vivo. Moreover, we discovered a number of sites with both non-canonical sequences and distinct properties that may serve ancillary roles in peptide recognition. We compared DALEL to a variety of state-of-the-art algorithms in the blind identification of known binding sites of the human Grb2 SH3 domain. We also benchmarked DALEL on curated biological motifs derived from the ELM database to evaluate the effect of increasing/decreasing the enrichment of the motifs. Our strategy can be applied in conjunction with experimental data of proteins interacting with a common partner to identify binding sites among them. Yet, our strategy can also be applied to any group of proteins of interest to identify enriched linear motifs or to exhaustively explore the space of linear information encoded in a polypeptide sequence. Finally, we have developed a webserver located at http://michnick.bcm.umontreal.ca/dalel, offering user-friendly interface and providing different scenarios utilizing DALEL.
Genome sequence of a cluster A13 mycobacteriophage detected in Mycobacterium phlei over a half century ago.

Science.gov (United States)

Marton, Szilvia; Fehér, Enikő; Horváth, Balázs; Háber, Katalin; Somogyi, Pál; Minárovits, János; Bányai, Krisztián

2016-01-01

A phage infecting Mycobacterium phlei was isolated in 1958 from a soil sample in Hungary. Some physicochemical and biological properties of the virus were described in independent studies over the years. Here, we report the genome sequence of this early mycobacteriophage isolate. The Phlei phage genome measured 50,418 bp, had a GC content of 60.1 % and was predicted to encode 81 proteins and three tRNAs. Phylogeny of the tape measure protein revealed genetic relatedness to other early isolates of mycobacteriophages within subcluster A2. The genomic organization and genetic relationships to other strains showed that the Phlei phage belongs to a novel genetic cluster, designated A13.
Genetic Barrier to Direct Acting Antivirals in HCV Sequences Deposited in the European Databank.

Directory of Open Access Journals (Sweden)

Dimas Alexandre Kliemann

Full Text Available Development of resistance results from mutations in the viral genome, and the presence of selective drug pressure leads to the emergence of a resistant virus population. The aim of this study was to analyze the impact of genetic variability on the genetic barrier to drug resistance to DAAs.The genetic barrier was quantified based on the number and type of nucleotide mutations required to impart resistance, considering full-length HCV NS3, NS5A and NS5B regions segregated by genotype into subtypes 1a, 1b, 2a, 2b and 3a. This study analyzeds 789 NS3 sequences, 708 sequences and 536 NS5B sequences deposited in the European Hepatitis C Virus Database, in the following resistance-associated positions: NS3: F43/I/L/S/V, Q80K/R, R155K/G, A156G/S/T and D168A/C/E/G/H/N/T/V/Y; NS5A: L/M28A/T/V, Q30E/H/R, L31F/I/M/V, H58D or P58S and Y93C/F/H/N/S; NS5B: S282P/R/T, C316H/N/Y, S368T, Y448C/H, S556G/R, D559R.Variants that require only one transversion in NS3 were found in 4 positions and include F43S, R80K, R155K/G and A156T. The genetic barrier to resistance shows subtypic differences at position 155 of the NS3 gene where a single transition is necessary in subtype 1a. In the NS5A gene, 5 positions where only one nucleotide change can confer resistance were found, such as L31M which requires one transversion in all subtypes, except in 0.28% of 1b sequences; and R30H, generated by a single transition, which was found in 10.25% of the sequences of genotype 1b. Other subtypic differences were observed at position 58, where resistance is less likely in genotype 1a because a transversion is required to create the variant 58S. For the NS5B inhibitors, the genetic barrier at positions conferring resistance was nearly identical in subtypes 1a and 1b, and single transitions or transversions were necessary in 5 positions to generate a drug-resistant variant of HCV. The positions C316Y and S556D required only one transition in all genotypes, Y448H and S556 G
Reconstruction of DNA sequences using genetic algorithms and cellular automata: towards mutation prediction?

Science.gov (United States)

Mizas, Ch; Sirakoulis, G Ch; Mardiris, V; Karafyllidis, I; Glykos, N; Sandaltzopoulos, R

2008-04-01

Change of DNA sequence that fuels evolution is, to a certain extent, a deterministic process because mutagenesis does not occur in an absolutely random manner. So far, it has not been possible to decipher the rules that govern DNA sequence evolution due to the extreme complexity of the entire process. In our attempt to approach this issue we focus solely on the mechanisms of mutagenesis and deliberately disregard the role of natural selection. Hence, in this analysis, evolution refers to the accumulation of genetic alterations that originate from mutations and are transmitted through generations without being subjected to natural selection. We have developed a software tool that allows modelling of a DNA sequence as a one-dimensional cellular automaton (CA) with four states per cell which correspond to the four DNA bases, i.e. A, C, T and G. The four states are represented by numbers of the quaternary number system. Moreover, we have developed genetic algorithms (GAs) in order to determine the rules of CA evolution that simulate the DNA evolution process. Linear evolution rules were considered and square matrices were used to represent them. If DNA sequences of different evolution steps are available, our approach allows the determination of the underlying evolution rule(s). Conversely, once the evolution rules are deciphered, our tool may reconstruct the DNA sequence in any previous evolution step for which the exact sequence information was unknown. The developed tool may be used to test various parameters that could influence evolution. We describe a paradigm relying on the assumption that mutagenesis is governed by a near-neighbour-dependent mechanism. Based on the satisfactory performance of our system in the deliberately simplified example, we propose that our approach could offer a starting point for future attempts to understand the mechanisms that govern evolution. The developed software is open-source and has a user-friendly graphical input interface.
Assessing genetic diversity among Brettanomyces yeasts by DNA fingerprinting and whole-genome sequencing.

Science.gov (United States)

Crauwels, Sam; Zhu, Bo; Steensels, Jan; Busschaert, Pieter; De Samblanx, Gorik; Marchal, Kathleen; Willems, Kris A; Verstrepen, Kevin J; Lievens, Bart

2014-07-01

Brettanomyces yeasts, with the species Brettanomyces (Dekkera) bruxellensis being the most important one, are generally reported to be spoilage yeasts in the beer and wine industry due to the production of phenolic off flavors. However, B. bruxellensis is also known to be a beneficial contributor in certain fermentation processes, such as the production of certain specialty beers. Nevertheless, despite its economic importance, Brettanomyces yeasts remain poorly understood at the genetic and genomic levels. In this study, the genetic relationship between more than 50 Brettanomyces strains from all presently known species and from several sources was studied using a combination of DNA fingerprinting techniques. This revealed an intriguing correlation between the B. bruxellensis fingerprints and the respective isolation source. To further explore this relationship, we sequenced a (beneficial) beer isolate of B. bruxellensis (VIB X9085; ST05.12/22) and compared its genome sequence with the genome sequences of two wine spoilage strains (AWRI 1499 and CBS 2499). ST05.12/22 was found to be substantially different from both wine strains, especially at the level of single nucleotide polymorphisms (SNPs). In addition, there were major differences in the genome structures between the strains investigated, including the presence of large duplications and deletions. Gene content analysis revealed the presence of 20 genes which were present in both wine strains but absent in the beer strain, including many genes involved in carbon and nitrogen metabolism, and vice versa, no genes that were missing in both AWRI 1499 and CBS 2499 were found in ST05.12/22. Together, this study provides tools to discriminate Brettanomyces strains and provides a first glimpse at the genetic diversity and genome plasticity of B. bruxellensis. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Genetic Diversity and Phylogenetic Analysis of the Iranian Leishmania Parasites Based on HSP70 Gene PCR-RFLP and Sequence Analysis.

Science.gov (United States)

Nemati, Sara; Fazaeli, Asghar; Hajjaran, Homa; Khamesipour, Ali; Anbaran, Mohsen Falahati; Bozorgomid, Arezoo; Zarei, Fatah

2017-08-01

Despite the broad distribution of leishmaniasis among Iranians and animals across the country, little is known about the genetic characteristics of the causative agents. Applying both HSP70 PCR-RFLP and sequence analyses, this study aimed to evaluate the genetic diversity and phylogenetic relationships among Leishmania spp. isolated from Iranian endemic foci and available reference strains. A total of 36 Leishmania isolates from almost all districts across the country were genetically analyzed for the HSP70 gene using both PCR-RFLP and sequence analysis. The original HSP70 gene sequences were aligned along with homologous Leishmania sequences retrieved from NCBI, and subjected to the phylogenetic analysis. Basic parameters of genetic diversity were also estimated. The HSP70 PCR-RFLP presented 3 different electrophoretic patterns, with no further intraspecific variation, corresponding to 3 Leishmania species available in the country, L. tropica, L. major, and L. infantum. Phylogenetic analyses presented 5 major clades, corresponding to 5 species complexes. Iranian lineages, including L. major, L. tropica, and L. infantum, were distributed among 3 complexes L. major, L. tropica, and L. donovani. However, within the L. major and L. donovani species complexes, the HSP70 phylogeny was not able to distinguish clearly between the L. major and L. turanica isolates, and between the L. infantum, L. donovani, and L. chagasi isolates, respectively. Our results indicated that both HSP70 PCR-RFLP and sequence analyses are medically applicable tools for identification of Leishmania species in Iranian patients. However, the reduced genetic diversity of the target gene makes it inevitable that its phylogeny only resolves the major groups, namely, the species complexes.
Using high-throughput sequencing to leverage surveillance of genetic diversity and oseltamivir resistance: a pilot study during the 2009 influenza A(H1N1 pandemic.

Directory of Open Access Journals (Sweden)

Juan Téllez-Sosa

Full Text Available BACKGROUND: Influenza viruses display a high mutation rate and complex evolutionary patterns. Next-generation sequencing (NGS has been widely used for qualitative and semi-quantitative assessment of genetic diversity in complex biological samples. The "deep sequencing" approach, enabled by the enormous throughput of current NGS platforms, allows the identification of rare genetic viral variants in targeted genetic regions, but is usually limited to a small number of samples. METHODOLOGY AND PRINCIPAL FINDINGS: We designed a proof-of-principle study to test whether redistributing sequencing throughput from a high depth-small sample number towards a low depth-large sample number approach is feasible and contributes to influenza epidemiological surveillance. Using 454-Roche sequencing, we sequenced at a rather low depth, a 307 bp amplicon of the neuraminidase gene of the Influenza A(H1N1 pandemic (A(H1N1pdm virus from cDNA amplicons pooled in 48 barcoded libraries obtained from nasal swab samples of infected patients (n = 299 taken from May to November, 2009 pandemic period in Mexico. This approach revealed that during the transition from the first (May-July to second wave (September-November of the pandemic, the initial genetic variants were replaced by the N248D mutation in the NA gene, and enabled the establishment of temporal and geographic associations with genetic diversity and the identification of mutations associated with oseltamivir resistance. CONCLUSIONS: NGS sequencing of a short amplicon from the NA gene at low sequencing depth allowed genetic screening of a large number of samples, providing insights to viral genetic diversity dynamics and the identification of genetic variants associated with oseltamivir resistance. Further research is needed to explain the observed replacement of the genetic variants seen during the second wave. As sequencing throughput rises and library multiplexing and automation improves, we foresee that

Genetic diversity analysis of Leuconostoc mesenteroides from Korean vegetables and food products by multilocus sequence typing.

Science.gov (United States)

Sharma, Anshul; Kaur, Jasmine; Lee, Sulhee; Park, Young-Seo

2018-06-01

In the present study, 35 Leuconostoc mesenteroides strains isolated from vegetables and food products from South Korea were studied by multilocus sequence typing (MLST) of seven housekeeping genes (atpA, groEL, gyrB, pheS, pyrG, rpoA, and uvrC). The fragment sizes of the seven amplified housekeeping genes ranged in length from 366 to 1414 bp. Sequence analysis indicated 27 different sequence types (STs) with 25 of them being represented by a single strain indicating high genetic diversity, whereas the remaining 2 were characterized by five strains each. In total, 220 polymorphic nucleotide sites were detected among seven housekeeping genes. The phylogenetic analysis based on the STs of the seven loci indicated that the 35 strains belonged to two major groups, A (28 strains) and B (7 strains). Split decomposition analysis showed that intraspecies recombination played a role in generating diversity among strains. The minimum spanning tree showed that the evolution of the STs was not correlated with food source. This study signifies that the multilocus sequence typing is a valuable tool to access the genetic diversity among L. mesenteroides strains from South Korea and can be used further to monitor the evolutionary changes.
Bioinformatics analysis and detection of gelatinase encoded gene in Lysinibacillussphaericus

Science.gov (United States)

Repin, Rul Aisyah Mat; Mutalib, Sahilah Abdul; Shahimi, Safiyyah; Khalid, Rozida Mohd.; Ayob, Mohd. Khan; Bakar, Mohd. Faizal Abu; Isa, Mohd Noor Mat

2016-11-01

In this study, we performed bioinformatics analysis toward genome sequence of Lysinibacillussphaericus (L. sphaericus) to determine gene encoded for gelatinase. L. sphaericus was isolated from soil and gelatinase species-specific bacterium to porcine and bovine gelatin. This bacterium offers the possibility of enzymes production which is specific to both species of meat, respectively. The main focus of this research is to identify the gelatinase encoded gene within the bacteria of L. Sphaericus using bioinformatics analysis of partially sequence genome. From the research study, three candidate gene were identified which was, gelatinase candidate gene 1 (P1), NODE_71_length_93919_cov_158.931839_21 which containing 1563 base pair (bp) in size with 520 amino acids sequence; Secondly, gelatinase candidate gene 2 (P2), NODE_23_length_52851_cov_190.061386_17 which containing 1776 bp in size with 591 amino acids sequence; and Thirdly, gelatinase candidate gene 3 (P3), NODE_106_length_32943_cov_169.147919_8 containing 1701 bp in size with 566 amino acids sequence. Three pairs of oligonucleotide primers were designed and namely as, F1, R1, F2, R2, F3 and R3 were targeted short sequences of cDNA by PCR. The amplicons were reliably results in 1563 bp in size for candidate gene P1 and 1701 bp in size for candidate gene P3. Therefore, the results of bioinformatics analysis of L. Sphaericus resulting in gene encoded gelatinase were identified.
SIGNIFICANCE OF TARGETED EXOME SEQUENCING AND METHODS OF DATA ANALYSIS IN THE DIAGNOSIS OF GENETIC DISORDERS LEADING TO THE DEVELOPMENT OF EPILEPTIC ENCEPHALOPATHY

Directory of Open Access Journals (Sweden)

Tatyana Victorovna Kozhanova

2017-08-01

Full Text Available Epilepsy is the most common serious neurological disorder, and there is a genetic basis in almost 50% of people with epilepsy. The diagnosis of genetic epilepsies makes to estimate reasons of seizures in the patient. Last decade has shown tremendous growth in gene sequencing technologies, which have made genetic tests available. The aim is to show significance of targeted exome sequencing and methods of data analysis in the diagnosis of hereditary syndromes leading to the development of epileptic encephalopathy. We examined 27 patients with с early EE (resistant to antiepileptic drugs, psychomotor and speech development delay in the psycho-neurological department. Targeted exome sequencing was performed for patients without a previously identified molecular diagnosis using 454 Sequencing GS Junior sequencer (Roche and IlluminaNextSeq 500 platform. As a result of the analysis, specific epilepsy genetic variants were diagnosed in 27 patients. The greatest number of cases was due to mutations in the SCN1A gene (7/27. The structure of mutations for other genes (mutations with a minor allele frequency of less than 0,5% are presented: ALDH7A1 (n=1, CACNA1C (n=1, CDKL5 (n=1, CNTNAP2 (n=2, DLGAP2 (n=2, DOCK7 (n=2, GRIN2B (n=2, HCN1 (n=1, NRXN1 (n=3, PCDH19 (n=1, RNASEH2B (n=2, SLC2A1 (n=1, UBE3A (n=1. The use of the exome sequencing in the genetic practice allows to significantly improve the effectiveness of medical genetic counseling, as it made possible to diagnose certain variants of genetically heterogeneous groups of diseases with similar of clinical manifestations.
Isolation and sequence analysis of the Pseudomonas syringae pv. tomato gene encoding a 2,3-diphosphoglycerate-independent phosphoglyceromutase.

Science.gov (United States)

Morris, V L; Jackson, D P; Grattan, M; Ainsworth, T; Cuppels, D A

1995-01-01

Pseudomonas syringae pv. tomato DC3481, a Tn5-induced mutant of the tomato pathogen DC3000, cannot grow and elicit disease symptoms on tomato seedlings. It also cannot grow on minimal medium containing malate, citrate, or succinate, three of the major organic acids found in tomatoes. We report here that this mutant also cannot use, as a sole carbon and/or energy source, a wide variety of hexoses and intermediates of hexose catabolism. Uptake studies have shown that DC3481 is not deficient in transport. A 3.8-kb EcoRI fragment of DC3000 DNA, which complements the Tn5 mutation, has been cloned and sequenced. The deduced amino acid sequences of two of the three open reading frames (ORFs) present on this fragment, ORF2 and ORF3, had no significant homology with sequences in the GenBank databases. However, the 510-amino-acid sequence of ORF1, the site of the Tn5 insertion, strongly resembled the deduced amino acid sequences of the Bacillus subtilis and Zea mays genes encoding 2,3-diphosphoglycerate (DPG)-independent phosphoglyceromutase (PGM) (52% identity and 72% similarity and 37% identity and 57% similarity, respectively). PGMs not requiring the cofactor DPG are usually found in plants and algae. Enzyme assays confirmed that P. syringae PGM activity required an intact ORF1. Not only is DC3481 the first PGM-deficient pseudomonad mutant to be described, but the P. syringae pgm gene is the first gram-negative bacterial gene identified that appears to code for a DPG-independent PGM. PGM activity appears essential for the growth and pathogenicity of P. syringae pv. tomato on its host plant. PMID:7896694
Isolation and sequence analysis of the Pseudomonas syringae pv. tomato gene encoding a 2,3-diphosphoglycerate-independent phosphoglyceromutase.

Science.gov (United States)

Morris, V L; Jackson, D P; Grattan, M; Ainsworth, T; Cuppels, D A

1995-04-01

Pseudomonas syringae pv. tomato DC3481, a Tn5-induced mutant of the tomato pathogen DC3000, cannot grow and elicit disease symptoms on tomato seedlings. It also cannot grow on minimal medium containing malate, citrate, or succinate, three of the major organic acids found in tomatoes. We report here that this mutant also cannot use, as a sole carbon and/or energy source, a wide variety of hexoses and intermediates of hexose catabolism. Uptake studies have shown that DC3481 is not deficient in transport. A 3.8-kb EcoRI fragment of DC3000 DNA, which complements the Tn5 mutation, has been cloned and sequenced. The deduced amino acid sequences of two of the three open reading frames (ORFs) present on this fragment, ORF2 and ORF3, had no significant homology with sequences in the GenBank databases. However, the 510-amino-acid sequence of ORF1, the site of the Tn5 insertion, strongly resembled the deduced amino acid sequences of the Bacillus subtilis and Zea mays genes encoding 2,3-diphosphoglycerate (DPG)-independent phosphoglyceromutase (PGM) (52% identity and 72% similarity and 37% identity and 57% similarity, respectively). PGMs not requiring the cofactor DPG are usually found in plants and algae. Enzyme assays confirmed that P. syringae PGM activity required an intact ORF1. Not only is DC3481 the first PGM-deficient pseudomonad mutant to be described, but the P. syringae pgm gene is the first gram-negative bacterial gene identified that appears to code for a DPG-independent PGM. PGM activity appears essential for the growth and pathogenicity of P. syringae pv. tomato on its host plant.
Applications of Genetic Programming

DEFF Research Database (Denmark)

Gaunholt, Hans; Toma, Laura

1996-01-01

In this report a study of genetic programming (GP) has been performed with respect to a number of applications such as Symbolic function regression, Solving Symbolic Differential Equations, Image encoding, the ant problem etc.......In this report a study of genetic programming (GP) has been performed with respect to a number of applications such as Symbolic function regression, Solving Symbolic Differential Equations, Image encoding, the ant problem etc....
Cloning and sequence of cDNA encoding 1-aminocyclo- propane-1-carboxylate oxidase in Vanda flowers

Directory of Open Access Journals (Sweden)

Pattana Srifah Huehne

2013-08-01

Full Text Available The 1-aminocyclopropane-1-carboxylate oxidase (ACO gene in the final step of ethylene biosynthesis was isolated from ethylene-sensitive Vanda Miss Joaquim flowers. This consists of 1,242 base pairs (bp encoding for 326 amino acid residues. To investigate the specific divergence in orchid ACO sequences, the deduced Vanda ACO was aligned with five other orchid ACOs. The results reveal that the ACO sequences within Doritaenopsis, Phalaenopsis and Vanda show highly conserved and almost 95% identical homology, while the ACOs isolated from Cymbidium, Dendrobium and Cattleya are 8788% identical to Vanda ACO. In addition, the 2-oxoglutarate- Fe(II_oxygenase (Oxy domain of orchid ACOs consists of a higher degree of amino acid conservation than that of the non-haem dioxygenase (DIOX_N domain. The overall homology regions of Vanda ACO are commonly folded into 12 α-helices and 12 β-sheets similar to the three dimensional template-structure of Petunia ACO. This Vanda ACO cloned gene is highly expressed in flower tissue compared with root and leaf tissues. In particular, there is an abundance of ACO transcript accumulation in the column followed by the lip and the perianth of Vanda Miss Joaquim flowers at the fully-open stage.
Fluorescent protein Dendra2 as a ratiometric genetically encoded pH-sensor.

Science.gov (United States)

Pakhomov, Alexey A; Martynov, Vladimir I; Orsa, Alexander N; Bondarenko, Alena A; Chertkova, Rita V; Lukyanov, Konstantin A; Petrenko, Alexander G; Deyev, Igor E

2017-12-02

Fluorescent protein Dendra2 is a monomeric GFP-like protein that belongs to the group of Kaede-like photoconvertible fluorescent proteins with irreversible photoconversion from a green- to red-emitting state when exposed to violet-blue light. In an acidic environment, photoconverted Dendra2 turns green due to protonation of the phenolic group of the chromophore with pKa of about 7.5. Thus, photoconverted form of Dendra2 can be potentially used as a ratiometric pH-sensor in the physiological pH range. However, incomplete photoconversion makes ratiometric measurements irreproducible when using standard filter sets. Here, we describe the method to detect fluorescence of only photoconverted Dendra2 form, but not nonconverted green Dendra2. We show that the 350 nm excitation light induces solely the fluorescence of photoconverted protein. By measuring the red to green fluorescence ratio, we determined intracellular pH in live CHO and HEK 293 cells. Thus, Dendra2 can be used as a novel ratiometric genetically encoded pH sensor with emission maxima in the green-red spectral region, which is suitable for application in live cells. Copyright © 2017 Elsevier Inc. All rights reserved.
Visualization of Nicotine Adenine Dinucleotide Redox Homeostasis with Genetically Encoded Fluorescent Sensors.

Science.gov (United States)

Zhao, Yuzheng; Zhang, Zhuo; Zou, Yejun; Yang, Yi

2018-01-20

Beyond their roles as redox currency in living organisms, pyridine dinucleotides (NAD + /NADH and NADP + /NADPH) are also precursors or cosubstrates of great significance in various physiologic and pathologic processes. Recent Advances: For many years, it was challenging to develop methodologies for monitoring pyridine dinucleotides in situ or in vivo. Recent advances in fluorescent protein-based sensors provide a rapid, sensitive, specific, and real-time readout of pyridine dinucleotide dynamics in single cells or in vivo, thereby opening a new era of pyridine dinucleotide bioimaging. In this article, we summarize the developments in genetically encoded fluorescent sensors for NAD + /NADH and NADP + /NADPH redox states, as well as their applications in life sciences and drug discovery. The strengths and weaknesses of individual sensors are also discussed. These sensors have the advantages of being specific and organelle targetable, enabling real-time monitoring and subcellular-level quantification of targeted molecules in living cells and in vivo. NAD + /NADH and NADP + /NADPH have distinct functions in metabolic and redox regulation, and thus, a comprehensive evaluation of metabolic and redox states must be multiplexed with a combination of various metabolite sensors in a single cell. Antioxid. Redox Signal. 28, 213-229.
Characterizing ligand-gated ion channel receptors with genetically encoded Ca2++ sensors.

Directory of Open Access Journals (Sweden)

John G Yamauchi

2011-01-01

Full Text Available We present a cell based system and experimental approach to characterize agonist and antagonist selectivity for ligand-gated ion channels (LGIC by developing sensor cells stably expressing a Ca(2+ permeable LGIC and a genetically encoded Förster (or fluorescence resonance energy transfer (FRET-based calcium sensor. In particular, we describe separate lines with human α7 and human α4β2 nicotinic acetylcholine receptors, mouse 5-HT(3A serotonin receptors and a chimera of human α7/mouse 5-HT(3A receptors. Complete concentration-response curves for agonists and Schild plots of antagonists were generated from these sensors and the results validate known pharmacology of the receptors tested. Concentration-response relations can be generated from either the initial rate or maximal amplitudes of FRET-signal. Although assaying at a medium throughput level, this pharmacological fluorescence detection technique employs a clonal line for stability and has versatility for screening laboratory generated congeners as agonists or antagonists on multiple subtypes of ligand-gated ion channels. The clonal sensor lines are also compatible with in vivo usage to measure indirectly receptor activation by endogenous neurotransmitters.
Harnessing the sorghum genome sequence:development of a genome-wide microsattelite (SSR) resource for swift genetic mapping and map based cloning in sorghum

Science.gov (United States)

Sorghum is the second cereal crop to have a full genome completely sequenced (Nature (2009), 457:551). This achievement is widely recognized as a scientific milestone for grass genetics and genomics in general. However, the true worth of genetic information lies in translating the sequence informa...
Sequence imputation of HPV16 genomes for genetic association studies.

Directory of Open Access Journals (Sweden)

Benjamin Smith

Full Text Available Human Papillomavirus type 16 (HPV16 causes over half of all cervical cancer and some HPV16 variants are more oncogenic than others. The genetic basis for the extraordinary oncogenic properties of HPV16 compared to other HPVs is unknown. In addition, we neither know which nucleotides vary across and within HPV types and lineages, nor which of the single nucleotide polymorphisms (SNPs determine oncogenicity.A reference set of 62 HPV16 complete genome sequences was established and used to examine patterns of evolutionary relatedness amongst variants using a pairwise identity heatmap and HPV16 phylogeny. A BLAST-based algorithm was developed to impute complete genome data from partial sequence information using the reference database. To interrogate the oncogenic risk of determined and imputed HPV16 SNPs, odds-ratios for each SNP were calculated in a case-control viral genome-wide association study (VWAS using biopsy confirmed high-grade cervix neoplasia and self-limited HPV16 infections from Guanacaste, Costa Rica.HPV16 variants display evolutionarily stable lineages that contain conserved diagnostic SNPs. The imputation algorithm indicated that an average of 97.5±1.03% of SNPs could be accurately imputed. The VWAS revealed specific HPV16 viral SNPs associated with variant lineages and elevated odds ratios; however, individual causal SNPs could not be distinguished with certainty due to the nature of HPV evolution.Conserved and lineage-specific SNPs can be imputed with a high degree of accuracy from limited viral polymorphic data due to the lack of recombination and the stochastic mechanism of variation accumulation in the HPV genome. However, to determine the role of novel variants or non-lineage-specific SNPs by VWAS will require direct sequence analysis. The investigation of patterns of genetic variation and the identification of diagnostic SNPs for lineages of HPV16 variants provides a valuable resource for future studies of HPV16
The complete nucleotide sequences of the 5 genetically distinct plastid genomes of Oenothera, subsection Oenothera: II. A microevolutionary view using bioinformatics and formal genetic data.

Science.gov (United States)

Greiner, Stephan; Wang, Xi; Herrmann, Reinhold G; Rauwolf, Uwe; Mayer, Klaus; Haberer, Georg; Meurer, Jörg

2008-09-01

A unique combination of genetic features and a rich stock of information make the flowering plant genus Oenothera an appealing model to explore the molecular basis of speciation processes including nucleus-organelle coevolution. From representative species, we have recently reported complete nucleotide sequences of the 5 basic and genetically distinguishable plastid chromosomes of subsection Oenothera (I-V). In nature, Oenothera plastid genomes are associated with 6 distinct, either homozygous or heterozygous, diploid nuclear genotypes of the 3 basic genomes A, B, or C. Artificially produced plastome-genome combinations that do not occur naturally often display interspecific plastome-genome incompatibility (PGI). In this study, we compare formal genetic data available from all 30 plastome-genome combinations with sequence differences between the plastomes to uncover potential determinants for interspecific PGI. Consistent with an active role in speciation, a remarkable number of genes have high Ka/Ks ratios. Different from the Solanacean cybrid model Atropa/tobacco, RNA editing seems not to be relevant for PGIs in Oenothera. However, predominantly sequence polymorphisms in intergenic segments are proposed as possible sources for PGI. A single locus, the bidirectional promoter region between psbB and clpP, is suggested to contribute to compartmental PGI in the interspecific AB hybrid containing plastome I (AB-I), consistent with its perturbed photosystem II activity.
Assessment of the genetic diversity of tomato yellow leaf curl virus.

Science.gov (United States)

Wan, H J; Yuan, W; Wang, R Q; Ye, Q J; Ruan, M Y; Li, Z M; Zhou, G Z; Yao, Z P; Yang, Y J

2015-01-26

The objective of the present study was to analyze the genetic diversity of tomato yellow leaf curl virus (TYLCV). Representative TYLCV sequences were searched in the National Center for Biotechnology Information database. Comprehensive analysis of TYLCV was performed using bioinformatics by examining gene structure, sequence alignments, phylogeny, GC content, and homology. Forty-eight representative TYLCV sequences were selected from 48 regions in 29 countries. The results showed that all TYLCV sequences were 2752-2794 nucleotides in length, which encoded 6 open reading frames (AV1, AV2, AC1, AC2, AC3, and AC4). GC content ranged from 0.41-0.42. Sequence alignment showed a number of insertions and deletions within these TYLCV sequences. Phylogenetic tree results revealed that the sequences were divided into 10 classes; homology of the sequences ranged from 72.8 to 98.6%. All 48 sequences contained the typical structure of TYLCV, including open reading frames and intergenic regions. These results provide a theoretical basis for the identification and evolution of the virus in the future.
WONOEP appraisal: new genetic approaches to study epilepsy

Science.gov (United States)

Rossignol, Elsa; Kobow, Katja; Simonato, Michele; Loeb, Jeffrey A.; Grisar, Thierry; Gilby, Krista L.; Vinet, Jonathan; Kadam, Shilpa D.; Becker, Albert J.

2014-01-01

Objective New genetic investigation techniques, including next-generation sequencing, epigenetic profiling, cell lineage mapping, targeted genetic manipulation of specific neuronal cell types, stem cell reprogramming and optogenetic manipulations within epileptic networks are progressively unravelling the mysteries of epileptogenesis and ictogenesis. These techniques have opened new avenues to discover the molecular basis of epileptogenesis and to study the physiological impacts of mutations in epilepsy-associated genes on a multilayer level, from cells to circuits. Methods This manuscript reviews recently published applications of these new genetic technologies in the study of epilepsy, as well as work presented by the authors at the genetic session of the XII Workshop on the Neurobiology of Epilepsy in Quebec, Canada. Results Next-generation sequencing is providing investigators with an unbiased means to assess the molecular causes of sporadic forms of epilepsy and have revealed the complexity and genetic heterogeneity of sporadic epilepsy disorders. To assess the functional impact of mutations in these newly identified genes on specific neuronal cell-types during brain development, new modeling strategies in animals, including conditional genetics in mice and in utero knockdown approaches, are enabling functional validation with exquisite cell-type and temporal specificity. In addition, optogenetics, using cell-type specific Cre recombinase driver lines, is enabling investigators to dissect networks involved in epilepsy. Genetically-encoded cell-type labeling is also providing new means to assess the role of the non-neuronal components of epileptic networks such as glial cells. Furthermore, beyond its role in revealing coding variants involved in epileptogenesis, next-generation sequencing can be used to assess the epigenetic modifications that lead to sustained network hyperexcitability in epilepsy, including methylation changes in gene promoters and non
Next generation DNA sequencing technology delivers valuable genetic markers for the genomic orphan legume species, Bituminaria bituminosa

Directory of Open Access Journals (Sweden)

Pazos-Navarro María

2011-12-01

Full Text Available Abstract Background Bituminaria bituminosa is a perennial legume species from the Canary Islands and Mediterranean region that has potential as a drought-tolerant pasture species and as a source of pharmaceutical compounds. Three botanical varieties have previously been identified in this species: albomarginata, bituminosa and crassiuscula. B. bituminosa can be considered a genomic 'orphan' species with very few genomic resources available. New DNA sequencing technologies provide an opportunity to develop high quality molecular markers for such orphan species. Results 432,306 mRNA molecules were sampled from a leaf transcriptome of a single B. bituminosa plant using Roche 454 pyrosequencing, resulting in an average read length of 345 bp (149.1 Mbp in total. Sequences were assembled into 3,838 isotigs/contigs representing putatively unique gene transcripts. Gene ontology descriptors were identified for 3,419 sequences. Raw sequence reads containing simple sequence repeat (SSR motifs were identified, and 240 primer pairs flanking these motifs were designed. Of 87 primer pairs developed this way, 75 (86.2% successfully amplified primarily single fragments by PCR. Fragment analysis using 20 primer pairs in 79 accessions of B. bituminosa detected 130 alleles at 21 SSR loci. Genetic diversity analyses confirmed that variation at these SSR loci accurately reflected known taxonomic relationships in original collections of B. bituminosa and provided additional evidence that a division of the botanical variety bituminosa into two according to geographical origin (Mediterranean region and Canary Islands may be appropriate. Evidence of cross-pollination was also found between botanical varieties within a B. bituminosa breeding programme. Conclusions B. bituminosa can no longer be considered a genomic orphan species, having now a large (albeit incomplete repertoire of expressed gene sequences that can serve as a resource for future genetic studies. This
A ripple-spreading genetic algorithm for the aircraft sequencing problem.

Science.gov (United States)

Hu, Xiao-Bing; Di Paolo, Ezequiel A

2011-01-01

When genetic algorithms (GAs) are applied to combinatorial problems, permutation representations are usually adopted. As a result, such GAs are often confronted with feasibility and memory-efficiency problems. With the aircraft sequencing problem (ASP) as a study case, this paper reports on a novel binary-representation-based GA scheme for combinatorial problems. Unlike existing GAs for the ASP, which typically use permutation representations based on aircraft landing order, the new GA introduces a novel ripple-spreading model which transforms the original landing-order-based ASP solutions into value-based ones. In the new scheme, arriving aircraft are projected as points into an artificial space. A deterministic method inspired by the natural phenomenon of ripple-spreading on liquid surfaces is developed, which uses a few parameters as input to connect points on this space to form a landing sequence. A traditional GA, free of feasibility and memory-efficiency problems, can then be used to evolve the ripple-spreading related parameters in order to find an optimal sequence. Since the ripple-spreading model is the centerpiece of the new algorithm, it is called the ripple-spreading GA (RSGA). The advantages of the proposed RSGA are illustrated by extensive comparative studies for the case of the ASP.
Molecular Comparison and Evolutionary Analyses of VP1 Nucleotide Sequences of New African Human Enterovirus 71 Isolates Reveal a Wide Genetic Diversity

Science.gov (United States)

Nougairède, Antoine; Joffret, Marie-Line; Deshpande, Jagadish M.; Dubot-Pérès, Audrey; Héraud, Jean-Michel

2014-01-01

Most circulating strains of Human enterovirus 71 (EV-A71) have been classified primarily into three genogroups (A to C) on the basis of genetic divergence between the 1D gene, which encodes the VP1 capsid protein. The aim of the present study was to provide further insights into the diversity of the EV-A71 genogroups following the recent description of highly divergent isolates, in particular those from African countries, including Madagascar. We classified recent EV-A71 isolates by a large comparison of 3,346 VP1 nucleotidic sequences collected from GenBank. Analysis of genetic distances and phylogenetic investigations indicated that some recently-reported isolates did not fall into the genogroups A-C and clustered into three additional genogroups, including one Indian genogroup (genogroup D) and 2 African ones (E and F). Our Bayesian phylogenetic analysis provided consistent data showing that the genogroup D isolates share a recent common ancestor with the members of genogroup E, while the isolates of genogroup F evolved from a recent common ancestor shared with the members of the genogroup B. Our results reveal the wide diversity that exists among EV-A71 isolates and suggest that the number of circulating genogroups is probably underestimated, particularly in developing countries where EV-A71 epidemiology has been poorly studied. PMID:24598878
Colorectal Cancer Genetic Heterogeneity Delineated by Multi-Region Sequencing.

Directory of Open Access Journals (Sweden)

You-Wang Lu

Full Text Available Intratumor heterogeneity (ITH leads to an underestimation of the mutational landscape portrayed by a single needle biopsy and consequently affects treatment precision. The extent of colorectal cancer (CRC genetic ITH is not well understood in Chinese patients. Thus, we conducted deep sequencing by using the OncoGxOne™ Plus panel, targeting 333 cancer-specific genes in multi-region biopsies of primary and liver metastatic tumors from three Chinese CRC patients. We determined that the extent of ITH varied among the three cases. On average, 65% of all the mutations detected were common within individual tumors. KMT2C aberrations and the NCOR1 mutation were the only ubiquitous events. Subsequent phylogenetic analysis showed that the tumors evolved in a branched manner. Comparison of the primary and metastatic tumors revealed that PPP2R1A (E370X, SETD2 (I1608V, SMAD4 (G382T, and AR splicing site mutations may be specific to liver metastatic cancer. These mutations might contribute to the initiation and progression of distant metastasis. Collectively, our analysis identified a substantial level of genetic ITH in CRC, which should be considered for personalized therapeutic strategies.
Genetic Diversity and Population Structure of F3:6 Nebraska Winter Wheat Genotypes Using Genotyping-By-Sequencing.

Science.gov (United States)

Eltaher, Shamseldeen; Sallam, Ahmed; Belamkar, Vikas; Emara, Hamdy A; Nower, Ahmed A; Salem, Khaled F M; Poland, Jesse; Baenziger, Peter S

2018-01-01

The availability of information on the genetic diversity and population structure in wheat ( Triticum aestivum L.) breeding lines will help wheat breeders to better use their genetic resources and manage genetic variation in their breeding program. The recent advances in sequencing technology provide the opportunity to identify tens or hundreds of thousands of single nucleotide polymorphism (SNPs) in large genome species (e.g., wheat). These SNPs can be utilized for understanding genetic diversity and performing genome wide association studies (GWAS) for complex traits. In this study, the genetic diversity and population structure were investigated in a set of 230 genotypes (F 3:6 ) derived from various crosses as a prerequisite for GWAS and genomic selection. Genotyping-by-sequencing provided 25,566 high-quality SNPs. The polymorphism information content (PIC) across chromosomes ranged from 0.09 to 0.37 with an average of 0.23. The distribution of SNPs markers on the 21 chromosomes ranged from 319 on chromosome 3D to 2,370 on chromosome 3B. The analysis of population structure revealed three subpopulations (G1, G2, and G3). Analysis of molecular variance identified 8% variance among and 92% within subpopulations. Of the three subpopulations, G2 had the highest level of genetic diversity based on three genetic diversity indices: Shannon's information index ( I ) = 0.494, diversity index ( h ) = 0.328 and unbiased diversity index (uh) = 0.331, while G3 had lowest level of genetic diversity ( I = 0.348, h = 0.226 and uh = 0.236). This high genetic diversity identified among the subpopulations can be used to develop new wheat cultivars.

Next Generation Sequencing and ALS: known genes, different phenotyphes.

Science.gov (United States)

Campopiano, Rosa; Ryskalin, Larisa; Giardina, Emiliano; Zampatti, Stefania; Busceti, Carla L; Biagioni, Francesca; Ferese, Rosangela; Storto, Marianna; Gambardella, Stefano; Fornai, Francesco

2017-12-01

Amyotrophic lateral sclerosis (ALS) is fatal neurodegenerative disease clinically characterized by upper and lower motor neuron dysfunction resulting in rapidly progressive paralysis and death from respiratory failure. Most cases appear to be sporadic, but 5-10 % of cases have a family history of the disease, and over the last decade, identification of mutations in about 20 genes predisposing to these disorders has provided the means to better understand their pathogenesis. Next Generation sequencing (NGS) is an advanced high-throughput DNA sequencing technology which have rapidly contributed to an acceleration in the discovery of genetic risk factors for both familial and sporadic neurological and neurodegenerative diseases. These strategies allowed to rapidly identify disease-associated variants and genetic risk factors for both familial (fALS) and sporadic ALS (sALS), strongly contributing to the knowledge of the genetic architecture of ALS. Moreover, as the number of ALS genes grows, many of the proteins they encode are in intracellular processes shared with other known diseases, suggesting an overlapping of clinical and phatological features between different diseases. To emphasize this concept, the review focuses on genes coding for Valosin-containing protein (VPC) and two Heterogeneous nuclear RNA-binding proteins (HNRNPA1 and hnRNPA2B1), recently idefied through NGS, where different mutations have been associated in both ALS and other neurological and neurodegenerative diseases.
Developing a Genetically Encoded, Cross-Species Biosensor for Detecting Ammonium and Regulating Biosynthesis of Cyanophycin.

Science.gov (United States)

Xiao, Yi; Jiang, Wen; Zhang, Fuzhong

2017-10-20

Responding to nitrogen status is essential for all living organisms. Bacteria have evolved various complex and exquisite regulatory systems to control nitrogen metabolism. However, natural nitrogen regulatory systems, owing to their complexity, often function only in their original hosts and do not respond properly when transferred to another species. By harnessing the Lactococcus GlnRA system, we developed a genetically encoded, cross-species ammonium biosensor that displays a dynamic range up to 9-fold upon detection of ammonium ion. We demonstrated applications of this ammonium biosensor in three different species (Escherichia coli, Pseudomonas putida, and Synechocystis sp.) to detect different nitrogen sources. This ammonium sensor was further used to regulate the biosynthesis of a nitrogen-rich polymer, cyanophycin, based on ammonium concentration. Given the importance of nitrogen responses, the developed biosensor should be broadly applicable to synthetic biology and bioengineering.
Whole-Genome Sequencing and Comparative Genome Analysis Provided Insight into the Predatory Features and Genetic Diversity of Two Bdellovibrio Species Isolated from Soil

Directory of Open Access Journals (Sweden)

Omotayo Opemipo Oyedara

2018-01-01

Full Text Available Bdellovibrio spp. are predatory bacteria with great potential as antimicrobial agents. Studies have shown that members of the genus Bdellovibrio exhibit peculiar characteristics that influence their ecological adaptations. In this study, whole genomes of two different Bdellovibrio spp. designated SKB1291214 and SSB218315 isolated from soil were sequenced. The core genes shared by all the Bdellovibrio spp. considered for the pangenome analysis including the epibiotic B. exovorus were 795. The number of unique genes identified in Bdellovibrio spp. SKB1291214, SSB218315, W, and B. exovorus JJS was 1343, 113, 857, and 1572, respectively. These unique genes encode hydrolytic, chemotaxis, and transporter proteins which might be useful for predation in the Bdellovibrio strains. Furthermore, the two Bdellovibrio strains exhibited differences based on the % GC content, amino acid identity, and 16S rRNA gene sequence. The 16S rRNA gene sequence of Bdellovibrio sp. SKB1291214 shared 99% identity with that of an uncultured Bdellovibrio sp. clone 12L 106 (a pairwise distance of 0.008 and 95–97% identity (a pairwise distance of 0.043 with that of other culturable terrestrial Bdellovibrio spp., including strain SSB218315. In Bdellovibrio sp. SKB1291214, 174 bp sequence was inserted at the host interaction (hit locus region usually attributed to prey attachment, invasion, and development of host independent Bdellovibrio phenotypes. Also, a gene equivalent to Bd0108 in B. bacteriovorus HD100 was not conserved in Bdellovibrio sp. SKB1291214. The results of this study provided information on the genetic characteristics and diversity of the genus Bdellovibrio that can contribute to their successful applications as a biocontrol agent.
The mitochondrial gene encoding ribosomal protein S12 has been translocated to the nuclear genome in Oenothera.

Science.gov (United States)

Grohmann, L; Brennicke, A; Schuster, W

1992-01-01

The Oenothera mitochondrial genome contains only a gene fragment for ribosomal protein S12 (rps12), while other plants encode a functional gene in the mitochondrion. The complete Oenothera rps12 gene is located in the nucleus. The transit sequence necessary to target this protein to the mitochondrion is encoded by a 5'-extension of the open reading frame. Comparison of the amino acid sequence encoded by the nuclear gene with the polypeptides encoded by edited mitochondrial cDNA and genomic sequences of other plants suggests that gene transfer between mitochondrion and nucleus started from edited mitochondrial RNA molecules. Mechanisms and requirements of gene transfer and activation are discussed. Images PMID:1454526
A deep auto-encoder model for gene expression prediction.

Science.gov (United States)

Xie, Rui; Wen, Jia; Quitadamo, Andrew; Cheng, Jianlin; Shi, Xinghua

2017-11-17

Gene expression is a key intermediate level that genotypes lead to a particular trait. Gene expression is affected by various factors including genotypes of genetic variants. With an aim of delineating the genetic impact on gene expression, we build a deep auto-encoder model to assess how good genetic variants will contribute to gene expression changes. This new deep learning model is a regression-based predictive model based on the MultiLayer Perceptron and Stacked Denoising Auto-encoder (MLP-SAE). The model is trained using a stacked denoising auto-encoder for feature selection and a multilayer perceptron framework for backpropagation. We further improve the model by introducing dropout to prevent overfitting and improve performance. To demonstrate the usage of this model, we apply MLP-SAE to a real genomic datasets with genotypes and gene expression profiles measured in yeast. Our results show that the MLP-SAE model with dropout outperforms other models including Lasso, Random Forests and the MLP-SAE model without dropout. Using the MLP-SAE model with dropout, we show that gene expression quantifications predicted by the model solely based on genotypes, align well with true gene expression patterns. We provide a deep auto-encoder model for predicting gene expression from SNP genotypes. This study demonstrates that deep learning is appropriate for tackling another genomic problem, i.e., building predictive models to understand genotypes' contribution to gene expression. With the emerging availability of richer genomic data, we anticipate that deep learning models play a bigger role in modeling and interpreting genomics.
Population genetic structure in farm and feral American mink (Neovison vison) inferred from RAD sequencing-generated single nucleotide polymorphisms

DEFF Research Database (Denmark)

Thirstrup, Janne Pia; Ruiz-Gonzalez, Aritz; Pujolar, José Martin

2015-01-01

Feral American mink populations (Neovison vison), derived from mink farms, are widespread in Europe. In this study we investigated genetic diversity and genetic differentiation between feral and farm mink using a panel of genetic markers (194 SNP) generated from RAD sequencing data. Sampling incl...
Analysis of the transcriptome of Erigeron breviscapus uncovers putative scutellarin and chlorogenic acids biosynthetic genes and genetic markers.

Science.gov (United States)

Jiang, Ni-Hao; Zhang, Guang-Hui; Zhang, Jia-Jin; Shu, Li-Ping; Zhang, Wei; Long, Guang-Qiang; Liu, Tao; Meng, Zheng-Gui; Chen, Jun-Wen; Yang, Sheng-Chao

2014-01-01

Erigeron breviscapus (Vant.) Hand-Mazz. is a famous medicinal plant. Scutellarin and chlorogenic acids are the primary active components in this herb. However, the mechanisms of biosynthesis and regulation for scutellarin and chlorogenic acids in E. breviscapus are considerably unknown. In addition, genomic information of this herb is also unavailable. Using Illumina sequencing on GAIIx platform, a total of 64,605,972 raw sequencing reads were generated and assembled into 73,092 non-redundant unigenes. Among them, 44,855 unigenes (61.37%) were annotated in the public databases Nr, Swiss-Prot, KEGG, and COG. The transcripts encoding the known enzymes involved in flavonoids and in chlorogenic acids biosynthesis were discovered in the Illumina dataset. Three candidate cytochrome P450 genes were discovered which might encode flavone 6-hydroase converting apigenin to scutellarein. Furthermore, 4 unigenes encoding the homologues of maize P1 (R2R3-MYB transcription factors) were defined, which might regulate the biosynthesis of scutellarin. Additionally, a total of 11,077 simple sequence repeat (SSR) were identified from 9,255 unigenes. Of SSRs, tri-nucleotide motifs were the most abundant motif. Thirty-six primer pairs for SSRs were randomly selected for validation of the amplification and polymorphism. The result revealed that 34 (94.40%) primer pairs were successfully amplified and 19 (52.78%) primer pairs exhibited polymorphisms. Using next generation sequencing (NGS) technology, this study firstly provides abundant genomic data for E. breviscapus. The candidate genes involved in the biosynthesis and transcriptional regulation of scutellarin and chlorogenic acids were obtained in this study. Additionally, a plenty of genetic makers were generated by identification of SSRs, which is a powerful tool for molecular breeding and genetics applications in this herb.
Analysis of the transcriptome of Erigeron breviscapus uncovers putative scutellarin and chlorogenic acids biosynthetic genes and genetic markers.

Directory of Open Access Journals (Sweden)

Ni-Hao Jiang

Full Text Available Erigeron breviscapus (Vant. Hand-Mazz. is a famous medicinal plant. Scutellarin and chlorogenic acids are the primary active components in this herb. However, the mechanisms of biosynthesis and regulation for scutellarin and chlorogenic acids in E. breviscapus are considerably unknown. In addition, genomic information of this herb is also unavailable.Using Illumina sequencing on GAIIx platform, a total of 64,605,972 raw sequencing reads were generated and assembled into 73,092 non-redundant unigenes. Among them, 44,855 unigenes (61.37% were annotated in the public databases Nr, Swiss-Prot, KEGG, and COG. The transcripts encoding the known enzymes involved in flavonoids and in chlorogenic acids biosynthesis were discovered in the Illumina dataset. Three candidate cytochrome P450 genes were discovered which might encode flavone 6-hydroase converting apigenin to scutellarein. Furthermore, 4 unigenes encoding the homologues of maize P1 (R2R3-MYB transcription factors were defined, which might regulate the biosynthesis of scutellarin. Additionally, a total of 11,077 simple sequence repeat (SSR were identified from 9,255 unigenes. Of SSRs, tri-nucleotide motifs were the most abundant motif. Thirty-six primer pairs for SSRs were randomly selected for validation of the amplification and polymorphism. The result revealed that 34 (94.40% primer pairs were successfully amplified and 19 (52.78% primer pairs exhibited polymorphisms.Using next generation sequencing (NGS technology, this study firstly provides abundant genomic data for E. breviscapus. The candidate genes involved in the biosynthesis and transcriptional regulation of scutellarin and chlorogenic acids were obtained in this study. Additionally, a plenty of genetic makers were generated by identification of SSRs, which is a powerful tool for molecular breeding and genetics applications in this herb.
Genetic alterations of hepatocellular carcinoma by random amplified polymorphic DNA analysis and cloning sequencing of tumor differential DNA fragment

Science.gov (United States)

Xian, Zhi-Hong; Cong, Wen-Ming; Zhang, Shu-Hui; Wu, Meng-Chao

2005-01-01

AIM: To study the genetic alterations and their association with clinicopathological characteristics of hepatocellular carcinoma (HCC), and to find the tumor related DNA fragments. METHODS: DNA isolated from tumors and corresponding noncancerous liver tissues of 56 HCC patients was amplified by random amplified polymorphic DNA (RAPD) with 10 random 10-mer arbitrary primers. The RAPD bands showing obvious differences in tumor tissue DNA corresponding to that of normal tissue were separated, purified, cloned and sequenced. DNA sequences were analyzed and compared with GenBank data. RESULTS: A total of 56 cases of HCC were demonstrated to have genetic alterations, which were detected by at least one primer. The detestability of genetic alterations ranged from 20% to 70% in each case, and 17.9% to 50% in each primer. Serum HBV infection, tumor size, histological grade, tumor capsule, as well as tumor intrahepatic metastasis, might be correlated with genetic alterations on certain primers. A band with a higher intensity of 480 bp or so amplified fragments in tumor DNA relative to normal DNA could be seen in 27 of 56 tumor samples using primer 4. Sequence analysis of these fragments showed 91% homology with Homo sapiens double homeobox protein DUX10 gene. CONCLUSION: Genetic alterations are a frequent event in HCC, and tumor related DNA fragments have been found in this study, which may be associated with hepatocarcin-ogenesis. RAPD is an effective method for the identification and analysis of genetic alterations in HCC, and may provide new information for further evaluating the molecular mechanism of hepatocarcinogenesis. PMID:15996039
Implication of lateral genetic transfer in the emergence of Aeromonas hydrophila isolates of epidemic outbreaks in channel catfish.

Directory of Open Access Journals (Sweden)

Mohammad J Hossain

Full Text Available To investigate the molecular basis of the emergence of Aeromonas hydrophila responsible for an epidemic outbreak of motile aeromonad septicemia of catfish in the Southeastern United States, we sequenced 11 A. hydrophila isolates that includes five reference and six recent epidemic isolates. Comparative genomics revealed that recent epidemic A. hydrophila isolates are highly clonal, whereas reference isolates are greatly diverse. We identified 55 epidemic-associated genetic regions with 313 predicted genes that are present in epidemic isolates but absent from reference isolates and 35% of these regions are located within genomic islands, suggesting their acquisition through lateral gene transfer. The epidemic-associated regions encode predicted prophage elements, pathogenicity islands, metabolic islands, fitness islands and genes of unknown functions, and 34 of the genes encoded in these regions were predicted as virulence factors. We found two pilus biogenesis gene clusters encoded within predicted pathogenicity islands. A functional metabolic island that encodes a complete pathway for myo-inositol catabolism was evident by the ability of epidemic A. hydrophila isolates to use myo-inositol as a sole carbon source. Testing of A. hydrophila field isolates found a consistent correlation between myo-inositol utilization as a sole carbon source and the presence of an epidemic-specific genetic marker. All epidemic isolates and one reference isolate shared a novel O-antigen cluster. Altogether we identified four different O-antigen biosynthesis gene clusters within the 11 sequenced A. hydrophila genomes. Our study reveals new insights into the evolutionary changes that have resulted in the emergence of recent epidemic A. hydrophila strains.
Horse cDNA clones encoding two MHC class I genes

Energy Technology Data Exchange (ETDEWEB)

Barbis, D.P.; Maher, J.K.; Stanek, J.; Klaunberg, B.A.; Antczak, D.F.

1994-12-31

Two full-length clones encoding MHC class I genes were isolated by screening a horse cDNA library, using a probe encoding in human HLA-A2.2Y allele. The library was made in the pcDNA1 vector (Invitrogen, San Diego, CA), using mRNA from peripheral blood lymphocytes obtained from a Thoroughbred stallion (No. 0834) homozygous for a common horse MHC haplotype (ELA-A2, -B2, -D2; Antczak et al. 1984; Donaldson et al. 1988). The clones were sequenced, using SP6 and T7 universal primers and horse-specific oligonucleotides designed to extend previously determined sequences.
Cloning of an epoxide hydrolase encoding gene from Rhodotorula mucilaginosa and functional expresion in Yarrowia lipolytica

CSIR Research Space (South Africa)

Labuschagne, M

2007-01-01

Full Text Available , were used to amplify the genomic EH-encoding gene from Rhodotorula mucilaginosa. The 2347 bp genomic sequence revealed a 1979 bp ORF containing nine introns. The cDNA sequence revealed an 1185 bp EH-encoding gene that translates into a 394 amino acid...
Genetic analysis of the Hungarian draft horse population using partial mitochondrial DNA D-loop sequencing

Science.gov (United States)

2018-01-01

Background The Hungarian draft is a horse breed with a recent mixed ancestry created in the 1920s by crossing local mares with draught horses imported from France and Belgium. The interest in its conservation and characterization has increased over the last few years. The aim of this work is to contribute to the characterization of the endangered Hungarian heavy draft horse populations in order to obtain useful information to implement conservation strategies for these genetic stocks. Methods To genetically characterize the breed and to set up the basis for a conservation program, in the present study a hypervariable region of the mitochrondial DNA (D-loop) was used to assess genetic diversity in Hungarian draft horses. Two hundred and eighty five sequences obtained in our laboratory and 419 downloaded sequences available from Genbank were analyzed. Results One hundred and sixty-four haplotypes and thirty-six polymorphic sites were observed. High haplotype and nucleotide diversity values (Hd = 0.954 ± 0.004; π = 0.028 ± 0.0004) were identified in Hungarian population, although they were higher within than among the different populations (Hd = 0.972 ± 0.002; π = 0.03097 ± 0.002). Fourteen of the previously observed seventeen haplogroups were detected. Discussion Our samples showed a large intra- and interbreed variation. There was no clear clustering on the median joining network figure. The overall information collected in this work led us to consider that the genetic scenario observed for Hungarian draft breed is more likely the result of contributions from ‘ancestrally’ different genetic backgrounds. This study could contribute to the development of a breeding plan for Hungarian draft horses and help to formulate a genetic conservation plan, avoiding inbreeding while. PMID:29404201
Molecular evolution of the Paramyxoviridae and Rhabdoviridae multiple-protein-encoding P gene.

Science.gov (United States)

Jordan, I K; Sutter, B A; McClure, M A

2000-01-01

Presented here is an analysis of the molecular evolutionary dynamics of the P gene among 76 representative sequences of the Paramyxoviridae and Rhabdoviridae RNA virus families. In a number of Paramyxoviridae taxa, as well as in vesicular stomatitis viruses of the Rhabdoviridae, the P gene encodes multiple proteins from a single genomic RNA sequence. These products include the phosphoprotein (P), as well as the C and V proteins. The complexity of the P gene makes it an intriguing locus to study from an evolutionary perspective. Amino acid sequence alignments of the proteins encoded at the P and N loci were used in independent phylogenetic reconstructions of the Paramyxoviridae and Rhabdoviridae families. P-gene-coding capacities were mapped onto the Paramyxoviridae phylogeny, and the most parsimonious path of multiple-coding-capacity evolution was determined. Levels of amino acid variation for Paramyxoviridae and Rhabdoviridae P-gene-encoded products were also analyzed. Proteins encoded in overlapping reading frames from the same nucleotides have different levels of amino acid variation. The nucleotide architecture that underlies the amino acid variation was determined in order to evaluate the role of selection in the evolution of the P gene overlapping reading frames. In every case, the evolution of one of the proteins encoded in the overlapping reading frames has been constrained by negative selection while the other has evolved more rapidly. The integrity of the overlapping reading frame that represents a derived state is generally maintained at the expense of the ancestral reading frame encoded by the same nucleotides. The evolution of such multicoding sequences is likely a response by RNA viruses to selective pressure to maximize genomic information content while maintaining small genome size. The ability to evolve such a complex genomic strategy is intimately related to the dynamics of the viral quasispecies, which allow enhanced exploration of the adaptive
Towards PDT with Genetically Encoded Photosensitizer KillerRed: A Comparison of Continuous and Pulsed Laser Regimens in an Animal Tumor Model.

Directory of Open Access Journals (Sweden)

Marina Shirmanova

Full Text Available The strong phototoxicity of the red fluorescent protein KillerRed allows it to be considered as a potential genetically encoded photosensitizer for the photodynamic therapy (PDT of cancer. The advantages of KillerRed over chemical photosensitizers are its expression in tumor cells transduced with the appropriate gene and direct killing of cells through precise damage to any desired cell compartment. The ability of KillerRed to affect cell division and to induce cell death has already been demonstrated in cancer cell lines in vitro and HeLa tumor xenografts in vivo. However, the further development of this approach for PDT requires optimization of the method of treatment. In this study we tested the continuous wave (593 nm and pulsed laser (584 nm, 10 Hz, 18 ns modes to achieve an antitumor effect. The research was implemented on CT26 subcutaneous mouse tumors expressing KillerRed in fusion with histone H2B. The results showed that the pulsed mode provided a higher rate of photobleaching of KillerRed without any temperature increase on the tumor surface. PDT with the continuous wave laser was ineffective against CT26 tumors in mice, whereas the pulsed laser induced pronounced histopathological changes and inhibition of tumor growth. Therefore, we selected an effective regimen for PDT when using the genetically encoded photosensitizer KillerRed and pulsed laser irradiation.
Sequence Analysis of Inducible Prophage phIS3501 Integrated into the Haemolysin II Gene of Bacillus thuringiensis var israelensis ATCC35646

Directory of Open Access Journals (Sweden)

Bouziane Moumen

2012-01-01

Full Text Available Diarrheic food poisoning by bacteria of the Bacillus cereus group is mostly due to several toxins encoded in the genomes. One of them, cytotoxin K, was recently identified as responsible for severe necrotic syndromes. Cytotoxin K is similar to a class of proteins encoded by genes usually annotated as haemolysin II (hlyII in the majority of genomes of the B. cereus group. The partially sequenced genome of Bacillus thuringiensis var israelensis ATCC35646 contains several potentially induced prophages, one of them integrated into the hlyII gene. We determined the complete sequence and established the genomic organization of this prophage-designated phIS3501. During induction of excision of this prophage with mitomycin C, intact hlyII gene is formed, thus providing to cells a genetic ability to synthesize the active toxin. Therefore, this prophage, upon its excision, can be implicated in the regulation of synthesis of the active toxin and thus in the virulence of bacterial host. A generality of selection for such systems in bacterial pathogens is indicated by the similarity of this genetic arrangement to that of Staphylococcus aureus β-haemolysin.
A flow cytometric assay technology based on quantum dots-encoded beads

International Nuclear Information System (INIS)

Wang Haiqiao; Liu Tiancai; Cao Yuancheng; Huang Zhenli; Wang Jianhao; Li Xiuqing; Zhao Yuandi

2006-01-01

A flow cytometric detecting technology based on quantum dots (QDs)-encoded beads has been described. Using this technology, several QDs-encoded beads with different code were identified effectively, and the target molecule (DNA sequence) in solution was also detected accurately by coupling to its complementary sequence probed on QDs-encoded beads through DNA hybridization assay. The resolution of this technology for encoded beads is resulted from two longer wavelength fluorescence identification signals (yellow and red fluorescent signals of QDs), and the third shorter wavelength fluorescence signal (green reporting signal of fluorescein isothiocyanate (FITC)) for the determination of reaction between probe and target. In experiment, because of QDs' unique optical character, only one excitation light source was needed to excite the QDs and probe dye FITC synchronously comparing with other flow cytometric assay technology. The results show that this technology has present excellent repeatability and good accuracy. It will become a promising multiple assay platform in various application fields after further improvement
Molecular genetics of the Usher syndrome in Lebanon: identification of 11 novel protein truncating mutations by whole exome sequencing.

Science.gov (United States)

Reddy, Ramesh; Fahiminiya, Somayyeh; El Zir, Elie; Mansour, Ahmad; Megarbane, Andre; Majewski, Jacek; Slim, Rima

2014-01-01

Usher syndrome (USH) is a genetically heterogeneous condition with ten disease-causing genes. The spectrum of genes and mutations causing USH in the Lebanese and Middle Eastern populations has not been described. Consequently, diagnostic approaches designed to screen for previously reported mutations were unlikely to identify the mutations in 11 unrelated families, eight of Lebanese and three of Middle Eastern origins. In addition, six of the ten USH genes consist of more than 20 exons, each, which made mutational analysis by Sanger sequencing of PCR-amplified exons from genomic DNA tedious and costly. The study was aimed at the identification of USH causing genes and mutations in 11 unrelated families with USH type I or II. Whole exome sequencing followed by expanded familial validation by Sanger sequencing. We identified disease-causing mutations in all the analyzed patients in four USH genes, MYO7A, USH2A, GPR98 and CDH23. Eleven of the mutations were novel and protein truncating, including a complex rearrangement in GPR98. Our data highlight the genetic diversity of Usher syndrome in the Lebanese population and the time and cost-effectiveness of whole exome sequencing approach for mutation analysis of genetically heterogeneous conditions caused by large genes.
Molecular genetics of the Usher syndrome in Lebanon: identification of 11 novel protein truncating mutations by whole exome sequencing.

Directory of Open Access Journals (Sweden)

Ramesh Reddy

Full Text Available Usher syndrome (USH is a genetically heterogeneous condition with ten disease-causing genes. The spectrum of genes and mutations causing USH in the Lebanese and Middle Eastern populations has not been described. Consequently, diagnostic approaches designed to screen for previously reported mutations were unlikely to identify the mutations in 11 unrelated families, eight of Lebanese and three of Middle Eastern origins. In addition, six of the ten USH genes consist of more than 20 exons, each, which made mutational analysis by Sanger sequencing of PCR-amplified exons from genomic DNA tedious and costly. The study was aimed at the identification of USH causing genes and mutations in 11 unrelated families with USH type I or II.Whole exome sequencing followed by expanded familial validation by Sanger sequencing.We identified disease-causing mutations in all the analyzed patients in four USH genes, MYO7A, USH2A, GPR98 and CDH23. Eleven of the mutations were novel and protein truncating, including a complex rearrangement in GPR98.Our data highlight the genetic diversity of Usher syndrome in the Lebanese population and the time and cost-effectiveness of whole exome sequencing approach for mutation analysis of genetically heterogeneous conditions caused by large genes.
Molecular Genetics of the Usher Syndrome in Lebanon: Identification of 11 Novel Protein Truncating Mutations by Whole Exome Sequencing

Science.gov (United States)

Reddy, Ramesh; Fahiminiya, Somayyeh; El Zir, Elie; Mansour, Ahmad; Megarbane, Andre; Majewski, Jacek; Slim, Rima

2014-01-01

Background Usher syndrome (USH) is a genetically heterogeneous condition with ten disease-causing genes. The spectrum of genes and mutations causing USH in the Lebanese and Middle Eastern populations has not been described. Consequently, diagnostic approaches designed to screen for previously reported mutations were unlikely to identify the mutations in 11 unrelated families, eight of Lebanese and three of Middle Eastern origins. In addition, six of the ten USH genes consist of more than 20 exons, each, which made mutational analysis by Sanger sequencing of PCR-amplified exons from genomic DNA tedious and costly. The study was aimed at the identification of USH causing genes and mutations in 11 unrelated families with USH type I or II. Methods Whole exome sequencing followed by expanded familial validation by Sanger sequencing. Results We identified disease-causing mutations in all the analyzed patients in four USH genes, MYO7A, USH2A, GPR98 and CDH23. Eleven of the mutations were novel and protein truncating, including a complex rearrangement in GPR98. Conclusion Our data highlight the genetic diversity of Usher syndrome in the Lebanese population and the time and cost-effectiveness of whole exome sequencing approach for mutation analysis of genetically heterogeneous conditions caused by large genes. PMID:25211151

Analysis of genetically modified organisms by pyrosequencing on a portable photodiode-based bioluminescence sequencer.

Science.gov (United States)

Song, Qinxin; Wei, Guijiang; Zhou, Guohua

2014-07-01

A portable bioluminescence analyser for detecting the DNA sequence of genetically modified organisms (GMOs) was developed by using a photodiode (PD) array. Pyrosequencing on eight genes (zSSIIb, Bt11 and Bt176 gene of genetically modified maize; Lectin, 35S-CTP4, CP4EPSPS, CaMV35S promoter and NOS terminator of the genetically modified Roundup ready soya) was successfully detected with this instrument. The corresponding limit of detection (LOD) was 0.01% with 35 PCR cycles. The maize and soya available from three different provenances in China were detected. The results indicate that pyrosequencing using the small size of the detector is a simple, inexpensive, and reliable way in a farm/field test of GMO analysis. Copyright © 2014 Elsevier Ltd. All rights reserved.
Modulating the Voltage-sensitivity of a Genetically Encoded Voltage Indicator.

Science.gov (United States)

Jung, Arong; Rajakumar, Dhanarajan; Yoon, Bong-June; Baker, Bradley J

2017-10-01

Saturation mutagenesis was performed on a single position in the voltage-sensing domain (VSD) of a genetically encoded voltage indicator (GEVI). The VSD consists of four transmembrane helixes designated S1-S4. The V220 position located near the plasma membrane/extracellular interface had previously been shown to affect the voltage range of the optical signal. Introduction of polar amino acids at this position reduced the voltage-dependent optical signal of the GEVI. Negatively charged amino acids slightly reduced the optical signal by 33 percent while positively charge amino acids at this position reduced the optical signal by 80%. Surprisingly, the range of V220D was similar to that of V220K with shifted optical responses towards negative potentials. In contrast, the V220E mutant mirrored the responses of the V220R mutation suggesting that the length of the side chain plays in role in determining the voltage range of the GEVI. Charged mutations at the 219 position all behaved similarly slightly shifting the optical response to more negative potentials. Charged mutations to the 221 position behaved erratically suggesting interactions with the plasma membrane and/or other amino acids in the VSD. Introduction of bulky amino acids at the V220 position increased the range of the optical response to include hyperpolarizing signals. Combining The V220W mutant with the R217Q mutation resulted in a probe that reduced the depolarizing signal and enhanced the hyperpolarizing signal which may lead to GEVIs that only report neuronal inhibition.
Genetic diversity of Taenia asiatica from Thailand and other geographical locations as revealed by cytochrome c oxidase subunit 1 sequences.

Science.gov (United States)

Anantaphruti, Malinee Thairungroj; Thaenkham, Urusa; Watthanakulpanich, Dorn; Phuphisut, Orawan; Maipanich, Wanna; Yoonuan, Tippayarat; Nuamtanong, Supaporn; Pubampen, Somjit; Sanguankiat, Surapol

2013-02-01

Twelve 924 bp cytochrome c oxidase subunit 1 (cox1) mitochondrial DNA sequences from Taenia asiatica isolates from Thailand were aligned and compared with multiple sequence isolates from Thailand and 6 other countries from the GenBank database. The genetic divergence of T. asiatica was also compared with Taenia saginata database sequences from 6 different countries in Asia, including Thailand, and 3 countries from other continents. The results showed that there were minor genetic variations within T. asiatica species, while high intraspecies variation was found in T. saginata. There were only 2 haplotypes and 1 polymorphic site found in T. asiatica, but 8 haplotypes and 9 polymorphic sites in T. saginata. Haplotype diversity was very low, 0.067, in T. asiatica and high, 0.700, in T. saginata. The very low genetic diversity suggested that T. asiatica may be at a risk due to the loss of potential adaptive alleles, resulting in reduced viability and decreased responses to environmental changes, which may endanger the species.
Detecting authorized and unauthorized genetically modified organisms containing vip3A by real-time PCR and next-generation sequencing.

Science.gov (United States)

Liang, Chanjuan; van Dijk, Jeroen P; Scholtens, Ingrid M J; Staats, Martijn; Prins, Theo W; Voorhuijzen, Marleen M; da Silva, Andrea M; Arisi, Ana Carolina Maisonnave; den Dunnen, Johan T; Kok, Esther J

2014-04-01

The growing number of biotech crops with novel genetic elements increasingly complicates the detection of genetically modified organisms (GMOs) in food and feed samples using conventional screening methods. Unauthorized GMOs (UGMOs) in food and feed are currently identified through combining GMO element screening with sequencing the DNA flanking these elements. In this study, a specific and sensitive qPCR assay was developed for vip3A element detection based on the vip3Aa20 coding sequences of the recently marketed MIR162 maize and COT102 cotton. Furthermore, SiteFinding-PCR in combination with Sanger, Illumina or Pacific BioSciences (PacBio) sequencing was performed targeting the flanking DNA of the vip3Aa20 element in MIR162. De novo assembly and Basic Local Alignment Search Tool searches were used to mimic UGMO identification. PacBio data resulted in relatively long contigs in the upstream (1,326 nucleotides (nt); 95 % identity) and downstream (1,135 nt; 92 % identity) regions, whereas Illumina data resulted in two smaller contigs of 858 and 1,038 nt with higher sequence identity (>99 % identity). Both approaches outperformed Sanger sequencing, underlining the potential for next-generation sequencing in UGMO identification.
Phylogenetic Analysis of Nucleus-Encoded Acetyl-CoA Carboxylases Targeted at the Cytosol and Plastid of Algae.

KAUST Repository

Huerlimann, Roger

2015-07-01

The understanding of algal phylogeny is being impeded by an unknown number of events of horizontal gene transfer (HGT), and primary and secondary/tertiary endosymbiosis. Through these events, previously heterotrophic eukaryotes developed photosynthesis and acquired new biochemical pathways. Acetyl-CoA carboxylase (ACCase) is a key enzyme in the fatty acid synthesis and elongation pathways in algae, where ACCase exists in two locations (cytosol and plastid) and in two forms (homomeric and heteromeric). All algae contain nucleus-encoded homomeric ACCase in the cytosol, independent of the origin of the plastid. Nucleus-encoded homomeric ACCase is also found in plastids of algae that arose from a secondary/tertiary endosymbiotic event. In contrast, plastids of algae that arose from a primary endosymbiotic event contain heteromeric ACCase, which consists of three nucleus-encoded and one plastid-encoded subunits. These properties of ACCase provide the potential to inform on the phylogenetic relationships of hosts and their plastids, allowing different hypothesis of endosymbiotic events to be tested. Alveolata (Dinoflagellata and Apicomplexa) and Chromista (Stramenopiles, Haptophyta and Cryptophyta) have traditionally been grouped together as Chromalveolata, forming the red lineage. However, recent genetic evidence groups the Stramenopiles, Alveolata and green plastid containing Rhizaria as SAR, excluding Haptophyta and Cryptophyta. Sequences coding for plastid and cytosol targeted homomeric ACCases were isolated from Isochrysis aff. galbana (TISO), Chromera velia and Nannochloropsis oculata, representing three taxonomic groups for which sequences were lacking. Phylogenetic analyses show that cytosolic ACCase strongly supports the SAR grouping. Conversely, plastidial ACCase groups the SAR with the Haptophyta, Cryptophyta and Prasinophyceae (Chlorophyta). These two ACCase based, phylogenetic relationships suggest that the plastidial homomeric ACCase was acquired by the
Allele Re-sequencing Technologies

DEFF Research Database (Denmark)

Byrne, Stephen; Farrell, Jacqueline Danielle; Asp, Torben

2013-01-01

The development of next-generation sequencing technologies has made sequencing an affordable approach for detection of genetic variations associated with various traits. However, the cost of whole genome re-sequencing still remains too high to be feasible for many plant species with large...... alternative to whole genome re-sequencing to identify causative genetic variations in plants. One challenge, however, will be efficient bioinformatics strategies for data handling and analysis from the increasing amount of sequence information....
Dynamic encoding of natural luminance sequences by LGN bursts.

Directory of Open Access Journals (Sweden)

Nicholas A Lesica

2006-07-01

Full Text Available In the lateral geniculate nucleus (LGN of the thalamus, visual stimulation produces two distinct types of responses known as tonic and burst. Due to the dynamics of the T-type Ca(2+ channels involved in burst generation, the type of response evoked by a particular stimulus depends on the resting membrane potential, which is controlled by a network of modulatory connections from other brain areas. In this study, we use simulated responses to natural scene movies to describe how modulatory and stimulus-driven changes in LGN membrane potential interact to determine the luminance sequences that trigger burst responses. We find that at low resting potentials, when the T channels are de-inactivated and bursts are relatively frequent, an excitatory stimulus transient alone is sufficient to evoke a burst. However, to evoke a burst at high resting potentials, when the T channels are inactivated and bursts are relatively rare, prolonged inhibitory stimulation followed by an excitatory transient is required. We also observe evidence of these effects in vivo, where analysis of experimental recordings demonstrates that the luminance sequences that trigger bursts can vary dramatically with the overall burst percentage of the response. To characterize the functional consequences of the effects of resting potential on burst generation, we simulate LGN responses to different luminance sequences at a range of resting potentials with and without a mechanism for generating bursts. Using analysis based on signal detection theory, we show that bursts enhance detection of specific luminance sequences, ranging from the onset of excitatory sequences at low resting potentials to the offset of inhibitory sequences at high resting potentials. These results suggest a dynamic role for burst responses during visual processing that may change according to behavioral state.
Genetic diversity of mtDNA D-loop sequences in four native Chinese chicken breeds.

Science.gov (United States)

Guo, H W; Li, C; Wang, X N; Li, Z J; Sun, G R; Li, G X; Liu, X J; Kang, X T; Han, R L

2017-10-01

1. To explore the genetic diversity of Chinese indigenous chicken breeds, a 585 bp fragment of the mitochondrial DNA (mtDNA) region was sequenced in 102 birds from the Xichuan black-bone chicken, Yunyang black-bone chicken and Lushi chicken. In addition, 30 mtDNA D-loop sequences of Silkie fowls were downloaded from NCBI. The mtDNA D-loop sequence polymorphism and maternal origin of 4 chicken breeds were analysed in this study. 2. The results showed that a total of 33 mutation sites and 28 haplotypes were detected in the 4 chicken breeds. The haplotype diversity and nucleotide diversity of these 4 native breeds were 0.916 ± 0.014 and 0.012 ± 0.002, respectively. Three clusters were formed in 4 Chinese native chickens and 12 reference breeds. Both the Xichuan black-bone chicken and Yunyang black-bone chicken were grouped into one cluster. Four haplogroups (A, B, C and E) emerged in the median-joining network in these breeds. 3. It was concluded that these 4 Chinese chicken breeds had high genetic diversity. The phylogenetic tree and median network profiles showed that Chinese native chickens and its neighbouring countries had at least two maternal origins, one from Yunnan, China and another from Southeast Asia or its surrounding area.
Molecular cloning and sequence analysis of complementary DNA encoding rat mammary gland medium-chain S-acyl fatty acid synthetase thio ester hydrolase

International Nuclear Information System (INIS)

Safford, R.; de Silva, J.; Lucas, C.

1987-01-01

Poly(A) + RNA from pregnant rat mammary glands was size-fractionated by sucrose gradient centrifugation, and fractions enriched in medium-chain S-acyl fatty acid synthetase thio ester hydrolase (MCH) were identified by in vitro translation and immunoprecipitation. A cDNA library was constructed, in pBR322, from enriched poly(A) + RNA and screened with two oligonucleotide probes deduced from rat MCH amino acid sequence data. Cross-hybridizing clones were isolated and found to contain cDNA inserts ranging from ∼ 1100 to 1550 base pairs (bp). A 1550-bp cDNA insert, from clone 43H09, was confirmed to encode MCH by hybrid-select translation/immunoprecipitation studies and by comparison of the amino acid sequence deduced from the DNA sequence of the clone to the amino acid sequence of the MCH peptides. Northern blot analysis revealed the size of the MCH mRNA to be 1500 nucleotides, and it is therefore concluded that the 1550-bp insert (including G x C tails) of clone 43H09 represents a full- or near-full-length copy of the MCH gene. The rat MCH sequence is the first reported sequence of a thioesterase from a mammalian source, but comparison of the deduced amino acid sequences of MCH and the recently published mallard duck medium-chain S-acyl fatty acid synthetase thioesterase reveals significant homology. In particular, a seven amino acid sequence containing the proposed active serine of the duck thioesterase is found to be perfectly conserved in rat MCH
Functional analysis of the Gonococcal Genetic Island of Neisseria gonorrhoeae.

Directory of Open Access Journals (Sweden)

Emilia Pachulec

Full Text Available Neisseria gonorrhoeae is an obligate human pathogen that is responsible for the sexually-transmitted disease gonorrhea. N. gonorrhoeae encodes a T4SS within the Gonococcal Genetic Island (GGI, which secretes ssDNA directly into the external milieu. Type IV secretion systems (T4SSs play a role in horizontal gene transfer and delivery of effector molecules into target cells. We demonstrate that GGI-like T4SSs are present in other β-proteobacteria, as well as in α- and γ-proteobacteria. Sequence comparison of GGI-like T4SSs reveals that the GGI-like T4SSs form a highly conserved unit that can be found located both on chromosomes and on plasmids. To better understand the mechanism of DNA secretion by N. gonorrhoeae, we performed mutagenesis of all genes encoded within the GGI, and studied the effects of these mutations on DNA secretion. We show that genes required for DNA secretion are encoded within the yaa-atlA and parA-parB regions, while genes encoded in the yfeB-exp1 region could be deleted without any effect on DNA secretion. Genes essential for DNA secretion are encoded within at least four different operons.
Development of novel InDel markers and genetic diversity in Chenopodium quinoa through whole-genome re-sequencing.

Science.gov (United States)

Zhang, Tifu; Gu, Minfeng; Liu, Yuhe; Lv, Yuanda; Zhou, Ling; Lu, Haiyan; Liang, Shuaiqiang; Bao, Huabin; Zhao, Han

2017-09-05

Quinoa (Chenopodium quinoa Willd.) is a balanced nutritional crop, but its breeding improvement has been limited by the lack of information on its genetics and genomics. Therefore, it is necessary to obtain knowledge on genomic variation, population structure, and genetic diversity and to develop novel Insertion/Deletion (InDel) markers for quinoa by whole-genome re-sequencing. We re-sequenced 11 quinoa accessions and obtained a coverage depth between approximately 7× to 23× the quinoa genome. Based on the 1453-megabase (Mb) assembly from the reference accession Riobamba, 8,441,022 filtered bi-allelic single nucleotide polymorphisms (SNPs) and 842,783 filtered InDels were identified, with an estimated SNP and InDel density of 5.81 and 0.58 per kilobase (kb). From the genomic InDel variations, 85 dimorphic InDel markers were newly developed and validated. Together with the 62 simple sequence repeat (SSR) markers reported, a total of 147 markers were used for genotyping the 129 quinoa accessions. Molecular grouping analysis showed classification into two major groups, the Andean highland (composed of the northern and southern highland subgroups) and Chilean coastal, based on combined STRUCTURE, phylogenetic tree and PCA (Principle Component Analysis) analyses. Further analysis of the genetic diversity exhibited a decreasing tendency from the Chilean coast group to the Andean highland group, and the gene flow between subgroups was more frequent than that between the two subgroups and the Chilean coastal group. The majority of the variations (approximately 70%) were found through an analysis of molecular variation (AMOVA) due to the diversity between the groups. This was congruent with the observation of a highly significant F ST value (0.705) between the groups, demonstrating significant genetic differentiation between the Andean highland type of quinoa and the Chilean coastal type. Moreover, a core set of 16 quinoa germplasms that capture all 362 alleles was
A model for visual memory encoding.

Directory of Open Access Journals (Sweden)

Rodolphe Nenert

Full Text Available Memory encoding engages multiple concurrent and sequential processes. While the individual processes involved in successful encoding have been examined in many studies, a sequence of events and the importance of modules associated with memory encoding has not been established. For this reason, we sought to perform a comprehensive examination of the network for memory encoding using data driven methods and to determine the directionality of the information flow in order to build a viable model of visual memory encoding. Forty healthy controls ages 19-59 performed a visual scene encoding task. FMRI data were preprocessed using SPM8 and then processed using independent component analysis (ICA with the reliability of the identified components confirmed using ICASSO as implemented in GIFT. The directionality of the information flow was examined using Granger causality analyses (GCA. All participants performed the fMRI task well above the chance level (>90% correct on both active and control conditions and the post-fMRI testing recall revealed correct memory encoding at 86.33 ± 5.83%. ICA identified involvement of components of five different networks in the process of memory encoding, and the GCA allowed for the directionality of the information flow to be assessed, from visual cortex via ventral stream to the attention network and then to the default mode network (DMN. Two additional networks involved in this process were the cerebellar and the auditory-insular network. This study provides evidence that successful visual memory encoding is dependent on multiple modules that are part of other networks that are only indirectly related to the main process. This model may help to identify the node(s of the network that are affected by a specific disease processes and explain the presence of memory encoding difficulties in patients in whom focal or global network dysfunction exists.
A model for visual memory encoding.

Science.gov (United States)

Nenert, Rodolphe; Allendorfer, Jane B; Szaflarski, Jerzy P

2014-01-01

Memory encoding engages multiple concurrent and sequential processes. While the individual processes involved in successful encoding have been examined in many studies, a sequence of events and the importance of modules associated with memory encoding has not been established. For this reason, we sought to perform a comprehensive examination of the network for memory encoding using data driven methods and to determine the directionality of the information flow in order to build a viable model of visual memory encoding. Forty healthy controls ages 19-59 performed a visual scene encoding task. FMRI data were preprocessed using SPM8 and then processed using independent component analysis (ICA) with the reliability of the identified components confirmed using ICASSO as implemented in GIFT. The directionality of the information flow was examined using Granger causality analyses (GCA). All participants performed the fMRI task well above the chance level (>90% correct on both active and control conditions) and the post-fMRI testing recall revealed correct memory encoding at 86.33 ± 5.83%. ICA identified involvement of components of five different networks in the process of memory encoding, and the GCA allowed for the directionality of the information flow to be assessed, from visual cortex via ventral stream to the attention network and then to the default mode network (DMN). Two additional networks involved in this process were the cerebellar and the auditory-insular network. This study provides evidence that successful visual memory encoding is dependent on multiple modules that are part of other networks that are only indirectly related to the main process. This model may help to identify the node(s) of the network that are affected by a specific disease processes and explain the presence of memory encoding difficulties in patients in whom focal or global network dysfunction exists.
Citrus plastid-related gene profiling based on expressed sequence tag analyses

Directory of Open Access Journals (Sweden)

Tercilio Calsa Jr.

2007-01-01

Full Text Available Plastid-related sequences, derived from putative nuclear or plastome genes, were searched in a large collection of expressed sequence tags (ESTs and genomic sequences from the Citrus Biotechnology initiative in Brazil. The identified putative Citrus chloroplast gene sequences were compared to those from Arabidopsis, Eucalyptus and Pinus. Differential expression profiling for plastid-directed nuclear-encoded proteins and photosynthesis-related gene expression variation between Citrus sinensis and Citrus reticulata, when inoculated or not with Xylella fastidiosa, were also analyzed. Presumed Citrus plastome regions were more similar to Eucalyptus. Some putative genes appeared to be preferentially expressed in vegetative tissues (leaves and bark or in reproductive organs (flowers and fruits. Genes preferentially expressed in fruit and flower may be associated with hypothetical physiological functions. Expression pattern clustering analysis suggested that photosynthesis- and carbon fixation-related genes appeared to be up- or down-regulated in a resistant or susceptible Citrus species after Xylella inoculation in comparison to non-infected controls, generating novel information which may be helpful to develop novel genetic manipulation strategies to control Citrus variegated chlorosis (CVC.
Bacillus subtilis IolQ (DegA) is a transcriptional repressor of iolX encoding NAD+-dependent scyllo-inositol dehydrogenase.

Science.gov (United States)

Kang, Dong-Min; Michon, Christophe; Morinaga, Tetsuro; Tanaka, Kosei; Takenaka, Shinji; Ishikawa, Shu; Yoshida, Ken-Ichi

2017-07-11

Bacillus subtilis is able to utilize at least three inositol stereoisomers as carbon sources, myo-, scyllo-, and D-chiro-inositol (MI, SI, and DCI, respectively). NAD + -dependent SI dehydrogenase responsible for SI catabolism is encoded by iolX. Even in the absence of functional iolX, the presence of SI or MI in the growth medium was found to induce the transcription of iolX through an unknown mechanism. Immediately upstream of iolX, there is an operon that encodes two genes, yisR and iolQ (formerly known as degA), each of which could encode a transcriptional regulator. Here we performed an inactivation analysis of yisR and iolQ and found that iolQ encodes a repressor of the iolX transcription. The coding sequence of iolQ was expressed in Escherichia coli and the gene product was purified as a His-tagged fusion protein, which bound to two sites within the iolX promoter region in vitro. IolQ is a transcriptional repressor of iolX. Genetic evidences allowed us to speculate that SI and MI might possibly be the intracellular inducers, however they failed to antagonize DNA binding of IolQ in in vitro experiments.
Further genetic localization of the transforming sequences of the p21 v-ras gene of Harvey murine sarcoma virus

DEFF Research Database (Denmark)

Willumsen, B M; Ellis, R W; Scolnick, E M

1984-01-01

, DNA sequence analysis has found a single open reading frame large enough to encode the viral p21 (R. Dhar, R. W. Ellis, T. Y. Shih, S. Oroszlan, B. Shapiro, J. Maizel, D. Lowy, and E. M. Scolnick, Science 217:934-937, 1982). There are three potential in-frame ATG initiation codons at the 5' end...
Avian reovirus L2 genome segment sequences and predicted structure/function of the encoded RNA-dependent RNA polymerase protein

Directory of Open Access Journals (Sweden)

Xu Wanhong

2008-12-01

Full Text Available Abstract Background The orthoreoviruses are infectious agents that possess a genome comprised of 10 double-stranded RNA segments encased in two concentric protein capsids. Like virtually all RNA viruses, an RNA-dependent RNA polymerase (RdRp enzyme is required for viral propagation. RdRp sequences have been determined for the prototype mammalian orthoreoviruses and for several other closely-related reoviruses, including aquareoviruses, but have not yet been reported for any avian orthoreoviruses. Results We determined the L2 genome segment nucleotide sequences, which encode the RdRp proteins, of two different avian reoviruses, strains ARV138 and ARV176 in order to define conserved and variable regions within reovirus RdRp proteins and to better delineate structure/function of this important enzyme. The ARV138 L2 genome segment was 3829 base pairs long, whereas the ARV176 L2 segment was 3830 nucleotides long. Both segments were predicted to encode λB RdRp proteins 1259 amino acids in length. Alignments of these newly-determined ARV genome segments, and their corresponding proteins, were performed with all currently available homologous mammalian reovirus (MRV and aquareovirus (AqRV genome segment and protein sequences. There was ~55% amino acid identity between ARV λB and MRV λ3 proteins, making the RdRp protein the most highly conserved of currently known orthoreovirus proteins, and there was ~28% identity between ARV λB and homologous MRV and AqRV RdRp proteins. Predictive structure/function mapping of identical and conserved residues within the known MRV λ3 atomic structure indicated most identical amino acids and conservative substitutions were located near and within predicted catalytic domains and lining RdRp channels, whereas non-identical amino acids were generally located on the molecule's surfaces. Conclusion The ARV λB and MRV λ3 proteins showed the highest ARV:MRV identity values (~55% amongst all currently known ARV and MRV
Molecular cloning of growth hormone encoding cDNA of Indian

Indian Academy of Sciences (India)

A modified rapid amplification of cDNA ends (RACE) strategy has been developed for cloning highly conserved cDNA sequences. Using this modified method, the growth hormone (GH) encoding cDNA sequences of Labeo rohita, Cirrhina mrigala and Catla catla have been cloned, characterized and overexpressed in ...
Next-generation sequencing for genetic testing of familial colorectal cancer syndromes.

Science.gov (United States)

Simbolo, Michele; Mafficini, Andrea; Agostini, Marco; Pedrazzani, Corrado; Bedin, Chiara; Urso, Emanuele D; Nitti, Donato; Turri, Giona; Scardoni, Maria; Fassan, Matteo; Scarpa, Aldo

2015-01-01

Genetic screening in families with high risk to develop colorectal cancer (CRC) prevents incurable disease and permits personalized therapeutic and follow-up strategies. The advancement of next-generation sequencing (NGS) technologies has revolutionized the throughput of DNA sequencing. A series of 16 probands for either familial adenomatous polyposis (FAP; 8 cases) or hereditary nonpolyposis colorectal cancer (HNPCC; 8 cases) were investigated for intragenic mutations in five CRC familial syndromes-associated genes (APC, MUTYH, MLH1, MSH2, MSH6) applying both a custom multigene Ion AmpliSeq NGS panel and conventional Sanger sequencing. Fourteen pathogenic variants were detected in 13/16 FAP/HNPCC probands (81.3 %); one FAP proband presented two co-existing pathogenic variants, one in APC and one in MUTYH. Thirteen of these 14 pathogenic variants were detected by both NGS and Sanger, while one MSH2 mutation (L280FfsX3) was identified only by Sanger sequencing. This is due to a limitation of the NGS approach in resolving sequences close or within homopolymeric stretches of DNA. To evaluate the performance of our NGS custom panel we assessed its capability to resolve the DNA sequences corresponding to 2225 pathogenic variants reported in the COSMIC database for APC, MUTYH, MLH1, MSH2, MSH6. Our NGS custom panel resolves the sequences where 2108 (94.7 %) of these variants occur. The remaining 117 mutations reside inside or in close proximity to homopolymer stretches; of these 27 (1.2 %) are imprecisely identified by the software but can be resolved by visual inspection of the region, while the remaining 90 variants (4.0 %) are blind spots. In summary, our custom panel would miss 4 % (90/2225) of pathogenic variants that would need a small set of Sanger sequencing reactions to be solved. The multiplex NGS approach has the advantage of analyzing multiple genes in multiple samples simultaneously, requiring only a reduced number of Sanger sequences to resolve
Toward allotetraploid cotton genome assembly: integration of a high-density molecular genetic linkage map with DNA sequence information

Science.gov (United States)

2012-01-01

Background Cotton is the world’s most important natural textile fiber and a significant oilseed crop. Decoding cotton genomes will provide the ultimate reference and resource for research and utilization of the species. Integration of high-density genetic maps with genomic sequence information will largely accelerate the process of whole-genome assembly in cotton. Results In this paper, we update a high-density interspecific genetic linkage map of allotetraploid cultivated cotton. An additional 1,167 marker loci have been added to our previously published map of 2,247 loci. Three new marker types, InDel (insertion-deletion) and SNP (single nucleotide polymorphism) developed from gene information, and REMAP (retrotransposon-microsatellite amplified polymorphism), were used to increase map density. The updated map consists of 3,414 loci in 26 linkage groups covering 3,667.62 cM with an average inter-locus distance of 1.08 cM. Furthermore, genome-wide sequence analysis was finished using 3,324 informative sequence-based markers and publicly-available Gossypium DNA sequence information. A total of 413,113 EST and 195 BAC sequences were physically anchored and clustered by 3,324 sequence-based markers. Of these, 14,243 ESTs and 188 BACs from different species of Gossypium were clustered and specifically anchored to the high-density genetic map. A total of 2,748 candidate unigenes from 2,111 ESTs clusters and 63 BACs were mined for functional annotation and classification. The 337 ESTs/genes related to fiber quality traits were integrated with 132 previously reported cotton fiber quality quantitative trait loci, which demonstrated the important roles in fiber quality of these genes. Higher-level sequence conservation between different cotton species and between the A- and D-subgenomes in tetraploid cotton was found, indicating a common evolutionary origin for orthologous and paralogous loci in Gossypium. Conclusion This study will serve as a valuable genomic resource

Molecular sequence data of hepatitis B virus and genetic diversity after vaccination.

Science.gov (United States)

van Ballegooijen, W Marijn; van Houdt, Robin; Bruisten, Sylvia M; Boot, Hein J; Coutinho, Roel A; Wallinga, Jacco

2009-12-15

The effect of vaccination programs on transmission of infectious disease is usually assessed by monitoring programs that rely on notifications of symptomatic illness. For monitoring of infectious diseases with a high proportion of asymptomatic cases or a low reporting rate, molecular sequence data combined with modern coalescent-based techniques offer a complementary tool to assess transmission. Here, the authors investigate the added value of using viral sequence data to monitor a vaccination program that was started in 1998 and was targeted against hepatitis B virus in men who have sex with men in Amsterdam, the Netherlands. The incidence in this target group, as estimated from the notifications of acute infections with hepatitis B virus, was low; therefore, there was insufficient power to show a significant change in incidence. In contrast, the genetic diversity, as estimated from the viral sequence collected from the target group, revealed a marked decrease after vaccination was introduced. Taken together, the findings suggest that introduction of vaccination coincided with a change in the target group toward behavior with a higher risk of infection. The authors argue that molecular sequence data provide a powerful additional monitoring instrument, next to conventional case registration, for assessing the impact of vaccination.
Characterization, genetic diversity, and evolutionary link of Cucumber mosaic virus strain New Delhi from India.

Science.gov (United States)

Koundal, Vikas; Haq, Qazi Mohd Rizwanul; Praveen, Shelly

2011-02-01

The genome of Cucumber mosaic virus New Delhi strain (CMV-ND) from India, obtained from tomato, was completely sequenced and compared with full genome sequences of 14 known CMV strains from subgroups I and II, for their genetic diversity. Sequence analysis suggests CMV-ND shares maximum sequence identity at the nucleotide level with a CMV strain from Taiwan. Among all 15 strains of CMV, the encoded protein 2b is least conserved, whereas the coat protein (CP) is most conserved. Sequence identity values and phylogram results indicate that CMV-ND belongs to subgroup I. Based on the recombination detection program result, it appears that CMV is prone to recombination, and different RNA components of CMV-ND have evolved differently. Recombinational analysis of all 15 CMV strains detected maximum recombination breakpoints in RNA2; CP showed the least recombination sites.
Genetic diversity of EBV-encoded LMP1 in the Swiss HIV Cohort Study and implication for NF-Κb activation.

Directory of Open Access Journals (Sweden)

Emilie Zuercher

Full Text Available Epstein-Barr virus (EBV is associated with several types of cancers including Hodgkin's lymphoma (HL and nasopharyngeal carcinoma (NPC. EBV-encoded latent membrane protein 1 (LMP1, a multifunctional oncoprotein, is a powerful activator of the transcription factor NF-κB, a property that is essential for EBV-transformed lymphoblastoid cell survival. Previous studies reported LMP1 sequence variations and induction of higher NF-κB activation levels compared to the prototype B95-8 LMP1 by some variants. Here we used biopsies of EBV-associated cancers and blood of individuals included in the Swiss HIV Cohort Study (SHCS to analyze LMP1 genetic diversity and impact of sequence variations on LMP1-mediated NF-κB activation potential. We found that a number of variants mediate higher NF-κB activation levels when compared to B95-8 LMP1 and mapped three single polymorphisms responsible for this phenotype: F106Y, I124V and F144I. F106Y was present in all LMP1 isolated in this study and its effect was variant dependent, suggesting that it was modulated by other polymorphisms. The two polymorphisms I124V and F144I were present in distinct phylogenetic groups and were linked with other specific polymorphisms nearby, I152L and D150A/L151I, respectively. The two sets of polymorphisms, I124V/I152L and F144I/D150A/L151I, which were markers of increased NF-κB activation in vitro, were not associated with EBV-associated HL in the SHCS. Taken together these results highlighted the importance of single polymorphisms for the modulation of LMP1 signaling activity and demonstrated that several groups of LMP1 variants, through distinct mutational paths, mediated enhanced NF-κB activation levels compared to B95-8 LMP1.
Isolation and sequence of cDNA encoding a cytochrome P-450 from an insecticide-resistant strain of the house fly, Musca domestica.

OpenAIRE

Feyereisen, R; Koener, J F; Farnsworth, D E; Nebert, D W

1989-01-01

A cDNA expression library from phenobarbital-treated house fly (Musca domestica) was screened with rabbit antisera directed against partially purified house fly cytochrome P-450. Two overlapping clones with insert lengths of 1.3 and 1.5 kilobases were isolated. The sequence of a 1629-base-pair (bp) cDNA was obtained, with an open reading frame (nucleotides 81-1610) encoding a P-450 protein of 509 residues (Mr = 58,738). The insect P-450 protein contains a hydrophobic NH2 terminus and a 22-res...
Genetic structuring and differentiation of Echinococcus multilocularis in Slovakia assessed by sequencing and isoenzyme studies

DEFF Research Database (Denmark)

Snabel, V.; Miterpakova, M.; D'Amelio, S.

2006-01-01

Nucleotide sequencing of the mitochondrial cytochrome c oxidase subunit 1 (CO1) gene and isoenzyme analysis were used to survey the genetic variability in Echinococcus multilocularis populations from Slovakia. A sample of 12 isolates acquired from 10 different districts from red foxes exhibited......) in the CO1 fragment. These data, along with the recently gathered data from French isolates, are indicative of a genetically unique population occurring in Central and Western Europe. Electrophoretic examination of enzymes produced by 14 gene loci revealed intraspecific polymorphism only with the glucose...... between the species were obtained by isoenzyme analysis. Fixed genetic differences between the species were detected in the glucose-phosphate isomerase, esterase and aldolase systems, and partial differences were detected in four additional systems....
Molecular comparison of the structural proteins encoding gene clusters of two related Lactobacillus delbrueckii bacteriophages.

Science.gov (United States)

Vasala, A; Dupont, L; Baumann, M; Ritzenthaler, P; Alatossava, T

1993-01-01

Virulent phage LL-H and temperate phage mv4 are two related bacteriophages of Lactobacillus delbrueckii. The gene clusters encoding structural proteins of these two phages have been sequenced and further analyzed. Six open reading frames (ORF-1 to ORF-6) were detected. Protein sequencing and Western immunoblotting experiments confirmed that ORF-3 (g34) encoded the main capsid protein Gp34. The presence of a putative late promoter in front of the phage LL-H g34 gene was suggested by primer extension experiments. Comparative sequence analysis between phage LL-H and phage mv4 revealed striking similarities in the structure and organization of this gene cluster, suggesting that the genes encoding phage structural proteins belong to a highly conservative module. Images PMID:8497043
Southern-by-Sequencing: A Robust Screening Approach for Molecular Characterization of Genetically Modified Crops

Directory of Open Access Journals (Sweden)

Gina M. Zastrow-Hayes

2015-03-01

Full Text Available Molecular characterization of events is an integral part of the advancement process during genetically modified (GM crop product development. Assessment of these events is traditionally accomplished by polymerase chain reaction (PCR and Southern blot analyses. Southern blot analysis can be time-consuming and comparatively expensive and does not provide sequence-level detail. We have developed a sequence-based application, Southern-by-Sequencing (SbS, utilizing sequence capture coupled with next-generation sequencing (NGS technology to replace Southern blot analysis for event selection in a high-throughput molecular characterization environment. SbS is accomplished by hybridizing indexed and pooled whole-genome DNA libraries from GM plants to biotinylated probes designed to target the sequence of transformation plasmids used to generate events within the pool. This sequence capture process enriches the sequence data obtained for targeted regions of interest (transformation plasmid DNA. Taking advantage of the DNA adjacent to the targeted bases (referred to as next-to-target sequence that accompanies the targeted transformation plasmid sequence, the data analysis detects plasmid-to-genome and plasmid-to-plasmid junctions introduced during insertion into the plant genome. Analysis of these junction sequences provides sequence-level information as to the following: the number of insertion loci including detection of unlinked, independently segregating, small DNA fragments; copy number; rearrangements, truncations, or deletions of the intended insertion DNA; and the presence of transformation plasmid backbone sequences. This molecular evidence from SbS analysis is used to characterize and select GM plants meeting optimal molecular characterization criteria. SbS technology has proven to be a robust event screening tool for use in a high-throughput molecular characterization environment.
Genetic structuring of European anchovy (Engraulis encrasicolus) populations through mitochondrial DNA sequences.

Science.gov (United States)

Keskin, Emre; Atar, Hasan Huseyin

2012-04-01

Mitochondrial DNA sequence variation in 655 bpfragments of the cytochrome oxidase c subunit I gene, known as the DNA barcode, of European anchovy (Engraulis encrasicolus) was evaluated by analyzing 1529 individuals representing 16 populations from the Black Sea, through the Marmara Sea and the Aegean Sea to the Mediterranean Sea. A total of 19 (2.9%) variable sites were found among individuals, and these defined 10 genetically diverged populations with an overall mean distance of 1.2%. The highest nucleotide divergence was found between samples of eastern Mediterranean and northern Aegean (2.2%). Evolutionary history analysis among 16 populations clustered the Mediterranean Sea clades in one main branch and the other clades in another branch. Diverging pattern of the European anchovy populations correlated with geographic dispersion supports the genetic structuring through the Black Sea-Marmara Sea-Aegean Sea-Mediterranean Sea quad.
Novel Genetic Variants of Sporadic Atrial Septal Defect (ASD) in a Chinese Population Identified by Whole-Exome Sequencing (WES).

Science.gov (United States)

Liu, Yong; Cao, Yu; Li, Yaxiong; Lei, Dongyun; Li, Lin; Hou, Zong Liu; Han, Shen; Meng, Mingyao; Shi, Jianlin; Zhang, Yayong; Wang, Yi; Niu, Zhaoyi; Xie, Yanhua; Xiao, Benshan; Wang, Yuanfei; Li, Xiao; Yang, Lirong; Wang, Wenju; Jiang, Lihong

2018-03-05

BACKGROUND Recently, mutations in several genes have been described to be associated with sporadic ASD, but some genetic variants remain to be identified. The aim of this study was to use whole-exome sequencing (WES) combined with bioinformatics analysis to identify novel genetic variants in cases of sporadic congenital ASD, followed by validation by Sanger sequencing. MATERIAL AND METHODS Five Han patients with secundum ASD were recruited, and their tissue samples were analyzed by WES, followed by verification by Sanger sequencing of tissue and blood samples. Further evaluation using blood samples included 452 additional patients with sporadic secundum ASD (212 male and 240 female patients) and 519 healthy subjects (252 male and 267 female subjects) for further verification by a multiplexed MassARRAY system. Bioinformatic analyses were performed to identify novel genetic variants associated with sporadic ASD. RESULTS From five patients with sporadic ASD, a total of 181,762 genomic variants in 33 exon loci, validated by Sanger sequencing, were selected and underwent MassARRAY analysis in 452 patients with ASD and 519 healthy subjects. Three loci with high mutation frequencies, the 138665410 FOXL2 gene variant, the 23862952 MYH6 gene variant, and the 71098693 HYDIN gene variant were found to be significantly associated with sporadic ASD (PASD (PASD, and supported the use of WES and bioinformatics analysis to identify disease-associated mutations.
Genome sequencing elucidates Sardinian genetic architecture and augments association analyses for lipid and blood inflammatory markers

Science.gov (United States)

Zoledziewska, Magdalena; Mulas, Antonella; Pistis, Giorgio; Steri, Maristella; Danjou, Fabrice; Kwong, Alan; Ortega del Vecchyo, Vicente Diego; Chiang, Charleston W. K.; Bragg-Gresham, Jennifer; Pitzalis, Maristella; Nagaraja, Ramaiah; Tarrier, Brendan; Brennan, Christine; Uzzau, Sergio; Fuchsberger, Christian; Atzeni, Rossano; Reinier, Frederic; Berutti, Riccardo; Huang, Jie; Timpson, Nicholas J; Toniolo, Daniela; Gasparini, Paolo; Malerba, Giovanni; Dedoussis, George; Zeggini, Eleftheria; Soranzo, Nicole; Jones, Chris; Lyons, Robert; Angius, Andrea; Kang, Hyun M.; Novembre, John; Sanna, Serena; Schlessinger, David; Cucca, Francesco; Abecasis, Gonçalo R

2015-01-01

We report ~17.6M genetic variants from whole-genome sequencing of 2,120 Sardinians; 22% are absent from prior sequencing-based compilations and enriched for predicted functional consequence. Furthermore, ~76K variants common in our sample (frequency >5%) are rare elsewhere (Genomes Project). We assessed the impact of these variants on circulating lipid levels and five inflammatory biomarkers. Fourteen signals, including two major new loci, were observed for lipid levels, and 19, including two novel loci, for inflammatory markers. New associations would be missed in analyses based on 1000 Genomes data, underlining the advantages of large-scale sequencing in this founder population. PMID:26366554
The genome sequence of the outbreeding globe artichoke constructed de novo incorporating a phase-aware low-pass sequencing strategy of F1 progeny

Science.gov (United States)

Scaglione, Davide; Reyes-Chin-Wo, Sebastian; Acquadro, Alberto; Froenicke, Lutz; Portis, Ezio; Beitel, Christopher; Tirone, Matteo; Mauro, Rosario; Lo Monaco, Antonino; Mauromicale, Giovanni; Faccioli, Primetta; Cattivelli, Luigi; Rieseberg, Loren; Michelmore, Richard; Lanteri, Sergio

2016-01-01

Globe artichoke (Cynara cardunculus var. scolymus) is an out-crossing, perennial, multi-use crop species that is grown worldwide and belongs to the Compositae, one of the most successful Angiosperm families. We describe the first genome sequence of globe artichoke. The assembly, comprising of 13,588 scaffolds covering 725 of the 1,084 Mb genome, was generated using ~133-fold Illumina sequencing data and encodes 26,889 predicted genes. Re-sequencing (30×) of globe artichoke and cultivated cardoon (C. cardunculus var. altilis) parental genotypes and low-coverage (0.5 to 1×) genotyping-by-sequencing of 163 F1 individuals resulted in 73% of the assembled genome being anchored in 2,178 genetic bins ordered along 17 chromosomal pseudomolecules. This was achieved using a novel pipeline, SOILoCo (Scaffold Ordering by Imputation with Low Coverage), to detect heterozygous regions and assign parental haplotypes with low sequencing read depth and of unknown phase. SOILoCo provides a powerful tool for de novo genome analysis of outcrossing species. Our data will enable genome-scale analyses of evolutionary processes among crops, weeds, and wild species within and beyond the Compositae, and will facilitate the identification of economically important genes from related species. PMID:26786968
Genetically encoded ratiometric fluorescent thermometer with wide range and rapid response.

Directory of Open Access Journals (Sweden)

Masahiro Nakano

Full Text Available Temperature is a fundamental physical parameter that plays an important role in biological reactions and events. Although thermometers developed previously have been used to investigate several important phenomena, such as heterogeneous temperature distribution in a single living cell and heat generation in mitochondria, the development of a thermometer with a sensitivity over a wide temperature range and rapid response is still desired to quantify temperature change in not only homeotherms but also poikilotherms from the cellular level to in vivo. To overcome the weaknesses of the conventional thermometers, such as a limitation of applicable species and a low temporal resolution, owing to the narrow temperature range of sensitivity and the thermometry method, respectively, we developed a genetically encoded ratiometric fluorescent temperature indicator, gTEMP, by using two fluorescent proteins with different temperature sensitivities. Our thermometric method enabled a fast tracking of the temperature change with a time resolution of 50 ms. We used this method to observe the spatiotemporal temperature change between the cytoplasm and nucleus in cells, and quantified thermogenesis from the mitochondria matrix in a single living cell after stimulation with carbonyl cyanide 4-(trifluoromethoxyphenylhydrazone, which was an uncoupler of oxidative phosphorylation. Moreover, exploiting the wide temperature range of sensitivity from 5°C to 50°C of gTEMP, we monitored the temperature in a living medaka embryo for 15 hours and showed the feasibility of in vivo thermometry in various living species.
Encoding color information for visual tracking: Algorithms and benchmark.

Science.gov (United States)

Liang, Pengpeng; Blasch, Erik; Ling, Haibin

2015-12-01

While color information is known to provide rich discriminative clues for visual inference, most modern visual trackers limit themselves to the grayscale realm. Despite recent efforts to integrate color in tracking, there is a lack of comprehensive understanding of the role color information can play. In this paper, we attack this problem by conducting a systematic study from both the algorithm and benchmark perspectives. On the algorithm side, we comprehensively encode 10 chromatic models into 16 carefully selected state-of-the-art visual trackers. On the benchmark side, we compile a large set of 128 color sequences with ground truth and challenge factor annotations (e.g., occlusion). A thorough evaluation is conducted by running all the color-encoded trackers, together with two recently proposed color trackers. A further validation is conducted on an RGBD tracking benchmark. The results clearly show the benefit of encoding color information for tracking. We also perform detailed analysis on several issues, including the behavior of various combinations between color model and visual tracker, the degree of difficulty of each sequence for tracking, and how different challenge factors affect the tracking performance. We expect the study to provide the guidance, motivation, and benchmark for future work on encoding color in visual tracking.
Design and development of genetically encoded fluorescent sensors to monitor intracellular chemical and physical parameters.

Science.gov (United States)

Germond, Arno; Fujita, Hideaki; Ichimura, Taro; Watanabe, Tomonobu M

2016-06-01

Over the past decades many researchers have made major contributions towards the development of genetically encoded (GE) fluorescent sensors derived from fluorescent proteins. GE sensors are now used to study biological phenomena by facilitating the measurement of biochemical behaviors at various scales, ranging from single molecules to single cells or even whole animals. Here, we review the historical development of GE fluorescent sensors and report on their current status. We specifically focus on the development strategies of the GE sensors used for measuring pH, ion concentrations (e.g., chloride and calcium), redox indicators, membrane potential, temperature, pressure, and molecular crowding. We demonstrate that these fluroescent protein-based sensors have a shared history of concepts and development strategies, and we highlight the most original concepts used to date. We believe that the understanding and application of these various concepts will pave the road for the development of future GE sensors and lead to new breakthroughs in bioimaging.
Fast two-photon imaging of subcellular voltage dynamics in neuronal tissue with genetically encoded indicators.

Science.gov (United States)

Chamberland, Simon; Yang, Helen H; Pan, Michael M; Evans, Stephen W; Guan, Sihui; Chavarha, Mariya; Yang, Ying; Salesse, Charleen; Wu, Haodi; Wu, Joseph C; Clandinin, Thomas R; Toth, Katalin; Lin, Michael Z; St-Pierre, François

2017-07-27

Monitoring voltage dynamics in defined neurons deep in the brain is critical for unraveling the function of neuronal circuits but is challenging due to the limited performance of existing tools. In particular, while genetically encoded voltage indicators have shown promise for optical detection of voltage transients, many indicators exhibit low sensitivity when imaged under two-photon illumination. Previous studies thus fell short of visualizing voltage dynamics in individual neurons in single trials. Here, we report ASAP2s, a novel voltage indicator with improved sensitivity. By imaging ASAP2s using random-access multi-photon microscopy, we demonstrate robust single-trial detection of action potentials in organotypic slice cultures. We also show that ASAP2s enables two-photon imaging of graded potentials in organotypic slice cultures and in Drosophila . These results demonstrate that the combination of ASAP2s and fast two-photon imaging methods enables detection of neural electrical activity with subcellular spatial resolution and millisecond-timescale precision.
Genomic view of bipolar disorder revealed by whole genome sequencing in a genetic isolate.

Directory of Open Access Journals (Sweden)

Benjamin Georgi

2014-03-01

Full Text Available Bipolar disorder is a common, heritable mental illness characterized by recurrent episodes of mania and depression. Despite considerable effort to elucidate the genetic underpinnings of bipolar disorder, causative genetic risk factors remain elusive. We conducted a comprehensive genomic analysis of bipolar disorder in a large Old Order Amish pedigree. Microsatellite genotypes and high-density SNP-array genotypes of 388 family members were combined with whole genome sequence data for 50 of these subjects, comprising 18 parent-child trios. This study design permitted evaluation of candidate variants within the context of haplotype structure by resolving the phase in sequenced parent-child trios and by imputation of variants into multiple unsequenced siblings. Non-parametric and parametric linkage analysis of the entire pedigree as well as on smaller clusters of families identified several nominally significant linkage peaks, each of which included dozens of predicted deleterious variants. Close inspection of exonic and regulatory variants in genes under the linkage peaks using family-based association tests revealed additional credible candidate genes for functional studies and further replication in population-based cohorts. However, despite the in-depth genomic characterization of this unique, large and multigenerational pedigree from a genetic isolate, there was no convergence of evidence implicating a particular set of risk loci or common pathways. The striking haplotype and locus heterogeneity we observed has profound implications for the design of studies of bipolar and other related disorders.
Genomic View of Bipolar Disorder Revealed by Whole Genome Sequencing in a Genetic Isolate

Science.gov (United States)

Georgi, Benjamin; Craig, David; Kember, Rachel L.; Liu, Wencheng; Lindquist, Ingrid; Nasser, Sara; Brown, Christopher; Egeland, Janice A.; Paul, Steven M.; Bućan, Maja

2014-01-01

Bipolar disorder is a common, heritable mental illness characterized by recurrent episodes of mania and depression. Despite considerable effort to elucidate the genetic underpinnings of bipolar disorder, causative genetic risk factors remain elusive. We conducted a comprehensive genomic analysis of bipolar disorder in a large Old Order Amish pedigree. Microsatellite genotypes and high-density SNP-array genotypes of 388 family members were combined with whole genome sequence data for 50 of these subjects, comprising 18 parent-child trios. This study design permitted evaluation of candidate variants within the context of haplotype structure by resolving the phase in sequenced parent-child trios and by imputation of variants into multiple unsequenced siblings. Non-parametric and parametric linkage analysis of the entire pedigree as well as on smaller clusters of families identified several nominally significant linkage peaks, each of which included dozens of predicted deleterious variants. Close inspection of exonic and regulatory variants in genes under the linkage peaks using family-based association tests revealed additional credible candidate genes for functional studies and further replication in population-based cohorts. However, despite the in-depth genomic characterization of this unique, large and multigenerational pedigree from a genetic isolate, there was no convergence of evidence implicating a particular set of risk loci or common pathways. The striking haplotype and locus heterogeneity we observed has profound implications for the design of studies of bipolar and other related disorders. PMID:24625924
Population genetic and evolution analysis of controversial genus Edwardsiella by multilocus sequence typing.

Science.gov (United States)

Buján, Noemí; Balboa, Sabela; L Romalde, Jesús; E Toranzo, Alicia; Magariños, Beatriz

2018-05-08

At present, the genus Edwardsiella compiles five species: E. tarda, E. hoshinae, E. ictaluri, E. piscicida and E. anguillarum. Some species of this genus such us E. ictaluri and E. piscicida are important pathogens of numerous fish species. With the description of the two latter species, the phylogeny of Edwardsiella became more complicated. With the aim to clarify the relationships among all species in the genus, a multilocus sequence typing (MLST) approach was developed and applied to characterize 56 isolates and 6 reference strains belonging to the five Edwardsiella species. Moreover, several analyses based on the MLST scheme were performed to investigate the evolution within the genus, as well as the influence of recombination and mutation in the speciation. Edwardsiella isolates presented a high genetic variability reflected in the fourteen sequence types (ST) represented by a single isolates out of eighteen total ST. Mutation events were considerably more frequent than recombination, although both approximately equal influenced the genetic diversification. However, the speciation among species occurred mostly by recombination. Edwardsiella genus displays a non-clonal population structure with some degree of geographical isolation followed by a population expansion of E. piscicida. A database from this study was created and hosted on pubmlst.org (http://pubmlst.org/edwardsiella/). Copyright © 2018 Elsevier Inc. All rights reserved.
Quantum-Sequencing: Fast electronic single DNA molecule sequencing

Science.gov (United States)

Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant

2014-03-01

A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free, high-throughput and cost-effective, single-molecule sequencing method. Here, we present the first demonstration of unique ``electronic fingerprint'' of all nucleotides (A, G, T, C), with single-molecule DNA sequencing, using Quantum-tunneling Sequencing (Q-Seq) at room temperature. We show that the electronic state of the nucleobases shift depending on the pH, with most distinct states identified at acidic pH. We also demonstrate identification of single nucleotide modifications (methylation here). Using these unique electronic fingerprints (or tunneling data), we report a partial sequence of beta lactamase (bla) gene, which encodes resistance to beta-lactam antibiotics, with over 95% success rate. These results highlight the potential of Q-Seq as a robust technique for next-generation sequencing.
Cloning of gene-encoded stem bromelain on system coming from Pichia pastoris as therapeutic protein candidate

Science.gov (United States)

Yusuf, Y.; Hidayati, W.

2018-01-01

The process of identifying bacterial recombination using PCR, and restriction, and then sequencing process was done after identifying the bacteria. This research aimed to get a yeast cell of Pichia pastoris which has an encoder gene of stem bromelain enzyme. The production of recombinant stem bromelain enzymes using yeast cells of P. pastoris can produce pure bromelain rod enzymes and have the same conformation with the enzyme’s conformation in pineapple plants. This recombinant stem bromelain enzyme can be used as a therapeutic protein in inflammatory, cancer and degenerative diseases. This study was an early stage of a step series to obtain bromelain rod protein derived from pineapple made with genetic engineering techniques. This research was started by isolating the RNA of pineapple stem which was continued with constructing cDNA using reserve transcriptase-PCR technique (RT-PCR), doing the amplification of bromelain enzyme encoder gene with PCR technique using a specific premiere couple which was designed. The process was continued by cloning into bacterium cells of Escherichia coli. A vector which brought the encoder gene of stem bromelain enzyme was inserted into the yeast cell of P. pastoris and was continued by identifying the yeast cell of P. pastoris which brought the encoder gene of stem bromelain enzyme. The research has not found enzyme gene of stem bromelain in yeast cell of P. pastoris yet. The next step is repeating the process by buying new reagent; RNase inhibitor, and buying liquid nitrogen.

Storing data encoded DNA in living organisms

Science.gov (United States)

Wong,; Pak C. , Wong; Kwong K. , Foote; Harlan, P [Richland, WA

2006-06-06

Current technologies allow the generation of artificial DNA molecules and/or the ability to alter the DNA sequences of existing DNA molecules. With a careful coding scheme and arrangement, it is possible to encode important information as an artificial DNA strand and store it in a living host safely and permanently. This inventive technology can be used to identify origins and protect R&D investments. It can also be used in environmental research to track generations of organisms and observe the ecological impact of pollutants. Today, there are microorganisms that can survive under extreme conditions. As well, it is advantageous to consider multicellular organisms as hosts for stored information. These living organisms can provide as memory housing and protection for stored data or information. The present invention provides well for data storage in a living organism wherein at least one DNA sequence is encoded to represent data and incorporated into a living organism.
Annotation and sequence diversity of transposable elements in common bean (Phaseolus vulgaris

Directory of Open Access Journals (Sweden)

Scott eJackson

2014-07-01

Full Text Available Common bean (Phaseolus vulgaris is an important legume crop grown and consumed worldwide. With the availability of the common bean genome sequence, the next challenge is to annotate the genome and characterize functional DNA elements. Transposable elements (TEs are the most abundant component of plant genomes and can dramatically affect genome evolution and genetic variation. Thus, it is pivotal to identify TEs in the common bean genome. In this study, we performed a genome-wide transposon annotation in common bean using a combination of homology and sequence structure-based methods. We developed a 2.12-Mb transposon database which includes 791 representative transposon sequences and is available upon request or from www.phytozome.org. Of note, nearly all transposons in the database are previously unrecognized TEs. More than 5,000 transposon-related expressed sequence tags (ESTs were detected which indicates that some transposons may be transcriptionally active. Two Ty1-copia retrotransposon families were found to encode the envelope-like protein which has rarely been identified in plant genomes. Also, we identified an extra open reading frame (ORF termed ORF2 from 15 Ty3-gypsy families that was located between the ORF encoding the retrotransposase and the 3’LTR. The ORF2 was in opposite transcriptional orientation to retrotransposase. Sequence homology searches and phylogenetic analysis suggested that the ORF2 may have an ancient origin, but its function is not clear. This transposon data provides a useful resource for understanding the genome organization and evolution and may be used to identify active TEs for developing transposon-tagging system in common bean and other related genomes.
Nucleotide sequences of two cellulase genes from alkalophilic Bacillus sp. strain N-4 and their strong homology.

OpenAIRE

Fukumori, F; Sashihara, N; Kudo, T; Horikoshi, K

1986-01-01

Two genes for cellulases of alkalophilic Bacillus sp. strain N-4 (ATCC 21833) have been sequenced. From the DNA sequences the cellulases encoded in the plasmids pNK1 and pNK2 consist of 488 and 409 amino acids, respectively. The DNA and protein sequences of the pNK1-encoded cellulase are related to those of the pNK2-encoded cellulase. The pNK2-encoded cellulase lacks the direct repeat sequence of a stretch of 60 amino acids near the C-terminal end of the pNK1-encoded cellulase. The duplicatio...
Genetics of Vitiligo

Science.gov (United States)

Spritz, Richard; Andersen, Genevieve

2016-01-01

Synopsis Vitiligo is “complex disorder” (also termed polygenic and multifactorial), reflecting simultaneous contributions of multiple genetic risk factors and environmental triggers. Large-scale genome-wide association studies, principally in European-derived whites and in Chinese, have discovered approximately 50 different genetic loci that contribute to vitiligo risk, some of which also contribute to other autoimmune diseases that are epidemiologically associated with vitiligo. At many of these vitiligo susceptibility loci the corresponding relevant genes have now been identified, and for some of these genes the specific DNA sequence variants that contribute to vitiligo risk are also now known. A large fraction of these genes encode proteins involved in immune regulation, a number of others play roles in cellular apoptosis, and still others are involved in regulating functions of melanocytes. For this last group, there appears to be an opposite relationship between susceptibility to vitiligo and susceptibility to melanoma, suggesting that vitiligo may engage a normal mechanism of immune surveillance for melanoma. While many of the specific biologic mechanisms through which these genetic factors operate to cause vitiligo remain to be elucidated, it is now clear that vitiligo is an autoimmune disease involving a complex relationship between programming and function of the immune system, aspects of the melanocyte autoimmune target, and dysregulation of the immune response. PMID:28317533
Exploring the influence of encoding format on subsequent memory.

Science.gov (United States)

Turney, Indira C; Dennis, Nancy A; Maillet, David; Rajah, M Natasha

2017-05-01

Distinctive encoding is greatly influenced by gist-based processes and has been shown to suffer when highly similar items are presented in close succession. Thus, elucidating the mechanisms underlying how presentation format affects gist processing is essential in determining the factors that influence these encoding processes. The current study utilised multivariate partial least squares (PLS) analysis to identify encoding networks directly associated with retrieval performance in a blocked and intermixed presentation condition. Subsequent memory analysis for successfully encoded items indicated no significant differences between reaction time and retrieval performance and presentation format. Despite no significant behavioural differences, behaviour PLS revealed differences in brain-behaviour correlations and mean condition activity in brain regions associated with gist-based vs. distinctive encoding. Specifically, the intermixed format encouraged more distinctive encoding, showing increased activation of regions associated with strategy use and visual processing (e.g., frontal and visual cortices, respectively). Alternatively, the blocked format exhibited increased gist-based processes, accompanied by increased activity in the right inferior frontal gyrus. Together, results suggest that the sequence that information is presented during encoding affects the degree to which distinctive encoding is engaged. These findings extend our understanding of the Fuzzy Trace Theory and the role of presentation format on encoding processes.
Prediction of novel archaeal enzymes from sequence-derived features

DEFF Research Database (Denmark)

Jensen, Lars Juhl; Skovgaard, Marie; Brunak, Søren

2002-01-01

The completely sequenced archaeal genomes potentially encode, among their many functionally uncharacterized genes, novel enzymes of biotechnological interest. We have developed a prediction method for detection and classification of enzymes from sequence alone (available at http://www.cbs.dtu.dk/......The completely sequenced archaeal genomes potentially encode, among their many functionally uncharacterized genes, novel enzymes of biotechnological interest. We have developed a prediction method for detection and classification of enzymes from sequence alone (available at http......://www.cbs.dtu.dk/services/ArchaeaFun/). The method does not make use of sequence similarity; rather, it relies on predicted protein features like cotranslational and posttranslational modifications, secondary structure, and simple physical/chemical properties....
Linkage of congenital isolated adrenocorticotropic hormone deficiency to the corticotropin releasing hormone locus using simple sequence repeat polymorphisms

Energy Technology Data Exchange (ETDEWEB)

Kyllo, J.H.; Collins, M.M.; Vetter, K.L. [Univ. of Iowa College of Medicine, Iowa City, IA (United States)] [and others

1996-03-29

Genetic screening techniques using simple sequence repeat polymorphisms were applied to investigate the molecular nature of congenital isolated adrenocorticotropic hormone (ACTH) deficiency. We hypothesize that this rare cause of hypocortisolism shared by a brother and sister with two unaffected sibs and unaffected parents is inherited as an autosomal recessive single gene mutation. Genes involved in the hypothalamic-pituitary axis controlling cortisol sufficiency were investigated for a causal role in this disorder. Southern blotting showed no detectable mutations of the gene encoding pro-opiomelanocortin (POMC), the ACTH precursor. Other candidate genes subsequently considered were those encoding neuroendocrine convertase-1, and neuroendocrine convertase-2 (NEC-1, NEC-2), and corticotropin releasing hormone (CRH). Tests for linkage were performed using polymorphic di- and tetranucleotide simple sequence repeat markers flanking the reported map locations for POMC, NEC-1, NEC-2, and CRH. The chromosomal haplotypes determined by the markers flanking the loci for POMC, NEC-1, and NEC-2 were not compatible with linkage. However, 22 individual markers defining the chromosomal haplotypes flanking CRH were compatible with linkage of the disorder to the immediate area of this gene of chromosome 8. Based on these data, we hypothesize that the ACTH deficiency in this family is due to an abnormality of CRH gene structure or expression. These results illustrate the useful application of high density genetic maps constructed with simple sequence repeat markers for inclusion/exclusion studies of candidate genes in even very small nuclear families segregating for unusual phenotypes. 25 refs., 5 figs., 2 tabs.
Genetic Diversity and Phylogenetic Evolution of Tibetan Sheep Based on mtDNA D-Loop Sequences.

Directory of Open Access Journals (Sweden)

Jianbin Liu

Full Text Available The molecular and population genetic evidence of the phylogenetic status of the Tibetan sheep (Ovis aries is not well understood, and little is known about this species' genetic diversity. This knowledge gap is partly due to the difficulty of sample collection. This is the first work to address this question. Here, the genetic diversity and phylogenetic relationship of 636 individual Tibetan sheep from fifteen populations were assessed using 642 complete sequences of the mitochondrial DNA D-loop. Samples were collected from the Qinghai-Tibetan Plateau area in China, and reference data were obtained from the six reference breed sequences available in GenBank. The length of the sequences varied considerably, between 1031 and 1259 bp. The haplotype diversity and nucleotide diversity were 0.992±0.010 and 0.019±0.001, respectively. The average number of nucleotide differences was 19.635. The mean nucleotide composition of the 350 haplotypes was 32.961% A, 29.708% T, 22.892% C, 14.439% G, 62.669% A+T, and 37.331% G+C. Phylogenetic analysis showed that all four previously defined haplogroups (A, B, C, and D were found in the 636 individuals of the fifteen Tibetan sheep populations but that only the D haplogroup was found in Linzhou sheep. Further, the clustering analysis divided the fifteen Tibetan sheep populations into at least two clusters. The estimation of the demographic parameters from the mismatch analyses showed that haplogroups A, B, and C had at least one demographic expansion in Tibetan sheep. These results contribute to the knowledge of Tibetan sheep populations and will help inform future conservation programs about the Tibetan sheep native to the Qinghai-Tibetan Plateau.
Horse domestication and conservation genetics of Przewalski's horse inferred from sex chromosomal and autosomal sequences.

Science.gov (United States)

Lau, Allison N; Peng, Lei; Goto, Hiroki; Chemnick, Leona; Ryder, Oliver A; Makova, Kateryna D

2009-01-01

Despite their ability to interbreed and produce fertile offspring, there is continued disagreement about the genetic relationship of the domestic horse (Equus caballus) to its endangered wild relative, Przewalski's horse (Equus przewalskii). Analyses have differed as to whether or not Przewalski's horse is placed phylogenetically as a separate sister group to domestic horses. Because Przewalski's horse and domestic horse are so closely related, genetic data can also be used to infer domestication-specific differences between the two. To investigate the genetic relationship of Przewalski's horse to the domestic horse and to address whether evolution of the domestic horse is driven by males or females, five homologous introns (a total of approximately 3 kb) were sequenced on the X and Y chromosomes in two Przewalski's horses and three breeds of domestic horses: Arabian horse, Mongolian domestic horse, and Dartmoor pony. Five autosomal introns (a total of approximately 6 kb) were sequenced for these horses as well. The sequences of sex chromosomal and autosomal introns were used to determine nucleotide diversity and the forces driving evolution in these species. As a result, X chromosomal and autosomal data do not place Przewalski's horses in a separate clade within phylogenetic trees for horses, suggesting a close relationship between domestic and Przewalski's horses. It was also found that there was a lack of nucleotide diversity on the Y chromosome and higher nucleotide diversity than expected on the X chromosome in domestic horses as compared with the Y chromosome and autosomes. This supports the hypothesis that very few male horses along with numerous female horses founded the various domestic horse breeds. Patterns of nucleotide diversity among different types of chromosomes were distinct for Przewalski's in contrast to domestic horses, supporting unique evolutionary histories of the two species.
Mob/oriT, a mobilizable site-specific recombination system for unmarked genetic manipulation in Bacillus thuringiensis and Bacillus cereus.

Science.gov (United States)

Wang, Pengxia; Zhu, Yiguang; Zhang, Yuyang; Zhang, Chunyi; Xu, Jianyi; Deng, Yun; Peng, Donghai; Ruan, Lifang; Sun, Ming

2016-06-10

Bacillus thuringiensis and Bacillus cereus are two important species in B. cereus group. The intensive study of these strains at the molecular level and construction of genetically modified bacteria requires the development of efficient genetic tools. To insert genes into or delete genes from bacterial chromosomes, marker-less manipulation methods were employed. We present a novel genetic manipulation method for B. thuringiensis and B. cereus strains that does not leave selection markers. Our approach takes advantage of the relaxase Mob02281 encoded by plasmid pBMB0228 from Bacillus thuringiensis. In addition to its mobilization function, this Mob protein can mediate recombination between oriT sites. The Mob02281 mobilization module was associated with a spectinomycin-resistance gene to form a Mob-Spc cassette, which was flanked by the core 24-bp oriT sequences from pBMB0228. A strain in which the wild-type chromosome was replaced with the modified copy containing the Mob-Spc cassette at the target locus was obtained via homologous recombination. Thus, the spectinomycin-resistance gene can be used to screen for Mob-Spc cassette integration mutants. Recombination between the two oriT sequences mediated by Mob02281, encoded by the Mob-Spc cassette, resulted in the excision of the Mob-Spc cassette, producing the desired chromosomal alteration without introducing unwanted selection markers. We used this system to generate an in-frame deletion of a target gene in B. thuringiensis as well as a gene located in an operon of B. cereus. Moreover, we demonstrated that this system can be used to introduce a single gene or an expression cassette of interest in B. thuringiensis. The Mob/oriT recombination system provides an efficient method for unmarked genetic manipulation and for constructing genetically modified bacteria of B. thuringiensis and B. cereus. Our method extends the available genetic tools for B. thuringiensis and B. cereus strains.
Illumination of the Spatial Order of Intracellular pH by Genetically Encoded pH-Sensitive Sensors

Directory of Open Access Journals (Sweden)

Mojca Benčina

2013-12-01

Full Text Available Fluorescent proteins have been extensively used for engineering genetically encoded sensors that can monitor levels of ions, enzyme activities, redox potential, and metabolites. Certain fluorescent proteins possess specific pH-dependent spectroscopic features, and thus can be used as indicators of intracellular pH. Moreover, concatenated pH-sensitive proteins with target proteins pin the pH sensors to a definite location within the cell, compartment, or tissue. This study provides an overview of the continually expanding family of pH-sensitive fluorescent proteins that have become essential tools for studies of pH homeostasis and cell physiology. We describe and discuss the design of intensity-based and ratiometric pH sensors, their spectral properties and pH-dependency, as well as their performance. Finally, we illustrate some examples of the applications of pH sensors targeted at different subcellular compartments.
Genetic divergence of Asiatic Bdellocephala (Turbellaria, Tricladida, Paludicola) as revealed by partial 18S rRNA gene sequence comparisons.

Science.gov (United States)

Kuznedelov, K D; Timoshkin, O A; Goldman, E

1997-01-01

Polymerase chain reaction (PCR) and direct sequencing of small ribosomal RNA genes were used for analysis of genetic differences among Asiatic species of freshwater triclad genus Bdellocephala. Representatives of four species and four subspecies of this genus were used to establish homology between nucleotides in the 5'-end portion of small ribosomal RNA gene sequences. Within 552 nucleotide sites of aligned sequences compared, six variable base positions were discovered, dividing Bdellocephala into five different genotypes. Sequence data allow to distinguish two groups of these genotypes. One of them unites species from Kamchatka and Japan, another one unites Baikalian taxa. Agreement between available morphological, cytological and sequence data is discussed.
Comprehensive Genetic Database of Expressed Sequence Tags for Coccolithophorids

Science.gov (United States)

Ranji, Mohammad; Hadaegh, Ahmad R.

Coccolithophorids are unicellular, marine, golden-brown, single-celled algae (Haptophyta) commonly found in near-surface waters in patchy distributions. They belong to the Phytoplankton family that is known to be responsible for much of the earth reproduction. Phytoplankton, just like plants live based on the energy obtained by Photosynthesis which produces oxygen. Substantial amount of oxygen in the earth's atmosphere is produced by Phytoplankton through Photosynthesis. The single-celled Emiliana Huxleyi is the most commonly known specie of Coccolithophorids and is known for extracting bicarbonate (HCO3) from its environment and producing calcium carbonate to form Coccoliths. Coccolithophorids are one of the world's primary producers, contributing about 15% of the average oceanic phytoplankton biomass to the oceans. They produce elaborate, minute calcite platelets (Coccoliths), covering the cell to form a Coccosphere and supplying up to 60% of the bulk pelagic calcite deposited on the sea floors. In order to understand the genetics of Coccolithophorid and the complexities of their biochemical reactions, we decided to build a database to store a complete profile of these organisms' genomes. Although a variety of such databases currently exist, (http://www.geneservice.co.uk/home/) none have yet been developed to comprehensively address the sequencing efforts underway by the Coccolithophorid research community. This database is called CocooExpress and is available to public (http://bioinfo.csusm.edu) for both data queries and sequence contribution.
Molecular cloning, sequence characterization and expression pattern of Rab18 gene from watermelon (Citrullus lanatus).

Science.gov (United States)

Xinli, Xiao; Lei, Peng

2015-03-04

The complete mRNA sequence of watermelon Rab18 gene was amplified through the rapid amplification of cDNA ends (RACE) method. The full-length mRNA was 1010 bp containing a 645 bp open reading frame, which encodes a protein of 214 amino acids. Sequence analysis revealed that watermelon Rab18 protein shares high homology with the Rab18 of cucumber (99%), muskmelon (98%), Morus notabilis (90%), tomato (89%), wine grape (89%) and potato (88%). Phylogenetic analysis revealed that watermelon Rab18 gene has a closer genetic relationship with Rab18 gene of cucumber and muskmelon. Tissue expression profile analysis indicated that watermelon Rab18 gene was highly expressed in root, stem and leaf, moderately expressed in flower and weakly expressed in fruit.
The cDNA sequence of a neutral horseradish peroxidase.

Science.gov (United States)

Bartonek-Roxå, E; Eriksson, H; Mattiasson, B

1991-02-16

A cDNA clone encoding a horseradish (Armoracia rusticana) peroxidase has been isolated and characterized. The cDNA contains 1378 nucleotides excluding the poly(A) tail and the deduced protein contains 327 amino acids which includes a 28 amino acid leader sequence. The predicted amino acid sequence is nine amino acids shorter than the major isoenzyme belonging to the horseradish peroxidase C group (HRP-C) and the sequence shows 53.7% identity with this isoenzyme. The described clone encodes nine cysteines of which eight correspond well with the cysteines found in HRP-C. Five potential N-glycosylation sites with the general sequence Asn-X-Thr/Ser are present in the deduced sequence. Compared to the earlier described HRP-C this is three glycosylation sites less. The shorter sequence and fewer N-glycosylation sites give the native isoenzyme a molecular weight of several thousands less than the horseradish peroxidase C isoenzymes. Comparison with the net charge value of HRP-C indicates that the described cDNA clone encodes a peroxidase which has either the same or a slightly less basic pI value, depending on whether the encoded protein is N-terminally blocked or not. This excludes the possibility that HRP-n could belong to either the HRP-A, -D or -E groups. The low sequence identity (53.7%) with HRP-C indicates that the described clone does not belong to the HRP-C isoenzyme group and comparison of the total amino acid composition with the HRP-B group does not place the described clone within this isoenzyme group. Our conclusion is that the described cDNA clone encodes a neutral horseradish peroxidase which belongs to a new, not earlier described, horseradish peroxidase group.
Genetic variability of Echinococcus granulosus complex in various geographical populations of Iran inferred by mitochondrial DNA sequences.

Science.gov (United States)

Spotin, Adel; Mahami-Oskouei, Mahmoud; Harandi, Majid Fasihi; Baratchian, Mehdi; Bordbar, Ali; Ahmadpour, Ehsan; Ebrahimi, Sahar

2017-01-01

To investigate the genetic variability and population structure of Echinococcus granulosus complex, 79 isolates were sequenced from different host species covering human, dog, camel, goat, sheep and cattle as of various geographical sub-populations of Iran (Northwestern, Northern, and Southeastern). In addition, 36 sequences of other geographical populations (Western, Southeastern and Central Iran), were directly retrieved from GenBank database for the mitochondrial cytochrome c oxidase subunit 1 (cox1) gene. The confirmed isolates were grouped as G1 genotype (n=92), G6 genotype (n=14), G3 genotype (n=8) and G2 genotype (n=1). 50 unique haplotypes were identified based on the analyzed sequences of cox1. A parsimonious network of the sequence haplotypes displayed star-like features in the overall population containing IR23 (22: 19.1%) as the most common haplotype. According to the analysis of molecular variance (AMOVA) test, the high value of haplotype diversity of E. granulosus complex was shown the total genetic variability within populations while nucleotide diversity was low in all populations. Neutrality indices of the cox1 (Tajima's D and Fu's Fs tests) were shown negative values in Western-Northwestern, Northern and Southeastern populations which indicating significant divergence from neutrality and positive but not significant in Central isolates. A pairwise fixation index (Fst) as a degree of gene flow was generally low value for all populations (0.00647-0.15198). The statistically Fst values indicate that Echinococcus sensu stricto (genotype G1-G3) populations are not genetically well differentiated in various geographical regions of Iran. To appraise the hypothetical evolutionary scenario, further study is needed to analyze concatenated mitogenomes and as well a panel of single locus nuclear markers should be considered in wider areas of Iran and neighboring countries. Copyright © 2016 Elsevier B.V. All rights reserved.
Genotyping-By-Sequencing (GBS) Detects Genetic Structure and Confirms Behavioral QTL in Tame and Aggressive Foxes (Vulpes vulpes).

Science.gov (United States)

Johnson, Jennifer L; Wittgenstein, Helena; Mitchell, Sharon E; Hyma, Katie E; Temnykh, Svetlana V; Kharlamova, Anastasiya V; Gulevich, Rimma G; Vladimirova, Anastasiya V; Fong, Hiu Wa Flora; Acland, Gregory M; Trut, Lyudmila N; Kukekova, Anna V

2015-01-01

The silver fox (Vulpes vulpes) offers a novel model for studying the genetics of social behavior and animal domestication. Selection of foxes, separately, for tame and for aggressive behavior has yielded two strains with markedly different, genetically determined, behavioral phenotypes. Tame strain foxes are eager to establish human contact while foxes from the aggressive strain are aggressive and difficult to handle. These strains have been maintained as separate outbred lines for over 40 generations but their genetic structure has not been previously investigated. We applied a genotyping-by-sequencing (GBS) approach to provide insights into the genetic composition of these fox populations. Sequence analysis of EcoT22I genomic libraries of tame and aggressive foxes identified 48,294 high quality SNPs. Population structure analysis revealed genetic divergence between the two strains and more diversity in the aggressive strain than in the tame one. Significant differences in allele frequency between the strains were identified for 68 SNPs. Three of these SNPs were located on fox chromosome 14 within an interval of a previously identified behavioral QTL, further supporting the importance of this region for behavior. The GBS SNP data confirmed that significant genetic diversity has been preserved in both fox populations despite many years of selective breeding. Analysis of SNP allele frequencies in the two populations identified several regions of genetic divergence between the tame and aggressive foxes, some of which may represent targets of selection for behavior. The GBS protocol used in this study significantly expanded genomic resources for the fox, and can be adapted for SNP discovery and genotyping in other canid species.
Genetic signatures of adaptation revealed from transcriptome sequencing of Arctic and red foxes

OpenAIRE

Kumar, Vikas; Kutschera, Verena E.; Nilsson, Maria A.; Janke, Axel

2015-01-01

Background The genus Vulpes (true foxes) comprises numerous species that inhabit a wide range of habitats and climatic conditions, including one species, the Arctic fox (Vulpes lagopus) which is adapted to the arctic region. A close relative to the Arctic fox, the red fox (Vulpes vulpes), occurs in subarctic to subtropical habitats. To study the genetic basis of their adaptations to different environments, transcriptome sequences from two Arctic foxes and one red fox individual were generated...
Identification and characterization of a gene encoding a putative ...

Indian Academy of Sciences (India)

2012-10-30

Oct 30, 2012 ... Genetic Improvement of Oil Crops, Ministry of Agriculture, Wuhan 430062, China. 2Institute of ... Its encoding gene is an essential candidate for oil crops to .... higher level in leaves than in other organs (Kim and Huang. 2004) ...
Sequence analysis of the Epstein-Barr virus (EBV) latent membrane protein-1 gene and promoter region

DEFF Research Database (Denmark)

Sandvej, Kristian; Gratama, J W; Munch, M

1997-01-01

Sequence variations in the Epstein-Barr virus (EBV) encoded latent membrane protein-1 (LMP-1) gene have been described in a Chinese nasopharyngeal carcinoma-derived isolate (CAO), and in viral isolates from various EBV-associated tumors. It has been suggested that these genetic changes, which...... include loss of a Xho I restriction site (position 169425) and a C-terminal 30-base pair (bp) deletion (position 168287-168256), define EBV genotypes associated with increased tumorigenicity or with disease among particular geographic populations. To determine the frequency of LMP-1 variations in European...... wild-type virus isolates, we sequenced the LMP-1 promoter and gene in EBV from lymphoblastoid cell lines from healthy carriers and patients without EBV-associated disease. Sequence changes were often present, and defined at least four main groups of viral isolates, which we designate Groups A through D...

Escherichia coli rpiA gene encoding ribose phosphate isomerase A

DEFF Research Database (Denmark)

Hove-Jensen, Bjarne; Maigaard, Marianne

1993-01-01

The rpiA gene encoding ribose phosphate isomerase A was cloned from phage 1A2(471) of the Kohara gene library. Subcloning, restriction, and complementation analyses revealed an 1,800-bp SspI-generated DNA fragment that contained the entire control and coding sequences. This DNA fragment was seque......The rpiA gene encoding ribose phosphate isomerase A was cloned from phage 1A2(471) of the Kohara gene library. Subcloning, restriction, and complementation analyses revealed an 1,800-bp SspI-generated DNA fragment that contained the entire control and coding sequences. This DNA fragment...
Genetic distance of Malaysian mousedeer based on mitochondrial DNA cytochrome oxidase I (COI) and D-loop region sequences

Science.gov (United States)

Bakar, Mohamad-Azam Akmal Abu; Rovie-Ryan, Jeffrine Japning; Ampeng, Ahmad; Yaakop, Salmah; Nor, Shukor Md; Md-Zain, Badrul Munir

2018-04-01

Mousedeer is one of the primitive mammals that can be found mainly in Southeast-Asia region. There are two species of mousedeer in Malaysia which are Tragulus kanchil and Tragulus napu. Both species can be distinguish by size, coat coloration, and throat pattern but clear diagnosis still cannot be found. The objective of the study is to show the genetic distance relationship between T. kanchil and T. napu and their population based on mitochondrial DNA (mtDNA) cytochrome oxidase I (COI) and D-loop region. There are 42 sample of mousedeer were used in this study collected by PERHILITAN from different locality. Another 29 D-loop sequence were retrieved from Genbank for comparative analysis. All sample were amplified using universal primer and species-specific primer for COI and D-loop genes via PCR process. The amplified sequences were analyzed to determine genetic distance of T. kanchil and T. napu. From the analysis, the average genetic distance between T. kanchil and T. napu based on locus COI and D-loop were 0.145 and 0.128 respectively. The genetic distance between populations of T. kanchil based on locus COI was between 0.003-0.013. For locus D-loop, genetic distance analysis showed distance in relationship between west-coast populations to east-coast population of T. kanchil. COI and D-loop mtDNA region provided a clear picture on the relationship within the mousedeer species. Last but not least, conservation effort toward protecting this species can be done by study the molecular genetics and prevent the extinction of this species.
Informed consent for exome sequencing research in families with genetic disease: the emerging issue of incidental findings.

Science.gov (United States)

Bergner, Amanda L; Bollinger, Juli; Raraigh, Karen S; Tichnell, Crystal; Murray, Brittney; Blout, Carrie Lynn; Telegrafi, Aida Bytyci; James, Cynthia A

2014-11-01

Genomic sequencing technology is increasingly used in genetic research. Studies of informed consent for exome and genome sequencing (ES/GS) research have largely involved hypothetical scenarios or healthy individuals enrolling in population-based studies. Studies have yet to explore the consent experiences of adults with inherited disease. We conducted a qualitative interview study of 15 adults recently enrolled in a large-scale ES/GS study (11 affected adults, four parents of affected children). Our study had two goals: (1) to explore three theoretical barriers to consent for ES/GS research (interpretive/technical complexity, possibility of incidental findings, and risks of loss of privacy); and (2) to explore how interviewees experienced the consent process. Interviewees could articulate study goals and processes, describe incidental findings, discuss risks of privacy loss, and reflect on their consent experience. Few expected the study would identify the genetic cause of their condition. All elected to receive incidental findings. Interviewees acknowledged paying little attention to potential implications of incidental findings in light of more pressing goals of supporting research regarding their own medical conditions. Interviewees suggested that experience living with a genetic condition prepared them to adjust to incidental findings. Interviewees also expressed little concern about loss of confidentiality of study data. Some experienced the consent process as very long. None desired reconsent prior to return of study results. Families with inherited disease likely would benefit from a consent process in which study risks and benefits were discussed in the context of prior experiences with genetic research and genetic disease. © 2014 Wiley Periodicals, Inc.
VRprofile: gene-cluster-detection-based profiling of virulence and antibiotic resistance traits encoded within genome sequences of pathogenic bacteria.

Science.gov (United States)

Li, Jun; Tai, Cui; Deng, Zixin; Zhong, Weihong; He, Yongqun; Ou, Hong-Yu

2017-01-10

VRprofile is a Web server that facilitates rapid investigation of virulence and antibiotic resistance genes, as well as extends these trait transfer-related genetic contexts, in newly sequenced pathogenic bacterial genomes. The used backend database MobilomeDB was firstly built on sets of known gene cluster loci of bacterial type III/IV/VI/VII secretion systems and mobile genetic elements, including integrative and conjugative elements, prophages, class I integrons, IS elements and pathogenicity/antibiotic resistance islands. VRprofile is thus able to co-localize the homologs of these conserved gene clusters using HMMer or BLASTp searches. With the integration of the homologous gene cluster search module with a sequence composition module, VRprofile has exhibited better performance for island-like region predictions than the other widely used methods. In addition, VRprofile also provides an integrated Web interface for aligning and visualizing identified gene clusters with MobilomeDB-archived gene clusters, or a variety set of bacterial genomes. VRprofile might contribute to meet the increasing demands of re-annotations of bacterial variable regions, and aid in the real-time definitions of disease-relevant gene clusters in pathogenic bacteria of interest. VRprofile is freely available at http://bioinfo-mml.sjtu.edu.cn/VRprofile. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
A Rickettsia Genome Overrun by Mobile Genetic Elements Provides Insight into the Acquisition of Genes Characteristic of an Obligate Intracellular Lifestyle

Science.gov (United States)

Joardar, Vinita; Williams, Kelly P.; Driscoll, Timothy; Hostetler, Jessica B.; Nordberg, Eric; Shukla, Maulik; Walenz, Brian; Hill, Catherine A.; Nene, Vishvanath M.; Azad, Abdu F.; Sobral, Bruno W.; Caler, Elisabet

2012-01-01

We present the draft genome for the Rickettsia endosymbiont of Ixodes scapularis (REIS), a symbiont of the deer tick vector of Lyme disease in North America. Among Rickettsia species (Alphaproteobacteria: Rickettsiales), REIS has the largest genome sequenced to date (>2 Mb) and contains 2,309 genes across the chromosome and four plasmids (pREIS1 to pREIS4). The most remarkable finding within the REIS genome is the extraordinary proliferation of mobile genetic elements (MGEs), which contributes to a limited synteny with other Rickettsia genomes. In particular, an integrative conjugative element named RAGE (for Rickettsiales amplified genetic element), previously identified in scrub typhus rickettsiae (Orientia tsutsugamushi) genomes, is present on both the REIS chromosome and plasmids. Unlike the pseudogene-laden RAGEs of O. tsutsugamushi, REIS encodes nine conserved RAGEs that include F-like type IV secretion systems similar to that of the tra genes encoded in the Rickettsia bellii and R. massiliae genomes. An unparalleled abundance of encoded transposases (>650) relative to genome size, together with the RAGEs and other MGEs, comprise ∼35% of the total genome, making REIS one of the most plastic and repetitive bacterial genomes sequenced to date. We present evidence that conserved rickettsial genes associated with an intracellular lifestyle were acquired via MGEs, especially the RAGE, through a continuum of genomic invasions. Robust phylogeny estimation suggests REIS is ancestral to the virulent spotted fever group of rickettsiae. As REIS is not known to invade vertebrate cells and has no known pathogenic effects on I. scapularis, its genome sequence provides insight on the origin of mechanisms of rickettsial pathogenicity. PMID:22056929
A haplotype regression approach for genetic evaluation using sequences from the 1000 bull genomes Project

International Nuclear Information System (INIS)

Lakhssassi, K.; González-Recio, O.

2017-01-01

Haplotypes from sequencing data may improve the prediction accuracy in genomic evaluations as haplotypes are in stronger linkage disequilibrium with quantitative trait loci than markers from SNP chips. This study focuses first, on the creation of haplotypes in a population sample of 450 Holstein animals, with full-sequence data from the 1000 bull genomes project; and second, on incorporating them into the whole genome prediction model. In total, 38,319,258 SNPs (and indels) from Next Generation Sequencing were included in the analysis. After filtering variants with minor allele frequency (MAF< 0.025) 13,912,326 SNPs were available for the haplotypes extraction with findhap.f90. The number of SNPs in the haploblocks was on average 924 SNP (166,552 bp). Unique haplotypes were around 97% in all chromosomes and were ignored leaving 153,428 haplotypes. Estimated haplotypes had a large contribution to the total variance of genomic estimated breeding values for kilogram of protein, Global Type Index, Somatic Cell Score and Days Open (between 32 and 99.9%). Haploblocks containing haplotypes with large effects were selected by filtering for each trait, haplotypes whose effect was larger/lower than the mean plus/minus 3 times the standard deviation (SD) and 1 SD above the mean of the haplotypes effect distribution. Results showed that filtering by 3 SD would not be enough to capture a large proportion of genetic variance, whereas filtering by 1 SD could be useful but model convergence should be considered. Additionally, sequence haplotypes were able to capture additional genetic variance to the polygenic effect for traits undergoing lower selection intensity like fertility and health traits.
A haplotype regression approach for genetic evaluation using sequences from the 1000 bull genomes Project

Energy Technology Data Exchange (ETDEWEB)

Lakhssassi, K.; González-Recio, O.

2017-07-01

Haplotypes from sequencing data may improve the prediction accuracy in genomic evaluations as haplotypes are in stronger linkage disequilibrium with quantitative trait loci than markers from SNP chips. This study focuses first, on the creation of haplotypes in a population sample of 450 Holstein animals, with full-sequence data from the 1000 bull genomes project; and second, on incorporating them into the whole genome prediction model. In total, 38,319,258 SNPs (and indels) from Next Generation Sequencing were included in the analysis. After filtering variants with minor allele frequency (MAF< 0.025) 13,912,326 SNPs were available for the haplotypes extraction with findhap.f90. The number of SNPs in the haploblocks was on average 924 SNP (166,552 bp). Unique haplotypes were around 97% in all chromosomes and were ignored leaving 153,428 haplotypes. Estimated haplotypes had a large contribution to the total variance of genomic estimated breeding values for kilogram of protein, Global Type Index, Somatic Cell Score and Days Open (between 32 and 99.9%). Haploblocks containing haplotypes with large effects were selected by filtering for each trait, haplotypes whose effect was larger/lower than the mean plus/minus 3 times the standard deviation (SD) and 1 SD above the mean of the haplotypes effect distribution. Results showed that filtering by 3 SD would not be enough to capture a large proportion of genetic variance, whereas filtering by 1 SD could be useful but model convergence should be considered. Additionally, sequence haplotypes were able to capture additional genetic variance to the polygenic effect for traits undergoing lower selection intensity like fertility and health traits.
Evolutionary dynamics of an expressed MHC class IIβ locus in the Ranidae (Anura) uncovered by genome walking and high-throughput amplicon sequencing

Science.gov (United States)

Mulder, Kevin P.; Cortazar-Chinarro, Maria; Harris, D. James; Crottini, Angelica; Grant, Evan H. Campbell; Fleischer, Robert C.; Savage, Anna E.

2017-01-01

The Major Histocompatibility Complex (MHC) is a genomic region encoding immune loci that are important and frequently used markers in studies of adaptive genetic variation and disease resistance. Given the primary role of infectious diseases in contributing to global amphibian declines, we characterized the hypervariable exon 2 and flanking introns of the MHC Class IIβ chain for 17 species of frogs in the Ranidae, a speciose and cosmopolitan family facing widespread pathogen infections and declines. We find high levels of genetic variation concentrated in the Peptide Binding Region (PBR) of the exon. Ten codons are under positive selection, nine of which are located in the mammal-defined PBR. We hypothesize that the tenth codon (residue 21) is an amphibian-specific PBR site that may be important in disease resistance. Trans-species and trans-generic polymorphisms are evident from exon-based genealogies, and co-phylogenetic analyses between intron, exon and mitochondrial based reconstructions reveal incongruent topologies, likely due to different locus histories. We developed two sets of barcoded adapters that reliably amplify a single and likely functional locus in all screened species using both 454 and Illumina based sequencing methods. These primers provide a resource for multiplexing and directly sequencing hundreds of samples in a single sequencing run, avoiding the labour and chimeric sequences associated with cloning, and enabling MHC population genetic analyses. Although the primers are currently limited to the 17 species we tested, these sequences and protocols provide a useful genetic resource and can serve as a starting point for future disease, adaptation and conservation studies across a range of anuran taxa.
Accidental genetic engineers: horizontal sequence transfer from parasitoid wasps to their Lepidopteran hosts.

Directory of Open Access Journals (Sweden)

Sean E Schneider

Full Text Available We show here that 105 regions in two Lepidoptera genomes appear to derive from horizontally transferred wasp DNA. We experimentally verified the presence of two of these sequences in a diverse set of silkworm (Bombyx mori genomes. We hypothesize that these horizontal transfers are made possible by the unusual strategy many parasitoid wasps employ of injecting hosts with endosymbiotic polydnaviruses to minimize the host's defense response. Because these virus-like particles deliver wasp DNA to the cells of the host, there has been much interest in whether genetic information can be permanently transferred from the wasp to the host. Two transferred sequences code for a BEN domain, known to be associated with polydnaviruses and transcriptional regulation. These findings represent the first documented cases of horizontal transfer of genes between two organisms by a polydnavirus. This presents an interesting evolutionary paradigm in which host species can acquire new sequences from parasitoid wasps that attack them. Hymenoptera and Lepidoptera diverged ∼300 MYA, making this type of event a source of novel sequences for recipient species. Unlike many other cases of horizontal transfer between two eukaryote species, these sequence transfers can be explained without the need to invoke the sequences 'hitchhiking' on a third organism (e.g. retrovirus capable of independent reproduction. The cellular machinery necessary for the transfer is contained entirely in the wasp genome. The work presented here is the first such discovery of what is likely to be a broader phenomenon among species affected by these wasps.
Calling genotypes from public RNA-sequencing data enables identification of genetic variants that affect gene-expression levels

NARCIS (Netherlands)

Deelen, Patrick; Zhernakova, Daria V.; de Haan, Mark; van der Sijde, Marijke; Bonder, Marc Jan; Karjalainen, Juha; van der Velde, K. Joeri; Abbott, Kristin M.; Fu, Jingyuan; Wijmenga, Cisca; Sinke, Richard J.; Swertz, Morris A.; Franke, Lude

2015-01-01

Background: RNA-sequencing (RNA-seq) is a powerful technique for the identification of genetic variants that affect gene-expression levels, either through expression quantitative trait locus (eQTL) mapping or through allele-specific expression (ASE) analysis. Given increasing numbers of RNA-seq
Alignment-free microbial phylogenomics under scenarios of sequence divergence, genome rearrangement and lateral genetic transfer.

Science.gov (United States)

Bernard, Guillaume; Chan, Cheong Xin; Ragan, Mark A

2016-07-01

Alignment-free (AF) approaches have recently been highlighted as alternatives to methods based on multiple sequence alignment in phylogenetic inference. However, the sensitivity of AF methods to genome-scale evolutionary scenarios is little known. Here, using simulated microbial genome data we systematically assess the sensitivity of nine AF methods to three important evolutionary scenarios: sequence divergence, lateral genetic transfer (LGT) and genome rearrangement. Among these, AF methods are most sensitive to the extent of sequence divergence, less sensitive to low and moderate frequencies of LGT, and most robust against genome rearrangement. We describe the application of AF methods to three well-studied empirical genome datasets, and introduce a new application of the jackknife to assess node support. Our results demonstrate that AF phylogenomics is computationally scalable to multi-genome data and can generate biologically meaningful phylogenies and insights into microbial evolution.
Phylogeny and genetic diversity of Bridgeoporus nobilissimus inferred using mitochondrial and nuclear rDNA sequences

Science.gov (United States)

Redberg, G.L.; Hibbett, D.S.; Ammirati, J.F.; Rodriguez, R.J.

2003-01-01

The genetic diversity and phylogeny of Bridgeoporus nobilissimus have been analyzed. DNA was extracted from spores collected from individual fruiting bodies representing six geographically distinct populations in Oregon and Washington. Spore samples collected contained low levels of bacteria, yeast and a filamentous fungal species. Using taxon-specific PCR primers, it was possible to discriminate among rDNA from bacteria, yeast, a filamentous associate and B. nobilissimus. Nuclear rDNA internal transcribed spacer (ITS) region sequences of B. nobilissimus were compared among individuals representing six populations and were found to have less than 2% variation. These sequences also were used to design dual and nested PCR primers for B. nobilissimus-specific amplification. Mitochondrial small-subunit rDNA sequences were used in a phylogenetic analysis that placed B. nobilissimus in the hymenochaetoid clade, where it was associated with Oxyporus and Schizopora.
Human growth hormone-related latrogenic Creutzfeldt-Jakob disease: Search for a genetic susceptibility by analysis of the PRNP coding region

Energy Technology Data Exchange (ETDEWEB)

Jaegly, A.; Boussin, F.; Deslys, J.P. [CEA/CRSSA/DSV/DPTE, Fontenay-aux-Roses (France)] [and others

1995-05-20

The human PRNP gene encoding PrP is located on chromosome 20 and consists of two exons and a single intron. The open reading frame is entirely fitted into the second exon. Genetic studies indicate that all of the familial and several sporadic forms of TSSEs are associated with mutations in the PRNP 759-bp coding region. Moreover, homozygosity at codon 129, a locus harboring a polymorphism among the general population, was proposed as a genetic susceptibility marker for both sporadic and iatrogenic CJD. To assess whether additional genetic predisposition markers exist in the PRNP gene, the authors sequenced the PRNP coding region of 17 of the 32 French patients who developed a hGH-related CJD.
Characterization of the dsDNA prophage sequences in the genome of Neisseria gonorrhoeae and visualization of productive bacteriophage

Directory of Open Access Journals (Sweden)

Maugel Timothy K

2007-07-01

Full Text Available Abstract Background Bioinformatic analysis of the genome sequence of Neisseria gonorrhoeae revealed the presence of nine probable prophage islands. The distribution, conservation and function of many of these sequences, and their ability to produce bacteriophage particles are unknown. Results Our analysis of the genomic sequence of FA1090 identified five genomic regions (NgoΦ1 – 5 that are related to dsDNA lysogenic phage. The genetic content of the dsDNA prophage sequences were examined in detail and found to contain blocks of genes encoding for proteins homologous to proteins responsible for phage DNA replication, structural proteins and proteins responsible for phage assembly. The DNA sequences from NgoΦ1, NgoΦ2 and NgoΦ3 contain some significant regions of identity. A unique region of NgoΦ2 showed very high similarity with the Pseudomonas aeruginosa generalized transducing phage F116. Comparative analysis at the nucleotide and protein levels suggests that the sequences of NgoΦ1 and NgoΦ2 encode functionally active phages, while NgoΦ3, NgoΦ4 and NgoΦ5 encode incomplete genomes. Expression of the NgoΦ1 and NgoΦ2 repressors in Escherichia coli inhibit the growth of E. coli and the propagation of phage λ. The NgoΦ2 repressor was able to inhibit transcription of N. gonorrhoeae genes and Haemophilus influenzae HP1 phage promoters. The holin gene of NgoΦ1 (identical to that encoded by NgoΦ2, when expressed in E. coli, could serve as substitute for the phage λ s gene. We were able to detect the presence of the DNA derived from NgoΦ1 in the cultures of N. gonorrhoeae. Electron microscopy analysis of culture supernatants revealed the presence of multiple forms of bacteriophage particles. Conclusion These data suggest that the genes similar to dsDNA lysogenic phage present in the gonococcus are generally conserved in this pathogen and that they are able to regulate the expression of other neisserial genes. Since phage particles were
Genetic structure of Florida green turtle rookeries as indicated by mitochondrial DNA control region sequences

Science.gov (United States)

Shamblin, Brian M.; Bagley, Dean A.; Ehrhart, Llewellyn M.; Desjardin, Nicole A.; Martin, R. Erik; Hart, Kristen M.; Naro-Maciel, Eugenia; Rusenko, Kirt; Stiner, John C.; Sobel, Debra; Johnson, Chris; Wilmers, Thomas; Wright, Laura J.; Nairn, Campbell J.

2014-01-01

Green turtle (Chelonia mydas) nesting has increased dramatically in Florida over the past two decades, ranking the Florida nesting aggregation among the largest in the Greater Caribbean region. Individual beaches that comprise several hundred kilometers of Florida’s east coast and Keys support tens to thousands of nests annually. These beaches encompass natural to highly developed habitats, and the degree of demographic partitioning among rookeries was previously unresolved. We characterized the genetic structure of ten Florida rookeries from Cape Canaveral to the Dry Tortugas through analysis of 817 base pair mitochondrial DNA (mtDNA) control region sequences from 485 nesting turtles. Two common haplotypes, CM-A1.1 and CM-A3.1, accounted for 87 % of samples, and the haplotype frequencies were strongly partitioned by latitude along Florida’s Atlantic coast. Most genetic structure occurred between rookeries on either side of an apparent genetic break in the vicinity of the St. Lucie Inlet that separates Hutchinson Island and Jupiter Island, representing the finest scale at which mtDNA structure has been documented in marine turtle rookeries. Florida and Caribbean scale analyses of population structure support recognition of at least two management units: central eastern Florida and southern Florida. More thorough sampling and deeper sequencing are necessary to better characterize connectivity among Florida green turtle rookeries as well as between the Florida nesting aggregation and others in the Greater Caribbean region.
Phylogeny and evolution of the auks (subfamily Alcinae) based on mitochondrial DNA sequences

Science.gov (United States)

Moum, Truls; Johansen, Steinar; Erikstad, Kjell Einar; Piatt, John F.

1994-01-01

The genetic divergence and phylogeny of the auks was assessed by mitochondrial DNA sequence comparisons in a study using 19 of the 22 auk species and two outgroup representatives. We compared more than 500 nucleotides from each of two mitochondrial genes encoding 12S rRNA and the NADH dehydrogenase subunit 6. Divergence times were estimated from transversional substitutions. The dovekie (Alle alle) is related to the razorbill (Alca torda) and the murres (Uria spp). Furthermore, the Xantus's murrelet (Synthliboramphus hypoleucus) and the ancient (Synthliboramphus antiquus) and Japanese murrelets (Synthliboramphus wumizusume) are genetically distinct members of the same main lineage, whereas brachyramphine and synthliboramphine murrelets are not closely related. An early adaptive radiation of six main species groups of auks seems to trace back to Middle Miocene. Later speciation probably involved ecological differentiations and geographical isolations.
Extreme expansion of NBS-encoding genes in Rosaceae.

Science.gov (United States)

Jia, YanXiao; Yuan, Yang; Zhang, Yanchun; Yang, Sihai; Zhang, Xiaohui

2015-05-03

Nucleotide binding site leucine-rich repeats (NBS-LRR) genes encode a large class of disease resistance (R) proteins in plants. Extensive studies have been carried out to identify and investigate NBS-encoding gene families in many important plant species. However, no comprehensive research into NBS-encoding genes in the Rosaceae has been performed. In this study, five whole-genome sequenced Rosaceae species, including apple, pear, peach, mei, and strawberry, were analyzed to investigate the evolutionary pattern of NBS-encoding genes and to compare them to those of three Cucurbitaceae species, cucumber, melon, and watermelon. Considerable differences in the copy number of NBS-encoding genes were observed between Cucurbitaceae and Rosaceae species. In Rosaceae species, a large number and a high proportion of NBS-encoding genes were observed in peach (437, 1.52%), mei (475, 1.51%), strawberry (346, 1.05%) and pear (617, 1.44%), and apple contained a whopping 1303 (2.05%) NBS-encoding genes, which might be the highest number of R-genes in all of these reported diploid plant. However, no more than 100 NBS-encoding genes were identified in Cucurbitaceae. Many more species-specific gene families were classified and detected with the signature of positive selection in Rosaceae species, especially in the apple genome. Taken together, our findings indicate that NBS-encoding genes in Rosaceae, especially in apple, have undergone extreme expansion and rapid adaptive evolution. Useful information was provided for further research on the evolutionary mode of disease resistance genes in Rosaceae crops.
Imaging intracellular pH in live cells with a genetically encoded red fluorescent protein sensor.

Science.gov (United States)

Tantama, Mathew; Hung, Yin Pun; Yellen, Gary

2011-07-06

Intracellular pH affects protein structure and function, and proton gradients underlie the function of organelles such as lysosomes and mitochondria. We engineered a genetically encoded pH sensor by mutagenesis of the red fluorescent protein mKeima, providing a new tool to image intracellular pH in live cells. This sensor, named pHRed, is the first ratiometric, single-protein red fluorescent sensor of pH. Fluorescence emission of pHRed peaks at 610 nm while exhibiting dual excitation peaks at 440 and 585 nm that can be used for ratiometric imaging. The intensity ratio responds with an apparent pK(a) of 6.6 and a >10-fold dynamic range. Furthermore, pHRed has a pH-responsive fluorescence lifetime that changes by ~0.4 ns over physiological pH values and can be monitored with single-wavelength two-photon excitation. After characterizing the sensor, we tested pHRed's ability to monitor intracellular pH by imaging energy-dependent changes in cytosolic and mitochondrial pH.
Genetic diversity among Puccinia melanocephala isolates from Brazil assessed using simple sequence repeat markers.

Science.gov (United States)

Peixoto-Junior, R F; Creste, S; Landell, M G A; Nunes, D S; Sanguino, A; Campos, M F; Vencovsky, R; Tambarussi, E V; Figueira, A

2014-09-26

Brown rust (causal agent Puccinia melanocephala) is an important sugarcane disease that is responsible for large losses in yield worldwide. Despite its importance, little is known regarding the genetic diversity of this pathogen in the main Brazilian sugarcane cultivation areas. In this study, we characterized the genetic diversity of 34 P. melanocephala isolates from 4 Brazilian states using loci identified from an enriched simple sequence repeat (SSR) library. The aggressiveness of 3 isolates from major sugarcane cultivation areas was evaluated by inoculating an intermediately resistant and a susceptible cultivar. From the enriched library, 16 SSR-specific primers were developed, which produced scorable alleles. Of these, 4 loci were polymorphic and 12 were monomorphic for all isolates evaluated. The molecular characterization of the 34 isolates of P. melanocephala conducted using 16 SSR loci revealed the existence of low genetic variability among the isolates. The average estimated genetic distance was 0.12. Phenetic analysis based on Nei's genetic distance clustered the isolates into 2 major groups. Groups I and II included 18 and 14 isolates, respectively, and both groups contained isolates from all 4 geographic regions studied. Two isolates did not cluster with these groups. It was not possible to obtain clusters according to location or state of origin. Analysis of disease severity data revealed that the isolates did not show significant differences in aggressiveness between regions.
A deep learning method for lincRNA detection using auto-encoder algorithm.

Science.gov (United States)

Yu, Ning; Yu, Zeng; Pan, Yi

2017-12-06

RNA sequencing technique (RNA-seq) enables scientists to develop novel data-driven methods for discovering more unidentified lincRNAs. Meantime, knowledge-based technologies are experiencing a potential revolution ignited by the new deep learning methods. By scanning the newly found data set from RNA-seq, scientists have found that: (1) the expression of lincRNAs appears to be regulated, that is, the relevance exists along the DNA sequences; (2) lincRNAs contain some conversed patterns/motifs tethered together by non-conserved regions. The two evidences give the reasoning for adopting knowledge-based deep learning methods in lincRNA detection. Similar to coding region transcription, non-coding regions are split at transcriptional sites. However, regulatory RNAs rather than message RNAs are generated. That is, the transcribed RNAs participate the biological process as regulatory units instead of generating proteins. Identifying these transcriptional regions from non-coding regions is the first step towards lincRNA recognition. The auto-encoder method achieves 100% and 92.4% prediction accuracy on transcription sites over the putative data sets. The experimental results also show the excellent performance of predictive deep neural network on the lincRNA data sets compared with support vector machine and traditional neural network. In addition, it is validated through the newly discovered lincRNA data set and one unreported transcription site is found by feeding the whole annotated sequences through the deep learning machine, which indicates that deep learning method has the extensive ability for lincRNA prediction. The transcriptional sequences of lincRNAs are collected from the annotated human DNA genome data. Subsequently, a two-layer deep neural network is developed for the lincRNA detection, which adopts the auto-encoder algorithm and utilizes different encoding schemes to obtain the best performance over intergenic DNA sequence data. Driven by those newly

An Information Theoretic Characterisation of Auditory Encoding

Science.gov (United States)

Overath, Tobias; Cusack, Rhodri; Kumar, Sukhbinder; von Kriegstein, Katharina; Warren, Jason D; Grube, Manon; Carlyon, Robert P; Griffiths, Timothy D

2007-01-01

The entropy metric derived from information theory provides a means to quantify the amount of information transmitted in acoustic streams like speech or music. By systematically varying the entropy of pitch sequences, we sought brain areas where neural activity and energetic demands increase as a function of entropy. Such a relationship is predicted to occur in an efficient encoding mechanism that uses less computational resource when less information is present in the signal: we specifically tested the hypothesis that such a relationship is present in the planum temporale (PT). In two convergent functional MRI studies, we demonstrated this relationship in PT for encoding, while furthermore showing that a distributed fronto-parietal network for retrieval of acoustic information is independent of entropy. The results establish PT as an efficient neural engine that demands less computational resource to encode redundant signals than those with high information content. PMID:17958472
Multi-Temporal Land Cover Classification with Sequential Recurrent Encoders

Science.gov (United States)

Rußwurm, Marc; Körner, Marco

2018-03-01

Earth observation (EO) sensors deliver data with daily or weekly temporal resolution. Most land use and land cover (LULC) approaches, however, expect cloud-free and mono-temporal observations. The increasing temporal capabilities of today's sensors enables the use of temporal, along with spectral and spatial features. Domains, such as speech recognition or neural machine translation, work with inherently temporal data and, today, achieve impressive results using sequential encoder-decoder structures. Inspired by these sequence-to-sequence models, we adapt an encoder structure with convolutional recurrent layers in order to approximate a phenological model for vegetation classes based on a temporal sequence of Sentinel 2 (S2) images. In our experiments, we visualize internal activations over a sequence of cloudy and non-cloudy images and find several recurrent cells, which reduce the input activity for cloudy observations. Hence, we assume that our network has learned cloud-filtering schemes solely from input data, which could alleviate the need for tedious cloud-filtering as a preprocessing step for many EO approaches. Moreover, using unfiltered temporal series of top-of-atmosphere (TOA) reflectance data, we achieved in our experiments state-of-the-art classification accuracies on a large number of crop classes with minimal preprocessing compared to other classification approaches.
Human pro. cap alpha. 1(III) collagen: cDNA sequence for the 3' end

Energy Technology Data Exchange (ETDEWEB)

Mankoo, B S; Dalgleish, R

1988-03-25

The authors have previously isolated two overlapping cDNA clones, pIII-21 and pIII-33, which encode the C-terminal end of human type III procollagen. They now present the sequence of 2520 bases encoded in these cDNAs which overlaps other previously published sequences for the same gene. The sequence presented differs from previously published sequences at five positions.
A linear-encoding model explains the variability of the target morphology in regeneration

Science.gov (United States)

Lobo, Daniel; Solano, Mauricio; Bubenik, George A.; Levin, Michael

2014-01-01

A fundamental assumption of today's molecular genetics paradigm is that complex morphology emerges from the combined activity of low-level processes involving proteins and nucleic acids. An inherent characteristic of such nonlinear encodings is the difficulty of creating the genetic and epigenetic information that will produce a given self-assembling complex morphology. This ‘inverse problem’ is vital not only for understanding the evolution, development and regeneration of bodyplans, but also for synthetic biology efforts that seek to engineer biological shapes. Importantly, the regenerative mechanisms in deer antlers, planarian worms and fiddler crabs can solve an inverse problem: their target morphology can be altered specifically and stably by injuries in particular locations. Here, we discuss the class of models that use pre-specified morphological goal states and propose the existence of a linear encoding of the target morphology, making the inverse problem easy for these organisms to solve. Indeed, many model organisms such as Drosophila, hydra and Xenopus also develop according to nonlinear encodings producing linear encodings of their final morphologies. We propose the development of testable models of regeneration regulation that combine emergence with a top-down specification of shape by linear encodings of target morphology, driving transformative applications in biomedicine and synthetic bioengineering. PMID:24402915
Shotgun protein sequencing.

Energy Technology Data Exchange (ETDEWEB)

Faulon, Jean-Loup Michel; Heffelfinger, Grant S.

2009-06-01

A novel experimental and computational technique based on multiple enzymatic digestion of a protein or protein mixture that reconstructs protein sequences from sequences of overlapping peptides is described in this SAND report. This approach, analogous to shotgun sequencing of DNA, is to be used to sequence alternative spliced proteins, to identify post-translational modifications, and to sequence genetically engineered proteins.
Recent advances in the molecular genetics of the lignin degrading fungus, phanerochaete chrysosporium

International Nuclear Information System (INIS)

Covert, S.F.

1991-01-01

During the past several years, molecular genetics research on phanerochaete chrysosporium, a white-rot basidiomycete, has increased dramatically. It is known that families of highly homologous, clustered genes encode the lignin peroxidases. The same appears to be true with the exocellobiohydrolase genes. Functional domains and active sites have been tentatively identified from the deduced amino acid sequences of these genes. Current investigations focus on elucidating the genomic organization of gene families, the mechanism(s) of gene regulation, and the role and interaction of specific gene products in lignocellulose degradation. (author)
Development of cleaved amplified polymorphic sequence markers and a CAPS-based genetic linkage map in watermelon (Citrullus lanatus [Thunb.] Matsum. and Nakai) constructed using whole-genome re-sequencing data.

Science.gov (United States)

Liu, Shi; Gao, Peng; Zhu, Qianglong; Luan, Feishi; Davis, Angela R; Wang, Xiaolu

2016-03-01

Cleaved amplified polymorphic sequence (CAPS) markers are useful tools for detecting single nucleotide polymorphisms (SNPs). This study detected and converted SNP sites into CAPS markers based on high-throughput re-sequencing data in watermelon, for linkage map construction and quantitative trait locus (QTL) analysis. Two inbred lines, Cream of Saskatchewan (COS) and LSW-177 had been re-sequenced and analyzed by Perl self-compiled script for CAPS marker development. 88.7% and 78.5% of the assembled sequences of the two parental materials could map to the reference watermelon genome, respectively. Comparative assembled genome data analysis provided 225,693 and 19,268 SNPs and indels between the two materials. 532 pairs of CAPS markers were designed with 16 restriction enzymes, among which 271 pairs of primers gave distinct bands of the expected length and polymorphic bands, via PCR and enzyme digestion, with a polymorphic rate of 50.94%. Using the new CAPS markers, an initial CAPS-based genetic linkage map was constructed with the F2 population, spanning 1836.51 cM with 11 linkage groups and 301 markers. 12 QTLs were detected related to fruit flesh color, length, width, shape index, and brix content. These newly CAPS markers will be a valuable resource for breeding programs and genetic studies of watermelon.
Genome analysis and identification of gelatinase encoded gene in Enterobacter aerogenes

Science.gov (United States)

Shahimi, Safiyyah; Mutalib, Sahilah Abdul; Khalid, Rozida Abdul; Repin, Rul Aisyah Mat; Lamri, Mohd Fadly; Bakar, Mohd Faizal Abu; Isa, Mohd Noor Mat

2016-11-01

In this study, bioinformatic analysis towards genome sequence of E. aerogenes was done to determine gene encoded for gelatinase. Enterobacter aerogenes was isolated from hot spring water and gelatinase species-specific bacterium to porcine and fish gelatin. This bacterium offers the possibility of enzymes production which is specific to both species gelatine, respectively. Enterobacter aerogenes was partially genome sequenced resulting in 5.0 mega basepair (Mbp) total size of sequence. From pre-process pipeline, 87.6 Mbp of total reads, 68.8 Mbp of total high quality reads and 78.58 percent of high quality percentage was determined. Genome assembly produced 120 contigs with 67.5% of contigs over 1 kilo base pair (kbp), 124856 bp of N50 contig length and 55.17 % of GC base content percentage. About 4705 protein gene was identified from protein prediction analysis. Two candidate genes selected have highest similarity identity percentage against gelatinase enzyme available in Swiss-Prot and NCBI online database. They were NODE_9_length_26866_cov_148.013245_12 containing 1029 base pair (bp) sequence with 342 amino acid sequence and NODE_24_length_155103_cov_177.082458_62 which containing 717 bp sequence with 238 amino acid sequence, respectively. Thus, two paired of primers (forward and reverse) were designed, based on the open reading frame (ORF) of selected genes. Genome analysis of E. aerogenes resulting genes encoded gelatinase were identified.
Genetic analysis of Fasciola isolates from cattle in Korea based on second internal transcribed spacer (ITS-2) sequence of nuclear ribosomal DNA.

Science.gov (United States)

Choe, Se-Eun; Nguyen, Thuy Thi-Dieu; Kang, Tae-Gyu; Kweon, Chang-Hee; Kang, Seung-Won

2011-09-01

Nuclear ribosomal DNA sequence of the second internal transcribed spacer (ITS-2) has been used efficiently to identify the liver fluke species collected from different hosts and various geographic regions. ITS-2 sequences of 19 Fasciola samples collected from Korean native cattle were determined and compared. Sequence comparison including ITS-2 sequences of isolates from this study and reference sequences from Fasciola hepatica and Fasciola gigantica and intermediate Fasciola in Genbank revealed seven identical variable sites of investigated isolates. Among 19 samples, 12 individuals had ITS-2 sequences completely identical to that of pure F. hepatica, five possessed the sequences identical to F. gigantica type, whereas two shared the sequence of both F. hepatica and F. gigantica. No variations in length and nucleotide composition of ITS-2 sequence were observed within isolates that belonged to F. hepatica or F. gigantica. At the position of 218, five Fasciola containing a single-base substitution (C>T) formed a distinct branch inside the F. gigantica-type group which was similar to those of Asian-origin isolates. The phylogenetic tree of the Fasciola spp. based on complete ITS-2 sequences from this study and other representative isolates in different locations clearly showed that pure F. hepatica, F. gigantica type and intermediate Fasciola were observed. The result also provided additional genetic evidence for the existence of three forms of Fasciola isolated from native cattle in Korea by genetic approach using ITS-2 sequence.
Dog Y chromosomal DNA sequence: identification, sequencing and SNP discovery

Directory of Open Access Journals (Sweden)

Kirkness Ewen

2006-10-01

Full Text Available Abstract Background Population genetic studies of dogs have so far mainly been based on analysis of mitochondrial DNA, describing only the history of female dogs. To get a picture of the male history, as well as a second independent marker, there is a need for studies of biallelic Y-chromosome polymorphisms. However, there are no biallelic polymorphisms reported, and only 3200 bp of non-repetitive dog Y-chromosome sequence deposited in GenBank, necessitating the identification of dog Y chromosome sequence and the search for polymorphisms therein. The genome has been only partially sequenced for one male dog, disallowing mapping of the sequence into specific chromosomes. However, by comparing the male genome sequence to the complete female dog genome sequence, candidate Y-chromosome sequence may be identified by exclusion. Results The male dog genome sequence was analysed by Blast search against the human genome to identify sequences with a best match to the human Y chromosome and to the female dog genome to identify those absent in the female genome. Candidate sequences were then tested for male specificity by PCR of five male and five female dogs. 32 sequences from the male genome, with a total length of 24 kbp, were identified as male specific, based on a match to the human Y chromosome, absence in the female dog genome and male specific PCR results. 14437 bp were then sequenced for 10 male dogs originating from Europe, Southwest Asia, Siberia, East Asia, Africa and America. Nine haplotypes were found, which were defined by 14 substitutions. The genetic distance between the haplotypes indicates that they originate from at least five wolf haplotypes. There was no obvious trend in the geographic distribution of the haplotypes. Conclusion We have identified 24159 bp of dog Y-chromosome sequence to be used for population genetic studies. We sequenced 14437 bp in a worldwide collection of dogs, identifying 14 SNPs for future SNP analyses, and
Identification and validation of human papillomavirus encoded microRNAs.

Directory of Open Access Journals (Sweden)

Kui Qian

Full Text Available We report here identification and validation of the first papillomavirus encoded microRNAs expressed in human cervical lesions and cell lines. We established small RNA libraries from ten human papillomavirus associated cervical lesions including cancer and two human papillomavirus harboring cell lines. These libraries were sequenced using SOLiD 4 technology. We used the sequencing data to predict putative viral microRNAs and discovered nine putative papillomavirus encoded microRNAs. Validation was performed for five candidates, four of which were successfully validated by qPCR from cervical tissue samples and cell lines: two were encoded by HPV 16, one by HPV 38 and one by HPV 68. The expression of HPV 16 microRNAs was further confirmed by in situ hybridization, and colocalization with p16INK4A was established. Prediction of cellular target genes of HPV 16 encoded microRNAs suggests that they may play a role in cell cycle, immune functions, cell adhesion and migration, development, and cancer. Two putative viral target sites for the two validated HPV 16 miRNAs were mapped to the E5 gene, one in the E1 gene, two in the L1 gene and one in the LCR region. This is the first report to show that papillomaviruses encode their own microRNA species. Importantly, microRNAs were found in libraries established from human cervical disease and carcinoma cell lines, and their expression was confirmed in additional tissue samples. To our knowledge, this is also the first paper to use in situ hybridization to show the expression of a viral microRNA in human tissue.
Cloning and sequence of the gene encoding a cefotaxime-hydrolyzing class A beta-lactamase isolated from Escherichia coli.

Science.gov (United States)

Ishii, Y; Ohno, A; Taguchi, H; Imajo, S; Ishiguro, M; Matsuzawa, H

1995-01-01

Escherichia coli TUH12191, which is resistant to piperacillin, cefazolin, cefotiam, ceftizoxime, cefuzonam, and aztreonam but is susceptible to cefoxitin, latamoxef, flomoxef, and imipenem, was isolated from the urine of a patient treated with beta-lactam antibiotics. The beta-lactamase (Toho-1) purified from the bacteria had a pI of 7.8, had a molecular weight of about 29,000, and hydrolyzed beta-lactam antibiotics such as penicillin G, ampicillin, oxacillin, carbenicillin, piperacillin, cephalothin, cefoxitin, cefotaxime, ceftazidime, and aztreonam. Toho-1 was markedly inhibited by beta-lactamase inhibitors such as clavulanic acid and tazobactam. Resistance to beta-lactams, streptomycin, spectinomycin, sulfamethoxazole, and trimethoprim was transferred by conjugational transfer from E. coli TUH12191 to E. coli ML4903, and the transferred plasmid was about 58 kbp, belonging to incompatibility group M. The cefotaxime resistance gene for Toho-1 was subcloned from the 58-kbp plasmid by transformation of E. coli MV1184. The sequence of the gene for Toho-1 was determined, and the open reading frame of the gene consisted of 873 or 876 bases (initial sequence, ATGATG). The nucleotide sequence of the gene (DDBJ accession number D37830) was found to be about 73% homologous to the sequence of the gene encoding a class A beta-lactamase produced by Klebsiella oxytoca E23004. According to the amino acid sequence deduced from the DNA sequence, the precursor consisted of 290 or 291 amino acid residues, which contained amino acid motifs common to class A beta-lactamases (70SXXK, 130SDN, and 234KTG). Toho-1 was about 83% homologous to the beta-lactamase mediated by the chromosome of K. oxytoca D488 and the beta-lactamase mediated by the plasmid of E. coli MEN-1. Therefore, the newly isolated beta-lactamase Toho-1 produced by E. coli TUH12191 is similar to beta-lactamases produced by K. oxytoca D488, K. oxytoca E23004, and E. coli MEN-1 rather than to mutants of TEM or SHV enzymes
Exploring genetic variation in the tomato (Solanum section Lycopersicon) clade by whole-genome sequencing.

Science.gov (United States)

Aflitos, Saulo; Schijlen, Elio; de Jong, Hans; de Ridder, Dick; Smit, Sandra; Finkers, Richard; Wang, Jun; Zhang, Gengyun; Li, Ning; Mao, Likai; Bakker, Freek; Dirks, Rob; Breit, Timo; Gravendeel, Barbara; Huits, Henk; Struss, Darush; Swanson-Wagner, Ruth; van Leeuwen, Hans; van Ham, Roeland C H J; Fito, Laia; Guignier, Laëtitia; Sevilla, Myrna; Ellul, Philippe; Ganko, Eric; Kapur, Arvind; Reclus, Emannuel; de Geus, Bernard; van de Geest, Henri; Te Lintel Hekkert, Bas; van Haarst, Jan; Smits, Lars; Koops, Andries; Sanchez-Perez, Gabino; van Heusden, Adriaan W; Visser, Richard; Quan, Zhiwu; Min, Jiumeng; Liao, Li; Wang, Xiaoli; Wang, Guangbiao; Yue, Zhen; Yang, Xinhua; Xu, Na; Schranz, Eric; Smets, Erik; Vos, Rutger; Rauwerda, Johan; Ursem, Remco; Schuit, Cees; Kerns, Mike; van den Berg, Jan; Vriezen, Wim; Janssen, Antoine; Datema, Erwin; Jahrman, Torben; Moquet, Frederic; Bonnet, Julien; Peters, Sander

2014-10-01

We explored genetic variation by sequencing a selection of 84 tomato accessions and related wild species representative of the Lycopersicon, Arcanum, Eriopersicon and Neolycopersicon groups, which has yielded a huge amount of precious data on sequence diversity in the tomato clade. Three new reference genomes were reconstructed to support our comparative genome analyses. Comparative sequence alignment revealed group-, species- and accession-specific polymorphisms, explaining characteristic fruit traits and growth habits in the various cultivars. Using gene models from the annotated Heinz 1706 reference genome, we observed differences in the ratio between non-synonymous and synonymous SNPs (dN/dS) in fruit diversification and plant growth genes compared to a random set of genes, indicating positive selection and differences in selection pressure between crop accessions and wild species. In wild species, the number of single-nucleotide polymorphisms (SNPs) exceeds 10 million, i.e. 20-fold higher than found in most of the crop accessions, indicating dramatic genetic erosion of crop and heirloom tomatoes. In addition, the highest levels of heterozygosity were found for allogamous self-incompatible wild species, while facultative and autogamous self-compatible species display a lower heterozygosity level. Using whole-genome SNP information for maximum-likelihood analysis, we achieved complete tree resolution, whereas maximum-likelihood trees based on SNPs from ten fruit and growth genes show incomplete resolution for the crop accessions, partly due to the effect of heterozygous SNPs. Finally, results suggest that phylogenetic relationships are correlated with habitat, indicating the occurrence of geographical races within these groups, which is of practical importance for Solanum genome evolution studies. © 2014 The Authors The Plant Journal © 2014 John Wiley & Sons Ltd.
Genetic diversity among five T4-like bacteriophages

Directory of Open Access Journals (Sweden)

Bertrand Claire

2006-05-01

Full Text Available Abstract Background Bacteriophages are an important repository of genetic diversity. As one of the major constituents of terrestrial biomass, they exert profound effects on the earth's ecology and microbial evolution by mediating horizontal gene transfer between bacteria and controlling their growth. Only limited genomic sequence data are currently available for phages but even this reveals an overwhelming diversity in their gene sequences and genomes. The contribution of the T4-like phages to this overall phage diversity is difficult to assess, since only a few examples of complete genome sequence exist for these phages. Our analysis of five T4-like genomes represents half of the known T4-like genomes in GenBank. Results Here, we have examined in detail the genetic diversity of the genomes of five relatives of bacteriophage T4: the Escherichia coli phages RB43, RB49 and RB69, the Aeromonas salmonicida phage 44RR2.8t (or 44RR and the Aeromonas hydrophila phage Aeh1. Our data define a core set of conserved genes common to these genomes as well as hundreds of additional open reading frames (ORFs that are nonconserved. Although some of these ORFs resemble known genes from bacterial hosts or other phages, most show no significant similarity to any known sequence in the databases. The five genomes analyzed here all have similarities in gene regulation to T4. Sequence motifs resembling T4 early and late consensus promoters were observed in all five genomes. In contrast, only two of these genomes, RB69 and 44RR, showed similarities to T4 middle-mode promoter sequences and to the T4 motA gene product required for their recognition. In addition, we observed that each phage differed in the number and assortment of putative genes encoding host-like metabolic enzymes, tRNA species, and homing endonucleases. Conclusion Our observations suggest that evolution of the T4-like phages has drawn on a highly diverged pool of genes in the microbial world. The T4
Motif analysis unveils the possible co-regulation of chloroplast genes and nuclear genes encoding chloroplast proteins.

Science.gov (United States)

Wang, Ying; Ding, Jun; Daniell, Henry; Hu, Haiyan; Li, Xiaoman

2012-09-01

Chloroplasts play critical roles in land plant cells. Despite their importance and the availability of at least 200 sequenced chloroplast genomes, the number of known DNA regulatory sequences in chloroplast genomes are limited. In this paper, we designed computational methods to systematically study putative DNA regulatory sequences in intergenic regions near chloroplast genes in seven plant species and in promoter sequences of nuclear genes in Arabidopsis and rice. We found that -35/-10 elements alone cannot explain the transcriptional regulation of chloroplast genes. We also concluded that there are unlikely motifs shared by intergenic sequences of most of chloroplast genes, indicating that these genes are regulated differently. Finally and surprisingly, we found five conserved motifs, each of which occurs in no more than six chloroplast intergenic sequences, are significantly shared by promoters of nuclear-genes encoding chloroplast proteins. By integrating information from gene function annotation, protein subcellular localization analyses, protein-protein interaction data, and gene expression data, we further showed support of the functionality of these conserved motifs. Our study implies the existence of unknown nuclear-encoded transcription factors that regulate both chloroplast genes and nuclear genes encoding chloroplast protein, which sheds light on the understanding of the transcriptional regulation of chloroplast genes.
Rapid Cellular Phenotyping of Human Pluripotent Stem Cell-Derived Cardiomyocytes using a Genetically Encoded Fluorescent Voltage Sensor

Directory of Open Access Journals (Sweden)

Jordan S. Leyton-Mange

2014-02-01

Full Text Available In addition to their promise in regenerative medicine, pluripotent stem cells have proved to be faithful models of many human diseases. In particular, patient-specific stem cell-derived cardiomyocytes recapitulate key features of several life-threatening cardiac arrhythmia syndromes. For both modeling and regenerative approaches, phenotyping of stem cell-derived tissues is critical. Cellular phenotyping has largely relied upon expression of lineage markers rather than physiologic attributes. This is especially true for cardiomyocytes, in part because electrophysiological recordings are labor intensive. Likewise, most optical voltage indicators suffer from phototoxicity, which damages cells and degrades signal quality. Here we present the use of a genetically encoded fluorescent voltage indicator, ArcLight, which we demonstrate can faithfully report transmembrane potentials in human stem cell-derived cardiomyocytes. We demonstrate the application of this fluorescent sensor in high-throughput, serial phenotyping of differentiating cardiomyocyte populations and in screening for drug-induced cardiotoxicity.
ERPs and oscillations during encoding predict retrieval of digit memory in superior mnemonists.

Science.gov (United States)

Pan, Yafeng; Li, Xianchun; Chen, Xi; Ku, Yixuan; Dong, Yujie; Dou, Zheng; He, Lin; Hu, Yi; Li, Weidong; Zhou, Xiaolin

2017-10-01

Previous studies have consistently demonstrated that superior mnemonists (SMs) outperform normal individuals in domain-specific memory tasks. However, the neural correlates of memory-related processes remain unclear. In the current EEG study, SMs and control participants performed a digit memory task during which their brain activity was recorded. Chinese SMs used a digit-image mnemonic for encoding digits, in which they associated 2-digit groups with images immediately after the presentation of each even-position digit in sequences. Behaviorally, SMs' memory of digit sequences was better than the controls'. During encoding in the study phase, SMs showed an increased right central P2 (150-250ms post onset) and a larger right posterior high-alpha (10-14Hz, 500-1720ms) oscillation on digits at even-positions compared with digits at odd-positions. Both P2 and high-alpha oscillations in the study phase co-varied with performance in the recall phase, but only in SMs, indicating that neural dynamics during encoding could predict successful retrieval of digit memory in SMs. Our findings suggest that representation of a digit sequence in SMs using mnemonics may recruit both the early-stage attention allocation process and the sustained information preservation process. This study provides evidence for the role of dynamic and efficient neural encoding processes in mnemonists. Copyright © 2017. Published by Elsevier Inc.
A clinical utility study of exome sequencing versus conventional genetic testing in pediatric neurology.

Science.gov (United States)

Vissers, Lisenka E L M; van Nimwegen, Kirsten J M; Schieving, Jolanda H; Kamsteeg, Erik-Jan; Kleefstra, Tjitske; Yntema, Helger G; Pfundt, Rolph; van der Wilt, Gert Jan; Krabbenborg, Lotte; Brunner, Han G; van der Burg, Simone; Grutters, Janneke; Veltman, Joris A; Willemsen, Michèl A A P

2017-09-01

Implementation of novel genetic diagnostic tests is generally driven by technological advances because they promise shorter turnaround times and/or higher diagnostic yields. Other aspects, including impact on clinical management or cost-effectiveness, are often not assessed in detail prior to implementation. We studied the clinical utility of whole-exome sequencing (WES) in complex pediatric neurology in terms of diagnostic yield and costs. We analyzed 150 patients (and their parents) presenting with complex neurological disorders of suspected genetic origin. In a parallel study, all patients received both the standard diagnostic workup (e.g., cerebral imaging, muscle biopsies or lumbar punctures, and sequential gene-by-gene-based testing) and WES simultaneously. Our unique study design allowed direct comparison of diagnostic yield of both trajectories and provided insight into the economic implications of implementing WES in this diagnostic trajectory. We showed that WES identified significantly more conclusive diagnoses (29.3%) than the standard care pathway (7.3%) without incurring higher costs. Exploratory analysis of WES as a first-tier diagnostic test indicates that WES may even be cost-saving, depending on the extent of other tests being omitted. Our data support such a use of WES in pediatric neurology for disorders of presumed genetic origin.Genet Med advance online publication 23 March 2017.
Hc-daf-2 encodes an insulin-like receptor kinase in the barber's pole worm, Haemonchus contortus, and restores partial dauer regulation.

Science.gov (United States)

Li, Facai; Lok, James B; Gasser, Robin B; Korhonen, Pasi K; Sandeman, Mark R; Shi, Deshi; Zhou, Rui; Li, Xiangrui; Zhou, Yanqin; Zhao, Junlong; Hu, Min

2014-06-01

Infective L3s (iL3s) of parasitic nematodes share common behavioural, morphological and developmental characteristics with the developmentally arrested (dauer) larvae of the free-living nematode Caenorhabditis elegans. It is proposed that similar molecular mechanisms regulate entry into or exit from the dauer stage in C. elegans, and the transition from free-living to parasitic forms of parasitic nematodes. In C. elegans, one of the key factors regulating the dauer transition is the insulin-like receptor (designated Ce-DAF-2) encoded by the gene Ce-daf-2. However, nothing is known about DAF-2 homologues in most parasitic nematodes. Here, using a PCR-based approach, we identified and characterised a gene (Hc-daf-2) and its inferred product (Hc-DAF-2) in Haemonchus contortus (a socioeconomically important parasitic nematode of ruminants). The sequence of Hc-DAF-2 displays significant sequence homology to insulin receptors (IR) in both vertebrates and invertebrates, and contains conserved structural domains. A sequence encoding an important proteolytic motif (RKRR) identified in the predicted peptide sequence of Hc-DAF-2 is consistent with that of the human IR, suggesting that it is involved in the formation of the IR complex. The Hc-daf-2 gene was transcribed in all life stages of H. contortus, with a significant up-regulation in the iL3 compared with other stages. To compare patterns of expression between Hc-daf-2 and Ce-daf-2, reporter constructs fusing the Ce-daf-2 or Hc-daf-2 promoter to sequence encoding GFP were microinjected into the N2 strain of C. elegans, and transgenic lines were established and examined. Both genes showed similar patterns of expression in amphidial (head) neurons, which relate to sensation and signal transduction. Further study by heterologous genetic complementation in a daf-2-deficient strain of C. elegans (CB1370) showed partial rescue of function by Hc-daf-2. Taken together, these findings provide a first insight into the roles of Hc
Transcription factor IID in the Archaea: sequences in the Thermococcus celer genome would encode a product closely related to the TATA-binding protein of eukaryotes

Science.gov (United States)

Marsh, T. L.; Reich, C. I.; Whitelock, R. B.; Olsen, G. J.; Woese, C. R. (Principal Investigator)

1994-01-01

The first step in transcription initiation in eukaryotes is mediated by the TATA-binding protein, a subunit of the transcription factor IID complex. We have cloned and sequenced the gene for a presumptive homolog of this eukaryotic protein from Thermococcus celer, a member of the Archaea (formerly archaebacteria). The protein encoded by the archaeal gene is a tandem repeat of a conserved domain, corresponding to the repeated domain in its eukaryotic counterparts. Molecular phylogenetic analyses of the two halves of the repeat are consistent with the duplication occurring before the divergence of the archael and eukaryotic domains. In conjunction with previous observations of similarity in RNA polymerase subunit composition and sequences and the finding of a transcription factor IIB-like sequence in Pyrococcus woesei (a relative of T. celer) it appears that major features of the eukaryotic transcription apparatus were well-established before the origin of eukaryotic cellular organization. The divergence between the two halves of the archael protein is less than that between the halves of the individual eukaryotic sequences, indicating that the average rate of sequence change in the archael protein has been less than in its eukaryotic counterparts. To the extent that this lower rate applies to the genome as a whole, a clearer picture of the early genes (and gene families) that gave rise to present-day genomes is more apt to emerge from the study of sequences from the Archaea than from the corresponding sequences from eukaryotes.

Genome wide characterization of simple sequence repeats in watermelon genome and their application in comparative mapping and genetic diversity analysis.

Science.gov (United States)

Zhu, Huayu; Song, Pengyao; Koo, Dal-Hoe; Guo, Luqin; Li, Yanman; Sun, Shouru; Weng, Yiqun; Yang, Luming

2016-08-05

Microsatellite markers are one of the most informative and versatile DNA-based markers used in plant genetic research, but their development has traditionally been difficult and costly. The whole genome sequencing with next-generation sequencing (NGS) technologies provides large amounts of sequence data to develop numerous microsatellite markers at whole genome scale. SSR markers have great advantage in cross-species comparisons and allow investigation of karyotype and genome evolution through highly efficient computation approaches such as in silico PCR. Here we described genome wide development and characterization of SSR markers in the watermelon (Citrullus lanatus) genome, which were then use in comparative analysis with two other important crop species in the Cucurbitaceae family: cucumber (Cucumis sativus L.) and melon (Cucumis melo L.). We further applied these markers in evaluating the genetic diversity and population structure in watermelon germplasm collections. A total of 39,523 microsatellite loci were identified from the watermelon draft genome with an overall density of 111 SSRs/Mbp, and 32,869 SSR primers were designed with suitable flanking sequences. The dinucleotide SSRs were the most common type representing 34.09 % of the total SSR loci and the AT-rich motifs were the most abundant in all nucleotide repeat types. In silico PCR analysis identified 832 and 925 SSR markers with each having a single amplicon in the cucumber and melon draft genome, respectively. Comparative analysis with these cross-species SSR markers revealed complicated mosaic patterns of syntenic blocks among the genomes of three species. In addition, genetic diversity analysis of 134 watermelon accessions with 32 highly informative SSR loci placed these lines into two groups with all accessions of C.lanatus var. citorides and three accessions of C. colocynthis clustered in one group and all accessions of C. lanatus var. lanatus and the remaining accessions of C. colocynthis
Progress toward characterization of the group A Streptococcus metagenome: complete genome sequence of a macrolide-resistant serotype M6 strain.

Science.gov (United States)

Banks, David J; Porcella, Stephen F; Barbian, Kent D; Beres, Stephen B; Philips, Lauren E; Voyich, Jovanka M; DeLeo, Frank R; Martin, Judith M; Somerville, Greg A; Musser, James M

2004-08-15

We describe the genome sequence of a macrolide-resistant strain (MGAS10394) of serotype M6 group A Streptococcus (GAS). The genome is 1,900,156 bp in length, and 8 prophage-like elements or remnants compose 12.4% of the chromosome. A 8.3-kb prophage remnant encodes the SpeA4 variant of streptococcal pyrogenic exotoxin A. The genome of strain MGAS10394 contains a chimeric genetic element composed of prophage genes and a transposon encoding the mefA gene conferring macrolide resistance. This chimeric element also has a gene encoding a novel surface-exposed protein (designated "R6 protein"), with an LPKTG cell-anchor motif located at the carboxyterminus. Surface expression of this protein was confirmed by flow cytometry. Humans with GAS pharyngitis caused by serotype M6 strains had antibody against the R6 protein present in convalescent, but not acute, serum samples. Our studies add to the theme that GAS prophage-encoded extracellular proteins contribute to host-pathogen interactions in a strain-specific fashion.
An encoding device and a method of encoding

DEFF Research Database (Denmark)

2012-01-01

The present invention relates to an encoding device, such as an optical position encoder, for encoding input from an object, and a method for encoding input from an object, for determining a position of an object that interferes with light of the device. The encoding device comprises a light source...... in the area in the space and may interfere with the light, which interference may be encoded into a position or activation....
Genetic Diversity of Pinus nigra Arn. Populations in Southern Spain and Northern Morocco Revealed By Inter-Simple Sequence Repeat Profiles

Directory of Open Access Journals (Sweden)

Oussama Ahrazem

2012-05-01

Full Text Available Eight Pinus nigra Arn. populations from Southern Spain and Northern Morocco were examined using inter-simple sequence repeat markers to characterize the genetic variability amongst populations. Pair-wise population genetic distance ranged from 0.031 to 0.283, with a mean of 0.150 between populations. The highest inter-population average distance was between PaCU from Cuenca and YeCA from Cazorla, while the lowest distance was between TaMO from Morocco and MA Sierra Mágina populations. Analysis of molecular variance (AMOVA and Nei’s genetic diversity analyses revealed higher genetic variation within the same population than among different populations. Genetic differentiation (Gst was 0.233. Cuenca showed the highest Nei’s genetic diversity followed by the Moroccan region, Sierra Mágina, and Cazorla region. However, clustering of populations was not in accordance with their geographical locations. Principal component analysis showed the presence of two major groups—Group 1 contained all populations from Cuenca while Group 2 contained populations from Cazorla, Sierra Mágina and Morocco—while Bayesian analysis revealed the presence of three clusters. The low genetic diversity observed in PaCU and YeCA is probably a consequence of inappropriate management since no estimation of genetic variability was performed before the silvicultural treatments. Data indicates that the inter-simple sequence repeat (ISSR method is sufficiently informative and powerful to assess genetic variability among populations of P. nigra.
Characterization of Durham virus, a novel rhabdovirus that encodes both a C and SH protein.

Science.gov (United States)

Allison, A B; Palacios, G; Travassos da Rosa, A; Popov, V L; Lu, L; Xiao, S Y; DeToy, K; Briese, T; Lipkin, W I; Keel, M K; Stallknecht, D E; Bishop, G R; Tesh, R B

2011-01-01

The family Rhabdoviridae is a diverse group of non-segmented, negative-sense RNA viruses that are distributed worldwide and infect a wide range of hosts including vertebrates, invertebrates, and plants. Of the 114 currently recognized vertebrate rhabdoviruses, relatively few have been well characterized at both the antigenic and genetic level; hence, the phylogenetic relationships between many of the vertebrate rhabdoviruses remain unknown. The present report describes a novel rhabdovirus isolated from the brain of a moribund American coot (Fulica americana) that exhibited neurological signs when found in Durham County, North Carolina, in 2005. Antigenic characterization of the virus revealed that it was serologically unrelated to 68 other known vertebrate rhabdoviruses. Genomic sequencing of the virus indicated that it shared the highest identity to Tupaia rhabdovirus (TUPV), and as only previously observed in TUPV, the genome encoded a putative C protein in an overlapping open reading frame (ORF) of the phosphoprotein gene and a small hydrophobic (SH) protein located in a novel ORF between the matrix and glycoprotein genes. Phylogenetic analysis of partial amino acid sequences of the nucleoprotein and polymerase protein indicated that, in addition to TUPV, the virus was most closely related to avian and small mammal rhabdoviruses from Africa and North America. In this report, we present the morphological, pathological, antigenic, and genetic characterization of the new virus, tentatively named Durham virus (DURV), and discuss its potential evolutionary relationship to other vertebrate rhabdoviruses. Copyright © 2010 Elsevier B.V. All rights reserved.
Characterization of Durham virus, a novel rhabdovirus that encodes both a C and SH protein

Science.gov (United States)

Allison, A. B.; Palacios, G.; Rosa, A. Travassos da; Popov, V. L.; Lu, L.; Xiao, S. Y.; DeToy, K.; Briese, T.; Lipkin, W. Ian; Keel, M. K.; Stallknecht, D. E.; Bishop, G. R.; Tesh, R. B.

2010-01-01

The family Rhabdoviridae is a diverse group of non-segmented, negative-sense RNA viruses that are distributed worldwide and infect a wide range of hosts including vertebrates, invertebrates, and plants. Of the 114 currently recognized vertebrate rhabdoviruses, relatively few have been well characterized at both the antigenic and genetic level; hence, the phylogenetic relationships between many of the vertebrate rhabdoviruses remain unknown. The present report describes a novel rhabdovirus isolated from the brain of a moribund American coot (Fulica americana) that exhibited neurological signs when found in Durham County, North Carolina, in 2005. Antigenic characterization of the virus revealed that it was serologically unrelated to 68 other known vertebrate rhabdoviruses. Genomic sequencing of the virus indicated that it shared the highest identity to Tupaia rhabdovirus (TUPV), and as only previously observed in TUPV, the genome encoded a putative C protein in an overlapping open reading frame (ORF) of the phosphoprotein gene and a small hydrophobic protein located in a novel ORF between the matrix and glycoprotein genes. Phylogenetic analysis of partial amino acid sequences of the nucleoprotein and polymerase proteins indicated that, in addition to TUPV, the virus was most closely related to avian and small mammal rhabdoviruses from Africa and North America. In this report, we present the morphological, pathological, antigenic, and genetic characterization of the new virus, tentatively named Durham virus (DURV), and discuss its potential evolutionary relationship to other vertebrate rhabdoviruses. PMID:20863863
Differential sequence diversity at merozoite surface protein-1 locus of Plasmodium knowlesi from humans and macaques in Thailand.

Science.gov (United States)

Putaporntip, Chaturong; Thongaree, Siriporn; Jongwutiwes, Somchai

2013-08-01

To determine the genetic diversity and potential transmission routes of Plasmodium knowlesi, we analyzed the complete nucleotide sequence of the gene encoding the merozoite surface protein-1 of this simian malaria (Pkmsp-1), an asexual blood-stage vaccine candidate, from naturally infected humans and macaques in Thailand. Analysis of Pkmsp-1 sequences from humans (n=12) and monkeys (n=12) reveals five conserved and four variable domains. Most nucleotide substitutions in conserved domains were dimorphic whereas three of four variable domains contained complex repeats with extensive sequence and size variation. Besides purifying selection in conserved domains, evidence of intragenic recombination scattering across Pkmsp-1 was detected. The number of haplotypes, haplotype diversity, nucleotide diversity and recombination sites of human-derived sequences exceeded that of monkey-derived sequences. Phylogenetic networks based on concatenated conserved sequences of Pkmsp-1 displayed a character pattern that could have arisen from sampling process or the presence of two independent routes of P. knowlesi transmission, i.e. from macaques to human and from human to humans in Thailand. Copyright © 2013 Elsevier B.V. All rights reserved.
Valyl-tRNA synthetase gene of Escherichia coli K12: Molecular genetic characterization and homology within a family of aminoacyl-tRNA synthetases

International Nuclear Information System (INIS)

Heck, J.D. III.

1988-01-01

This work reports the subcloning and characterization of the molecular elements necessary for the expression of the Escherichia coli valS gene encoding valyl-tRNA synthetase. The valS gene was subcloned from plasmid pLC26-22 by genetic complementation of a valS ts strain. The DNA region encoding the valS structural gene was determined by in vitro coupled transcription-translation assays. Cells transformed with a plasmid containing a full length copy of the valS gene enhanced in vivo valyl-tRNA synthetase specific activity twelve-fold. DNA sequences flanking the valS structural gene are presented. The transcription initiation sites of the valS gene were determined, in vivo and in vitro, by S1 nuclease protection studies, primer-extension analysis and both [α- 32 P]labeled and [γ- 32 P]end-labeled in vitro transcription assays. The DNA sequence of the valS gene of Escherichia coli has been determined. Significant similarity at the primary sequence level was detected between valyl-tRNA synthetase of E. coli and other known branched-chain aminoacyl-tRNA synthetases. An extended open reading frame (ORF) encoded on the DNA strand opposite the valS structural gene is described
Cloning and DNA sequence of the mercuric- and organomercurial-resistance determinants of plasmid pDU1358

International Nuclear Information System (INIS)

Griffin, H.G.; Foster, T.J.; Silver, S.; Misra, T.K.

1987-01-01

The broad-spectrum mercurial-resistance plasmid pDU1358 was analyzed by cloning the resistance determinants and preparing a physical and genetic map of a 45-kilobase (kb) region of the plasmid that contains two separate mercurial-resistance operons that mapped about 20 kb apart. One encoded narrow-spectrum mercurial resistance to Hg 2+ and a few organomercurials; the other specified broad-spectrum resistance to phenylmercury and additional organomercurials. Each determinant governed mercurial transport functions. Southern DNA x DNA hybridization experiments using gene-specific probes from the plasmid R100 mer operon indicated close homology with the R100 deteminant. The 2153 base pairs of the promoter-distal part of the broad-spectrum Hg 2+ -resistance operon of pDU1358 were sequenced. This region included the 3'-terminal part of the merA gene, merD, unidentified reading frame URF1, and a part of URF2 homologous to previously sequenced determinants of plasmid R100. Between the merA and merD genes, an open reading frame encoding a 212 amino acid polypeptide was identified as the merB gene that determines the enzyme organomercurial lyase that cleaves the C-Hg bond of phenylmercury
Exploration of genetically encoded voltage indicators based on a chimeric voltage sensing domain

Directory of Open Access Journals (Sweden)

Yukiko eMishina

2014-09-01

Full Text Available Deciphering how the brain generates cognitive function from patterns of electrical signals is one of the ultimate challenges in neuroscience. To this end, it would be highly desirable to monitor the activities of very large numbers of neurons while an animal engages in complex behaviours. Optical imaging of electrical activity using genetically encoded voltage indicators (GEVIs has the potential to meet this challenge. Currently prevalent GEVIs are based on the voltage-sensitive fluorescent protein (VSFP prototypical design or on the voltage dependent state transitions of microbial opsins.We recently introduced a new VSFP design in which the voltage-sensing domain (VSD is sandwiched between a FRET pair of fluorescent proteins (termed VSFP-Butterflies and also demonstrated a series of chimeric VSD in which portions of the VSD of Ciona intestinalis voltage-sensitive phosphatase (Ci-VSP are substituted by homologous portions of a voltage-gated potassium channel subunit. These chimeric VSD had faster sensing kinetics than that of the native Ci-VSD. Here, we describe a new set of VSFPs that combine chimeric VSD with the Butterfly structure. We show that these chimeric VSFP-Butterflies can report membrane voltage oscillations of up to 200 Hz in cultured cells and report sensory evoked cortical population responses in living mice. This class of GEVIs may be suitable for imaging of brain rhythms in behaving mammalians.
Exploration of genetically encoded voltage indicators based on a chimeric voltage sensing domain.

Science.gov (United States)

Mishina, Yukiko; Mutoh, Hiroki; Song, Chenchen; Knöpfel, Thomas

2014-01-01

Deciphering how the brain generates cognitive function from patterns of electrical signals is one of the ultimate challenges in neuroscience. To this end, it would be highly desirable to monitor the activities of very large numbers of neurons while an animal engages in complex behaviors. Optical imaging of electrical activity using genetically encoded voltage indicators (GEVIs) has the potential to meet this challenge. Currently prevalent GEVIs are based on the voltage-sensitive fluorescent protein (VSFP) prototypical design or on the voltage-dependent state transitions of microbial opsins. We recently introduced a new VSFP design in which the voltage-sensing domain (VSD) is sandwiched between a fluorescence resonance energy transfer pair of fluorescent proteins (termed VSFP-Butterflies) and also demonstrated a series of chimeric VSD in which portions of the VSD of Ciona intestinalis voltage-sensitive phosphatase are substituted by homologous portions of a voltage-gated potassium channel subunit. These chimeric VSD had faster sensing kinetics than that of the native Ci-VSD. Here, we describe a new set of VSFPs that combine chimeric VSD with the Butterfly structure. We show that these chimeric VSFP-Butterflies can report membrane voltage oscillations of up to 200 Hz in cultured cells and report sensory evoked cortical population responses in living mice. This class of GEVIs may be suitable for imaging of brain rhythms in behaving mammalians.
A Toolbox of Genetically Encoded FRET-Based Biosensors for Rapid l-Lysine Analysis

Directory of Open Access Journals (Sweden)

Victoria Steffen

2016-09-01

Full Text Available Background: The fast development of microbial production strains for basic and fine chemicals is increasingly carried out in small scale cultivation systems to allow for higher throughput. Such parallelized systems create a need for new rapid online detection systems to quantify the respective target compound. In this regard, biosensors, especially genetically encoded Förster resonance energy transfer (FRET-based biosensors, offer tremendous opportunities. As a proof-of-concept, we have created a toolbox of FRET-based biosensors for the ratiometric determination of l-lysine in fermentation broth. Methods: The sensor toolbox was constructed based on a sensor that consists of an optimized central lysine-/arginine-/ornithine-binding protein (LAO-BP flanked by two fluorescent proteins (enhanced cyan fluorescent protein (ECFP, Citrine. Further sensor variants with altered affinity and sensitivity were obtained by circular permutation of the binding protein as well as the introduction of flexible and rigid linkers between the fluorescent proteins and the LAO-BP, respectively. Results: The sensor prototype was applied to monitor the extracellular l-lysine concentration of the l-lysine producing Corynebacterium glutamicum (C. glutamicum strain DM1933 in a BioLector® microscale cultivation device. The results matched well with data obtained by HPLC analysis and the Ninhydrin assay, demonstrating the high potential of FRET-based biosensors for high-throughput microbial bioprocess optimization.
Molecular characterization, sequence analysis and tissue expression of a porcine gene – MOSPD2

Directory of Open Access Journals (Sweden)

Yang Jie

2017-01-01

Full Text Available The full-length cDNA sequence of a porcine gene, MOSPD2, was amplified using the rapid amplification of cDNA ends method based on a pig expressed sequence tag sequence which was highly homologous to the coding sequence of the human MOSPD2 gene. Sequence prediction analysis revealed that the open reading frame of this gene encodes a protein of 491 amino acids that has high homology with the motile sperm domain-containing protein 2 (MOSPD2 of five species: horse (89%, human (90%, chimpanzee (89%, rhesus monkey (89% and mouse (85%; thus, it could be defined as a porcine MOSPD2 gene. This novel porcine gene was assigned GeneID: 100153601. This gene is structured in 15 exons and 14 introns as revealed by computer-assisted analysis. The phylogenetic analysis revealed that the porcine MOSPD2 gene has a closer genetic relationship with the MOSPD2 gene of horse. Tissue expression analysis indicated that the porcine MOSPD2 gene is generally and differentially expressed in the spleen, muscle, skin, kidney, lung, liver, fat and heart. Our experiment is the first to establish the primary foundation for further research on the porcine MOSPD2 gene.
Application of Next Generation Sequencing on Genetic Testing

DEFF Research Database (Denmark)

Li, Jian

The discovery of genetic factors behind increasing number of human diseases and the growth of education of genetic knowledge to the public make demands for genetic testing increase rapidly. However, traditional genetic testing methods cannot meet all kinds of the requirements. Next generation seq...
Cloning and Sequencing of Gene Encoding Outer Membrane Lipoprotein LipL41 of Leptospira Interrogans Serovar Grippotyphosa

Directory of Open Access Journals (Sweden)

M.S. Soltani

2014-12-01

Full Text Available Background: Leptospirosis is an infectious bacterial disease caused by pathogenic serovars of Leptospira. Development of reliable and applicable diagnostic test and also recombinant vaccine for this disease require specific antigens that are highly conserved among diverse pathogenic leptospiral serovars. Outer membrane proteins(OMPs of leptospira are effective antigens which can stimulate remarkable immune responses during infection, among them LipL41 is an immunogenic lipoprotein which is present only in pathogenic serovars so it could be regarded as a good candidate for vaccine development and diagnostic method. In order to identify genetic conservation of the lipL41 gene, we cloned and sequenced this gen from Leptospira interrogans serovar vaccinal and field of Grippotyphosa. Materials and Methods: Leptospira interrogans serovar vaccinal Grippotyphosa (RTCC2808 and serovar field Grippotyphosa (RTCC2825were used to inoculate into the selective culture medium(EMJH. The genomic DNA was extracted by standard phenol-chloroform method. The lipL41 gene were amplified by specific primers and cloned into pTZ57R/T vector and transformed into the competent E. coli (Top10 cells. the extracted recombinant plasmid were sequenced. And the related sequences were subjected to homology analysis by comparing them to sequences in the Genbank database. Results: PCR amplification of the lipL41 gene resulted in the 1065 bp PCR product. DNA sequence analysis revealed that lipL41 gene between serovar vaccinal Grippotyphosa (RTCC2808and serovar field Grippotyphosa (RTCC2825 in Iran was 100%. It was also showed that the lipL41 gene had high identity (96%-100% with other pathogenic serovars submitted in Genbank database. Conclusion: The results of this study showed that the lipL41 gene was highly conserved among various pathogenic Leptospira serovars( >95.9 % identity. Hence the cloned gene could be further used for expression of recombinant protein for serodiagnosis
Touch imprint cytology with massively parallel sequencing (TIC-seq): a simple and rapid method to snapshot genetic alterations in tumors.

Science.gov (United States)

Amemiya, Kenji; Hirotsu, Yosuke; Goto, Taichiro; Nakagomi, Hiroshi; Mochizuki, Hitoshi; Oyama, Toshio; Omata, Masao

2016-12-01

Identifying genetic alterations in tumors is critical for molecular targeting of therapy. In the clinical setting, formalin-fixed paraffin-embedded (FFPE) tissue is usually employed for genetic analysis. However, DNA extracted from FFPE tissue is often not suitable for analysis because of its low levels and poor quality. Additionally, FFPE sample preparation is time-consuming. To provide early treatment for cancer patients, a more rapid and robust method is required for precision medicine. We present a simple method for genetic analysis, called touch imprint cytology combined with massively paralleled sequencing (touch imprint cytology [TIC]-seq), to detect somatic mutations in tumors. We prepared FFPE tissues and TIC specimens from tumors in nine lung cancer patients and one patient with breast cancer. We found that the quality and quantity of TIC DNA was higher than that of FFPE DNA, which requires microdissection to enrich DNA from target tissues. Targeted sequencing using a next-generation sequencer obtained sufficient sequence data using TIC DNA. Most (92%) somatic mutations in lung primary tumors were found to be consistent between TIC and FFPE DNA. We also applied TIC DNA to primary and metastatic tumor tissues to analyze tumor heterogeneity in a breast cancer patient, and showed that common and distinct mutations among primary and metastatic sites could be classified into two distinct histological subtypes. TIC-seq is an alternative and feasible method to analyze genomic alterations in tumors by simply touching the cut surface of specimens to slides. © 2016 The Authors. Cancer Medicine published by John Wiley & Sons Ltd.
Multichannel compressive sensing MRI using noiselet encoding.

Directory of Open Access Journals (Sweden)

Kamlesh Pawar

Full Text Available The incoherence between measurement and sparsifying transform matrices and the restricted isometry property (RIP of measurement matrix are two of the key factors in determining the performance of compressive sensing (CS. In CS-MRI, the randomly under-sampled Fourier matrix is used as the measurement matrix and the wavelet transform is usually used as sparsifying transform matrix. However, the incoherence between the randomly under-sampled Fourier matrix and the wavelet matrix is not optimal, which can deteriorate the performance of CS-MRI. Using the mathematical result that noiselets are maximally incoherent with wavelets, this paper introduces the noiselet unitary bases as the measurement matrix to improve the incoherence and RIP in CS-MRI. Based on an empirical RIP analysis that compares the multichannel noiselet and multichannel Fourier measurement matrices in CS-MRI, we propose a multichannel compressive sensing (MCS framework to take the advantage of multichannel data acquisition used in MRI scanners. Simulations are presented in the MCS framework to compare the performance of noiselet encoding reconstructions and Fourier encoding reconstructions at different acceleration factors. The comparisons indicate that multichannel noiselet measurement matrix has better RIP than that of its Fourier counterpart, and that noiselet encoded MCS-MRI outperforms Fourier encoded MCS-MRI in preserving image resolution and can achieve higher acceleration factors. To demonstrate the feasibility of the proposed noiselet encoding scheme, a pulse sequences with tailored spatially selective RF excitation pulses was designed and implemented on a 3T scanner to acquire the data in the noiselet domain from a phantom and a human brain. The results indicate that noislet encoding preserves image resolution better than Fouirer encoding.
Nucleotide sequence of Hungarian grapevine chrome mosaic nepovirus RNA1.

Science.gov (United States)

Le Gall, O; Candresse, T; Brault, V; Dunez, J

1989-10-11

The nucleotide sequence of the RNA1 of hungarian grapevine chrome mosaic virus, a nepovirus very closely related to tomato black ring virus, has been determined from cDNA clones. It is 7212 nucleotides in length excluding the 3' terminal poly(A) tail and contains a large open reading frame extending from nucleotides 216 to 6971. The presumably encoded polyprotein is 2252 amino acids in length with a molecular weight of 250 kDa. The primary structure of the polyprotein was compared with that of other viral polyproteins, revealing the same general genetic organization as that of other picorna-like viruses (comoviruses, potyviruses and picornaviruses), except that an additional protein is suspected to occupy the N-terminus of the polyprotein.
Expression analysis of a ''Cucurbita'' cDNA encoding endonuclease

International Nuclear Information System (INIS)

Szopa, J.

1995-01-01

The nuclear matrices of plant cell nuclei display intrinsic nuclease activity which consists in nicking supercoiled DNA. A cDNA encoding a 32 kDa endonuclease has been cloned and sequenced. The nucleotide and deduced amino-acid sequences show high homology to known 14-3-3-protein sequences from other sources. The amino-acid sequence shows agreement with consensus sequences for potential phosphorylation by protein kinase A and C and for calcium, lipid and membrane-binding sites. The nucleotide-binding site is also present within the conserved part of the sequence. By Northern blot analysis, the differential expression of the corresponding mRNA was detected; it was the strongest in sink tissues. The endonuclease activity found on DNA-polyacrylamide gel electrophoresis coincided with mRNA content and was the highest in tuber. (author). 22 refs, 6 figs
Molecular phylogeny of Toxoplasmatinae: comparison between inferences based on mitochondrial and apicoplast genetic sequences

Directory of Open Access Journals (Sweden)

Michelle Klein Sercundes

2016-03-01

Full Text Available Abstract Phylogenies within Toxoplasmatinae have been widely investigated with different molecular markers. Here, we studied molecular phylogenies of the Toxoplasmatinae subfamily based on apicoplast and mitochondrial genes. Partial sequences of apicoplast genes coding for caseinolytic protease (clpC and beta subunit of RNA polymerase (rpoB, and mitochondrial gene coding for cytochrome B (cytB were analyzed. Laboratory-adapted strains of the closely related parasites Sarcocystis falcatula and Sarcocystis neurona were investigated, along with Neospora caninum, Neospora hughesi, Toxoplasma gondii (strains RH, CTG and PTG, Besnoitia akodoni, Hammondia hammondiand two genetically divergent lineages of Hammondia heydorni. The molecular analysis based on organellar genes did not clearly differentiate between N. caninum and N. hughesi, but the two lineages of H. heydorni were confirmed. Slight differences between the strains of S. falcatula and S. neurona were encountered in all markers. In conclusion, congruent phylogenies were inferred from the three different genes and they might be used for screening undescribed sarcocystid parasites in order to ascertain their phylogenetic relationships with organisms of the family Sarcocystidae. The evolutionary studies based on organelar genes confirm that the genusHammondia is paraphyletic. The primers used for amplification of clpC and rpoB were able to amplify genetic sequences of organisms of the genus Sarcocystisand organisms of the subfamily Toxoplasmatinae as well.

Sequence Algebra, Sequence Decision Diagrams and Dynamic Fault Trees

International Nuclear Information System (INIS)

Rauzy, Antoine B.

2011-01-01

A large attention has been focused on the Dynamic Fault Trees in the past few years. By adding new gates to static (regular) Fault Trees, Dynamic Fault Trees aim to take into account dependencies among events. Merle et al. proposed recently an algebraic framework to give a formal interpretation to these gates. In this article, we extend Merle et al.'s work by adopting a slightly different perspective. We introduce Sequence Algebras that can be seen as Algebras of Basic Events, representing failures of non-repairable components. We show how to interpret Dynamic Fault Trees within this framework. Finally, we propose a new data structure to encode sets of sequences of Basic Events: Sequence Decision Diagrams. Sequence Decision Diagrams are very much inspired from Minato's Zero-Suppressed Binary Decision Diagrams. We show that all operations of Sequence Algebras can be performed on this data structure.
Genetic diversity studies in pea (Pisum sativum L.) using simple sequence repeat markers.

Science.gov (United States)

Kumari, P; Basal, N; Singh, A K; Rai, V P; Srivastava, C P; Singh, P K

2013-03-13

The genetic diversity among 28 pea (Pisum sativum L.) genotypes was analyzed using 32 simple sequence repeat markers. A total of 44 polymorphic bands, with an average of 2.1 bands per primer, were obtained. The polymorphism information content ranged from 0.657 to 0.309 with an average of 0.493. The variation in genetic diversity among these cultivars ranged from 0.11 to 0.73. Cluster analysis based on Jaccard's similarity coefficient using the unweighted pair-group method with arithmetic mean (UPGMA) revealed 2 distinct clusters, I and II, comprising 6 and 22 genotypes, respectively. Cluster II was further differentiated into 2 subclusters, IIA and IIB, with 12 and 10 genotypes, respectively. Principal component (PC) analysis revealed results similar to those of UPGMA. The first, second, and third PCs contributed 21.6, 16.1, and 14.0% of the variation, respectively; cumulative variation of the first 3 PCs was 51.7%.
Physiology of SLC12 transporters: lessons from inherited human genetic mutations and genetically engineered mouse knockouts.

Science.gov (United States)

Gagnon, Kenneth B; Delpire, Eric

2013-04-15

Among the over 300 members of the solute carrier (SLC) group of integral plasma membrane transport proteins are the nine electroneutral cation-chloride cotransporters belonging to the SLC12 gene family. Seven of these transporters have been functionally described as coupling the electrically silent movement of chloride with sodium and/or potassium. Although in silico analysis has identified two additional SLC12 family members, no physiological role has been ascribed to the proteins encoded by either the SLC12A8 or the SLC12A9 genes. Evolutionary conservation of this gene family from protists to humans confirms their importance. A wealth of physiological, immunohistochemical, and biochemical studies have revealed a great deal of information regarding the importance of this gene family to human health and disease. The sequencing of the human genome has provided investigators with the capability to link several human diseases with mutations in the genes encoding these plasma membrane proteins. The availability of bacterial artificial chromosomes, recombination engineering techniques, and the mouse genome sequence has simplified the creation of targeting constructs to manipulate the expression/function of these cation-chloride cotransporters in the mouse in an attempt to recapitulate some of these human pathologies. This review will summarize the three human disorders that have been linked to the mutation/dysfunction of the Na-Cl, Na-K-2Cl, and K-Cl cotransporters (i.e., Bartter's, Gitleman's, and Andermann's syndromes), examine some additional pathologies arising from genetically modified mouse models of these cotransporters including deafness, blood pressure, hyperexcitability, and epithelial transport deficit phenotypes.
New Complexity Scalable MPEG Encoding Techniques for Mobile Applications

Directory of Open Access Journals (Sweden)

Stephan Mietens

2004-03-01

Full Text Available Complexity scalability offers the advantage of one-time design of video applications for a large product family, including mobile devices, without the need of redesigning the applications on the algorithmic level to meet the requirements of the different products. In this paper, we present complexity scalable MPEG encoding having core modules with modifications for scalability. The interdependencies of the scalable modules and the system performance are evaluated. Experimental results show scalability giving a smooth change in complexity and corresponding video quality. Scalability is basically achieved by varying the number of computed DCT coefficients and the number of evaluated motion vectors but other modules are designed such they scale with the previous parameters. In the experiments using the Ã‚Â“StefanÃ‚Â” sequence, the elapsed execution time of the scalable encoder, reflecting the computational complexity, can be gradually reduced to roughly 50% of its original execution time. The video quality scales between 20 dB and 48 dB PSNR with unity quantizer setting, and between 21.5 dB and 38.5 dB PSNR for different sequences targeting 1500 kbps. The implemented encoder and the scalability techniques can be successfully applied in mobile systems based on MPEG video compression.
Genetic diversity in two Japanese flounder populations from China seas inferred using microsatellite markers and COI sequences

Science.gov (United States)

Xu, Dongdong; Li, Sanlei; Lou, Bao; Zhang, Yurong; Zhan, Wei; Shi, Huilai

2012-07-01

Japanese flounder is one of the most important commercial species in China; however, information on the genetic background of natural populations in China seas is scarce. The lack of genetic data has hampered fishery management and aquaculture development programs for this species. In the present study, we have analyzed the genetic diversity in natural populations of Japanese flounder sampled from the Yellow Sea (Qingdao population, QD) and East China Sea (Zhoushan population, ZS) using 10 polymorphic microsatellite loci and cytochrome c oxidase subunit I (COI) sequencing data. A total of 68 different alleles were observed over 10 microsatellite loci. The total number of alleles per locus ranged from 2 to 9, and the number of genotypes per locus ranged from 3 to 45. The observed heterozygosity and expected heterozygosity in QD were 0.733 and 0.779, respectively, and in ZS the heterozygosity values were 0.708 and 0.783, respectively. Significant departures from Hardy-Weinberg equilibrium were observed in 7 of the 10 microsatellite loci in each of the two populations. The COI sequencing analysis revealed 25 polymorphic sites and 15 haplotypes in the two populations. The haplotype diversity and nucleotide diversity in the QD population were 0.746±0.072 8 and 0.003 34±0.001 03 respectively, and in ZS population the genetic diversity values were 0.712±0.047 0 and 0.003 18±0.000 49, respectively. The microsatellite data ( F st =0.048 7, P <0.001) and mitochondrial DNA data ( F st =0.128, P <0.001) both revealed significant genetic differentiation between the two populations. The information on the genetic variation and differentiation in Japanese flounder obtained in this study could be used to set up suitable guidelines for the management and conservation of this species, as well as for managing artificial selection programs. In future studies, more geographically diverse stocks should be used to obtain a deeper understanding of the population structure of Japanese
Flow Cytometry Enables Multiplexed Measurements of Genetically Encoded Intramolecular FRET Sensors Suitable for Screening.

Science.gov (United States)

Doucette, Jaimee; Zhao, Ziyan; Geyer, Rory J; Barra, Melanie M; Balunas, Marcy J; Zweifach, Adam

2016-07-01

Genetically encoded sensors based on intramolecular FRET between CFP and YFP are used extensively in cell biology research. Flow cytometry has been shown to offer a means to measure CFP-YFP FRET; we suspected it would provide a unique way to conduct multiplexed measurements from cells expressing different FRET sensors, which is difficult to do with microscopy, and that this could be used for screening. We confirmed that flow cytometry accurately measures FRET signals using cells transiently transfected with an ERK activity reporter, comparing responses measured with imaging and cytometry. We created polyclonal long-term transfectant lines, each expressing a different intramolecular FRET sensor, and devised a way to bar-code four distinct populations of cells. We demonstrated the feasibility of multiplexed measurements and determined that robust multiplexed measurements can be conducted in plate format. To validate the suitability of the method for screening, we measured responses from a plate of bacterial extracts that in unrelated experiments we had determined contained the protein kinase C (PKC)-activating compound teleocidin A-1. The multiplexed assay correctly identifying the teleocidin A-1-containing well. We propose that multiplexed cytometric FRET measurements will be useful for analyzing cellular function and for screening compound collections. © 2016 Society for Laboratory Automation and Screening.
Molecular cloning and expression of cDNA encoding a lumenal calcium binding glycoprotein from sarcoplasmic reticulum

International Nuclear Information System (INIS)

Leberer, E.; Charuk, J.H.M.; MacLennan, D.H.; Green, N.M.

1989-01-01

Antibody screening was used to isolate a cDNA encoding the 160-kDa glycoprotein of rabbit skeletal muscle sarcoplasmic reticulum. The cDNA is identical to that encoding the 53-kDa glycoprotein except that it contains an in-frame insertion of 1,308 nucleotides near its 5' end, apparently resulting from alternative splicing. The protein encoded by the cDNA would contain a 19-residue NH 2 -terminal signal sequence and a 453-residue COOH-terminal sequence identical to the 53-kDa glycoprotein. It would also contain a 436-amino acid insert between these sequences. This insert would be highly acidic, suggesting that it might bind Ca 2+ . The purified 160-kDa glycoprotein and the glycoprotein expressed in COS-1 cells transfected with cDNA encoding the 160-kDa glycoprotein were shown to bind 45 C 2+ in a gel overlay assay. The protein was shown to be located in the lumen of the sarcoplasmic reticulum and to be associated through Ca 2+ with the membrane. The authors propose that this lumenal Ca 2+ binding glycoprotein of the sarcoplasmic reticulum be designated sarcalumenin
Identification of functional elements and regulatory circuits by Drosophila modENCODE

Energy Technology Data Exchange (ETDEWEB)

Roy, Sushmita; Ernst, Jason; Kharchenko, Peter V.; Kheradpour, Pouya; Negre, Nicolas; Eaton, Matthew L.; Landolin, Jane M.; Bristow, Christopher A.; Ma, Lijia; Lin, Michael F.; Washietl, Stefan; Arshinoff, Bradley I.; Ay, Ferhat; Meyer, Patrick E.; Robine, Nicolas; Washington, Nicole L.; Stefano, Luisa Di; Berezikov, Eugene; Brown, Christopher D.; Candeias, Rogerio; Carlson, Joseph W.; Carr, Adrian; Jungreis, Irwin; Marbach, Daniel; Sealfon, Rachel; Tolstorukov, Michael Y.; Will, Sebastian; Alekseyenko, Artyom A.; Artieri, Carlo; Booth, Benjamin W.; Brooks, Angela N.; Dai, Qi; Davis, Carrie A.; Duff, Michael O.; Feng, Xin; Gorchakov, Andrey A.; Gu, Tingting; Henikoff, Jorja G.; Kapranov, Philipp; Li, Renhua; MacAlpine, Heather K.; Malone, John; Minoda, Aki; Nordman, Jared; Okamura, Katsutomo; Perry, Marc; Powell, Sara K.; Riddle, Nicole C.; Sakai, Akiko; Samsonova, Anastasia; Sandler, Jeremy E.; Schwartz, Yuri B.; Sher, Noa; Spokony, Rebecca; Sturgill, David; van Baren, Marijke; Wan, Kenneth H.; Yang, Li; Yu, Charles; Feingold, Elise; Good, Peter; Guyer, Mark; Lowdon, Rebecca; Ahmad, Kami; Andrews, Justen; Berger, Bonnie; Brenner, Steven E.; Brent, Michael R.; Cherbas, Lucy; Elgin, Sarah C. R.; Gingeras, Thomas R.; Grossman, Robert; Hoskins, Roger A.; Kaufman, Thomas C.; Kent, William; Kuroda, Mitzi I.; Orr-Weaver, Terry; Perrimon, Norbert; Pirrotta, Vincenzo; Posakony, James W.; Ren, Bing; Russell, Steven; Cherbas, Peter; Graveley, Brenton R.; Lewis, Suzanna; Micklem, Gos; Oliver, Brian; Park, Peter J.; Celniker, Susan E.; Henikoff, Steven; Karpen, Gary H.; Lai, Eric C.; MacAlpine, David M.; Stein, Lincoln D.; White, Kevin P.; Kellis, Manolis

2010-12-22

To gain insight into how genomic information is translated into cellular and developmental programs, the Drosophila model organism Encyclopedia of DNA Elements (modENCODE) project is comprehensively mapping transcripts, histone modifications, chromosomal proteins, transcription factors, replication proteins and intermediates, and nucleosome properties across a developmental time course and in multiple cell lines. We have generated more than 700 data sets and discovered protein-coding, noncoding, RNA regulatory, replication, and chromatin elements, more than tripling the annotated portion of the Drosophila genome. Correlated activity patterns of these elements reveal a functional regulatory network, which predicts putative new functions for genes, reveals stage- and tissue-specific regulators, and enables gene-expression prediction. Our results provide a foundation for directed experimental and computational studies in Drosophila and related species and also a model for systematic data integration toward comprehensive genomic and functional annotation. Several years after the complete genetic sequencing of many species, it is still unclear how to translate genomic information into a functional map of cellular and developmental programs. The Encyclopedia of DNA Elements (ENCODE) (1) and model organism ENCODE (modENCODE) (2) projects use diverse genomic assays to comprehensively annotate the Homo sapiens (human), Drosophila melanogaster (fruit fly), and Caenorhabditis elegans (worm) genomes, through systematic generation and computational integration of functional genomic data sets. Previous genomic studies in flies have made seminal contributions to our understanding of basic biological mechanisms and genome functions, facilitated by genetic, experimental, computational, and manual annotation of the euchromatic and heterochromatic genome (3), small genome size, short life cycle, and a deep knowledge of development, gene function, and chromosome biology. The functions
Statistical framework for detection of genetically modified organisms based on Next Generation Sequencing.

Science.gov (United States)

Willems, Sander; Fraiture, Marie-Alice; Deforce, Dieter; De Keersmaecker, Sigrid C J; De Loose, Marc; Ruttink, Tom; Herman, Philippe; Van Nieuwerburgh, Filip; Roosens, Nancy

2016-02-01

Because the number and diversity of genetically modified (GM) crops has significantly increased, their analysis based on real-time PCR (qPCR) methods is becoming increasingly complex and laborious. While several pioneers already investigated Next Generation Sequencing (NGS) as an alternative to qPCR, its practical use has not been assessed for routine analysis. In this study a statistical framework was developed to predict the number of NGS reads needed to detect transgene sequences, to prove their integration into the host genome and to identify the specific transgene event in a sample with known composition. This framework was validated by applying it to experimental data from food matrices composed of pure GM rice, processed GM rice (noodles) or a 10% GM/non-GM rice mixture, revealing some influential factors. Finally, feasibility of NGS for routine analysis of GM crops was investigated by applying the framework to samples commonly encountered in routine analysis of GM crops. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.
SAMPEG: a scene-adaptive parallel MPEG-2 software encoder

NARCIS (Netherlands)

Farin, D.S.; Mache, N.; With, de P.H.N.; Girod, B.; Bouman, C.A.; Steinbach, E.G.

2001-01-01

This paper presents a fully software-based MPEG-2 encoder architecture, which uses scene-change detection to optimize the Group-of-Picture (GOP) structure for the actual video sequence. This feature enables easy, lossless edit cuts at scene-change positions and it also improves overall picture
Genetic basis of chronic pancreatitis

NARCIS (Netherlands)

Jansen, JBMJ; Morsche, RT; van Goor, Harry; Drenth, JPH

2002-01-01

Background: Pancreatitis has a proven genetic basis in a minority of patients. Methods: Review of the literature on genetics of pancreatitis. Results: Ever since the discovery that in most patients with hereditary pancreatitis a mutation in the gene encoding for cationic trypsinogen (R122H) was
Flanking sequence determination and event-specific detection of genetically modified wheat B73-6-1.

Science.gov (United States)

Xu, Junyi; Cao, Jijuan; Cao, Dongmei; Zhao, Tongtong; Huang, Xin; Zhang, Piqiao; Luan, Fengxia

2013-05-01

In order to establish a specific identification method for genetically modified (GM) wheat, exogenous insert DNA and flanking sequence between exogenous fragment and recombinant chromosome of GM wheat B73-6-1 were successfully acquired by means of conventional polymerase chain reaction (PCR) and thermal asymmetric interlaced (TAIL)-PCR strategies. Newly acquired exogenous fragment covered the full-length sequence of transformed genes such as transformed plasmid and corresponding functional genes including marker uidA, herbicide-resistant bar, ubiquitin promoter, and high-molecular-weight gluten subunit. The flanking sequence between insert DNA revealed high similarity with Triticum turgidum A gene (GenBank: AY494981.1). A specific PCR detection method for GM wheat B73-6-1 was established on the basis of primers designed according to the flanking sequence. This specific PCR method was validated by GM wheat, GM corn, GM soybean, GM rice, and non-GM wheat. The specifically amplified target band was observed only in GM wheat B73-6-1. This method is of high specificity, high reproducibility, rapid identification, and excellent accuracy for the identification of GM wheat B73-6-1.
Genetic variation and DNA fingerprinting of durian types in Malaysia using simple sequence repeat (SSR) markers.

Science.gov (United States)

Siew, Ging Yang; Ng, Wei Lun; Tan, Sheau Wei; Alitheen, Noorjahan Banu; Tan, Soon Guan; Yeap, Swee Keong

2018-01-01

Durian ( Durio zibethinus ) is one of the most popular tropical fruits in Asia. To date, 126 durian types have been registered with the Department of Agriculture in Malaysia based on phenotypic characteristics. Classification based on morphology is convenient, easy, and fast but it suffers from phenotypic plasticity as a direct result of environmental factors and age. To overcome the limitation of morphological classification, there is a need to carry out genetic characterization of the various durian types. Such data is important for the evaluation and management of durian genetic resources in producing countries. In this study, simple sequence repeat (SSR) markers were used to study the genetic variation in 27 durian types from the germplasm collection of Universiti Putra Malaysia. Based on DNA sequences deposited in Genbank, seven pairs of primers were successfully designed to amplify SSR regions in the durian DNA samples. High levels of variation among the 27 durian types were observed (expected heterozygosity, H E = 0.35). The DNA fingerprinting power of SSR markers revealed by the combined probability of identity (PI) of all loci was 2.3×10 -3 . Unique DNA fingerprints were generated for 21 out of 27 durian types using five polymorphic SSR markers (the other two SSR markers were monomorphic). We further tested the utility of these markers by evaluating the clonal status of shared durian types from different germplasm collection sites, and found that some were not clones. The findings in this preliminary study not only shows the feasibility of using SSR markers for DNA fingerprinting of durian types, but also challenges the current classification of durian types, e.g., on whether the different types should be called "clones", "varieties", or "cultivars". Such matters have a direct impact on the regulation and management of durian genetic resources in the region.
DNA-encoded chemical libraries: advancing beyond conventional small-molecule libraries.

Science.gov (United States)

Franzini, Raphael M; Neri, Dario; Scheuermann, Jörg

2014-04-15

DNA-encoded chemical libraries (DECLs) represent a promising tool in drug discovery. DECL technology allows the synthesis and screening of chemical libraries of unprecedented size at moderate costs. In analogy to phage-display technology, where large antibody libraries are displayed on the surface of filamentous phage and are genetically encoded in the phage genome, DECLs feature the display of individual small organic chemical moieties on DNA fragments serving as amplifiable identification barcodes. The DNA-tag facilitates the synthesis and allows the simultaneous screening of very large sets of compounds (up to billions of molecules), because the hit compounds can easily be identified and quantified by PCR-amplification of the DNA-barcode followed by high-throughput DNA sequencing. Several approaches have been used to generate DECLs, differing both in the methods used for library encoding and for the combinatorial assembly of chemical moieties. For example, DECLs can be used for fragment-based drug discovery, displaying a single molecule on DNA or two chemical moieties at the extremities of complementary DNA strands. DECLs can vary substantially in the chemical structures and the library size. While ultralarge libraries containing billions of compounds have been reported containing four or more sets of building blocks, also smaller libraries have been shown to be efficient for ligand discovery. In general, it has been found that the overall library size is a poor predictor for library performance and that the number and diversity of the building blocks are rather important indicators. Smaller libraries consisting of two to three sets of building blocks better fulfill the criteria of drug-likeness and often have higher quality. In this Account, we present advances in the DECL field from proof-of-principle studies to practical applications for drug discovery, both in industry and in academia. DECL technology can yield specific binders to a variety of target
Two Genes Encoding Uracil Phosphoribosyltransferase Are Present in Bacillus subtilis

DEFF Research Database (Denmark)

Martinussen, Jan; Glaser, Philippe; Andersen, Paal S.

1995-01-01

Uracil phosphoribosyltransferase (UPRTase) catalyzes the key reaction in the salvage of uracil in many microorganisms. Surprisingly, two genes encoding UPRTase activity were cloned from Bacillus subtilis by complementation of an Escherichia coli mutant. The genes were sequenced, and the putative...
Use of Whole Genome Sequencing for Diagnosis and Discovery in the Cancer Genetics Clinic

Directory of Open Access Journals (Sweden)

Samantha B. Foley

2015-01-01

Full Text Available Despite the potential of whole-genome sequencing (WGS to improve patient diagnosis and care, the empirical value of WGS in the cancer genetics clinic is unknown. We performed WGS on members of two cohorts of cancer genetics patients: those with BRCA1/2 mutations (n = 176 and those without (n = 82. Initial analysis of potentially pathogenic variants (PPVs, defined as nonsynonymous variants with allele frequency < 1% in ESP6500 in 163 clinically-relevant genes suggested that WGS will provide useful clinical results. This is despite the fact that a majority of PPVs were novel missense variants likely to be classified as variants of unknown significance (VUS. Furthermore, previously reported pathogenic missense variants did not always associate with their predicted diseases in our patients. This suggests that the clinical use of WGS will require large-scale efforts to consolidate WGS and patient data to improve accuracy of interpretation of rare variants. While loss-of-function (LoF variants represented only a small fraction of PPVs, WGS identified additional cancer risk LoF PPVs in patients with known BRCA1/2 mutations and led to cancer risk diagnoses in 21% of non-BRCA cancer genetics patients after expanding our analysis to 3209 ClinVar genes. These data illustrate how WGS can be used to improve our ability to discover patients' cancer genetic risks.
Cloning and characterization of the ddc homolog encoding L-2,4-diaminobutyrate decarboxylase in Enterobacter aerogenes.

Science.gov (United States)

Yamamoto, S; Mutoh, N; Tsuzuki, D; Ikai, H; Nakao, H; Shinoda, S; Narimatsu, S; Miyoshi, S I

2000-05-01

L-2,4-diaminobutyrate decarboxylase (DABA DC) catalyzes the formation of 1,3-diaminopropane (DAP) from DABA. In the present study, the ddc gene encoding DABA DC from Enterobacter aerogenes ATCC 13048 was cloned and characterized. Determination of the nucleotide sequence revealed an open reading frame of 1470 bp encoding a 53659-Da protein of 490 amino acids, whose deduced NH2-terminal sequence was identical to that of purified DABA DC from E. aerogenes. The deduced amino acid sequence was highly similar to those of Acinetobacter baumannii and Haemophilus influenzae DABA DCs encoded by the ddc genes. The lysine-307 of the E. aerogenes DABA DC was identified as the pyridoxal 5'-phosphate binding residue by site-directed mutagenesis. Furthermore, PCR analysis revealed the distribution of E. aerogenes ddc homologs in some other species of Enterobacteriaceae. Such a relatively wide occurrence of the ddc homologs implies biological significance of DABA DC and its product DAP.
Mutations in STX1B, encoding a presynaptic protein, cause fever-associated epilepsy syndromes

DEFF Research Database (Denmark)

Schubert, J.; Siekierska, A.; Langlois, M.

2014-01-01

Febrile seizures affect 2-4% of all children(1) and have a strong genetic component(2). Recurrent mutations in three main genes (SCN1A, SCN1B and GABRG2)(3-5) have been identified that cause febrile seizures with or without epilepsy. Here we report the identification of mutations in STX1B, encoding...... syntaxin-1B(6), that are associated with both febrile seizures and epilepsy. Whole-exome sequencing in independent large pedigrees(7,8) identified cosegregating STX1B mutations predicted to cause an early truncation or an in-frame insertion or deletion. Three additional nonsense or missense mutations...... and a de novo microdeletion encompassing STX1B were then identified in 449 familial or sporadic cases. Video and local field potential analyses of zebrafish larvae with antisense knockdown of stx1b showed seizure-like behavior and epileptiform discharges that were highly sensitive to increased temperature...
ENCODE whole-genome data in the UCSC genome browser (2011 update).

Science.gov (United States)

Raney, Brian J; Cline, Melissa S; Rosenbloom, Kate R; Dreszer, Timothy R; Learned, Katrina; Barber, Galt P; Meyer, Laurence R; Sloan, Cricket A; Malladi, Venkat S; Roskin, Krishna M; Suh, Bernard B; Hinrichs, Angie S; Clawson, Hiram; Zweig, Ann S; Kirkup, Vanessa; Fujita, Pauline A; Rhead, Brooke; Smith, Kayla E; Pohl, Andy; Kuhn, Robert M; Karolchik, Donna; Haussler, David; Kent, W James

2011-01-01

The ENCODE project is an international consortium with a goal of cataloguing all the functional elements in the human genome. The ENCODE Data Coordination Center (DCC) at the University of California, Santa Cruz serves as the central repository for ENCODE data. In this role, the DCC offers a collection of high-throughput, genome-wide data generated with technologies such as ChIP-Seq, RNA-Seq, DNA digestion and others. This data helps illuminate transcription factor-binding sites, histone marks, chromatin accessibility, DNA methylation, RNA expression, RNA binding and other cell-state indicators. It includes sequences with quality scores, alignments, signals calculated from the alignments, and in most cases, element or peak calls calculated from the signal data. Each data set is available for visualization and download via the UCSC Genome Browser (http://genome.ucsc.edu/). ENCODE data can also be retrieved using a metadata system that captures the experimental parameters of each assay. The ENCODE web portal at UCSC (http://encodeproject.org/) provides information about the ENCODE data and links for access.
A Ti plasmid-encoded enzyme required for degradation of mannopine is functionally homologous to the T-region-encoded enzyme required for synthesis of this opine in crown gall tumors.

OpenAIRE

Kim, K S; Chilton, W S; Farrand, S K

1996-01-01

The mocC gene encoded by the octopine/mannityl opine-type Ti plasmid pTi15955 is related at the nucleotide sequence level to mas1' encoded by the T region of this plasmid. While Mas1 is required for the synthesis of mannopine (MOP) by crown gall tumor cells, MocC is essential for the utilization of MOP by Agrobacterium spp. A cosmid clone of pTi15955, pYDH208, encodes mocC and confers the utilization of MOP on strain NT1 and on strain UIA5, a derivative of NT1 lacking the 450-kb cryptic plasm...

Improved Efficiency and Reliability of NGS Amplicon Sequencing Data Analysis for Genetic Diagnostic Procedures Using AGSA Software

Directory of Open Access Journals (Sweden)

Axel Poulet

2016-01-01

Full Text Available Screening for BRCA mutations in women with familial risk of breast or ovarian cancer is an ideal situation for high-throughput sequencing, providing large amounts of low cost data. However, 454, Roche, and Ion Torrent, Thermo Fisher, technologies produce homopolymer-associated indel errors, complicating their use in routine diagnostics. We developed software, named AGSA, which helps to detect false positive mutations in homopolymeric sequences. Seventy-two familial breast cancer cases were analysed in parallel by amplicon 454 pyrosequencing and Sanger dideoxy sequencing for genetic variations of the BRCA genes. All 565 variants detected by dideoxy sequencing were also detected by pyrosequencing. Furthermore, pyrosequencing detected 42 variants that were missed with Sanger technique. Six amplicons contained homopolymer tracts in the coding sequence that were systematically misread by the software supplied by Roche. Read data plotted as histograms by AGSA software aided the analysis considerably and allowed validation of the majority of homopolymers. As an optimisation, additional 250 patients were analysed using microfluidic amplification of regions of interest (Access Array Fluidigm of the BRCA genes, followed by 454 sequencing and AGSA analysis. AGSA complements a complete line of high-throughput diagnostic sequence analysis, reducing time and costs while increasing reliability, notably for homopolymer tracts.
Genetic stability of attenuated mengovirus vectors with duplicate primary cleavage sequences

International Nuclear Information System (INIS)

Binder, J.J.; Hoffman, M.A.; Palmenberg, A.C.

2003-01-01

Short poly(C)-tract Mengoviruses have proven vaccine efficacy in many species of animals. A novel vector for the delivery of foreign proteins was created by insertion of a second autoproteolytic primary cleavage cassette linked to a multiple cloning site (MCS) into an attenuated variant of Mengo. Nineteen cDNAs from foreign sequences that ranged from 39 to 1653 bases were cloned into the MCS. The viral reading frame was maintained and translation resulted in dual, autocatalytic excision of the foreign peptides without disruption of any Mengo proteins. All cDNAs except those with the largest insertions produced viable virus. Active proteins such as GFP, CAT, and SIV p27 were expressed within infected cells. Relative to parental Mengo, the growth kinetics and genetic stability of each vector was inversely proportional to the size of the inserted sequence. While segments up to 1000 bases could be carried, inserts greater than 500-600 bases were usually reduced in size during serial passage. The limit on carrying capacity was probably due to difficulties in virion assembly or particle stability. Yet for inserts less than 500-600 bases, the Mengo vectors provided an effective system for the delivery of foreign epitopes into cells and mice
Modulating and Measuring Intracellular H2O2 Using Genetically Encoded Tools to Study Its Toxicity to Human Cells.

Science.gov (United States)

Huang, Beijing K; Stein, Kassi T; Sikes, Hadley D

2016-12-16

Reactive oxygen species (ROS) such as H 2 O 2 play paradoxical roles in mammalian physiology. It is hypothesized that low, baseline levels of H 2 O 2 are necessary for growth and differentiation, while increased intracellular H 2 O 2 concentrations are associated with pathological phenotypes and genetic instability, eventually reaching a toxic threshold that causes cell death. However, the quantities of intracellular H 2 O 2 that lead to these different responses remain an unanswered question in the field. To address this question, we used genetically encoded constructs that both generate and quantify H 2 O 2 in a dose-response study of H 2 O 2 -mediated toxicity. We found that, rather than a simple concentration-response relationship, a combination of intracellular concentration and the cumulative metric of H 2 O 2 concentration multiplied by time (i.e., the area under the curve) determined the occurrence and level of cell death. Establishing the quantitative relationship between H 2 O 2 and cell toxicity promotes a deeper understanding of the intracellular effects of H 2 O 2 specifically as an individual reactive oxygen species, and it contributes to an understanding of its role in various redox-related diseases.
SequenceCEROSENE: a computational method and web server to visualize spatial residue neighborhoods at the sequence level.

Science.gov (United States)

Heinke, Florian; Bittrich, Sebastian; Kaiser, Florian; Labudde, Dirk

2016-01-01

To understand the molecular function of biopolymers, studying their structural characteristics is of central importance. Graphics programs are often utilized to conceive these properties, but with the increasing number of available structures in databases or structure models produced by automated modeling frameworks this process requires assistance from tools that allow automated structure visualization. In this paper a web server and its underlying method for generating graphical sequence representations of molecular structures is presented. The method, called SequenceCEROSENE (color encoding of residues obtained by spatial neighborhood embedding), retrieves the sequence of each amino acid or nucleotide chain in a given structure and produces a color coding for each residue based on three-dimensional structure information. From this, color-highlighted sequences are obtained, where residue coloring represent three-dimensional residue locations in the structure. This color encoding thus provides a one-dimensional representation, from which spatial interactions, proximity and relations between residues or entire chains can be deduced quickly and solely from color similarity. Furthermore, additional heteroatoms and chemical compounds bound to the structure, like ligands or coenzymes, are processed and reported as well. To provide free access to SequenceCEROSENE, a web server has been implemented that allows generating color codings for structures deposited in the Protein Data Bank or structure models uploaded by the user. Besides retrieving visualizations in popular graphic formats, underlying raw data can be downloaded as well. In addition, the server provides user interactivity with generated visualizations and the three-dimensional structure in question. Color encoded sequences generated by SequenceCEROSENE can aid to quickly perceive the general characteristics of a structure of interest (or entire sets of complexes), thus supporting the researcher in the initial
A genetic study of SSV1, the prototypical fusellovirus.

Directory of Open Access Journals (Sweden)

Eric eIverson

2012-06-01

Full Text Available Viruses of thermophilic Archaea are unique in both their structures and genomic sequences. The most widespread and arguably best studied are the lemon-shaped fuselloviruses. The spindle-shaped virus morphology is unique to Archaea but widespread therein. The best studied fusellovirus is SSV1 from Beppu Japan, which infects Sulfolobus solfataricus. Very little is known about the function of the genes in the SSV1 genome. Recently we have developed genetic tools to analyze these genes. In this study, we have deleted three SSV1 open reading frames ranging from completely conserved to poorly conserved: VP2, d244, and b129. Deletion of the universally conserved ORF b129, which encodes a predicted transcriptional regulator, results in loss of infectivity. Deletion of the poorly-conserved predicted DNA binding protein gene VP2 yields viable virus that is indistinguishable from wild-type Deletion of the well-conserved ORF d244 that encodes a predicted nuclease yields viable virus. However infection of Sulfolobus solfataricus with virus lacking ORF d244 dramatically retards host growth, compared to the wild-type virus.
Genetically Encoded Biosensors Reveal PKA Hyperphosphorylation on the Myofilaments in Rabbit Heart Failure.

Science.gov (United States)

Barbagallo, Federica; Xu, Bing; Reddy, Gopireddy R; West, Toni; Wang, Qingtong; Fu, Qin; Li, Minghui; Shi, Qian; Ginsburg, Kenneth S; Ferrier, William; Isidori, Andrea M; Naro, Fabio; Patel, Hemal H; Bossuyt, Julie; Bers, Donald; Xiang, Yang K

2016-09-30

In heart failure, myofilament proteins display abnormal phosphorylation, which contributes to contractile dysfunction. The mechanisms underlying the dysregulation of protein phosphorylation on myofilaments is not clear. This study aims to understand the mechanisms underlying altered phosphorylation of myofilament proteins in heart failure. We generate a novel genetically encoded protein kinase A (PKA) biosensor anchored onto the myofilaments in rabbit cardiac myocytes to examine PKA activity at the myofilaments in responses to adrenergic stimulation. We show that PKA activity is shifted from the sarcolemma to the myofilaments in hypertrophic failing rabbit myocytes. In particular, the increased PKA activity on the myofilaments is because of an enhanced β2 adrenergic receptor signal selectively directed to the myofilaments together with a reduced phosphodiesterase activity associated with the myofibrils. Mechanistically, the enhanced PKA activity on the myofilaments is associated with downregulation of caveolin-3 in the hypertrophic failing rabbit myocytes. Reintroduction of caveolin-3 in the failing myocytes is able to normalize the distribution of β2 adrenergic receptor signal by preventing PKA signal access to the myofilaments and to restore contractile response to adrenergic stimulation. In hypertrophic rabbit myocytes, selectively enhanced β2 adrenergic receptor signaling toward the myofilaments contributes to elevated PKA activity and PKA phosphorylation of myofilament proteins. Reintroduction of caveolin-3 is able to confine β2 adrenergic receptor signaling and restore myocyte contractility in response to β adrenergic stimulation. © 2016 American Heart Association, Inc.
Genome-Wide Single-Nucleotide Polymorphisms Discovery and High-Density Genetic Map Construction in Cauliflower Using Specific-Locus Amplified Fragment Sequencing

Science.gov (United States)

Zhao, Zhenqing; Gu, Honghui; Sheng, Xiaoguang; Yu, Huifang; Wang, Jiansheng; Huang, Long; Wang, Dan

2016-01-01

Molecular markers and genetic maps play an important role in plant genomics and breeding studies. Cauliflower is an important and distinctive vegetable; however, very few molecular resources have been reported for this species. In this study, a novel, specific-locus amplified fragment (SLAF) sequencing strategy was employed for large-scale single nucleotide polymorphism (SNP) discovery and high-density genetic map construction in a double-haploid, segregating population of cauliflower. A total of 12.47 Gb raw data containing 77.92 M pair-end reads were obtained after processing and 6815 polymorphic SLAFs between the two parents were detected. The average sequencing depths reached 52.66-fold for the female parent and 49.35-fold for the male parent. Subsequently, these polymorphic SLAFs were used to genotype the population and further filtered based on several criteria to construct a genetic linkage map of cauliflower. Finally, 1776 high-quality SLAF markers, including 2741 SNPs, constituted the linkage map with average data integrity of 95.68%. The final map spanned a total genetic length of 890.01 cM with an average marker interval of 0.50 cM, and covered 364.9 Mb of the reference genome. The markers and genetic map developed in this study could provide an important foundation not only for comparative genomics studies within Brassica oleracea species but also for quantitative trait loci identification and molecular breeding of cauliflower. PMID:27047515
Genetic Diversity Assessment and Identification of New Sour Cherry Genotypes Using Intersimple Sequence Repeat Markers

Directory of Open Access Journals (Sweden)

Roghayeh Najafzadeh

2014-01-01

Full Text Available Iran is one of the chief origins of subgenus Cerasus germplasm. In this study, the genetic variation of new Iranian sour cherries (which had such superior growth characteristics and fruit quality as to be considered for the introduction of new cultivars was investigated and identified using 23 intersimple sequence repeat (ISSR markers. Results indicated a high level of polymorphism of the genotypes based on these markers. According to these results, primers tested in this study specially ISSR-4, ISSR-6, ISSR-13, ISSR-14, ISSR-16, and ISSR-19 produced good and various levels of amplifications which can be effectively used in genetic studies of the sour cherry. The genetic similarity among genotypes showed a high diversity among the genotypes. Cluster analysis separated improved cultivars from promising Iranian genotypes, and the PCoA supported the cluster analysis results. Since the Iranian genotypes were superior to the improved cultivars and were separated from them in most groups, these genotypes can be considered as distinct genotypes for further evaluations in the framework of breeding programs and new cultivar identification in cherries. Results also confirmed that ISSR is a reliable DNA marker that can be used for exact genetic studies and in sour cherry breeding programs.
cDNA sequences of two inducible T-cell genes

Energy Technology Data Exchange (ETDEWEB)

Kwon, B.S. (Indiana Univ. School of Medicine, Indianapolis (USA) Guthrie Research Institute, Sayre, PA (USA)); Weissman, S.M. (Yale Univ., New Haven, CT (USA))

1989-03-01

The authors have previously described a set of human T-lymphocyte-specific cDNA clones isolated by a modified differential screening procedure. Apparent full-length cDNAs containing the sequences of 14 of the 16 initial isolates were sequenced and were found to represent five different species of mRNA; three of the five species were identical to previously reported cDNA sequences of preproenkephalin, T-cell-replacing factor, and a serine esterase, respectively. The other two species, 4-1BB and L2G25B, were inducible sequences found in mRNA from both a cytolytic T-lymphocyte and a helper T-lymphocyte clone and were not previously described in T-cell mRNA; these mRNA sequences encode peptides of 256 and 92 amino acids, respectively. Both peptides contain putative leader sequences. The protein encoded by 4-1BB also has a potential membrane anchor segment and other features also seen in known receptor proteins.
Imaging activity in astrocytes and neurons with genetically encoded calcium indicators following in utero electroporation

Directory of Open Access Journals (Sweden)

J. Michael eGee

2015-04-01

Full Text Available Complex interactions between networks of astrocytes and neurons are beginning to be appreciated, but remain poorly understood. Transgenic mice expressing fluorescent protein reporters of cellular activity, such as the GCaMP family of genetically encoded calcium indicators, have been used to explore network behavior. However, in some cases, it may be desirable to use long-established rat models that closely mimic particular aspects of human conditions such as Parkinson’s disease and the development of epilepsy following status epilepticus. Methods for expressing reporter proteins in the rat brain are relatively limited. Transgenic rat technologies exist but are fairly immature. Viral-mediated expression is robust but unstable, requires invasive injections, and only works well for fairly small genes (< 5 kb. In utero electroporation offers a valuable alternative. IUE is a proven method for transfecting populations of astrocytes and neurons in the rat brain without the strict limitations on transgene size. We built a toolset of IUE plasmids carrying GCaMP variants 3, 6s or 6f driven by CAG and targeted to the cytosol or the plasma membrane. Because low baseline fluorescence of GCaMP can hinder identification of transfected cells, we included the option of co-expressing a cytosolic tdTomato protein. A binary system consisting of a plasmid carrying a piggyBac inverted terminal repeat-flanked CAG-GCaMP-IRES-tdTomato cassette and a separate plasmid encoding for expression of piggyBac transposase was employed to stably express GCaMP and tdTomato. The plasmids were co-electroporated on embryonic days 13.5-14.5 and astrocytic and neuronal activity was subsequently imaged in acute or cultured brain slices prepared from the cortex or hippocampus. Large spontaneous transients were detected in slices obtained from rats of varying ages up to 127 days. In this report, we demonstrate the utility of this toolset for interrogating astrocytic and neuronal
Genetically Engineered Cyanobacteria

Science.gov (United States)

Zhou, Ruanbao (Inventor); Gibbons, William (Inventor)

2015-01-01

The disclosed embodiments provide cyanobacteria spp. that have been genetically engineered to have increased production of carbon-based products of interest. These genetically engineered hosts efficiently convert carbon dioxide and light into carbon-based products of interest such as long chained hydrocarbons. Several constructs containing polynucleotides encoding enzymes active in the metabolic pathways of cyanobacteria are disclosed. In many instances, the cyanobacteria strains have been further genetically modified to optimize production of the carbon-based products of interest. The optimization includes both up-regulation and down-regulation of particular genes.
Identification and characterization of genes encoding polycyclic aromatic hydrocarbon dioxygenase and polycyclic aromatic hydrocarbon dihydrodiol dehydrogenase in Pseudomonas putida OUS82.

OpenAIRE

Takizawa, N; Kaida, N; Torigoe, S; Moritani, T; Sawada, T; Satoh, S; Kiyohara, H

1994-01-01

Naphthalene and phenanthrene are transformed by enzymes encoded by the pah gene cluster of Pseudomonas putida OUS82. The pahA and pahB genes, which encode the first and second enzymes, dioxygenase and cis-dihydrodiol dehydrogenase, respectively, were identified and sequenced. The DNA sequences showed that pahA and pahB were clustered and that pahA consisted of four cistrons, pahAa, pahAb, pahAc, and pahAd, which encode ferredoxin reductase, ferredoxin, and two subunits of the iron-sulfur prot...
Comparative Sequence Analysis of Multidrug-Resistant IncA/C Plasmids from Salmonella enterica.

Science.gov (United States)

Hoffmann, Maria; Pettengill, James B; Gonzalez-Escalona, Narjol; Miller, John; Ayers, Sherry L; Zhao, Shaohua; Allard, Marc W; McDermott, Patrick F; Brown, Eric W; Monday, Steven R

2017-01-01

Determinants of multidrug resistance (MDR) are often encoded on mobile elements, such as plasmids, transposons, and integrons, which have the potential to transfer among foodborne pathogens, as well as to other virulent pathogens, increasing the threats these traits pose to human and veterinary health. Our understanding of MDR among Salmonella has been limited by the lack of closed plasmid genomes for comparisons across resistance phenotypes, due to difficulties in effectively separating the DNA of these high-molecular weight, low-copy-number plasmids from chromosomal DNA. To resolve this problem, we demonstrate an efficient protocol for isolating, sequencing and closing IncA/C plasmids from Salmonella sp. using single molecule real-time sequencing on a Pacific Biosciences (Pacbio) RS II Sequencer. We obtained six Salmonella enterica isolates from poultry, representing six different serovars, each exhibiting the MDR-Ampc resistance profile. Salmonella plasmids were obtained using a modified mini preparation and transformed with Escherichia coli DH10Br. A Qiagen Large-Construct kit™ was used to recover highly concentrated and purified plasmid DNA that was sequenced using PacBio technology. These six closed IncA/C plasmids ranged in size from 104 to 191 kb and shared a stable, conserved backbone containing 98 core genes, with only six differences among those core genes. The plasmids encoded a number of antimicrobial resistance genes, including those for quaternary ammonium compounds and mercury. We then compared our six IncA/C plasmid sequences: first with 14 IncA/C plasmids derived from S. enterica available at the National Center for Biotechnology Information (NCBI), and then with an additional 38 IncA/C plasmids derived from different taxa. These comparisons allowed us to build an evolutionary picture of how antimicrobial resistance may be mediated by this common plasmid backbone. Our project provides detailed genetic information about resistance genes in
Comparative Sequence Analysis of Multidrug-Resistant IncA/C Plasmids from Salmonella enterica

Directory of Open Access Journals (Sweden)

Maria Hoffmann

2017-08-01

Full Text Available Determinants of multidrug resistance (MDR are often encoded on mobile elements, such as plasmids, transposons, and integrons, which have the potential to transfer among foodborne pathogens, as well as to other virulent pathogens, increasing the threats these traits pose to human and veterinary health. Our understanding of MDR among Salmonella has been limited by the lack of closed plasmid genomes for comparisons across resistance phenotypes, due to difficulties in effectively separating the DNA of these high-molecular weight, low-copy-number plasmids from chromosomal DNA. To resolve this problem, we demonstrate an efficient protocol for isolating, sequencing and closing IncA/C plasmids from Salmonella sp. using single molecule real-time sequencing on a Pacific Biosciences (Pacbio RS II Sequencer. We obtained six Salmonella enterica isolates from poultry, representing six different serovars, each exhibiting the MDR-Ampc resistance profile. Salmonella plasmids were obtained using a modified mini preparation and transformed with Escherichia coli DH10Br. A Qiagen Large-Construct kit™ was used to recover highly concentrated and purified plasmid DNA that was sequenced using PacBio technology. These six closed IncA/C plasmids ranged in size from 104 to 191 kb and shared a stable, conserved backbone containing 98 core genes, with only six differences among those core genes. The plasmids encoded a number of antimicrobial resistance genes, including those for quaternary ammonium compounds and mercury. We then compared our six IncA/C plasmid sequences: first with 14 IncA/C plasmids derived from S. enterica available at the National Center for Biotechnology Information (NCBI, and then with an additional 38 IncA/C plasmids derived from different taxa. These comparisons allowed us to build an evolutionary picture of how antimicrobial resistance may be mediated by this common plasmid backbone. Our project provides detailed genetic information about
GENETIC DIFFERENTIATION AMONG POPULATIONS OF Chromobotia macracanthus BLEEKER FROM SUMATRA AND KALIMANTAN BASED ON SEQUENCING GENE OF MTDNA CYTOCHROME B AND NUCLEUS DNA RAG2

Directory of Open Access Journals (Sweden)

Sudarto Sudarto

2008-12-01

Full Text Available Research on genetic differentiation among populations of Chromobotia macracanthus Bleeker from Sumatra, based on sequencing gene of mtDNA Cytochrome b and nucleus DNA RAG2 has been done. The objectives of the study were to obtain the representation of genetic differentiation among population of clown loach fishes or botia (Chromobotia macracanthus from Sumatra and Kalimantan and to estimate the time divergence of both population group of botia. Samples of botia population were taken from 3 rivers in Sumatra namely Batanghari, Musi, and Tulang Bawang and one river from Kalimantan namely Kapuas. The genetic analysis was based on the sequencing of mtDNA Cytochrome b and nucleus DNA RAG2. The statistical analysis was done by using APE package on R language. The parameters observed were: nucleotide diversity, genetic distance, and neighbor-joining tree. The result showed that the highest nucleotide diversity was fish population of Musi, while the other two populations, Tulang Bawang (Sumatra and Kapuas (Kalimantan, were considered as the lowest genetic diversity especially based on nucleus DNA RAG2 sequencing. Based on mtDNA Cytochrome-b sequencing, the most distinct population among those populations based on genetic distance were fish populations of Musi and Kapuas. According to the result of neighbor-joining tree analysis, the populations of botia were classified into two groups namely group of Sumatra and group of Kalimantan. The estimation of time divergence among group of population of Sumatra and Kalimantan based on mtDNA Cytochrome b was about 9.25—9.46 million years (Miocene era. The high genetic differences between groups of Sumatra and Kalimantan suggested that the effort of restocking botia from Sumatra into Kalimantan has to be done carefully, because it may disturb the gene originality of both botia populations.
Unravelling the Molecular Epidemiology and Genetic Diversity among Burkholderia pseudomallei Isolates from South India Using Multi-Locus Sequence Typing.

Science.gov (United States)

Tellapragada, Chaitanya; Kamthan, Aayushi; Shaw, Tushar; Ke, Vandana; Kumar, Subodh; Bhat, Vinod; Mukhopadhyay, Chiranjay

2016-01-01

There is a slow but steady rise in the case detection rates of melioidosis from various parts of the Indian sub-continent in the past two decades. However, the epidemiology of the disease in India and the surrounding South Asian countries remains far from well elucidated. Multi-locus sequence typing (MLST) is a useful epidemiological tool to study the genetic relatedness of bacterial isolates both with-in and across the countries. With this background, we studied the molecular epidemiology of 32 Burkholderia pseudomallei isolates (31 clinical and 1 soil isolate) obtained during 2006-2015 from various parts of south India using multi-locus sequencing typing and analysis. Of the 32 isolates included in the analysis, 30 (93.7%) had novel allelic profiles that were not reported previously. Sequence type (ST) 1368 (n = 15, 46.8%) with allelic profile (1, 4, 6, 4, 1, 1, 3) was the most common genotype observed. We did not observe a genotypic association of STs with geographical location, type of infection and year of isolation in the present study. Measure of genetic differentiation (FST) between Indian and the rest of world isolates was 0.14413. Occurrence of the same ST across three adjacent states of south India suggest the dispersion of B.pseudomallei across the south western coastal part of India with limited geographical clustering. However, majority of the STs reported from the present study remained as "outliers" on the eBURST "Population snapshot", suggesting the genetic diversity of Indian isolates from the Australasian and Southeast Asian isolates.
Unravelling the Molecular Epidemiology and Genetic Diversity among Burkholderia pseudomallei Isolates from South India Using Multi-Locus Sequence Typing.

Directory of Open Access Journals (Sweden)

Chaitanya Tellapragada

Full Text Available There is a slow but steady rise in the case detection rates of melioidosis from various parts of the Indian sub-continent in the past two decades. However, the epidemiology of the disease in India and the surrounding South Asian countries remains far from well elucidated. Multi-locus sequence typing (MLST is a useful epidemiological tool to study the genetic relatedness of bacterial isolates both with-in and across the countries. With this background, we studied the molecular epidemiology of 32 Burkholderia pseudomallei isolates (31 clinical and 1 soil isolate obtained during 2006-2015 from various parts of south India using multi-locus sequencing typing and analysis. Of the 32 isolates included in the analysis, 30 (93.7% had novel allelic profiles that were not reported previously. Sequence type (ST 1368 (n = 15, 46.8% with allelic profile (1, 4, 6, 4, 1, 1, 3 was the most common genotype observed. We did not observe a genotypic association of STs with geographical location, type of infection and year of isolation in the present study. Measure of genetic differentiation (FST between Indian and the rest of world isolates was 0.14413. Occurrence of the same ST across three adjacent states of south India suggest the dispersion of B.pseudomallei across the south western coastal part of India with limited geographical clustering. However, majority of the STs reported from the present study remained as "outliers" on the eBURST "Population snapshot", suggesting the genetic diversity of Indian isolates from the Australasian and Southeast Asian isolates.
The promoter of the glucoamylase-encoding gene of Aspergillus niger functions in Ustilago maydis

Energy Technology Data Exchange (ETDEWEB)

Smith, T.L. (Dept. of Agriculture, Madison, WI (United States) Univ. of Wisconsin, Madison (United States)); Gaskell, J.; Cullen, D. (Dept. of Agriculture, Madison, WI (United States)); Berka, R.M.; Yang, M.; Henner, D.J. (Genentech Inc., San Francisco, CA (United States))

1990-01-01

Promoter sequences from the Aspergillus niger glucoamylase-encoding gene (glaA) were linked to the bacterial hygromycin (Hy) phosphotransferase-encoding gene (hph) and this chimeric marker was used to select Hy-resistant (Hy[sup R]) Ustilago maydis transformants. This is an example of an Ascomycete promoter functioning in a Basidiomycete. Hy[sup R] transformants varied with respect to copy number of integrated vector, mitotic stability, and tolerance to Hy. Only 216 bp of glaA promoter sequence is required for expression in U. maydis but this promoter is not induced by starch as it is in Aspergillus spp. The transcription start points are the same in U. maydis and A. niger.
Estimation of genetic structure of a Mycosphaerella musicola population using inter-simple sequence repeat markers.

Science.gov (United States)

Peixouto, Y S; Dórea Bragança, C A; Andrade, W B; Ferreira, C F; Haddad, F; Oliveira, S A S; Darosci Brito, F S; Miller, R N G; Amorim, E P

2015-07-17

Among the diseases affecting banana (Musa sp), yellow Sigatoka, caused by the fungal pathogen Mycosphaerella musicola Leach, is considered one of the most important in Brazil, causing losses throughout the year. Understanding the genetic structure of pathogen populations will provide insight into the life history of pathogens, including the evolutionary processes occurring in agrosystems. Tools for estimating the possible emergence of pathogen variants with altered pathogenicity, virulence, or aggressiveness, as well as resistance to systemic fungicides, can also be developed from such data. The objective of this study was to analyze the genetic diversity and population genetics of M. musicola in the main banana-producing regions in Brazil. A total of 83 isolates collected from different banana cultivars in the Brazilian states of Bahia, Rio Grande do Norte, and Minas Gerais were evaluated using inter-simple sequence repeat markers. High variability was detected between the isolates, and 85.5% of the haplotypes were singletons in the populations. The highest source of genetic diversity (97.22%) was attributed to variations within populations. Bayesian cluster analysis revealed the presence of 2 probable ancestral groups, however, showed no relationship to population structure in terms of collection site, state of origin, or cultivar. Similarly, we detected noevidence of genetic recombination between individuals within different states, indicating that asexual cycles play a major role in M. musicola reproduction and that long-distance dispersal of the pathogen is the main factor contributing to the lack of population structure in the fungus.
A massively parallel sequencing approach uncovers ancient origins and high genetic variability of endangered Przewalski's horses.

Science.gov (United States)

Goto, Hiroki; Ryder, Oliver A; Fisher, Allison R; Schultz, Bryant; Kosakovsky Pond, Sergei L; Nekrutenko, Anton; Makova, Kateryna D

2011-01-01

The endangered Przewalski's horse is the closest relative of the domestic horse and is the only true wild horse species surviving today. The question of whether Przewalski's horse is the direct progenitor of domestic horse has been hotly debated. Studies of DNA diversity within Przewalski's horses have been sparse but are urgently needed to ensure their successful reintroduction to the wild. In an attempt to resolve the controversy surrounding the phylogenetic position and genetic diversity of Przewalski's horses, we used massively parallel sequencing technology to decipher the complete mitochondrial and partial nuclear genomes for all four surviving maternal lineages of Przewalski's horses. Unlike single-nucleotide polymorphism (SNP) typing usually affected by ascertainment bias, the present method is expected to be largely unbiased. Three mitochondrial haplotypes were discovered-two similar ones, haplotypes I/II, and one substantially divergent from the other two, haplotype III. Haplotypes I/II versus III did not cluster together on a phylogenetic tree, rejecting the monophyly of Przewalski's horse maternal lineages, and were estimated to split 0.117-0.186 Ma, significantly preceding horse domestication. In the phylogeny based on autosomal sequences, Przewalski's horses formed a monophyletic clade, separate from the Thoroughbred domestic horse lineage. Our results suggest that Przewalski's horses have ancient origins and are not the direct progenitors of domestic horses. The analysis of the vast amount of sequence data presented here suggests that Przewalski's and domestic horse lineages diverged at least 0.117 Ma but since then have retained ancestral genetic polymorphism and/or experienced gene flow.

Human visual system automatically encodes sequential regularities of discrete events.

Science.gov (United States)

Kimura, Motohiro; Schröger, Erich; Czigler, István; Ohira, Hideki

2010-06-01

For our adaptive behavior in a dynamically changing environment, an essential task of the brain is to automatically encode sequential regularities inherent in the environment into a memory representation. Recent studies in neuroscience have suggested that sequential regularities embedded in discrete sensory events are automatically encoded into a memory representation at the level of the sensory system. This notion is largely supported by evidence from investigations using auditory mismatch negativity (auditory MMN), an event-related brain potential (ERP) correlate of an automatic memory-mismatch process in the auditory sensory system. However, it is still largely unclear whether or not this notion can be generalized to other sensory modalities. The purpose of the present study was to investigate the contribution of the visual sensory system to the automatic encoding of sequential regularities using visual mismatch negativity (visual MMN), an ERP correlate of an automatic memory-mismatch process in the visual sensory system. To this end, we conducted a sequential analysis of visual MMN in an oddball sequence consisting of infrequent deviant and frequent standard stimuli, and tested whether the underlying memory representation of visual MMN generation contains only a sensory memory trace of standard stimuli (trace-mismatch hypothesis) or whether it also contains sequential regularities extracted from the repetitive standard sequence (regularity-violation hypothesis). The results showed that visual MMN was elicited by first deviant (deviant stimuli following at least one standard stimulus), second deviant (deviant stimuli immediately following first deviant), and first standard (standard stimuli immediately following first deviant), but not by second standard (standard stimuli immediately following first standard). These results are consistent with the regularity-violation hypothesis, suggesting that the visual sensory system automatically encodes sequential
Targeted exon sequencing in Usher syndrome type I.

Science.gov (United States)

Bujakowska, Kinga M; Consugar, Mark; Place, Emily; Harper, Shyana; Lena, Jaclyn; Taub, Daniel G; White, Joseph; Navarro-Gomez, Daniel; Weigel DiFranco, Carol; Farkas, Michael H; Gai, Xiaowu; Berson, Eliot L; Pierce, Eric A

2014-12-02

Patients with Usher syndrome type I (USH1) have retinitis pigmentosa, profound congenital hearing loss, and vestibular ataxia. This syndrome is currently thought to be associated with at least six genes, which are encoded by over 180 exons. Here, we present the use of state-of-the-art techniques in the molecular diagnosis of a cohort of 47 USH1 probands. The cohort was studied with selective exon capture and next-generation sequencing of currently known inherited retinal degeneration genes, comparative genomic hybridization, and Sanger sequencing of new USH1 exons identified by human retinal transcriptome analysis. With this approach, we were able to genetically solve 14 of the 47 probands by confirming the biallelic inheritance of mutations. We detected two likely pathogenic variants in an additional 19 patients, for whom family members were not available for cosegregation analysis to confirm biallelic inheritance. Ten patients, in addition to primary disease-causing mutations, carried rare likely pathogenic USH1 alleles or variants in other genes associated with deaf-blindness, which may influence disease phenotype. Twenty-one of the identified mutations were novel among the 33 definite or likely solved patients. Here, we also present a clinical description of the studied cohort at their initial visits. We found a remarkable genetic heterogeneity in the studied USH1 cohort with multiplicity of mutations, of which many were novel. No obvious influence of genotype on phenotype was found, possibly due to small sample sizes of the genotypes under study. Copyright 2014 The Association for Research in Vision and Ophthalmology, Inc.
Evolution of Sphingomonad Gene Clusters Related to Pesticide Catabolism Revealed by Genome Sequence and Mobilomics of Sphingobium herbicidovorans MH.

Science.gov (United States)

Nielsen, Tue Kjærgaard; Rasmussen, Morten; Demanèche, Sandrine; Cecillon, Sébastien; Vogel, Timothy M; Hansen, Lars Hestbjerg

2017-09-01

Bacterial degraders of chlorophenoxy herbicides have been isolated from various ecosystems, including pristine environments. Among these degraders, the sphingomonads constitute a prominent group that displays versatile xenobiotic-degradation capabilities. Four separate sequencing strategies were required to provide the complete sequence of the complex and plastic genome of the canonical chlorophenoxy herbicide-degrading Sphingobium herbicidovorans MH. The genome has an intricate organization of the chlorophenoxy-herbicide catabolic genes sdpA, rdpA, and cadABCD that encode the (R)- and (S)-enantiomer-specific 2,4-dichlorophenoxypropionate dioxygenases and four subunits of a Rieske non-heme iron oxygenase involved in 2-methyl-chlorophenoxyacetic acid degradation, respectively. Several major genomic rearrangements are proposed to help understand the evolution and mobility of these important genes and their genetic context. Single-strain mobilomic sequence analysis uncovered plasmids and insertion sequence-associated circular intermediates in this environmentally important bacterium and enabled the description of evolutionary models for pesticide degradation in strain MH and related organisms. The mobilome presented a complex mosaic of mobile genetic elements including four plasmids and several circular intermediate DNA molecules of insertion-sequence elements and transposons that are central to the evolution of xenobiotics degradation. Furthermore, two individual chromosomally integrated prophages were shown to excise and form free circular DNA molecules. This approach holds great potential for improving the understanding of genome plasticity, evolution, and microbial ecology. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Development of a functional cell-based assay that probes the specific interaction between influenza A virus NP and its packaging signal sequence RNA.

Science.gov (United States)

Woo, Jiwon; Yu, Kyung Lee; Lee, Sun Hee; You, Ji Chang

2015-02-06

Although cis-acting packaging signal RNA sequences for the influenza virus NP encoding vRNA have been identified recently though genetic studies, little is known about the interaction between NP and the vRNA packaging signals either in vivo or in vitro. Here, we provide evidence that NP is able to interact specifically with the vRNA packaging sequence RNA within living cells and that the specific RNA binding activity of NP in vivo requires both the N-terminal and central region of the protein. This assay established would be a valuable tool for further detailed studies of the NP-packaging signal RNA interaction in living cells. Copyright © 2014 Elsevier Inc. All rights reserved.
Chimeric Feline Coronaviruses That Encode Type II Spike Protein on Type I Genetic Background Display Accelerated Viral Growth and Altered Receptor Usage▿

Science.gov (United States)

Tekes, Gergely; Hofmann-Lehmann, Regina; Bank-Wolf, Barbara; Maier, Reinhard; Thiel, Heinz-Jürgen; Thiel, Volker

2010-01-01

Persistent infection of domestic cats with feline coronaviruses (FCoVs) can lead to a highly lethal, immunopathological disease termed feline infectious peritonitis (FIP). Interestingly, there are two serotypes, type I and type II FCoVs, that can cause both persistent infection and FIP, even though their main determinant of host cell tropism, the spike (S) protein, is of different phylogeny and displays limited sequence identity. In cell culture, however, there are apparent differences. Type II FCoVs can be propagated to high titers by employing feline aminopeptidase N (fAPN) as a cellular receptor, whereas the propagation of type I FCoVs is usually difficult, and the involvement of fAPN as a receptor is controversial. In this study we have analyzed the phenotypes of recombinant FCoVs that are based on the genetic background of type I FCoV strain Black but encode the type II FCoV strain 79-1146 S protein. Our data demonstrate that recombinant FCoVs expressing a type II FCoV S protein acquire the ability to efficiently use fAPN for host cell entry and corroborate the notion that type I FCoVs use another main host cell receptor. We also observed that recombinant FCoVs display a large-plaque phenotype and, unexpectedly, accelerated growth kinetics indistinguishable from that of type II FCoV strain 79-1146. Thus, the main phenotypic differences for type I and type II FCoVs in cell culture, namely, the growth kinetics and the efficient usage of fAPN as a cellular receptor, can be attributed solely to the FCoV S protein. PMID:19906918
Accurate Local-Ancestry Inference in Exome-Sequenced Admixed Individuals via Off-Target Sequence Reads

Science.gov (United States)

Hu, Youna; Willer, Cristen; Zhan, Xiaowei; Kang, Hyun Min; Abecasis, Gonçalo R.

2013-01-01

Estimates of the ancestry of specific chromosomal regions in admixed individuals are useful for studies of human evolutionary history and for genetic association studies. Previously, this ancestry inference relied on high-quality genotypes from genome-wide association study (GWAS) arrays. These high-quality genotypes are not always available when samples are exome sequenced, and exome sequencing is the strategy of choice for many ongoing genetic studies. Here we show that off-target reads generated during exome-sequencing experiments can be combined with on-target reads to accurately estimate the ancestry of each chromosomal segment in an admixed individual. To reconstruct local ancestry, our method SEQMIX models aligned bases directly instead of relying on hard genotype calls. We evaluate the accuracy of our method through simulations and analysis of samples sequenced by the 1000 Genomes Project and the NHLBI Grand Opportunity Exome Sequencing Project. In African Americans, we show that local-ancestry estimates derived by our method are very similar to those derived with Illumina’s Omni 2.5M genotyping array and much improved in relation to estimates that use only exome genotypes and ignore off-target sequencing reads. Software implementing this method, SEQMIX, can be applied to analysis of human population history or used for genetic association studies in admixed individuals. PMID:24210252
Programming molecular self-assembly of intrinsically disordered proteins containing sequences of low complexity

Science.gov (United States)

Simon, Joseph R.; Carroll, Nick J.; Rubinstein, Michael; Chilkoti, Ashutosh; López, Gabriel P.

2017-06-01

Dynamic protein-rich intracellular structures that contain phase-separated intrinsically disordered proteins (IDPs) composed of sequences of low complexity (SLC) have been shown to serve a variety of important cellular functions, which include signalling, compartmentalization and stabilization. However, our understanding of these structures and our ability to synthesize models of them have been limited. We present design rules for IDPs possessing SLCs that phase separate into diverse assemblies within droplet microenvironments. Using theoretical analyses, we interpret the phase behaviour of archetypal IDP sequences and demonstrate the rational design of a vast library of multicomponent protein-rich structures that ranges from uniform nano-, meso- and microscale puncta (distinct protein droplets) to multilayered orthogonally phase-separated granular structures. The ability to predict and program IDP-rich assemblies in this fashion offers new insights into (1) genetic-to-molecular-to-macroscale relationships that encode hierarchical IDP assemblies, (2) design rules of such assemblies in cell biology and (3) molecular-level engineering of self-assembled recombinant IDP-rich materials.
Genetic Diversity and Sequence Variations at Growth Hormone Loci among Composite and Hereford Populations of Beef Cattle

Directory of Open Access Journals (Sweden)

ALAN J. LYMBERY

2000-07-01

Full Text Available A total of 194 Hereford and 235 composite breed cattle from Wokalup Research Station were used in this study. The aims of the study were to: Investigate polymorphisms in the growth hormone gene in the composite and purebred Hereford herds from the Wokalup selection experiment, compare genetic diversity in the growth hormone gene of the breeds, sequencing and compare the sequences of growth hormone loci between composite and purebred Hereford herds with published sequence from Genebank. The genomic DNA was extracted using Wizard genomic DNA purification system from Promega. Two fragments of growth hormone gene were amplified using PCR and continued with RFLP. Each genotype in both loci was sequenced. PCR products of each genotypes were cloned into PCR II, transformed, colonies selection, plasmid DNA extraction continued with cycle sequencing. Polymorphisms were found in both breeds of cattle in both loci of GH-L1 and GH-L2 of the growth hormone gene by PCR-RFLP analysis. Sequencing analysis confirmed the RFLPs data, polymorphism detected using AluI at GH-L1 is due to substitution between leusin/ valine at position 127, while polymorphism at the MspI restriction site was caused by transition of C to T at +837 position.
An Exome Sequencing Study to Assess the Role of Rare Genetic Variation in Pulmonary Fibrosis.

Science.gov (United States)

Petrovski, Slavé; Todd, Jamie L; Durheim, Michael T; Wang, Quanli; Chien, Jason W; Kelly, Fran L; Frankel, Courtney; Mebane, Caroline M; Ren, Zhong; Bridgers, Joshua; Urban, Thomas J; Malone, Colin D; Finlen Copeland, Ashley; Brinkley, Christie; Allen, Andrew S; O'Riordan, Thomas; McHutchison, John G; Palmer, Scott M; Goldstein, David B

2017-07-01

Idiopathic pulmonary fibrosis (IPF) is an increasingly recognized, often fatal lung disease of unknown etiology. The aim of this study was to use whole-exome sequencing to improve understanding of the genetic architecture of pulmonary fibrosis. We performed a case-control exome-wide collapsing analysis including 262 unrelated individuals with pulmonary fibrosis clinically classified as IPF according to American Thoracic Society/European Respiratory Society/Japanese Respiratory Society/Latin American Thoracic Association guidelines (81.3%), usual interstitial pneumonia secondary to autoimmune conditions (11.5%), or fibrosing nonspecific interstitial pneumonia (7.2%). The majority (87%) of case subjects reported no family history of pulmonary fibrosis. We searched 18,668 protein-coding genes for an excess of rare deleterious genetic variation using whole-exome sequence data from 262 case subjects with pulmonary fibrosis and 4,141 control subjects drawn from among a set of individuals of European ancestry. Comparing genetic variation across 18,668 protein-coding genes, we found a study-wide significant (P RTEL1, and PARN. A model qualifying ultrarare, deleterious, nonsynonymous variants implicated TERT and RTEL1, and a model specifically qualifying loss-of-function variants implicated RTEL1 and PARN. A subanalysis of 186 case subjects with sporadic IPF confirmed TERT, RTEL1, and PARN as study-wide significant contributors to sporadic IPF. Collectively, 11.3% of case subjects with sporadic IPF carried a qualifying variant in one of these three genes compared with the 0.3% carrier rate observed among control subjects (odds ratio, 47.7; 95% confidence interval, 21.5-111.6; P = 5.5 × 10 -22 ). We identified TERT, RTEL1, and PARN-three telomere-related genes previously implicated in familial pulmonary fibrosis-as significant contributors to sporadic IPF. These results support the idea that telomere dysfunction is involved in IPF pathogenesis.
Nucleic acid sequences encoding D1 and D1/D2 domains of human coxsackievirus and adenovirus receptor (CAR)

Science.gov (United States)

Freimuth, Paul I.

2010-04-06

The invention provides recombinant human CAR (coxsackievirus and adenovirus receptor) polypeptides which bind adenovirus. Specifically, polypeptides corresponding to adenovirus binding domain D1 and the entire extracellular domain of human CAR protein comprising D1 and D2 are provided. In another aspect, the invention provides nucleic acid sequences encoding these domains and expression vectors for producing the domains and bacterial cells containing such vectors. The invention also includes an isolated fusion protein comprised of the D1 polypeptide fused to a polypeptide which facilitates folding of D1 when expressed in bacteria. The functional D1 domain finds application in a therapeutic method for treating a patient infected with a CAR D1-binding virus, and also in a method for identifying an antiviral compound which interferes with viral attachment. The invention also provides a method for specifically targeting a cell for infection by a virus which binds to D1.
Molecular characterization of rpoB gene encoding the RNA ...

African Journals Online (AJOL)

Polymerase chain reaction (PCR) mediated direct DNA sequencing was evaluated for rapid detection of Rifampicin resistance (RMPr) of Mycobacterium tuberculosis. After amplification of the rpoB gene, the product was sequenced using ABI 310 Genetic Analyzer and the rifampicin resistance in M. tuberculosis were ...
Conventions and nomenclature for double diffusion encoding NMR and MRI

DEFF Research Database (Denmark)

Shemesh, Noam; Jespersen, Sune N; Alexander, Daniel C

2015-01-01

, such as double diffusion encoding (DDE) NMR and MRI, may provide novel quantifiable metrics that are less easily inferred from conventional diffusion acquisitions. Despite the growing interest on the topic, the terminology for the pulse sequences, their parameters, and the metrics that can be derived from them...
Isolation, sequencing and expression of RED, a novel human gene encoding an acidic-basic dipeptide repeat.

Science.gov (United States)

Assier, E; Bouzinba-Segard, H; Stolzenberg, M C; Stephens, R; Bardos, J; Freemont, P; Charron, D; Trowsdale, J; Rich, T

1999-04-16

A novel human gene RED, and the murine homologue, MuRED, were cloned. These genes were named after the extensive stretch of alternating arginine (R) and glutamic acid (E) or aspartic acid (D) residues that they contain. We term this the 'RED' repeat. The genes of both species were expressed in a wide range of tissues and we have mapped the human gene to chromosome 5q22-24. MuRED and RED shared 98% sequence identity at the amino acid level. The open reading frame of both genes encodes a 557 amino acid protein. RED fused to a fluorescent tag was expressed in nuclei of transfected cells and localised to nuclear dots. Co-localisation studies showed that these nuclear dots did not contain either PML or Coilin, which are commonly found in the POD or coiled body nuclear compartments. Deletion of the amino terminal 265 amino acids resulted in a failure to sort efficiently to the nucleus, though nuclear dots were formed. Deletion of a further 50 amino acids from the amino terminus generates a protein that can sort to the nucleus but is unable to generate nuclear dots. Neither construct localised to the nucleolus. The characteristics of RED and its nuclear localisation implicate it as a regulatory protein, possibly involved in transcription.
Isolation, nucleotide sequence and expression of a cDNA encoding feline granulocyte colony-stimulating factor.

Science.gov (United States)

Dunham, S P; Onions, D E

2001-06-21

A cDNA encoding feline granulocyte colony stimulating factor (fG-CSF) was cloned from alveolar macrophages using the reverse transcriptase-polymerase chain reaction. The cDNA is 949 bp in length and encodes a predicted mature protein of 174 amino acids. Recombinant fG-CSF was expressed as a glutathione S-transferase fusion and purified by affinity chromatography. Biological activity of the recombinant protein was demonstrated using the murine myeloblastic cell line GNFS-60, which showed an ED50 for fG-CSF of approximately 2 ng/ml. Copyright 2001 Academic Press.
Next-generation sequencing meets genetic diagnostics: development of a comprehensive workflow for the analysis of BRCA1 and BRCA2 genes

Science.gov (United States)

Feliubadaló, Lídia; Lopez-Doriga, Adriana; Castellsagué, Ester; del Valle, Jesús; Menéndez, Mireia; Tornero, Eva; Montes, Eva; Cuesta, Raquel; Gómez, Carolina; Campos, Olga; Pineda, Marta; González, Sara; Moreno, Victor; Brunet, Joan; Blanco, Ignacio; Serra, Eduard; Capellá, Gabriel; Lázaro, Conxi

2013-01-01

Next-generation sequencing (NGS) is changing genetic diagnosis due to its huge sequencing capacity and cost-effectiveness. The aim of this study was to develop an NGS-based workflow for routine diagnostics for hereditary breast and ovarian cancer syndrome (HBOCS), to improve genetic testing for BRCA1 and BRCA2. A NGS-based workflow was designed using BRCA MASTR kit amplicon libraries followed by GS Junior pyrosequencing. Data analysis combined Variant Identification Pipeline freely available software and ad hoc R scripts, including a cascade of filters to generate coverage and variant calling reports. A BRCA homopolymer assay was performed in parallel. A research scheme was designed in two parts. A Training Set of 28 DNA samples containing 23 unique pathogenic mutations and 213 other variants (33 unique) was used. The workflow was validated in a set of 14 samples from HBOCS families in parallel with the current diagnostic workflow (Validation Set). The NGS-based workflow developed permitted the identification of all pathogenic mutations and genetic variants, including those located in or close to homopolymers. The use of NGS for detecting copy-number alterations was also investigated. The workflow meets the sensitivity and specificity requirements for the genetic diagnosis of HBOCS and improves on the cost-effectiveness of current approaches. PMID:23249957
Genetically encoded fluorescent voltage sensors using the voltage-sensing domain of Nematostella and Danio phosphatases exhibit fast kinetics.

Science.gov (United States)

Baker, Bradley J; Jin, Lei; Han, Zhou; Cohen, Lawrence B; Popovic, Marko; Platisa, Jelena; Pieribone, Vincent

2012-07-15

A substantial increase in the speed of the optical response of genetically encoded fluorescent protein voltage sensors (FP voltage sensors) was achieved by using the voltage-sensing phosphatase genes of Nematostella vectensis and Danio rerio. A potential N. vectensis voltage-sensing phosphatase was identified in silico. The voltage-sensing domain (S1-S4) of the N. vectensis homolog was used to create an FP voltage sensor called Nema. By replacing the phosphatase with a cerulean/citrine FRET pair, a new FP voltage sensor was synthesized with fast off kinetics (Tau(off)voltage-sensing phosphatase homolog, designated Zahra and Zahra 2, exhibited fast on and off kinetics within 2ms of the time constants observed with the organic voltage-sensitive dye, di4-ANEPPS. Mutagenesis of the S4 region of the Danio FP voltage sensor shifted the voltage dependence to more negative potentials but did not noticeably affect the kinetics of the optical signal. Copyright © 2012 Elsevier B.V. All rights reserved.
Genetically-encoded fluorescent voltage sensors using the voltage-sensing domain of Nematostella and Danio phosphatases exhibit fast kinetics

Science.gov (United States)

Baker, Bradley J.; Jin, Lei; Han, Zhou; Cohen, Lawrence B.; Popovic, Marko; Platisa, Jelena; Pieribone, Vincent

2012-01-01

A substantial increase in the speed of the optical response of genetically-encoded Fluorescent Protein voltage sensors (FP voltage sensors) was achieved by using the voltage-sensing phosphatase genes of Nematostella vectensis and Danio rerio. A potential N. vectensis voltage-sensing phosphatase was identified in silico. The voltage-sensing domain (S1–S4) of the N. vectensis homolog was used to create an FP voltage sensor called Nema. By replacing the phosphatase with a cerulean/citrine FRET pair, a new FP voltage sensor was synthesized with fast off kinetics (Tauoff voltage-sensing phosphatase homolog, designated Zahra and Zahra 2, exhibited fast on and off kinetics within 2 msec of the time constants observed with the organic voltage-sensitive dye, di4-ANEPPS. Mutagenesis of the S4 region of the Danio FP voltage sensor shifted the voltage dependence to more negative potentials but did not noticeably affect the kinetics of the optical signal. PMID:22634212
Genetic diversity of the Andean tuber-bearing species, oca (Oxalis tuberosa Mol.), investigated by inter-simple sequence repeats.

Science.gov (United States)

Pissard, A; Ghislain, M; Bertin, P

2006-01-01

The Andean tuber-bearing species, Oxalis tuberosa Mol., is a vegetatively propagated crop cultivated in the uplands of the Andes. Its genetic diversity was investigated in the present study using the inter-simple sequence repeat (ISSR) technique. Thirty-two accessions originating from South America (Argentina, Bolivia, Chile, and Peru) and maintained in vitro were chosen to represent the ecogeographic diversity of its cultivation area. Twenty-two primers were tested and 9 were selected according to fingerprinting quality and reproducibility. Genetic diversity analysis was performed with 90 markers. Jaccard's genetic distance between accessions ranged from 0 to 0.49 with an average of 0.28 +/- 0.08 (mean +/- SD). Dendrogram (UPGMA (unweighted pair-group method with arithmetic averaging)) and factorial correspondence analysis (FCA) showed that the genetic structure was influenced by the collection site. The two most distant clusters contained all of the Peruvian accessions, one from Bolivia, none from Argentina or Chile. Analysis by country revealed that Peru presented the greatest genetic distances from the other countries and possessed the highest intra-country genetic distance (0.30 +/- 0.08). This suggests that the Peruvian oca accessions form a distinct genetic group. The relatively low level of genetic diversity in the oca species may be related to its predominating reproduction strategy, i.e., vegetative propagation. The extent and structure of the genetic diversity of the species detailed here should help the establishment of conservation strategies.
Journal of Genetics | Indian Academy of Sciences

Indian Academy of Sciences (India)

Home; Journals; Journal of Genetics. Chetana Sachidanandan. Articles written in Journal of Genetics. Volume 92 Issue 3 December 2013 pp 695-701 Perspectives. Time for the zebrafish ENCODE · Sridhar Sivasubbu Chetana Sachidanandan Vinod Scaria · More Details Fulltext PDF ...
D20S16 is a complex interspersed repeated sequence: Genetic and physical analysis of the locus

Energy Technology Data Exchange (ETDEWEB)

Bowden, D.W.; Krawchuk, M.D.; Howard, T.D. [Wake Forest Univ., Winston-Salem, NC (United States)] [and others

1995-01-20

The genomic structure of the D20S16 locus has been evaluated using genetic and physical methods. D20S16, originally detected with the probe CRI-L1214, is a highly informative, complex restriction fragment length polymorphism consisting of two separate allelic systems. The allelic systems have the characteristics of conventional VNTR polymorphisms and are separated by recombination ({theta} = 0.02, Z{sub max} = 74.82), as demonstrated in family studies. Most of these recombination events are meiotic crossovers and are maternal in origin, but two, including deletion of the locus in a cell line from a CEPH family member, occur without evidence for exchange of flanking markers. DNA sequence analysis suggests that the basis of the polymorphism is variable numbers of a 98-bp sequence tandemly repeated with 87 to 90% sequence similarity between repeats. The 98-bp repeat is a dimer of 49 bp sequence with 45 to 98% identity between the elements. In addition, nonpolymorphic genomic sequences adjacent to the polymorphic 98-bp repeat tracts are also repeated but are not polymorphic, i.e., show no individual to individual variation. Restriction enzyme mapping of cosmids containing the CRI-L1214 sequence suggests that there are multiple interspersed repeats of the CRI-L1214 sequence on chromosome 20. The results of dual-color fluorescence in situ hybridization experiments with interphase nuclei are also consistent with multiple repeats of an interspersed sequence on chromosome 20. 23 refs., 6 figs.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.