WorldWideScience

Sample records for i-like proviral genome

  1. Molecular cloning of human T-cell lymphotrophic virus type I-like proviral genome from the peripheral lymphocyte DNA of a patient with chronic neurologic disorders

    International Nuclear Information System (INIS)

    Reddy, E.P.; Mettus, R.V.; DeFreitas, E.; Wroblewska, Z.; Cisco, M.; Koprowski, H.

    1988-01-01

    Human T-cell lymphotropic virus type 1 (HTLV-I), the etiologic agent of human T-cell leukemia, has recently been shown to be associated with neurologic disorders such as tropical spastic paraparesis, HTLV-associated myelopathy, and possibly with multiple sclerosis. In this communication, the authors have examined one specific case of neurologic disorder that can be classified as multiple sclerosis or tropical spastic paraparesis. The patient suffering from chronic neurologic disorder was found to contain antibodies to HTLV-I envelope and gag proteins in his serum and cerebrospinal fluid. Lymphocytes from peripheral blood and cerebrospinal fluid of the patient were shown to express viral RNA sequences by in situ hybridization. Southern blot analysis of the patient lymphocyte DNA revealed the presence of HTLV-I-related sequences. Blot-hybridization analysis of the RNA from fresh peripheral lymphocytes stimulated with interleukin 2 revealed the presence of abundant amounts of genomic viral RNA with little or no subgenomic RNA. They have clones the proviral genome from the DNA of the peripheral lymphocytes and determined its restriction map. This analysis shows that this proviral genome is very similar if not identical to that of the prototype HTLV-I genome

  2. Proviral HIV-genome-wide and pol-gene specific zinc finger nucleases: usability for targeted HIV gene therapy.

    Science.gov (United States)

    Wayengera, Misaki

    2011-07-22

    Infection with HIV, which culminates in the establishment of a latent proviral reservoir, presents formidable challenges for ultimate cure. Building on the hypothesis that ex-vivo or even in-vivo abolition or disruption of HIV-gene/genome-action by target mutagenesis or excision can irreversibly abrogate HIV's innate fitness to replicate and survive, we previously identified the isoschizomeric bacteria restriction enzymes (REases) AcsI and ApoI as potent cleavers of the HIV-pol gene (11 and 9 times in HIV-1 and 2, respectively). However, both enzymes, along with others found to cleave across the entire HIV-1 genome, slice (SX) at palindromic sequences that are prevalent within the human genome and thereby pose the risk of host genome toxicity. A long-term goal in the field of R-M enzymatic therapeutics has thus been to generate synthetic restriction endonucleases with longer recognition sites limited in specificity to HIV. We aimed (i) to assemble and construct zinc finger arrays and nucleases (ZFN) with either proviral-HIV-pol gene or proviral-HIV-1 whole-genome specificity respectively, and (ii) to advance a model for pre-clinically testing lentiviral vectors (LV) that deliver and transduce either ZFN genotype. First, we computationally generated the consensus sequences of (a) 114 dsDNA-binding zinc finger (Zif) arrays (ZFAs or ZifHIV-pol) and (b) two zinc-finger nucleases (ZFNs) which, unlike the AcsI and ApoI homeodomains, possess specificity to >18 base-pair sequences uniquely present within the HIV-pol gene (ZifHIV-polFN). Another 15 ZFNs targeting >18 bp sequences within the complete HIV-1 proviral genome were constructed (ZifHIV-1FN). Second, a model for constructing lentiviral vectors (LVs) that deliver and transduce a diploid copy of either ZifHIV-polFN or ZifHIV-1FN chimeric genes (termed LV- 2xZifHIV-polFN and LV- 2xZifHIV-1FN, respectively) is proposed. Third, two preclinical models for controlled testing of the safety and efficacy of either of these

  3. Proviral HIV-genome-wide and pol-gene specific Zinc Finger Nucleases: Usability for targeted HIV gene therapy

    Directory of Open Access Journals (Sweden)

    Wayengera Misaki

    2011-07-01

    Full Text Available Abstract Background Infection with HIV, which culminates in the establishment of a latent proviral reservoir, presents formidable challenges for ultimate cure. Building on the hypothesis that ex-vivo or even in-vivo abolition or disruption of HIV-gene/genome-action by target mutagenesis or excision can irreversibly abrogate HIV's innate fitness to replicate and survive, we previously identified the isoschizomeric bacteria restriction enzymes (REases AcsI and ApoI as potent cleavers of the HIV-pol gene (11 and 9 times in HIV-1 and 2, respectively. However, both enzymes, along with others found to cleave across the entire HIV-1 genome, slice (SX at palindromic sequences that are prevalent within the human genome and thereby pose the risk of host genome toxicity. A long-term goal in the field of R-M enzymatic therapeutics has thus been to generate synthetic restriction endonucleases with longer recognition sites limited in specificity to HIV. We aimed (i to assemble and construct zinc finger arrays and nucleases (ZFN with either proviral-HIV-pol gene or proviral-HIV-1 whole-genome specificity respectively, and (ii to advance a model for pre-clinically testing lentiviral vectors (LV that deliver and transduce either ZFN genotype. Methods and Results First, we computationally generated the consensus sequences of (a 114 dsDNA-binding zinc finger (Zif arrays (ZFAs or ZifHIV-pol and (b two zinc-finger nucleases (ZFNs which, unlike the AcsI and ApoI homeodomains, possess specificity to >18 base-pair sequences uniquely present within the HIV-pol gene (ZifHIV-polFN. Another 15 ZFNs targeting >18 bp sequences within the complete HIV-1 proviral genome were constructed (ZifHIV-1FN. Second, a model for constructing lentiviral vectors (LVs that deliver and transduce a diploid copy of either ZifHIV-polFN or ZifHIV-1FN chimeric genes (termed LV- 2xZifHIV-polFN and LV- 2xZifHIV-1FN, respectively is proposed. Third, two preclinical models for controlled testing of

  4. Specific Destruction of HIV Proviral p17 Gene in T Lymphoid Cells Achieved by the Genome Editing Technology.

    Science.gov (United States)

    Kishida, Tsunao; Ejima, Akika; Mazda, Osam

    2016-01-01

    Recent development in genome editing technologies has enabled site-directed deprivation of a nucleotide sequence in the chromosome in mammalian cells. Human immunodeficiency (HIV) infection causes integration of proviral DNA into the chromosome, which potentially leads to re-emergence of the virus, but conventional treatment cannot delete the proviral DNA sequence from the cells infected with HIV. In the present study, the transcription activator-like effector nucleases (TALENs) specific for the HIV p17 gene were constructed, and their activities to destroy the target sequence were evaluated. SSA assay showed a high activity of a pair of p17-specific TALENs. A human T lymphoid cell line, Jurkat, was infected with a lentivirus vector followed by transfection with the TALEN-HIV by electroporation. The target sequence was destructed in approximately 10-95% of the p17 polymerase chain reaction clones, and the efficiencies depended on the Jurkat-HIV clones. Because p17 plays essential roles for assembly and budding of HIV, and this gene has relatively low nucleotide sequence diversity, genome editing procedures targeting p17 may provide a therapeutic benefit for HIV infection.

  5. Structure and possible function of a G-quadruplex in the long terminal repeat of the proviral HIV-1 genome.

    Science.gov (United States)

    De Nicola, Beatrice; Lech, Christopher J; Heddi, Brahim; Regmi, Sagar; Frasson, Ilaria; Perrone, Rosalba; Richter, Sara N; Phan, Anh Tuân

    2016-07-27

    The long terminal repeat (LTR) of the proviral human immunodeficiency virus (HIV)-1 genome is integral to virus transcription and host cell infection. The guanine-rich U3 region within the LTR promoter, previously shown to form G-quadruplex structures, represents an attractive target to inhibit HIV transcription and replication. In this work, we report the structure of a biologically relevant G-quadruplex within the LTR promoter region of HIV-1. The guanine-rich sequence designated LTR-IV forms a well-defined structure in physiological cationic solution. The nuclear magnetic resonance (NMR) structure of this sequence reveals a parallel-stranded G-quadruplex containing a single-nucleotide thymine bulge, which participates in a conserved stacking interaction with a neighboring single-nucleotide adenine loop. Transcription analysis in a HIV-1 replication competent cell indicates that the LTR-IV region may act as a modulator of G-quadruplex formation in the LTR promoter. Consequently, the LTR-IV G-quadruplex structure presented within this work could represent a valuable target for the design of HIV therapeutics. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  6. CD4 is expressed on a heterogeneous subset of hematopoietic progenitors, which persistently harbor CXCR4 and CCR5-tropic HIV proviral genomes in vivo.

    Directory of Open Access Journals (Sweden)

    Nadia T Sebastian

    2017-07-01

    Full Text Available Latent HIV infection of long-lived cells is a barrier to viral clearance. Hematopoietic stem and progenitor cells are a heterogeneous population of cells, some of which are long-lived. CXCR4-tropic HIVs infect a broad range of HSPC subtypes, including hematopoietic stem cells, which are multi-potent and long-lived. However, CCR5-tropic HIV infection is limited to more differentiated progenitor cells with life spans that are less well understood. Consistent with emerging data that restricted progenitor cells can be long-lived, we detected persistent HIV in restricted HSPC populations from optimally treated people. Further, genotypic and phenotypic analysis of amplified env alleles from donor samples indicated that both CXCR4- and CCR5-tropic viruses persisted in HSPCs. RNA profiling confirmed expression of HIV receptor RNA in a pattern that was consistent with in vitro and in vivo results. In addition, we characterized a CD4high HSPC sub-population that was preferentially targeted by a variety of CXCR4- and CCR5-tropic HIVs in vitro. Finally, we present strong evidence that HIV proviral genomes of both tropisms can be transmitted to CD4-negative daughter cells of multiple lineages in vivo. In some cases, the transmitted proviral genomes contained signature deletions that inactivated the virus, eliminating the possibility that coincidental infection explains the results. These data support a model in which both stem and non-stem cell progenitors serve as persistent reservoirs for CXCR4- and CCR5-tropic HIV proviral genomes that can be passed to daughter cells.

  7. CD4 is expressed on a heterogeneous subset of hematopoietic progenitors, which persistently harbor CXCR4 and CCR5-tropic HIV proviral genomes in vivo.

    Science.gov (United States)

    Sebastian, Nadia T; Zaikos, Thomas D; Terry, Valeri; Taschuk, Frances; McNamara, Lucy A; Onafuwa-Nuga, Adewunmi; Yucha, Ryan; Signer, Robert A J; Riddell, James; Bixby, Dale; Markowitz, Norman; Morrison, Sean J; Collins, Kathleen L

    2017-07-01

    Latent HIV infection of long-lived cells is a barrier to viral clearance. Hematopoietic stem and progenitor cells are a heterogeneous population of cells, some of which are long-lived. CXCR4-tropic HIVs infect a broad range of HSPC subtypes, including hematopoietic stem cells, which are multi-potent and long-lived. However, CCR5-tropic HIV infection is limited to more differentiated progenitor cells with life spans that are less well understood. Consistent with emerging data that restricted progenitor cells can be long-lived, we detected persistent HIV in restricted HSPC populations from optimally treated people. Further, genotypic and phenotypic analysis of amplified env alleles from donor samples indicated that both CXCR4- and CCR5-tropic viruses persisted in HSPCs. RNA profiling confirmed expression of HIV receptor RNA in a pattern that was consistent with in vitro and in vivo results. In addition, we characterized a CD4high HSPC sub-population that was preferentially targeted by a variety of CXCR4- and CCR5-tropic HIVs in vitro. Finally, we present strong evidence that HIV proviral genomes of both tropisms can be transmitted to CD4-negative daughter cells of multiple lineages in vivo. In some cases, the transmitted proviral genomes contained signature deletions that inactivated the virus, eliminating the possibility that coincidental infection explains the results. These data support a model in which both stem and non-stem cell progenitors serve as persistent reservoirs for CXCR4- and CCR5-tropic HIV proviral genomes that can be passed to daughter cells.

  8. Damaging the Integrated HIV Proviral DNA with TALENs.

    Directory of Open Access Journals (Sweden)

    Christy L Strong

    Full Text Available HIV-1 integrates its proviral DNA genome into the host genome, presenting barriers for virus eradication. Several new gene-editing technologies have emerged that could potentially be used to damage integrated proviral DNA. In this study, we use transcription activator-like effector nucleases (TALENs to target a highly conserved sequence in the transactivation response element (TAR of the HIV-1 proviral DNA. We demonstrated that TALENs cleave a DNA template with the HIV-1 proviral target site in vitro. A GFP reporter, under control of HIV-1 TAR, was efficiently inactivated by mutations introduced by transfection of TALEN plasmids. When infected cells containing the full-length integrated HIV-1 proviral DNA were transfected with TALENs, the TAR region accumulated indels. When one of these mutants was tested, the mutated HIV-1 proviral DNA was incapable of producing detectable Gag expression. TALEN variants engineered for degenerate recognition of select nucleotide positions also cleaved proviral DNA in vitro and the full-length integrated proviral DNA genome in living cells. These results suggest a possible design strategy for the therapeutic considerations of incomplete target sequence conservation and acquired resistance mutations. We have established a new strategy for damaging integrated HIV proviral DNA that may have future potential for HIV-1 proviral DNA eradication.

  9. The proviral genome of radiation leukemia virus: Molecular cloning, nucleotide sequence of its long terminal repeat and integration in lymphoma cell DNA

    International Nuclear Information System (INIS)

    Janowski, M.; Merregaert, J.; Boniver, J.; Maisin, J.R.

    1985-01-01

    The proviral genome of a thymotropic and leukemogenic C57BL/Ka mouse retrovirus, RadLV/VL/sub 3/(T+L+), was cloned as a biologically active PstI insert in the bacterial plasmid pBR322. Its restriction map was compared to those, already known, of two nonthymotropic and nonleukemogenic viruses of the same mouse strain, the ecotropic BL/Ka(B) and the xenotropic constituent of the radiation leukemia virus complex (RadLV). Differences were observed in the pol gene and in the env gene. Moreover, the nucleotide sequence of the RadLV/VL/sub 3/(T+L+) long terminal repeat revealed the existence of two copies of a 42 bp long sequence, separated by 11 nucleotides and of which BL/Ka(B) possesses only one copy

  10. The proviral genome of radiation leukemia virus (RadLV): molecular cloning, restriction analysis and integration sites in tumor cell DNA

    International Nuclear Information System (INIS)

    Janowski, M.; Merregaert, J.; Nuyten, J.M.; Maisin, J.R.

    1984-01-01

    An infectious clone of the linear, unintegrated RadLV provirus was obtained by insertion in the plasmid pBR322. Its restriction map was indistinguishable from that of the majority of the multiple proviral copies, which are found apparently at random sites in the DNA of RadLV-induced rat thymic lymphomas [fr

  11. Contribution of type W human endogenous retroviruses to the human genome: characterization of HERV-W proviral insertions and processed pseudogenes.

    Science.gov (United States)

    Grandi, Nicole; Cadeddu, Marta; Blomberg, Jonas; Tramontano, Enzo

    2016-09-09

    Human endogenous retroviruses (HERVs) are ancient sequences integrated in the germ line cells and vertically transmitted through the offspring constituting about 8 % of our genome. In time, HERVs accumulated mutations that compromised their coding capacity. A prominent exception is HERV-W locus 7q21.2, producing a functional Env protein (Syncytin-1) coopted for placental syncytiotrophoblast formation. While expression of HERV-W sequences has been investigated for their correlation to disease, an exhaustive description of the group composition and characteristics is still not available and current HERV-W group information derive from studies published a few years ago that, of course, used the rough assemblies of the human genome available at that time. This hampers the comparison and correlation with current human genome assemblies. In the present work we identified and described in detail the distribution and genetic composition of 213 HERV-W elements. The bioinformatics analysis led to the characterization of several previously unreported features and provided a phylogenetic classification of two main subgroups with different age and structural characteristics. New facts on HERV-W genomic context of insertion and co-localization with sequences putatively involved in disease development are also reported. The present work is a detailed overview of the HERV-W contribution to the human genome and provides a robust genetic background useful to clarify HERV-W role in pathologies with poorly understood etiology, representing, to our knowledge, the most complete and exhaustive HERV-W dataset up to date.

  12. LHC physics? I like it!

    CERN Multimedia

    2013-01-01

    When asked why I called the new particle “Higgs like”, rather than just “Higgs”, I used to joke that it’s because I like it. And indeed I do. But now we can confidently drop the ‘like’: this new particle is almost undoubtedly a Higgs.   What leads me to say that with such confidence is the skill and dedication of the Higgs analysis teams from ATLAS and CMS. Over the last few months they have shown that a number of key properties of the new particle all point to it being a Higgs: the way it interacts with other particles agrees with theoretical predictions for a Higgs particle, and its quantum properties of spin and parity are as required for a Higgs. The question we need to ask now is what kind of Higgs is it? Is it the Higgs of the Standard Model of particle physics? If so, then one of the crowning achievements of 20th century physics will be complete, with a theory that fully explains the behaviour of the particles that m...

  13. Why I like power cuts...

    CERN Multimedia

    Computer Security Team

    2012-01-01

    Accidental power cuts - a permanent nuisance when running accelerators or computing services, since it takes a lot of time to recover from them. While I feel very sorry for those who are under pressure to get their service running again and deeply regret the loss of down-time and availability, I must admit that I like power cuts: power cuts make computers reboot! And rebooting computers at CERN means all the pending software patches are automatically applied.   But don’t think I am egotistic enough to endorse power cuts. Not necessarily! I am already happy if you regularly patch your computer(s) yourself, where regularly means at least once a month: · If you run a centrally or locally managed Windows computer, give that small orange blinking “CMF” icon in the taskbar a chance in the evening to apply all the pending patches. Also, let it initiate a reboot at the end! · If you have a personal computer with your own Windows operating system, ...

  14. Multiple proviral integration events after virological synapse-mediated HIV-1 spread

    International Nuclear Information System (INIS)

    Russell, Rebecca A.; Martin, Nicola; Mitar, Ivonne; Jones, Emma; Sattentau, Quentin J.

    2013-01-01

    HIV-1 can move directly between T cells via virological synapses (VS). Although aspects of the molecular and cellular mechanisms underlying this mode of spread have been elucidated, the outcomes for infection of the target cell remain incompletely understood. We set out to determine whether HIV-1 transfer via VS results in productive, high-multiplicity HIV-1 infection. We found that HIV-1 cell-to-cell spread resulted in nuclear import of multiple proviruses into target cells as seen by fluorescence in-situ hybridization. Proviral integration into the target cell genome was significantly higher than that seen in a cell-free infection system, and consequent de novo viral DNA and RNA production in the target cell detected by quantitative PCR increased over time. Our data show efficient proviral integration across VS, implying the probability of multiple integration events in target cells that drive productive T cell infection. - Highlights: • Cell-to-cell HIV-1 infection delivers multiple vRNA copies to the target cell. • Cell-to-cell infection results in productive infection of the target cell. • Cell-to-cell transmission is more efficient than cell-free HIV-1 infection. • Suggests a mechanism for recombination in cells infected with multiple viral genomes

  15. Using Resurrected Ancestral Proviral Proteins to Engineer Virus Resistance

    Directory of Open Access Journals (Sweden)

    Asunción Delgado

    2017-05-01

    Full Text Available Proviral factors are host proteins hijacked by viruses for processes essential for virus propagation such as cellular entry and replication. Pathogens and their hosts co-evolve. It follows that replacing a proviral factor with a functional ancestral form of the same protein could prevent viral propagation without fatally compromising organismal fitness. Here, we provide proof of concept of this notion. Thioredoxins serve as general oxidoreductases in all known cells. We report that several laboratory resurrections of Precambrian thioredoxins display substantial levels of functionality within Escherichia coli. Unlike E. coli thioredoxin, however, these ancestral thioredoxins are not efficiently recruited by the bacteriophage T7 for its replisome and therefore prevent phage propagation in E. coli. These results suggest an approach to the engineering of virus resistance. Diseases caused by viruses may have a devastating effect in agriculture. We discuss how the suggested approach could be applied to the engineering of plant virus resistance.

  16. In vitro modeling of HIV proviral activity in microglia.

    Science.gov (United States)

    Campbell, Lee A; Richie, Christopher T; Zhang, Yajun; Heathward, Emily J; Coke, Lamarque M; Park, Emily Y; Harvey, Brandon K

    2017-12-01

    Microglia, the resident macrophages of the brain, play a key role in the pathogenesis of HIV-associated neurocognitive disorders (HAND) due to their productive infection by HIV. This results in the release of neurotoxic viral proteins and pro-inflammatory compounds which negatively affect the functionality of surrounding neurons. Because models of HIV infection within the brain are limited, we aimed to create a novel microglia cell line with an integrated HIV provirus capable of recreating several hallmarks of HIV infection. We utilized clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9 gene editing technology and integrated a modified HIV provirus into CHME-5 immortalized microglia to create HIV-NanoLuc CHME-5. In the modified provirus, the Gag-Pol region is replaced with the coding region for NanoLuciferase (NanoLuc), which allows for the rapid assay of HIV long terminal repeat activity using a luminescent substrate, while still containing the necessary genetic material to produce established neurotoxic viral proteins (e.g. tat, nef, gp120). We confirmed that HIV-NanoLuc CHME-5 microglia express NanoLuc, along with the HIV viral protein Nef. We subsequently exposed these cells to a battery of experiments to modulate the activity of the provirus. Proviral activity was enhanced by treating the cells with pro-inflammatory factors lipopolysaccharide (LPS) and tumor necrosis factor alpha and by overexpressing the viral regulatory protein Tat. Conversely, genetic modification of the toll-like receptor-4 gene by CRISPR/Cas9 reduced LPS-mediated proviral activation, and pharmacological application of NF-κB inhibitor sulfasalazine similarly diminished proviral activity. Overall, these data suggest that HIV-NanoLuc CHME-5 may be a useful tool in the study of HIV-mediated neuropathology and proviral regulation. Published 2017. This article is a U.S. Government work and is in the public domain in the USA.

  17. Possible roles of HIV-1 nucleocapsid protein in the specificity of proviral DNA synthesis and in its variability.

    Science.gov (United States)

    Lapadat-Tapolsky, M; Gabus, C; Rau, M; Darlix, J L

    1997-05-02

    Retroviral nucleocapsid (NC) protein is an integral part of the virion nucleocapsid where it coats the dimeric RNA genome. Due to its nucleic acid binding and annealing activities, NC protein directs the annealing of the tRNA primer to the primer binding site and greatly facilitates minus strand DNA elongation and transfer while protecting the nucleic acids against nuclease degradation. To understand the role of NCp7 in viral DNA synthesis, we examined the influence of NCp7 on self-primed versus primer-specific reverse transcription. The results show that HIV-1 NCp7 can extensively inhibit self-primed reverse transcription of viral and cellular RNAs while promoting primer-specific synthesis of proviral DNA. The role of NCp7 vis-a-vis the presence of mutations in the viral DNA during minus strand elongation was examined. NCp7 maximized the annealing between a cDNA(-) primer containing one to five consecutive errors and an RNA representing the 3' end of the genome. The ability of reverse transcriptase (RT) in the presence of NCp7 to subsequently extend the mutated primers depended upon the position of the mismatch within the primer:template complex. When the mutations were at the polymerisation site, primer extension by RT in the presence of NCp7 was very high, about 40% for one mismatch and 3% for five consecutive mismatches. Mutations within the DNA primer or at its 5' end had little effect on the extension of viral DNA by RT. Taken together these results indicate that NCp7 plays major roles in proviral DNA synthesis within the virion core due to its ability to promote prime-specific proviral DNA synthesis while concurrently inhibiting non-specific reverse transcription of viral and cellular RNAs. Moreover, the observation that NCp7 enhances the incorporation of mutations during minus strand DNA elongation favours the notion that NCp7 is a factor contributing to the high mutation rate of HIV-1.

  18. Using Resurrected Ancestral Proviral Proteins to Engineer Virus Resistance.

    Science.gov (United States)

    Delgado, Asunción; Arco, Rocio; Ibarra-Molero, Beatriz; Sanchez-Ruiz, Jose M

    2017-05-09

    Proviral factors are host proteins hijacked by viruses for processes essential for virus propagation such as cellular entry and replication. Pathogens and their hosts co-evolve. It follows that replacing a proviral factor with a functional ancestral form of the same protein could prevent viral propagation without fatally compromising organismal fitness. Here, we provide proof of concept of this notion. Thioredoxins serve as general oxidoreductases in all known cells. We report that several laboratory resurrections of Precambrian thioredoxins display substantial levels of functionality within Escherichia coli. Unlike E. coli thioredoxin, however, these ancestral thioredoxins are not efficiently recruited by the bacteriophage T7 for its replisome and therefore prevent phage propagation in E. coli. These results suggest an approach to the engineering of virus resistance. Diseases caused by viruses may have a devastating effect in agriculture. We discuss how the suggested approach could be applied to the engineering of plant virus resistance. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.

  19. Number and location of mouse mammary tumor virus proviral DNA in mouse DNA of normal tissue and of mammary tumors.

    Science.gov (United States)

    Groner, B; Hynes, N E

    1980-01-01

    The Southern DNA filter transfer technique was used to characterize the genomic location of the mouse mammary tumor proviral DNA in different inbred strains of mice. Two of the strains (C3H and CBA) arose from a cross of a Bagg albino (BALB/c) mouse and a DBA mouse. The mouse mammary tumor virus-containing restriction enzyme DNA fragments of these strains had similar patterns, suggesting that the proviruses of these mice are in similar genomic locations. Conversely, the pattern arising from the DNA of the GR mouse, a strain genetically unrelated to the others, appeared different, suggesting that its mouse mammary tumor proviruses are located in different genomic sites. The structure of another gene, that coding for beta-globin, was also compared. The mice strains which we studied can be categorized into two classes, expressing either one or two beta-globin proteins. The macroenvironment of the beta-globin gene appeared similar among the mice strains belonging to one genetic class. Female mice of the C3H strain exogenously transmit mouse mammary tumor virus via the milk, and their offspring have a high incidence of mammary tumor occurrence. DNA isolated from individual mammary tumors taken from C3H mice or from BALB/c mice foster nursed on C3H mothers was analyzed by the DNA filter transfer technique. Additional mouse mammary tumor virus-containing fragments were found in the DNA isolated from each mammary tumor. These proviral sequences were integrated into different genomic sites in each tumor. Images PMID:6245257

  20. Quantification of HTLV-I proviral load in experimentally infected rabbits

    Directory of Open Access Journals (Sweden)

    Kindt Thomas J

    2005-05-01

    Full Text Available Abstract Background Levels of proviral load in HTLV-1 infected patients correlate with clinical outcome and are reasonably prognostic. Adaptation of proviral load measurement techniques is examined here for use in an experimental rabbit model of HTLV-1 infection. Initial efforts sought to correlate proviral load with route and dose of inoculation and with clinical outcome in this model. These methods contribute to our continuing goal of using the model to test treatments that alleviate virus infection. Results A real-time PCR assay was used to measure proviral load in blood and tissue samples from a series of rabbits infected using HTLV-1 inocula prepared as either cell-free virus particles, infected cells or blood, or by naked DNA injection. Proviral loads from asymptomatically infected rabbits showed levels corresponding to those reported for human patients with clinically silent HTLV-1 infections. Proviral load was comparably increased in 50% of experimentally infected rabbits that developed either spontaneous benign or malignant tumors while infected. Similarly elevated provirus was found in organs of rabbits with experimentally induced acute leukemia/lymphoma-like disease. Levels of provirus in organs taken at necropsy varied widely suggesting that reservoirs of infections exist in non-lymphoid organs not traditionally thought to be targets for HTLV-1. Conclusion Proviral load measurement is a valuable enhancement to the rabbit model for HTLV-1 infection providing a metric to monitor clinical status of the infected animals as well as a means for the testing of treatment to combat infection. In some cases proviral load in blood did not reflect organ proviral levels, revealing a limitation of this method for monitoring health status of HTLV-1 infected individuals.

  1. Improved detection of CXCR4-using HIV by V3 genotyping: application of population-based and "deep" sequencing to plasma RNA and proviral DNA.

    Science.gov (United States)

    Swenson, Luke C; Moores, Andrew; Low, Andrew J; Thielen, Alexander; Dong, Winnie; Woods, Conan; Jensen, Mark A; Wynhoven, Brian; Chan, Dennison; Glascock, Christopher; Harrigan, P Richard

    2010-08-01

    Tropism testing should rule out CXCR4-using HIV before treatment with CCR5 antagonists. Currently, the recombinant phenotypic Trofile assay (Monogram) is most widely utilized; however, genotypic tests may represent alternative methods. Independent triplicate amplifications of the HIV gp120 V3 region were made from either plasma HIV RNA or proviral DNA. These underwent standard, population-based sequencing with an ABI3730 (RNA n = 63; DNA n = 40), or "deep" sequencing with a Roche/454 Genome Sequencer-FLX (RNA n = 12; DNA n = 12). Position-specific scoring matrices (PSSMX4/R5) (-6.96 cutoff) and geno2pheno[coreceptor] (5% false-positive rate) inferred tropism from V3 sequence. These methods were then independently validated with a separate, blinded dataset (n = 278) of screening samples from the maraviroc MOTIVATE trials. Standard sequencing of HIV RNA with PSSM yielded 69% sensitivity and 91% specificity, relative to Trofile. The validation dataset gave 75% sensitivity and 83% specificity. Proviral DNA plus PSSM gave 77% sensitivity and 71% specificity. "Deep" sequencing of HIV RNA detected >2% inferred-CXCR4-using virus in 8/8 samples called non-R5 by Trofile, and <2% in 4/4 samples called R5. Triplicate analyses of V3 standard sequence data detect greater proportions of CXCR4-using samples than previously achieved. Sequencing proviral DNA and "deep" V3 sequencing may also be useful tools for assessing tropism.

  2. Zinc finger nuclease: a new approach for excising HIV-1 proviral DNA from infected human T cells.

    Science.gov (United States)

    Qu, Xiying; Wang, Pengfei; Ding, Donglin; Wang, Xiaohui; Zhang, Gongmin; Zhou, Xin; Liu, Lin; Zhu, Xiaoli; Zeng, Hanxian; Zhu, Huanzhang

    2014-09-01

    A major reason that Acquired Immune Deficiency Syndrome (AIDS) cannot be completely cured is the human immunodeficiency virus 1 (HIV-1) provirus integrated into the human genome. Though existing therapies can inhibit replication of HIV-1, they cannot eradicate it. A molecular therapy gains popularity due to its specifically targeting to HIV-1 infected cells and effectively removing the HIV-1, regardless of viral genes being active or dormant. Now, we propose a new method which can excellently delete the HIV provirus from the infected human T cell genome. First, we designed zinc-finger nucleases (ZFNs) that target a sequence within the long terminal repeat (LTR) U3 region that is highly conserved in whole clade. Then, we screened out one pair of ZFN and named it as ZFN-U3. We discovered that ZFN-U3 can exactly target and eliminate the full-length HIV-1 proviral DNA after the infected human cell lines treated with it, and the frequency of its excision was about 30 % without cytotoxicity. These results prove that ZFN-U3 can efficiently excise integrated HIV-1 from the human genome in infected cells. This method to delete full length HIV-1 in human genome can therefore provide a novel approach to cure HIV-infected individuals in the future.

  3. High prevalence of HIV-1 transmitted drug-resistance mutations from proviral DNA massively parallel sequencing data of therapy-naïve chronically infected Brazilian blood donors.

    Directory of Open Access Journals (Sweden)

    Rodrigo Pessôa

    Full Text Available An improved understanding of the prevalence of low-abundance transmitted drug-resistance mutations (TDRM in therapy-naïve HIV-1-infected patients may help determine which patients are the best candidates for therapy. In this study, we aimed to obtain a comprehensive picture of the evolving HIV-1 TDRM across the massive parallel sequences (MPS of the viral entire proviral genome in a well-characterized Brazilian blood donor naïve to antiretroviral drugs.The MPS data from 128 samples used in the analysis were sourced from Brazilian blood donors and were previously classified by less-sensitive (LS or "detuned" enzyme immunoassay as non-recent or longstanding HIV-1 infections. The Stanford HIV Resistance Database (HIVDBv 6.2 and IAS-USA mutation lists were used to interpret the pattern of drug resistance. The minority variants with TDRM were identified using a threshold of ≥ 1.0% and ≤ 20% of the reads sequenced. The rate of TDRM in the MPS data of the proviral genome were compared with the corresponding published consensus sequences of their plasma viruses.No TDRM were detected in the integrase or envelope regions. The overall prevalence of TDRM in the protease (PR and reverse transcriptase (RT regions of the HIV-1 pol gene was 44.5% (57/128, including any mutations to the nucleoside analogue reverse transcriptase inhibitors (NRTI and non-nucleoside analogue reverse transcriptase inhibitors (NNRTI. Of the 57 subjects, 43 (75.4% harbored a minority variant containing at least one clinically relevant TDRM. Among the 43 subjects, 33 (76.7% had detectable minority resistant variants to NRTIs, 6 (13.9% to NNRTIs, and 16 (37.2% to PR inhibitors. The comparison of viral sequences in both sources, plasma and cells, would have detected 48 DNA provirus disclosed TDRM by MPS previously missed by plasma bulk analysis.Our findings revealed a high prevalence of TDRM found in this group, as the use of MPS drastically increased the detection of these

  4. Biochemical characterization of cells transformed via transfection by feline sarcoma virus proviral DNA.

    OpenAIRE

    Rosenberg, Z F; Sahagan, B G; Snyder, H W; Worley, M B; Essex, M; Haseltine, W A

    1981-01-01

    Murine fibroblasts transformed by transfection with DNA from mink cells infected with the Snyder-Theilen strain of feline sarcoma virus and subgroup B feline leukemia virus were analyzed for the presence of integrated proviral DNA and the expression of feline leukemia virus- and feline sarcoma virus-specific proteins. The transformed murine cells harbored at least one intact feline sarcoma virus provirus, but did not contain feline leukemia virus provirus. The transformed murine cells express...

  5. Chromosomal locations of members of a family of novel endogenous human retroviral genomes

    International Nuclear Information System (INIS)

    Horn, T.M.; Huebner, K.; Croce, C.; Callahan, R.

    1986-01-01

    Human cellular DNA contains two distinguishable families of retroviral related sequences. One family shares extensive nucleotide sequence homology with infectious mammalian type C retroviral genomes. The other family contains major regions of homology with the pol genes of infectious type A and B and avian type C and D retroviral genomes. Analysis of the human recombinant clone HLM-2 has shown that the pol gene in the latter family is located within an endogenous proviral genome. The authors show that the proviral genome in HLM-2 and the related recombinant clone HLM-25 are located, respectively, on human chromosomes 1 and 5. Other related proviral genomes are located on chromosomes 7, 8, 11, 14, and 17

  6. Cattle with the BoLA class II DRB3*0902 allele have significantly lower bovine leukemia proviral loads.

    Science.gov (United States)

    Hayashi, Takumi; Mekata, Hirohisa; Sekiguchi, Satoshi; Kirino, Yumi; Mitoma, Shuya; Honkawa, Kazuyuki; Horii, Yoichiro; Norimine, Junzo

    2017-09-12

    The bovine MHC (BoLA) class II DRB3 alleles are associated with polyclonal expansion of lymphocytes caused by bovine leukemia virus (BLV) infection in cattle. To examine whether the DRB3*0902 allele, one of the resistance-associated alleles, is associated with the proviral load, we measured BLV proviral load of BLV-infected cattle and clarified their DRB3 alleles. Fifty-seven animals with DRB3*0902 were identified out of 835 BLV-infected cattle and had significantly lower proviral load (Pclass II DRA/DRB3*0902 molecule plays an important immunological role in suppressing viral replication, resulting in resistance to the disease progression.

  7. Characterization of a ViI-like Phage Specific to Escherichia coli O157:H7

    Directory of Open Access Journals (Sweden)

    Kropinski Andrew M

    2011-09-01

    Full Text Available Abstract Phage vB_EcoM_CBA120 (CBA120, isolated against Escherichia coli O157:H7 from a cattle feedlot, is morphologically very similar to the classic phage ViI of Salmonella enterica serovar Typhi. Until recently, little was known genetically or physiologically about the ViI-like phages, and none targeting E. coli have been described in the literature. The genome of CBA120 has been fully sequenced and is highly similar to those of both ViI and the Shigella phage AG3. The core set of structural and replication-related proteins of CBA120 are homologous to those from T-even phages, but generally are more closely related to those from T4-like phages of Vibrio, Aeromonas and cyanobacteria than those of the Enterobacteriaceae. The baseplate and method of adhesion to the host are, however, very different from those of either T4 or the cyanophages. None of the outer baseplate proteins are conserved. Instead of T4's long and short tail fibers, CBA120, like ViI, encodes tail spikes related to those normally seen on podoviruses. The 158 kb genome, like that of T4, is circularly permuted and terminally redundant, but unlike T4 CBA120 does not substitute hmdCyt for cytosine in its DNA. However, in contrast to other coliphages, CBA120 and related coliphages we have isolated cannot incorporate 3H-thymidine (3H-dThd into their DNA. Protein sequence comparisons cluster the putative "thymidylate synthase" of CBA120, ViI and AG3 much more closely with those of Delftia phage φW-14, Bacillus subtilis phage SPO1, and Pseudomonas phage YuA, all known to produce and incorporate hydroxymethyluracil (hmdUra.

  8. Mutations in Ovis aries TMEM154 are associated with lower small ruminant lentivirus proviral concentration in one sheep flock.

    Science.gov (United States)

    Alshanbari, F A; Mousel, M R; Reynolds, J O; Herrmann-Hoesing, L M; Highland, M A; Lewis, G S; White, S N

    2014-08-01

    Small ruminant lentivirus (SRLV), also called ovine progressive pneumonia virus or maedi-visna, is present in 24% of US sheep. Like human immunodeficiency virus, SRLV is a macrophage-tropic lentivirus that causes lifelong infection. The production impacts from SRLV are due to a range of disease symptoms, including pneumonia, arthritis, mastitis, body condition wasting and encephalitis. There is no cure and no effective vaccine for preventing SRLV infection. However, breed differences in prevalence and proviral concentration indicate a genetic basis for susceptibility to SRLV. Animals with high blood proviral concentration show increased tissue lesion severity, so proviral concentration represents a live animal test for control post-infection in terms of proviral replication and disease severity. Recently, it was found that sheep with two copies of TMEM154 haplotype 1 (encoding lysine at position 35) had lower odds of SRLV infection. In this study, we examined the relationship between SRLV control post-infection and variants in two genes, TMEM154 and CCR5, in four flocks containing 1403 SRLV-positive sheep. We found two copies of TMEM154 haplotype 1 were associated with lower SRLV proviral concentration in one flock (P < 0.02). This identified the same favorable diplotype for SRLV control post-infection as for odds of infection. However, frequencies of haplotypes 2 and 3 were too low in the other three flocks to test. The CCR5 promoter deletion did not have consistent association with SRLV proviral concentration. Future work in flocks with more balanced allele frequencies is needed to confirm or refute TMEM154 association with control of SRLV post-infection. Published 2014. This article is a U.S. Government work and is in the public domain in the USA. Animal Genetics published by John Wiley & Sons Ltd on behalf of Stichting International Foundation for Animal Genetics.

  9. Investigating Signs of Recent Evolution in the Pool of Pro-viral DNA during Years of Successful HAART

    DEFF Research Database (Denmark)

    Mens, H.; Pedersen, Anders Gorm; Jørgensen, L. B.

    2007-01-01

    In order to shed light on the nature of the persistent reservoir of human immunodeficiency virus type 1 (HIV-1), we investigated signs of recent evolution in the pool of proviral DNA in patients on successful HAART. Pro-viral DNA, corresponding to the C2-V3-C3 region of the HIV-1 env gene...... there were temporal trends indicating ongoing replication and evolution. In summary, it was not possible to detect definitive signs of ongoing evolution in either the bulk-sequenced or the clonal data with the methods employed here, but our results could be consistent with localized expression of archival...

  10. Investigating signs of recent evolution in the pool of proviral HIV type 1 DNA during years of successful HAART

    DEFF Research Database (Denmark)

    Mens, Helene; Pedersen, Anders G; Jørgensen, Louise B

    2007-01-01

    In order to shed light on the nature of the persistent reservoir of human immunodeficiency virus type 1 (HIV-1), we investigated signs of recent evolution in the pool of proviral DNA in patients on successful HAART. Pro-viral DNA, corresponding to the C2-V3-C3 region of the HIV-1 env gene...... there were temporal trends indicating ongoing replication and evolution. In summary, it was not possible to detect definitive signs of ongoing evolution in either the bulk-sequenced or the clonal data with the methods employed here, but our results could be consistent with localized expression of archival...

  11. Association of Sicca Syndrome with Proviral Load and Proinflammatory Cytokines in HTLV-1 Infection

    Directory of Open Access Journals (Sweden)

    Clara Mônica Lima

    2016-01-01

    Full Text Available The Sjögren syndrome has been diagnosed in patients with HTLV-1 associated myelopathy and dry mouth and dry eyes are documented in HTLV-1 carriers. However the diagnosis of Sjögren syndrome in these subjects has been contested. In this cross-sectional study, we evaluated the role of immunological factors and proviral load, in sicca syndrome associated with HTLV-1 in patients without myelopathy. Subjects were recruited in the HTLV-1 Clinic, from 2009 to 2011. The proviral load and cytokine levels (IFN-γ, TNF-α, IL-5, and IL-10 were obtained from a database containing the values presented by the subjects at admission in the clinic. Of the 272 participants, 59 (21.7% had sicca syndrome and in all of them anti-Sjögren syndrome related antigen A (SSA and antigen B (SSB were negatives. The production of TNF-α and IFN-γ was higher in the group with sicca syndrome (P<0.05 than in HTLV-1 infected subjects without sicca syndrome. Our data indicates that patients with sicca syndrome associated with HTLV-1 do not have Sjögren syndrome. However the increased production of TNF-α and IFN-γ in this group of patients may contribute to the pathogenesis of sicca syndrome associated with HTLV-1.

  12. Vaccination of rhesus macaques with a vif-deleted simian immunodeficiency virus proviral DNA vaccine

    International Nuclear Information System (INIS)

    Sparger, Ellen E.; Dubie, Robert A.; Shacklett, Barbara L.; Cole, Kelly S.; Chang, W.L.; Luciw, Paul A.

    2008-01-01

    Studies in non-human primates, with simian immunodeficiency virus (SIV) and simian/human immunodeficiency virus (SHIV) have demonstrated that live-attenuated viral vaccines are highly effective; however these vaccine viruses maintain a low level of pathogenicity. Lentivirus attenuation associated with deletion of the viral vif gene carries a significantly reduced risk for pathogenicity, while retaining the potential for virus replication of low magnitude in the host. This report describes a vif-deleted simian immunodeficiency virus (SIV)mac239 provirus that was tested as an attenuated proviral DNA vaccine by inoculation of female rhesus macaques. SIV-specific interferon-γ enzyme-linked immunospot responses of low magnitude were observed after immunization with plasmid containing the vif-deleted SIV provirus. However, vaccinated animals displayed strong sustained virus-specific T cell proliferative responses and increasing antiviral antibody titers. These immune responses suggested either persistent vaccine plasmid expression or low level replication of vif-deleted SIV in the host. Immunized and unvaccinated macaques received a single high dose vaginal challenge with pathogenic SIVmac251. A transient suppression of challenge virus load and a greater median survival time was observed for vaccinated animals. However, virus loads for vaccinated and unvaccinated macaques were comparable by twenty weeks after challenge and overall survival curves for the two groups were not significantly different. Thus, a vif-deleted SIVmac239 proviral DNA vaccine is immunogenic and capable of inducing a transient suppression of pathogenic challenge virus, despite severe attenuation of the vaccine virus

  13. A universal real-time PCR assay for the quantification of group-M HIV-1 proviral load.

    Science.gov (United States)

    Malnati, Mauro S; Scarlatti, Gabriella; Gatto, Francesca; Salvatori, Francesca; Cassina, Giulia; Rutigliano, Teresa; Volpi, Rosy; Lusso, Paolo

    2008-01-01

    Quantification of human immunodeficiency virus type-1 (HIV-1) proviral DNA is increasingly used to measure the HIV-1 cellular reservoirs, a helpful marker to evaluate the efficacy of antiretroviral therapeutic regimens in HIV-1-infected individuals. Furthermore, the proviral DNA load represents a specific marker for the early diagnosis of perinatal HIV-1 infection and might be predictive of HIV-1 disease progression independently of plasma HIV-1 RNA levels and CD4(+) T-cell counts. The high degree of genetic variability of HIV-1 poses a serious challenge for the design of a universal quantitative assay capable of detecting all the genetic subtypes within the main (M) HIV-1 group with similar efficiency. Here, we describe a highly sensitive real-time PCR protocol that allows for the correct quantification of virtually all group-M HIV-1 strains with a higher degree of accuracy compared with other methods. The protocol involves three stages, namely DNA extraction/lysis, cellular DNA quantification and HIV-1 proviral load assessment. Owing to the robustness of the PCR design, this assay can be performed on crude cellular extracts, and therefore it may be suitable for the routine analysis of clinical samples even in developing countries. An accurate quantification of the HIV-1 proviral load can be achieved within 1 d from blood withdrawal.

  14. Genomes

    National Research Council Canada - National Science Library

    Brown, T. A. (Terence A.)

    2002-01-01

    ... of genome expression and replication processes, and transcriptomics and proteomics. This text is richly illustrated with clear, easy-to-follow, full color diagrams, which are downloadable from the book's website...

  15. Convergent evolution of SIV env after independent inoculation of rhesus macaques with infectious proviral DNA

    International Nuclear Information System (INIS)

    Buckley, Kathleen A.; Li Peilin; Khimani, Anis H.; Hofmann-Lehmann, Regina; Liska, Vladimir; Anderson, Daniel C.; McClure, Harold M.; Ruprecht, Ruth M.

    2003-01-01

    The env gene of three simian immunodeficiency virus (SIV) variants developed convergent mutations during disease progression in six rhesus macaques. The monkeys had been inoculated with supercoiled plasmids encoding infectious proviruses of SIVmac239 (a pathogenic, wild-type strain), SIVΔ3 (the live attenuated vaccine strain derived from SIVmac239), or SIVΔ3+ (a pathogenic progeny virus that had evolved from SIVΔ3). All six monkeys developed immunodeficiency and progressed to fatal disease. Although many divergent mutations arose in env among the different hosts, three regions consistently mutated in all monkeys studied; these similar mutations developed independently even though the animals had received only a single infectious molecular clone rather than standard viral inocula that contain viral quasispecies. Together, these data indicate that the env genes of SIVmac239, SIVΔ3, and SIVΔ3+, in the context of different proviral backbones, evolve similarly in different hosts during disease progression

  16. Telomere Length, Proviral Load and Neurologic Impairment in HTLV-1 and HTLV-2-Infected Subjects

    Directory of Open Access Journals (Sweden)

    Benjamin Usadi

    2016-08-01

    Full Text Available Short or damaged telomeres have been implicated in degenerative conditions. We hypothesized that analysis of telomere length (TL in human T-cell lymphotropic virus (HTLV infection and HTLV-associated neuropathy might provide clues to the etiology of HTLV-associated disease and viral dynamics. A subset of 45 human T-cell lymphotropic virus type 1 (HTLV-1, 45 human T-cell lymphotropic virus type 2 (HTLV-2, and 45 seronegative subjects was selected from the larger HTLV Outcomes Study (HOST cohort, matched on age, sex and race/ethnicity. Telomere-to-single-copy gene (T/S ratio (a measure of TL and HTLV-1 and HTLV-2 proviral loads were measured in peripheral blood mononuclear cells (PBMCs using quantitative PCR (qPCR. Vibration sensation measured by tuning fork during neurologic examinations performed as part of the HOST study allowed for an assessment of peripheral neuropathy. TL was compared between groups using t-tests, linear and logistic regression. Mean T/S ratio was 1.02 ± 0.16 in HTLV-1, 1.03 ± 0.17 in HTLV-2 and 0.99 ± 0.18 in HTLV seronegative subjects (p = 0.322. TL was not associated with HTLV-1 or -2 proviral load. Shorter TL was significantly associated with impaired vibration sense in the HTLV-2 positive group only. Overall, we found no evidence that telomere length was affected by chronic HTLV-1 and HTLV-2 infection. That TL was only associated with peripheral neuropathy in the HTLV-2-positive group is intriguing, but should be interpreted cautiously. Studies with larger sample size and telomere length measurement in lymphocyte subsets may clarify the relationship between TL and HTLV-infection.

  17. Excision of HIV-1 proviral DNA by recombinant cell permeable tre-recombinase.

    Directory of Open Access Journals (Sweden)

    Lakshmikanth Mariyanna

    Full Text Available Over the previous years, comprehensive studies on antiretroviral drugs resulted in the successful introduction of highly active antiretroviral therapy (HAART into clinical practice for treatment of HIV/AIDS. However, there is still need for new therapeutic approaches, since HAART cannot eradicate HIV-1 from the infected organism and, unfortunately, can be associated with long-term toxicity and the development of drug resistance. In contrast, novel gene therapy strategies may have the potential to reverse the infection by eradicating HIV-1. For example, expression of long terminal repeat (LTR-specific recombinase (Tre-recombinase has been shown to result in chromosomal excision of proviral DNA and, in consequence, in the eradication of HIV-1 from infected cell cultures. However, the delivery of Tre-recombinase currently depends on the genetic manipulation of target cells, a process that is complicating such therapeutic approaches and, thus, might be undesirable in a clinical setting. In this report we demonstrate that E.coli expressed Tre-recombinases, tagged either with the protein transduction domain (PTD from the HIV-1 Tat trans-activator or the translocation motif (TLM of the Hepatitis B virus PreS2 protein, were able to translocate efficiently into cells and showed significant recombination activity on HIV-1 LTR sequences. Tre activity was observed using episomal and stable integrated reporter constructs in transfected HeLa cells. Furthermore, the TLM-tagged enzyme was able to excise the full-length proviral DNA from chromosomal integration sites of HIV-1-infected HeLa and CEM-SS cells. The presented data confirm Tre-recombinase activity on integrated HIV-1 and provide the basis for the non-genetic transient application of engineered recombinases, which may be a valuable component of future HIV eradication strategies.

  18. Focal glomerulosclerosis in proviral and c-fms transgenic mice links Vpr expression to HIV-associated nephropathy

    International Nuclear Information System (INIS)

    Dickie, Peter; Roberts, Amanda; Uwiera, Richard; Witmer, Jennifer; Sharma, Kirti; Kopp, Jeffrey B.

    2004-01-01

    Clinical and morphologic features of human immunodeficiency virus (HIV)-associated nephropathy (HIVAN), such as proteinuria, sclerosing glomerulopathy, tubular degeneration, and interstitial disease, have been modeled in mice bearing an HIV proviral transgene rendered noninfectious through a deletion in gag/pol. Exploring the genetic basis of HIVAN, HIV transgenic mice bearing mutations in either or both of the accessory genes nef and vpr were created. Proteinuria and focal glomerulosclerosis (FGS) only developed in mice with an intact vpr gene. Transgenic mice bearing a simplified proviral DNA (encoding only Tat and Vpr) developed renal disease characterized by FGS in which Vpr protein was localized to glomerular and tubular epithelia by immunohistochemistry. The dual transgenic progeny of HIV[Tat/Vpr] mice bred to HIV[ΔVpr] proviral transgenic mice displayed a more severe nephropathy with no apparent increase in Vpr expression, implying that multiple viral genes contribute to HIVAN. However, the unique contribution of macrophage-specific Vpr expression in the development of glomerular disease was underscored by the induction of FGS in multiple murine lines bearing a c-fms/vpr transgene

  19. A human genome-wide loss-of-function screen identifies effective chikungunya antiviral drugs.

    Science.gov (United States)

    Karlas, Alexander; Berre, Stefano; Couderc, Thérèse; Varjak, Margus; Braun, Peter; Meyer, Michael; Gangneux, Nicolas; Karo-Astover, Liis; Weege, Friderike; Raftery, Martin; Schönrich, Günther; Klemm, Uwe; Wurzlbauer, Anne; Bracher, Franz; Merits, Andres; Meyer, Thomas F; Lecuit, Marc

    2016-05-12

    Chikungunya virus (CHIKV) is a globally spreading alphavirus against which there is no commercially available vaccine or therapy. Here we use a genome-wide siRNA screen to identify 156 proviral and 41 antiviral host factors affecting CHIKV replication. We analyse the cellular pathways in which human proviral genes are involved and identify druggable targets. Twenty-one small-molecule inhibitors, some of which are FDA approved, targeting six proviral factors or pathways, have high antiviral activity in vitro, with low toxicity. Three identified inhibitors have prophylactic antiviral effects in mouse models of chikungunya infection. Two of them, the calmodulin inhibitor pimozide and the fatty acid synthesis inhibitor TOFA, have a therapeutic effect in vivo when combined. These results demonstrate the value of loss-of-function screening and pathway analysis for the rational identification of small molecules with therapeutic potential and pave the way for the development of new, host-directed, antiviral agents.

  20. Kinetics of HIV-1 CTL epitopes recognized by HLA I alleles in HIV-infected individuals at times near primary infection: the Provir/Latitude45 study.

    Directory of Open Access Journals (Sweden)

    Jennifer Papuchon

    Full Text Available In patients responding successfully to ART, the next therapeutic step is viral cure. An interesting strategy is antiviral vaccination, particularly involving CD8 T cell epitopes. However, attempts at vaccination are dependent on the immunogenetic background of individuals. The Provir/Latitude 45 project aims to investigate which CTL epitopes in proviral HIV-1 will be recognized by the immune system when HLA alleles are taken into consideration. A prior study (Papuchon et al, PLoS ONE 2013 showed that chronically-infected patients under successful ART exhibited variations of proviral CTL epitopes compared to a reference viral strain (HXB2 and that a generic vaccine may not be efficient. Here, we investigated viral and/or proviral CTL epitopes at different time points in recently infected individuals of the Canadian primary HIV infection cohort and assessed the affinity of these epitopes for HLA alleles during the study period. An analysis of the results confirms that it is not possible to fully predict which epitopes will be recognized by the HLA alleles of the patients if the reference sequences and epitopes are taken as the basis of simulation. Epitopes may be seen to vary in circulating RNA and proviral DNA. Despite this confirmation, the overall variability of the epitopes was low in these patients who are temporally close to primary infection.

  1. Kinetics of HIV-1 CTL epitopes recognized by HLA I alleles in HIV-infected individuals at times near primary infection: the Provir/Latitude45 study.

    Science.gov (United States)

    Papuchon, Jennifer; Pinson, Patricia; Guidicelli, Gwenda-Line; Bellecave, Pantxika; Thomas, Réjean; LeBlanc, Roger; Reigadas, Sandrine; Taupin, Jean-Luc; Baril, Jean Guy; Routy, Jean Pierre; Wainberg, Mark; Fleury, Hervé

    2014-01-01

    In patients responding successfully to ART, the next therapeutic step is viral cure. An interesting strategy is antiviral vaccination, particularly involving CD8 T cell epitopes. However, attempts at vaccination are dependent on the immunogenetic background of individuals. The Provir/Latitude 45 project aims to investigate which CTL epitopes in proviral HIV-1 will be recognized by the immune system when HLA alleles are taken into consideration. A prior study (Papuchon et al, PLoS ONE 2013) showed that chronically-infected patients under successful ART exhibited variations of proviral CTL epitopes compared to a reference viral strain (HXB2) and that a generic vaccine may not be efficient. Here, we investigated viral and/or proviral CTL epitopes at different time points in recently infected individuals of the Canadian primary HIV infection cohort and assessed the affinity of these epitopes for HLA alleles during the study period. An analysis of the results confirms that it is not possible to fully predict which epitopes will be recognized by the HLA alleles of the patients if the reference sequences and epitopes are taken as the basis of simulation. Epitopes may be seen to vary in circulating RNA and proviral DNA. Despite this confirmation, the overall variability of the epitopes was low in these patients who are temporally close to primary infection.

  2. Role of the RIG-I-like receptors in antiviral response

    Directory of Open Access Journals (Sweden)

    Agnieszka Jabłońska

    2014-01-01

    Full Text Available The innate nonspecific immunity is the first line of defense against viral infection. Toll-like receptors (TLRs and retinoic acid-inducible gene I (RIG-I-like receptors (RLRs are two main receptor families detecting viral nucleic acid. So far, three RLR family members were characterized: RIG-I, MDA5 and LGP2. RLR constitute a family of cytoplasmic helicases, which recognized intracellular single-stranded and double-stranded RNA that is introduced to cytosol during viral infection and replication. In this work we review the current knowledge about the mechanisms of viral recognition by RIG-I-like receptors and their signaling pathways for the activation of type I interferons and pro-inflammatory cytokines synthesis.

  3. Comparison of HTLV-I Proviral Load in Adult T Cell Leukemia/Lymphoma (ATL), HTLV-I-Associated Myelopathy (HAM-TSP) and Healthy Carriers.

    Science.gov (United States)

    Akbarin, Mohammad Mehdi; Rahimi, Hossein; Hassannia, Tahereh; Shoja Razavi, Ghazaleh; Sabet, Faezeh; Shirdel, Abbas

    2013-03-01

    Human T Lymphocyte Virus Type one (HTLV-I) is a retrovirus that infects about 10-20 million people worldwide. Khorasan province in Iran is an endemic area. The majority of HTLV-I-infected individuals sustain healthy carriers but small proportion of infected population developed two progressive diseases: HAM/TSP and ATL. The proviral load could be a virological marker for disease monitoring, therefore in the present study HTLV-I proviral load has been evaluated in ATL and compared to HAM/TSP and healthy carriers. In this case series study, 47 HTLV-I infected individuals including 13 ATL, 23 HAM/TSP and 11 asymptomatic subjects were studied. Peripheral blood mononuclear cells (PBMCs) were investigated for presence of HTLV-I DNA provirus by PCR using LTR and Tax fragments. Then in infected subjects, HTLV-I proviral load was measured using real time PCR TaqMan method. The average age of patients in ATL was 52±8, in HAM/TSP 45.52±15.17 and in carrier's 38.65±14.9 years which differences were not statistically significant. The analysis of data showed a significant difference in mean WBC among study groups (ATL vs HAM/TSP and carriers P=0.0001). Moreover, mean HTLV-I proviral load was 11967.2 ± 5078, 409 ± 71.3 and 373.6 ± 143.3 in ATL, HAM/TSP and Healthy Carriers, respectively. The highest HTLV-I proviral load was measured in ATL group that had a significant correlation with WBC count (R=0.495, P=0.001). The proviral load variations between study groups was strongly significant (ATL vs carrier P=0.0001; ATL vs HAM/TSP P= 0.0001 and HAM/TSP vs carriers P< 0.05). Conclusion : The present study demonstrated that HTLV-I proviral load was higher in ATL group in comparison with HAM/TSP and healthy carriers. Therefore, HTLV-I proviral load is a prognostic factor for development of HTLV-I associated diseases and can be used as a monitoring marker for the efficiency of therapeutic regime.

  4. Quantification of bovine leukemia virus proviral DNA using a low-cost real-time polymerase chain reaction.

    Science.gov (United States)

    Petersen, M I; Alvarez, I; Trono, K G; Jaworski, J P

    2018-04-11

    The detection of bovine leukemia virus (BLV) proviral DNA is an important tool to address whether an animal is infected with BLV. Compared with serological assays, real-time PCR accounts for greater sensitivity and can serve as a confirmatory test for the clarification of inconclusive or discordant serological test results. However, the high cost related to real-time PCR assays has limited their systematic inclusion in BLV surveillance and eradication programs. The aim of the present study was to validate a low-cost quantitative real-time PCR. Interestingly, by using SYBR Green detection dye, we were able to reduce the cost of a single reaction by a factor of 5 compared with most common assays based on the use of fluorogenic probes (i.e., TaqMan technology). This approach allowed a highly sensitive and specific detection and quantification of BLV proviral DNA from purified peripheral blood leukocytes and a milk matrix. Due to its simplicity and low cost, our in-house BLV SYBR quantitative real-time PCR might be used either as a screening or as a confirmatory test in BLV control programs. Copyright © 2018 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  5. RIG-I Like Receptors in Antiviral Immunity and Therapeutic Applications

    Directory of Open Access Journals (Sweden)

    Michael Gale Jr.

    2011-06-01

    Full Text Available The RNA helicase family of RIG-I-like receptors (RLRs is a key component of host defense mechanisms responsible for detecting viruses and triggering innate immune signaling cascades to control viral replication and dissemination. As cytoplasm-based sensors, RLRs recognize foreign RNA in the cell and activate a cascade of antiviral responses including the induction of type I interferons, inflammasome activation, and expression of proinflammatory cytokines and chemokines. This review provides a brief overview of RLR function, ligand interactions, and downstream signaling events with an expanded discussion on the therapeutic potential of targeting RLRs for immune stimulation and treatment of virus infection.

  6. Concise classification of the genomic porcine endogenous retroviral gamma1 load to defined lineages.

    Science.gov (United States)

    Klymiuk, Nikolai; Wolf, Eckhard; Aigner, Bernhard

    2008-02-05

    We investigated the infection history of porcine endogenous retroviruses (PERV) gamma1 by analyzing published env and LTR sequences. PERV sequences from various breeds, porcine cell lines and infected human primary cells were included in the study. We identified a considerable number of retroviral lineages indicating multiple independent colonization events of the porcine genome. A recent boost of the proviral load in an isolated pig herd and exclusive occurrence of distinct lineages in single studies indicated the ongoing colonization of the porcine genome with endogenous retroviruses. Retroviral recombination between co-packaged genomes was a general factor for PERV gamma1 diversity which indicated the simultaneous expression of different proviral loci over a period of time. In total, our detailed description of endogenous retroviral lineages is the prerequisite for breeding approaches to minimize the infectious potential of porcine tissues for the subsequent use in xenotransplantation.

  7. RIG-I-Like Receptor Signaling in Singleton-Merten Syndrome

    Directory of Open Access Journals (Sweden)

    Changming Lu

    2017-09-01

    Full Text Available Singleton-Merten syndrome (SMS is an autosomal dominant, multi-system innate immune disorder characterized by early and severe aortic and valvular calcification, dental and skeletal abnormalities, psoriasis, glaucoma, and other varying clinical findings. Recently we identified a specific gain-of-function mutation in IFIH1, interferon induced with helicase C domain 1, segregated with this disease. SMS disease without hallmark dental anomalies, termed atypical SMS, has recently been reported caused by variants in DDX58, DEXD/H-box helicase 58. IFIH1 and DDX58 encode retinoic acid-inducible gene I (RIG-I-like receptors family members melanoma differentiation-associated gene 5 and RIG-I, respectively. These cytosolic pattern recognition receptors function in viral RNA detection initiating an innate immune response through independent pathways that promote type I and type III interferon expression and proinflammatory cytokines. In this review, we focus on SMS as an innate immune disorder summarizing clinical features, molecular aspects of the pathogenetic pathway and discussing underlying mechanisms of the disease.

  8. The Icsbp locus is a common proviral insertion site in mature B-cell lymphomas/plasmacytomas induced by exogenous murine leukemia virus

    International Nuclear Information System (INIS)

    Ma Shiliang; Sorensen, Annette Balle; Kunder, Sandra; Sorensen, Karina Dalsgaard; Quintanilla-Martinez, Leticia; Morris, David W.; Schmidt, Joerg; Pedersen, Finn Skou

    2006-01-01

    ICSBP (interferon consensus sequence binding protein)/IRF8 (interferon regulatory factor 8) is an interferon gamma-inducible transcription factor expressed predominantly in hematopoietic cells, and down-regulation of this factor has been observed in chronic myelogenous leukemia and acute myeloid leukemia in man. By screening about 1200 murine leukemia virus (MLV)-induced lymphomas, we found proviral insertions at the Icsbp locus in 14 tumors, 13 of which were mature B-cell lymphomas or plasmacytomas. Only one was a T-cell lymphoma, although such tumors constituted about half of the samples screened. This indicates that the Icsbp locus can play a specific role in the development of mature B-lineage malignancies. Two proviral insertions in the last Icsbp exon were found to act by a poly(A)-insertion mechanism. The remaining insertions were found within or outside Icsbp. Since our results showed expression of Icsbp RNA and protein in all end-stage tumor samples, a simple tumor suppressor function of ICSBP is not likely. Interestingly, proviral insertions at Icsbp have not been reported from previous extensive screenings of mature B-cell lymphomas induced by endogenous MLVs. We propose that ICSBP might be involved in an early modulation of an immune response to exogenous MLVs that might also play a role in proliferation of the mature B-cell lymphomas

  9. Absence of A3Z3-Related Hypermutations in the env and vif Proviral Genes in FIV Naturally Infected Cats

    Directory of Open Access Journals (Sweden)

    Lucía Cano-Ortiz

    2018-05-01

    Full Text Available Apolipoprotein B mRNA-editing enzyme catalytic polypeptide-like 3 (APOBEC3; A3 proteins comprise an important family of restriction factors that produce hypermutations on proviral DNA and are able to limit virus replication. Vif, an accessory protein present in almost all lentiviruses, counteracts the antiviral A3 activity. Seven haplotypes of APOBEC3Z3 (A3Z3 were described in domestic cats (hap I–VII, and in-vitro studies have demonstrated that these proteins reduce infectivity of vif-defective feline immunodeficiency virus (FIV. Moreover, hap V is resistant to vif-mediated degradation. However, studies on the effect of A3Z3 in FIV-infected cats have not been developed. Here, the correlation between APOBEC A3Z3 haplotypes in domestic cats and the frequency of hypermutations in the FIV vif and env genes were assessed in a retrospective cohort study with 30 blood samples collected between 2012 and 2016 from naturally FIV-infected cats in Brazil. The vif and env sequences were analyzed and displayed low or undetectable levels of hypermutations, and could not be associated with any specific A3Z3 haplotype.

  10. RetroTector online, a rational tool for analysis of retroviral elements in small and medium size vertebrate genomic sequences

    Directory of Open Access Journals (Sweden)

    Benachenhou Farid

    2009-06-01

    Full Text Available Abstract Background The rapid accumulation of genomic information in databases necessitates rapid and specific algorithms for extracting biologically meaningful information. More or less complete retroviral sequences, also called proviral or endogenous retroviral sequences; ERVs, constitutes at least 5% of vertebrate genomes. After infecting the host, these retroviruses have integrated in germ line cells, and have then been carried in genomes for at least several 100 million years. A better understanding of structure and function of these sequences can have profound biological and medical consequences. Methods RetroTector© (ReTe is a platform-independent Java program for identification and characterization of proviral sequences in vertebrate genomes. The full ReTe requires a local installation with a MySQL database. Although not overly complicated, the installation may take some time. A "light" version of ReTe, (RetroTector online; ROL which does not require specific installation procedures is provided, via the World Wide Web. Results ROL http://www.fysiologi.neuro.uu.se/jbgs/ was implemented under the Batchelor web interface (A Lövgren et al. It allows both GenBank accession number, file and FASTA cut-and-paste admission of sequences (5 to 10 000 kilobases. Up to ten submissions can be done simultaneously, allowing batch analysis of Discussion Proviral sequences can be hard to recognize, especially if the integration occurred many million years ago. Precise delineation of LTR, gag, pro, pol and env can be difficult, requiring manual work. ROL is a way of simplifying these tasks. Conclusion ROL provides 1. annotation and presentation of known retroviral sequences, 2. detection of proviral chains in unknown genomic sequences, with up to 100 Mbase per submission.

  11. Evaluation of the role of TAX, HBZ, and HTLV-1 proviral load on the survival of ATLL patients.

    Science.gov (United States)

    Akbarin, Mohammad Mehdi; Shirdel, Abbas; Bari, Alireza; Mohaddes, Seyedeh Tahereh; Rafatpanah, Houshang; Karimani, Ehsan Ghayour; Etminani, Kobra; Golabpour, Amin; Torshizi, Reza

    2017-06-01

    Adult T-cell leukemia/lymphoma (ATLL) is an aggressive malignancy with very poor prognosis and short survival, caused by the human T-lymphotropic virus type-1 (HTLV-1). The HTLV-1 biomarkers trans-activator x (TAX) and HTLV-1 basic leucine zipper factor (HBZ) are main oncogenes and life-threatening elements. This study aimed to assess the role of the TAX and HBZ genes and HTLV-1 proviral load (PVL) in the survival of patients with ATLL. Forty-three HTLV-1-infected individuals, including 18 asymptomatic carriers (AC) and 25 ATLL patients (ATLL), were evaluated between 2011 and 2015. The mRNA expression of TAX and HBZ and the HTLV-1 PVL were measured by quantitative PCR. Significant differences in the mean expression levels of TAX and HBZ were observed between the two study groups (ATLL and AC, P =0.014 and P =0.000, respectively). In addition, the ATLL group showed a significantly higher PVL than AC ( P =0.000). There was a significant negative relationship between PVL and survival among all study groups ( P =0.047). The HTLV-1 PVL and expression of TAX and HBZ were higher in the ATLL group than in the AC group. Moreover, a higher PVL was associated with shorter survival time among all ATLL subjects. Therefore, measurement of PVL, TAX , and HBZ may be beneficial for monitoring and predicting HTLV-1-infection outcomes, and PVL may be useful for prognosis assessment of ATLL patients. This research demonstrates the possible correlation between these virological markers and survival in ATLL patients.

  12. Genetic modelling of PIM proteins in cancer: proviral tagging, cooperation with oncogenes, tumor suppressor genes and carcinogens.

    Directory of Open Access Journals (Sweden)

    Enara eAguirre

    2014-05-01

    Full Text Available The PIM proteins, which were initially discovered as proviral insertion sites in Moloney murine leukemia virus infection, are a family of highly homologous serine/threonine kinases that have been reported to be overexpressed in hematological malignancies and solid tumors. The PIM proteins have also been associated with metastasis and overall treatment responses and implicated in the regulation of apoptosis, metabolism, the cell cycle, and homing and migration, which makes these proteins interesting targets for anticancer drug discovery. The use of retroviral insertional mutagenesis and refined approaches such as complementation tagging has allowed the identification of myc, pim and a third group of genes (including bmi1 and gfi1 as complementing genes in lymphomagenesis. Moreover, mouse modeling of human cancer has provided an understanding of the molecular pathways that are involved in tumor initiation and progression at the physiological level. In particular, genetically modified mice have allowed researchers to further elucidate the role of each of the Pim isoforms in various tumor types. PIM kinases have been identified as weak oncogenes because experimental overexpression in lymphoid tissue, prostate and liver induces tumors at a relatively low incidence and with a long latency. However, very strong synergistic tumorigenicity between Pim1/2 and c-Myc and other oncogenes has been observed in lymphoid tissues. Mouse models have also been used to study whether the inhibition of specific PIM isoforms is required to prevent carcinogen-induced sarcomas, indicating that the absence of Pim2 and Pim3 greatly reduces sarcoma growth and bone invasion; the extent of this effect is similar to that observed in the absence of all 3 isoforms. This review will summarize some of the animal models that have been used to understand the isoform-specific contribution of PIM kinases to tumorigenesis.

  13. Hybridization Capture Reveals Evolution and Conservation across the Entire Koala Retrovirus Genome

    Science.gov (United States)

    Ishida, Yasuko; Cui, Pin; Vielgrader, Hanna; Helgen, Kristofer M.; Roca, Alfred L.; Greenwood, Alex D.

    2014-01-01

    The koala retrovirus (KoRV) is the only retrovirus known to be in the midst of invading the germ line of its host species. Hybridization capture and next generation sequencing were used on modern and museum DNA samples of koala (Phascolarctos cinereus) to examine ca. 130 years of evolution across the full KoRV genome. Overall, the entire proviral genome appeared to be conserved across time in sequence, protein structure and transcriptional binding sites. A total of 138 polymorphisms were detected, of which 72 were found in more than one individual. At every polymorphic site in the museum koalas, one of the character states matched that of modern KoRV. Among non-synonymous polymorphisms, radical substitutions involving large physiochemical differences between amino acids were elevated in env, potentially reflecting anti-viral immune pressure or avoidance of receptor interference. Polymorphisms were not detected within two functional regions believed to affect infectivity. Host sequences flanking proviral integration sites were also captured; with few proviral loci shared among koalas. Recently described variants of KoRV, designated KoRV-B and KoRV-J, were not detected in museum samples, suggesting that these variants may be of recent origin. PMID:24752422

  14. Hybridization capture reveals evolution and conservation across the entire Koala retrovirus genome.

    Directory of Open Access Journals (Sweden)

    Kyriakos Tsangaras

    Full Text Available The koala retrovirus (KoRV is the only retrovirus known to be in the midst of invading the germ line of its host species. Hybridization capture and next generation sequencing were used on modern and museum DNA samples of koala (Phascolarctos cinereus to examine ca. 130 years of evolution across the full KoRV genome. Overall, the entire proviral genome appeared to be conserved across time in sequence, protein structure and transcriptional binding sites. A total of 138 polymorphisms were detected, of which 72 were found in more than one individual. At every polymorphic site in the museum koalas, one of the character states matched that of modern KoRV. Among non-synonymous polymorphisms, radical substitutions involving large physiochemical differences between amino acids were elevated in env, potentially reflecting anti-viral immune pressure or avoidance of receptor interference. Polymorphisms were not detected within two functional regions believed to affect infectivity. Host sequences flanking proviral integration sites were also captured; with few proviral loci shared among koalas. Recently described variants of KoRV, designated KoRV-B and KoRV-J, were not detected in museum samples, suggesting that these variants may be of recent origin.

  15. Dynamic interaction between STLV-1 proviral load and T-cell response during chronic infection and after immunosuppression in non-human primates.

    Directory of Open Access Journals (Sweden)

    Sandrine Souquière

    Full Text Available We used mandrills (Mandrillus sphinx naturally infected with simian T-cell leukemia virus type 1 (STLV-1 as a model for evaluating the influence of natural STLV-1 infection on the dynamics and evolution of the immune system during chronic infection. Furthermore, in order to evaluate the role of the immune system in controlling the infection during latency, we induced immunosuppression in the infected monkeys. We first showed that the STLV-1 proviral load was higher in males than in females and increased significantly with the duration of infection: mandrills infected for 10-6 years had a significantly higher proviral load than those infected for 2-4 years. Curiously, this observation was associated with a clear reduction in CD4+ T-cell number with age. We also found that the percentage of CD4(+ T cells co-expressing the activation marker HLA-DR and the mean percentage of CD25(+ in CD4(+ and CD8(+ T cells were significantly higher in infected than in uninfected animals. Furthermore, the STLV-1 proviral load correlated positively with T-cell activation but not with the frequency of T cells secreting interferon gamma in response to Tax peptides. Lastly, we showed that, during immunosuppression in infected monkeys, the percentages of CD8(+ T cells expressing HLA-DR(+ and of CD4(+ T cells expressing the proliferation marker Ki67 decreased significantly, although the percentage of CD8(+ T cells expressing HLA-DR(+ and Ki67 increased significantly by the end of treatment. Interestingly, the proviral load increased significantly after immunosuppression in the monkey with the highest load. Our study demonstrates that mandrills naturally infected with STLV-1 could be a suitable model for studying the relations between host and virus. Further studies are needed to determine whether the different compartments of the immune response during infection induce the long latency by controlling viral replication over time. Such studies would provide important

  16. Expression and Functional Characterization of the RIG-I-Like Receptors MDA5 and LGP2 in Rainbow Trout (Oncorhynchus mykiss) ▿ †

    Science.gov (United States)

    Chang, Mingxian; Collet, Bertrand; Nie, Pin; Lester, Katherine; Campbell, Scott; Secombes, Christopher J.; Zou, Jun

    2011-01-01

    The retinoic acid-inducible gene I (RIG-I)-like receptors (RLR) comprise three homologues: RIG-I, melanoma differentiation-associated gene 5 (MDA5), and laboratory of genetics and physiology 2 (LGP2). They activate the host interferon (IFN) system upon recognition of viral RNA pathogen-associated molecular patterns (PAMPs) in the cytoplasm. Bioinformatic analysis of the sequenced vertebrate genomes suggests that the cytosolic surveillance system is conserved in lower vertebrates, and recent functional studies have confirmed that RIG-I is important to fish antiviral immunity. In this study, we have identified MDA5 and LGP2 homologues from rainbow trout Oncorhynchus mykiss and an additional LGP2 variant with an incomplete C-terminal domain of RIG-I. Trout MDA5 and LGP2 were constitutively produced in fibroblast and macrophage cell lines and upregulated by poly(I:C), recombinant IFN, or infection by RNA viruses (viral hemorrhagic septicemia virus and salmon alphavirus) with a single-stranded positive or negative genome. Overexpression of MDA5 and LGP2 but not of the LGP2 variant resulted in significant accumulation of Mx transcripts in cultured cells, which correlated with a marked enhancement of protection against viral infection. These results demonstrate that both MDA5 and LGP2 are important RLRs in host surveillance against infection of both negative and positive viruses and that the LGP2 variant with a deletion of 54 amino acids at the C terminus acts as a negative regulator for LGP2-elicited antiviral signaling by competing for the viral RNA PAMPs. Interestingly, MDA5 expression was not affected by overexpressed LGP2 in transfected cells and vice versa, suggesting that they likely act in parallel as positive regulators for IFN production. PMID:21680521

  17. Absence of ultraviolet-inducible DNA polymerase I-like activity in Escherichia coli strains harbouring R plasmids

    International Nuclear Information System (INIS)

    Upton, C.; Pinney, R.J.

    1981-01-01

    No DNA polymerase I-like activity was found associated with the ultraviolet (u.v.)-protecting plasmids R205, R46 or pKM101 in either uninduced or u.v.-induced wild-type or DNA polymerase I-deficient strains of Escherichia coli. Nor was any plasmid-associated polymerase activity detectable in similar systems containing u.v.-irradiated DNA as template. However, plasmids R205, R46 and pKM 101 still increased survival and mutagenesis of the polymerase I-deficient E. coli strain after u.v. irradiation. (author)

  18. Stable integration of recombinant adeno-associated virus vector genomes after transduction of murine hematopoietic stem cells.

    Science.gov (United States)

    Han, Zongchao; Zhong, Li; Maina, Njeri; Hu, Zhongbo; Li, Xiaomiao; Chouthai, Nitin S; Bischof, Daniela; Weigel-Van Aken, Kirsten A; Slayton, William B; Yoder, Mervin C; Srivastava, Arun

    2008-03-01

    We previously reported that among single-stranded adeno-associated virus (ssAAV) vectors, serotypes 1 through 5, ssAAV1 is the most efficient in transducing murine hematopoietic stem cells (HSCs), but viral second-strand DNA synthesis remains a rate-limiting step. Subsequently, using double-stranded, self-complementary AAV (scAAV) vectors, serotypes 7 through 10, we observed that scAAV7 vectors also transduce murine HSCs efficiently. In the present study, we used scAAV1 and scAAV7 shuttle vectors to transduce HSCs in a murine bone marrow serial transplant model in vivo, which allowed examination of the AAV proviral integration pattern in the mouse genome, as well as recovery and nucleotide sequence analyses of AAV-HSC DNA junction fragments. The proviral genomes were stably integrated, and integration sites were localized to different mouse chromosomes. None of the integration sites was found to be in a transcribed gene, or near a cellular oncogene. None of the animals, monitored for up to 1 year, exhibited pathological abnormalities. Thus, AAV proviral integration-induced risk of oncogenesis was not found in our study, which provides functional confirmation of stable transduction of self-renewing multipotential HSCs by scAAV vectors as well as promise for the use of these vectors in the potential treatment of disorders of the hematopoietic system.

  19. Diagnosis and surgical treatment of a Chiari I-like malformation in an African lion (Panthera leo).

    Science.gov (United States)

    McCain, Stephanie; Souza, Marcy; Ramsay, Ed; Schumacher, Juergen; Hecht, Silke; Thomas, William

    2008-09-01

    A 13-mo-old intact male African lion (Panthera leo) presented with a 3-mo history of lethargy, ventral flexion of the neck, abnormal vocalization, and ataxia. Hemogram and serum biochemistries were within normal limits except for the presence of hypokalemia (2.7 mEq/L) and hypochloridemia (108 mEq/L). When no improvement was noted with oral potassium gluconate supplementation, a computed tomography scan of the brain and skull was performed, and no abnormalities were noted. However, magnetic resonance imaging detected occipital bone thickening, crowding of the caudal cranial fossa with cerebellar compression and herniation, and cervical syringohydromyelia, which was consistent with a Chiari I-like malformation. Foramen magnum decompression was performed to relieve the compression of the cerebellum. The animal recovered well with subsequent resolution of clinical signs. Hypovitaminosis A has been proposed previously as the underlying etiology for this malformation in lions with similar clinical presentations. This lion's serum and liver vitamin A concentrations were low (100 ng/ml and 25.31 microg/g, respectively) compared to concentrations reported for domestic carnivores and support hypovitaminosis A as the underlying cause of this animal's Chiari I-like malformation.

  20. An in vitro study on the risk of non-allergic type-I like hypersensitivity to Momordica charantia.

    Science.gov (United States)

    Sagkan, Rahsan Ilikci

    2013-10-26

    Momordica charantia (MC) is a tropical plant that is extensively used in folk medicine. However, the knowledge about side effects of this plant is relatively little according to knowledge about its therapeutic effects. The aim of this study is to reveal the effects of non-allergic type-I like hypersensitivity to MC by an experiment which was designed in vitro. In the present study, the expression of CD63 and CD203c on peripheral blood basophils against different dilutions of MC extracts was measured using flow cytometry and compared with one another. In addition to this, intra-assay CV's of testing extracts were calculated for precision on reproducibility of test results. It was observed that the fruit extract of MC at 1/100 and 1/1000 dilutions significantly increased active basophils compared to same extract at 1/10000 dilution. In conclusion, Momordica charantia may elicit a non-allergic type-I like hypersensitivity reaction in especially susceptible individuals.

  1. Communication: I like

    CERN Multimedia

    Staff Association

    2015-01-01

    To fulfill its mission to represent CERN personnel with the Management and the Member States, the Staff Council has set up a series of Commissions: employment conditions, pensions, legal matters, social protection, health and safety, InformAction, CAPA (individual cases) and, more recently, Media-Com. As its name suggests the Media-Com Commission deals with all matters of communication. The mandate of the new Commission is to implement and optimize the communication channels that the Staff Association uses to keep you informed. To attract the greatest number of people, Media-Com operates through multiple communication channels, such as articles in the Echo, the Staff Association information bulletin, the Staff Association website (http://staff- association.web.cern.ch/), Facebook, and, more recently, the intra-CERN Social platform. The Social platform is a discussion forum, for exchanging ideas, expressing views, reacting to, and commenting on current events of the Staff Association. To participa...

  2. Utilization of a DNA enzyme immunoassay for the detection of proviral DNA of human immunodeficiency virus type 1 by polymerase chain reaction.

    Science.gov (United States)

    Zella, D; Cavicchini, A; Cattaneo, E; Cimarelli, A; Bertazzoni, U

    1995-02-01

    The detection of proviral DNA by Polymerase Chain Reaction (PCR) is regarded as an important tool in the diagnosis of HIV-1 infection, specially among adults at risk of AIDS and children born to seropositive mothers. However, application of PCR in routine testing is hampered by the need to use radioactive probes. In this study, a non-radioactive test based on a microtiter plate (DNA Enzyme ImmunoAssay, DEIA) was used for the detection of proviral sequences of HIV-1 in peripheral blood cells of different patients. The results of the PCR-DEIA assay were compared to those obtained by liquid hybridization (PCR-LH), virus isolation (VI) and Western blot (WB). The study population included 92 patients belonging to three different groups: seropositive subjects with a well-defined clinical status and WB profile; adults at risk of infection with negative or indeterminate WB; children born to seropositive mothers with still unestablished HIV-1 infection. In the seropositive subjects, both PCR-LH and PCR-DEIA confirmed infection and gave the same results as WB. In adults at risk of infection, PCR with both methods anticipated the seroconversion in one patient with indeterminate WB and confirmed the absence of infection among seronegative and other indeterminate patients. In children born to seropositive mothers, both PCR systems as well as VI permitted an early diagnosis of infection, as confirmed by the clinical follow-up. This study has shown that in subjects at risk of AIDS and in children born to seropositive mothers, the non-isotopic DEIA method presents the same sensitivity and specificity for the detection of HIV-1 infection as the radioactive procedure. The DEIA method appears to be particularly useful for the detection of PCR products in routine diagnostic analyses.

  3. Proviral amplification of the Gypsy endogenous retrovirus of Drosophila melanogaster involves env-independent invasion of the female germline.

    OpenAIRE

    Chalvet, F; Teysset, L; Terzian, C; Prud'homme, N; Santamaria, P; Bucheton, A; Pélisson, A

    1999-01-01

    Gypsy is an infectious endogenous retrovirus of Drosophila melanogaster. The gypsy proviruses replicate very efficiently in the genome of the progeny of females homozygous for permissive alleles of the flamenco gene. This replicative transposition is correlated with derepression of gypsy expression, specifically in the somatic cells of the ovaries of the permissive mothers. The determinism of this amplification was studied further by making chimeric mothers containing different permissive/res...

  4. UBXN1 Interferes with Rig-I-like Receptor-Mediated Antiviral Immune Response by Targeting MAVS

    Directory of Open Access Journals (Sweden)

    Penghua Wang

    2013-04-01

    Full Text Available RNA viruses are sensed by RIG-I-like receptors (RLRs, which signal through a mitochondria-associated adaptor molecule, MAVS, resulting in systemic antiviral immune responses. Although RLR signaling is essential for limiting RNA virus replication, it must be stringently controlled to prevent damage from inflammation. We demonstrate here that among all tested UBX-domain-containing protein family members, UBXN1 exhibits the strongest inhibitory effect on RNA-virus-induced type I interferon response. UBXN1 potently inhibits RLR- and MAVS-induced, but not TLR3-, TLR4-, or DNA-virus-induced innate immune responses. Depletion of UBXN1 enhances virus-induced innate immune responses, including those resulting from RNA viruses such as vesicular stomatitis, Sendai, West Nile, and dengue virus infection, repressing viral replication. Following viral infection, UBXN1 is induced, binds to MAVS, interferes with intracellular MAVS oligomerization, and disrupts the MAVS/TRAF3/TRAF6 signalosome. These findings underscore a critical role of UBXN1 in the modulation of a major antiviral signaling pathway.

  5. RIG-I-like receptor-induced IRF3 mediated pathway of apoptosis (RIPA: a new antiviral pathway

    Directory of Open Access Journals (Sweden)

    Saurabh Chattopadhyay

    2016-11-01

    Full Text Available Abstract The innate immune response is the first line of host defense to eliminate viral infection. Pattern recognition receptors in the cytosol, such as RIG-I-like receptors (RLR and Nod-like receptors (NLR, and membrane bound Toll like receptors (TLR detect viral infection and initiate transcription of a cohort of antiviral genes, including interferon (IFN and interferon stimulated genes (ISGs, which ultimately block viral replication. Another mechanism to reduce viral spread is through RIPA, the RLR-induced IRF3-mediated pathway of apoptosis, which causes infected cells to undergo premature death. The transcription factor IRF3 can mediate cellular antiviral responses by both inducing antiviral genes and triggering apoptosis through the activation of RIPA. The mechanism of IRF3 activation in RIPA is distinct from that of transcriptional activation; it requires linear polyubiquitination of specific lysine residues of IRF3. Using RIPA-active, but transcriptionally inactive, IRF3 mutants, it was shown that RIPA can prevent viral replication and pathogenesis in mice.

  6. RetroTector online, a rational tool for analysis of retroviral elements in small and medium size vertebrate genomic sequences.

    Science.gov (United States)

    Sperber, Göran; Lövgren, Anders; Eriksson, Nils-Einar; Benachenhou, Farid; Blomberg, Jonas

    2009-06-16

    The rapid accumulation of genomic information in databases necessitates rapid and specific algorithms for extracting biologically meaningful information. More or less complete retroviral sequences, also called proviral or endogenous retroviral sequences; ERVs, constitutes at least 5% of vertebrate genomes. After infecting the host, these retroviruses have integrated in germ line cells, and have then been carried in genomes for at least several 100 million years. A better understanding of structure and function of these sequences can have profound biological and medical consequences. RetroTector (ReTe) is a platform-independent Java program for identification and characterization of proviral sequences in vertebrate genomes. The full ReTe requires a local installation with a MySQL database. Although not overly complicated, the installation may take some time. A "light" version of ReTe, (RetroTector online; ROL) which does not require specific installation procedures is provided, via the World Wide Web. ROL http://www.fysiologi.neuro.uu.se/jbgs/ was implemented under the Batchelor web interface (A Lövgren et al). It allows both GenBank accession number, file and FASTA cut-and-paste admission of sequences (5 to 10,000 kilobases). Up to ten submissions can be done simultaneously, allowing batch analysis of genome specific "brooms", which increase specificity. Proviral sequences can be hard to recognize

  7. Systematic investigation of electron impact excitation-autoionization from the ground state of highly charged GaI-like ions through ΔN=1 transitions

    International Nuclear Information System (INIS)

    Oreg, J.; Bar-Shalom, A.; Mandlebaum, P.; Mittnik, D.; Meroz, E.; Schwob, J.L.; Klapisch, M.

    1991-01-01

    A systematic variation in the line intensity ratios of GaI-like and ZnI-like ions of rare earth elements has been recently observed in spectra emitted in a low density, high temperature tokamak plasma. This variation is shown to be correlated with the gradual opening of autoionizing channels through inner-shell excited configurations of the GaI-like charge-state. These channels enhance the indirect ionization rate of GaI-like ions through excitation-autoionization (EA), effecting the ionization balance and temperatures of greatest abundance. We present a systematic investigation of EA and direct impact ionization (DI) in the GaI-like isoelectronic sequence from Mo (Z = 42) to Dy (Z = 66). As Z decreases from Dy to Pr (Z = 59) the levels of the configuration 3d 9 4p4f, which are excited from the ground state by strong dipole collisional transitions, gradually cross the first ionization limit of the ion and are responsible for this ionization enhancement. When Z decreases further an additional channel is opened through the configuration 3d 9 4p4d. 9 refs., 3 figs., 1 tab

  8. Systematic investigation of electron impact excitation-autoionization from the ground state of highly charged GaI-like ions through ΔN = 1 transitions

    International Nuclear Information System (INIS)

    Oreg, J.; Bar-Shalom, A.; Goldstein, W.H.; Mandlebaum, P.; Mittnik, D.; Meroz, E.; Schwob, J.L.; Klapisch, M.

    1991-01-01

    A systematic variation in the line intensity ratios of GaI-like and ZnI-like ions of rare earth elements has been recently observed in spectra emitted in a low density, high temperature Tokamak plasma. This variation is shown to be correlated with the gradual opening of autoionizing channels through inner-shell excited configurations of the GaI-like charge-state. These channels enhance the indirect ionization rate of GaI-like ions through excitation-autoionization (EA), effecting the ionization balance and temperatures of greatest abundance. The authors a systematic investigations of EA and direct impact ionizations (DI) in the GaI-like isoelectronic sequence from Mo (Z = 42) to Dy (Z = 66). As Z decreases from Dy to Pr (Z = 59) the levels of the configuration 3d 9 4p4f, which are excited from the ground state by strong dipole collisional transitions, gradually cross the first ionization limit of the ion and are responsible for this ionization enhancement. When Z decreases further an additional channel is opened through the configuration 3d 9 4p4d

  9. Effect of excitation-autoionization processes on the line emission of Zn I-- and GaI--like rare-earth ions in hot coronal plasmas

    International Nuclear Information System (INIS)

    Mandelbaum, P.; Finkenthal, M.; Meroz, E.; Schwob, J.L.; Oreg, J.; Goldstein, W.H.; Klapisch, M.; Osterheld, L.; Bar Shalom, A.; Lippman, S.; Huang, L.K.; Moos, H.W.

    1990-01-01

    A systematic variation in the line-intensity ratios of GaI-- and ZnI--like Pr (Z=59) to Dy (Z=66) ions has been observed in spectra emitted by atoms injected in a low-density high-temperature tokamak plasma. This variation is shown to be correlated with the progressive closing of the autoionizing channels through the excited 3d 9 4s 2 4p4f configuration in the GaI--like ionization state as Z increases

  10. Lack of evidence to support the association of a single IL28B genotype SNP rs12979860 with the HTLV-1 clinical outcomes and proviral load

    Directory of Open Access Journals (Sweden)

    Sanabani Sabri Saeed

    2012-12-01

    Full Text Available Abstract Background The Interleukin 28B (IL28B rs12979860 polymorphisms was recently reported to be associated with the human T-cell leukemia virus type 1 (HTLV-1 proviral load (PvL and the development of the HTLV-1-associated myelopathy/tropical spastic paraparesis (HAM/TSP. Methods In an attempt to examine this hypothesis, we assessed the association of the rs12979860 genotypes with HTLV-1 PvL levels and clinical status in 112 unrelated Brazilian subjects (81 HTLV-1 asymptomatic carriers, 24 individuals with HAM/TSP and 7 with Adult T cell Leukemia/Lymphoma (ATLL. Results All 112 samples were successfully genotyped and their PvLs compared. Neither the homozygote TT nor the heterozygote CT mutations nor the combination genotypes (TT/CT were associated with a greater PvL. We also observed no significant difference in allele distribution between asymptomatic carriers and patients with HTLV-1 associated HAM/TSP. Conclusions Our study failed to support the previously reported positive association between the IL28B rs12979860 polymorphisms and an increased risk of developing HAM/TSP in the Brazilian population.

  11. Evaluation of viremia, proviral load and cytokine profile in naturally feline immunodeficiency virus infected cats treated with two different protocols of recombinant feline interferon omega.

    Science.gov (United States)

    Leal, Rodolfo O; Gil, Solange; Duarte, Ana; McGahie, David; Sepúlveda, Nuno; Niza, Maria M R E; Tavares, Luís

    2015-04-01

    This study assesses viremia, provirus and blood cytokine profile in naturally FIV-infected cats treated with two distinct protocols of interferon omega (rFeIFN-ω). Samples from FIV-cats previously submitted to two single-arm studies were used: 7/18 received the licensed/subcutaneous protocol (SC) while 11/18 were treated orally (PO). Viremia, provirus and blood mRNA expression of interleukin (IL)-1, IL-4, IL-6, IL-10, IL-12p40, Interferon-γ and Tumor Necrosis Factor-α were monitored by Real-Time qPCR. Concurrent plasma levels of IL-6, IL-12p40 and IL-4 were assessed by ELISA. IL-6 plasma levels decreased in the SC group (p = 0.031). IL-6 mRNA expression (p = 0.037) decreased in the PO group, albeit not sufficiently to change concurrent plasma levels. Neither viremia nor other measured cytokines changed with therapy. Proviral load increased in the SC group (p = 0.031), which can be justified by a clinically irrelevant increase of lymphocyte count. Independently of the protocol, rFeIFN-ω seems to act on innate immunity by reducing pro-inflammatory stimulus. Copyright © 2015 Elsevier Ltd. All rights reserved.

  12. Endogenous retroviruses in fish genomes: from relics of past infections to evolutionary innovations?

    Directory of Open Access Journals (Sweden)

    Magali Naville

    2016-08-01

    Full Text Available The increasing availability of fish genome sequences has allowed to gain new insights into the diversity and host distribution of retroviruses in fish and other vertebrates. This distribution can be assessed through the identification and analysis of endogenous retroviruses, which are proviral remnants of past infections integrated in genomes. Retroviral sequences are probably important for evolution through their ability to induce rearrangements and to contribute regulatory and coding sequences; they may also protect their host against new infections. We argue that the current mass of genome sequences will soon strongly improve our understanding of retrovirus diversity and evolution in aquatic animals, with the identification of new/re-emerging elements and host resistance genes that restrict their infectivity.

  13. Measles Virus Suppresses RIG-I-like Receptor Activation in Dendritic Cells via DC-SIGN-Mediated Inhibition of PP1 Phosphatases

    NARCIS (Netherlands)

    Mesman, Annelies W.; Zijlstra-Willems, Esther M.; Kaptein, Tanja M.; de Swart, Rik L.; Davis, Meredith E.; Ludlow, Martin; Duprex, W. Paul; Gack, Michaela U.; Gringhuis, Sonja I.; Geijtenbeek, Teunis B. H.

    2014-01-01

    Dendritic cells (DCs) are targets of measles virus (MV) and play central roles in viral dissemination. However, DCs express the RIG-I-like receptors (RLRs) RIG-I and Mda5 that sense MV and induce type I interferon (IFN) production. Given the potency of this antiviral response, RLRs are tightly

  14. Measles virus suppresses RIG-I-like receptor activation in dendritic cells via DC-SIGN-mediated inhibition of PP1 phosphatases

    NARCIS (Netherlands)

    A.W. Mesman (Annelies ); E.M. Zijlstra-Willems (Esther); T.M. Kaptein (Tanja); R.L. de Swart (Rik); M.E. Davis (Meredith); M. Ludlow (Martin); W.P. Duprex (Paul); M.U. Gack (Michaela); S.I. Gringhuis (Sonja); T.B.H. Geijtenbeek (Teunis)

    2014-01-01

    textabstractDendritic cells (DCs) are targets of measles virus (MV) and play central roles in viral dissemination. However, DCs express the RIG-I-like receptors (RLRs) RIG-I and Mda5 that sense MV and induce type I interferon (IFN) production. Given the potency of this antiviral response, RLRs are

  15. Low Proviral Load is Associated with Indeterminate Western Blot Patterns in Human T-Cell Lymphotropic Virus Type 1 Infected Individuals: Could Punctual Mutations be Related?

    Directory of Open Access Journals (Sweden)

    Camila Cánepa

    2015-10-01

    Full Text Available Background: indeterminate Western blot (WB patterns are a major concern for diagnosis of human T-cell lymphotropic virus type 1 (HTLV-1 infection, even in non-endemic areas. Objectives: (a to define the prevalence of indeterminate WB among different populations from Argentina; (b to evaluate if low proviral load (PVL is associated with indeterminate WB profiles; and (c to describe mutations in LTR and tax sequence of these cases. Results: Among 2031 samples, 294 were reactive by screening. Of them, 48 (16.3% were WB indeterminate and of those 15 (31.3% were PCR+. Quantitative real-time PCR (qPCR was performed to 52 HTLV-1+ samples, classified as Group 1 (G1: 25 WB+ samples from individuals with pathologies; Group 2 (G2: 18 WB+ samples from asymptomatic carriers (AC; and Group 3 (G3: 9 seroindeterminate samples from AC. Median PVL was 4.78, 2.38, and 0.15 HTLV-1 copies/100 PBMCs, respectively; a significant difference (p=0.003 was observed. Age and sex were associated with PVL in G1 and G2, respectively. Mutations in the distal and central regions of Tax Responsive Elements (TRE 1 and 2 of G3 were observed, though not associated with PVL.The 8403A>G mutation of the distal region, previously related to high PVL, was absent in G3 but present in 50% of WB+ samples (p = 0.03. Conclusions: indeterminateWBresults confirmed later as HTLV-1 positive may be associated with low PVL levels. Mutations in LTR and tax are described; their functional relevance remains to be determined.

  16. T-cell tropism of simian T-cell leukaemia virus type 1 and cytokine profiles in relation to proviral load and immunological changes during chronic infection of naturally infected mandrills (Mandrillus sphinx).

    Science.gov (United States)

    Souquière, Sandrine; Mouinga-Ondeme, Augustin; Makuwa, Maria; Beggio, Paola; Radaelli, Antonia; De Giuli Morghen, Carlo; Mortreux, Franck; Kazanji, Mirdad

    2009-08-01

    Although a wide variety of non-human primates are susceptible to simian T-cell leukaemia virus type 1 (STLV-1), little is known about the virological or molecular determinants of natural STLV-1 infection. We determined STLV-1 virus tropism in vivo and its relation to the immune response by evaluating cytokine production and T-cell subsets in naturally infected and uninfected mandrills. With real-time PCR methods, we found that STLV-1 in mandrills infects both CD4(+) and CD8(+) T cells; however, proviral loads were significantly higher (P = 0.01) in CD4(+) than in CD8(+) cells (mean STLV-1 copies number per 100 cells (+/- SD) was 7.8 +/- 8 in CD4(+) T cells and 3.9 +/- 4.5 in CD8(+) T cells). After culture, STLV-1 provirus was detected in enriched CD4(+) but not in enriched CD8(+) T cells. After 6 months of culture, STLV-1-transformed cell lines expressing CD3(+), CD4(+) and HLADR(+) were established, and STLV-1 proteins and tax/rex mRNA were detected. In STLV-1 infected monkeys, there was a correlation between high proviral load and elevated levels of interleukin (IL)-2, IL-6, IL-10, interferon-gamma and tumour necrosis factor-alpha. The two monkeys with the highest STLV-1 proviral load had activated CD4(+)HLADR(+) and CD8(+)HLADR(+) T-cell subsets and a high percentage of CD25(+) in CD4(+) and CD8(+) T cells. Our study provides the first cellular, immunological and virological characterization of natural STLV-1 infection in mandrills and shows that they are an appropriate animal model for further physiopathological studies of the natural history of human T-cell leukaemia viruses.

  17. Cytokine profile and proviral load among Japanese immigrants and non-Japanese infected with HTLV-1 in a non-endemic area of Brazil.

    Directory of Open Access Journals (Sweden)

    João Américo Domingos

    Full Text Available The lifetime risk of HTLV-1-associated myelopathy/tropical spastic paraparesis (HAM/TSP development differs among ethnic groups. To better understand these differences, this prospective cohort study was conducted to investigate the cytokine profile and the HTLV-1 proviral load (PVL in Japanese and non-Japanese populations with HAM/TSP and asymptomatic carriers (ACs. The serum IL-2, IL-4, IL-6, IL-10, IL-17, TNF-α, and IFN-γ levels were quantified using the Cytometric Bead Array in 40 HTLV-1-infected patients (11 HAM/TSP and 29 ACs and 18 healthy controls (HCs in Brazil. Among ACs, 15 were Japanese descendants and 14 were non-Japanese. Of 11 patients with HAM/TSP, only one was a Japanese descendant. The HTLV-1 PVL was quantified by real-time PCR. The HTLV-1 PVL was 2.7-fold higher in HAM/TSP patients than ACs. Regardless of the clinical outcome, the PVL was significantly higher in patients younger than 60 years than older patients. The HAM/TSP and ACs had higher IL-10 serum concentrations than that of HCs. The ACs also showed higher IL-6 serum levels than those of HCs. According to age, the IL-10 and IL-6 levels were higher in ACs non-Japanese patients older than 60 years. HAM/TSP patients showed a positive correlation between IL-6 and IL-17 and a negative correlation between the PVL and IL-17 and IFN-γ. In the all ACs, a significant positive correlation was observed between IL-2 and IL-17 and a negative correlation was detected between IL-10 and TNF-α. Only 6.25% of the Japanese patients were symptomatic carriers, compared with 41.67% of the non-Japanese patients. In conclusion, this study showed that high levels of HTLV-1 PVL was intrinsicaly associated with the development of HAM/TSP. A higher HTLV-1 PVL and IL10 levels found in non-Japanese ACs over 60 years old, which compared with the Japanese group depicts that the ethnic background may interfere in the host immune status. More researches also need to be undertaken regarding the host

  18. Glucocorticoids facilitate the transcription from the human cytomegalovirus major immediate early promoter in glucocorticoid receptor- and nuclear factor-I-like protein-dependent manner

    International Nuclear Information System (INIS)

    Inoue-Toyoda, Maki; Kato, Kohsuke; Nagata, Kyosuke; Yoshikawa, Hiroyuki

    2015-01-01

    Human cytomegalovirus (HCMV) is a common and usually asymptomatic virus agent in healthy individuals. Initiation of HCMV productive infection depends on expression of the major immediate early (MIE) genes. The transcription of HCMV MIE genes is regulated by a diverse set of transcription factors. It was previously reported that productive HCMV infection is triggered probably by elevation of the plasma hydroxycorticoid level. However, it is poorly understood whether the transcription of MIE genes is directly regulated by glucocorticoid. Here, we found that the dexamethasone (DEX), a synthetic glucocorticoid, facilitates the transcription of HCMV MIE genes through the MIE promoter and enhancer in a glucocorticoid receptor (GR)-dependent manner. By competitive EMSA and reporter assays, we revealed that an NF-I like protein is involved in DEX-mediated transcriptional activation of the MIE promoter. Thus, this study supports a notion that the increased level of hydroxycorticoid in the third trimester of pregnancy reactivates HCMV virus production from the latent state. - Highlights: • DEX facilitates the transcription from the HCMV MIE promoter. • GR is involved in DEX-dependent transcription from the HCMV MIE promoter. • A 17 bp repeat is responsible for the HCMV MIE promoter activation by DEX. • An NF-I-like protein is involved in the HCMV MIE promoter activation by DEX

  19. I like what I know

    Directory of Open Access Journals (Sweden)

    Onvara Oeusoonthornwattana

    2010-07-01

    Full Text Available What is the role of recognition in consumer choice? The recognition heuristic (RH proposes that in situations where recognition is correlated with a decision criterion, recognized objects will be chosen more often than unrecognized ones, regardless of any other relevant information available about the recognized object. Past research has investigated this non-compensatory decision heuristic in inference. Here we report two experiments on preference using a naturalistic consumer choice task. Results revealed that, although recognition was a powerful driver of preferences, it was used in a compensatory rather than a non-compensatory way. Specifically, additional information learned about recognized brand objects significantly affected choices. It appears that recognition is processed in a compensatory manner and combined with other attributes in preferential choice.

  20. Extreme genomes

    OpenAIRE

    DeLong, Edward F

    2000-01-01

    The complete genome sequence of Thermoplasma acidophilum, an acid- and heat-loving archaeon, has recently been reported. Comparative genomic analysis of this 'extremophile' is providing new insights into the metabolic machinery, ecology and evolution of thermophilic archaea.

  1. Grass genomes

    OpenAIRE

    Bennetzen, Jeffrey L.; SanMiguel, Phillip; Chen, Mingsheng; Tikhonov, Alexander; Francki, Michael; Avramova, Zoya

    1998-01-01

    For the most part, studies of grass genome structure have been limited to the generation of whole-genome genetic maps or the fine structure and sequence analysis of single genes or gene clusters. We have investigated large contiguous segments of the genomes of maize, sorghum, and rice, primarily focusing on intergenic spaces. Our data indicate that much (>50%) of the maize genome is composed of interspersed repetitive DNAs, primarily nested retrotransposons that in...

  2. De Novo Transcriptome Analysis Shows That SAV-3 Infection Upregulates Pattern Recognition Receptors of the Endosomal Toll-Like and RIG-I-Like Receptor Signaling Pathways in Macrophage/Dendritic Like TO-Cells

    Directory of Open Access Journals (Sweden)

    Cheng Xu

    2016-04-01

    Full Text Available A fundamental step in cellular defense mechanisms is the recognition of “danger signals” made of conserved pathogen associated molecular patterns (PAMPs expressed by invading pathogens, by host cell germ line coded pattern recognition receptors (PRRs. In this study, we used RNA-seq and the Kyoto encyclopedia of genes and genomes (KEGG to identify PRRs together with the network pathway of differentially expressed genes (DEGs that recognize salmonid alphavirus subtype 3 (SAV-3 infection in macrophage/dendritic like TO-cells derived from Atlantic salmon (Salmo salar L headkidney leukocytes. Our findings show that recognition of SAV-3 in TO-cells was restricted to endosomal Toll-like receptors (TLRs 3 and 8 together with RIG-I-like receptors (RLRs and not the nucleotide-binding oligomerization domain-like receptors NOD-like receptor (NLRs genes. Among the RLRs, upregulated genes included the retinoic acid inducible gene I (RIG-I, melanoma differentiation association 5 (MDA5 and laboratory of genetics and physiology 2 (LGP2. The study points to possible involvement of the tripartite motif containing 25 (TRIM25 and mitochondrial antiviral signaling protein (MAVS in modulating RIG-I signaling being the first report that links these genes to the RLR pathway in SAV-3 infection in TO-cells. Downstream signaling suggests that both the TLR and RLR pathways use interferon (IFN regulatory factors (IRFs 3 and 7 to produce IFN-a2. The validity of RNA-seq data generated in this study was confirmed by quantitative real time qRT-PCR showing that genes up- or downregulated by RNA-seq were also up- or downregulated by RT-PCR. Overall, this study shows that de novo transcriptome assembly identify key receptors of the TLR and RLR sensors engaged in host pathogen interaction at cellular level. We envisage that data presented here can open a road map for future intervention strategies in SAV infection of salmon.

  3. Cancer genomics

    DEFF Research Database (Denmark)

    Norrild, Bodil; Guldberg, Per; Ralfkiær, Elisabeth Methner

    2007-01-01

    Almost all cells in the human body contain a complete copy of the genome with an estimated number of 25,000 genes. The sequences of these genes make up about three percent of the genome and comprise the inherited set of genetic information. The genome also contains information that determines whe...

  4. Detection and quantification of proviral HIV-1 184 M/V in circulating CD4(+) T cells of patients on HAART with a viremia less than 1000 copies/ml

    DEFF Research Database (Denmark)

    Mohey, Rajesh; Jørgensen, Anne Louise; Møller, Bjarne K

    2005-01-01

    and incorporation of resistant forms in the long-lived CD4+ T cellular DNA compartment is not clear. Objective To investigate the relationship between lamivudine associated mutant-type 184V and the wild-type 184M proviral forms in the circulating CD4+ T cells of patients and low-level viremia. Study design Cross-sectional......Background Highly active anti-retroviral therapy (HAART) effectively reduces HIV replication but does not completely hinder it. Sub-optimal therapy leads to HIV resistance to the drugs administered. However, the role of low-level viremia (viral-load less than 1000 copies/ml) on mutation genesis...... study of 50 patients on long-term HAART, with a viremia of less than 1000 copies/ml. Patients were stratified into three groups; on lamivudine, group I (viral load

  5. Hepatic expression of proteasome subunit alpha type-6 is upregulated during viral hepatitis and putatively regulates the expression of ISG15 ubiquitin-like modifier, a proviral host gene in hepatitis C virus infection.

    Science.gov (United States)

    Broering, R; Trippler, M; Werner, M; Real, C I; Megger, D A; Bracht, T; Schweinsberg, V; Sitek, B; Eisenacher, M; Meyer, H E; Baba, H A; Weber, F; Hoffmann, A-C; Gerken, G; Schlaak, J F

    2016-05-01

    The interferon-stimulated gene 15 (ISG15) plays an important role in the pathogenesis of hepatitis C virus (HCV) infection. ISG15-regulated proteins have previously been identified that putatively affect this proviral interaction. The present observational study aimed to elucidate the relation between ISG15 and these host factors during HCV infection. Transcriptomic and proteomic analyses were performed using liver samples of HCV-infected (n = 54) and uninfected (n = 10) or HBV-infected controls (n = 23). Primary human hepatocytes (PHH) were treated with Toll-like receptor ligands, interferons and kinase inhibitors. Expression of ISG15 and proteasome subunit alpha type-6 (PSMA6) was suppressed in subgenomic HCV replicon cell lines using specific siRNAs. Comparison of hepatic expression patterns revealed significantly increased signals for ISG15, IFIT1, HNRNPK and PSMA6 on the protein level as well as ISG15, IFIT1 and PSMA6 on the mRNA level in HCV-infected patients. In contrast to interferon-stimulated genes, PSMA6 expression occurred independent of HCV load and genotype. In PHH, the expression of ISG15 and PSMA6 was distinctly induced by poly(I:C), depending on IRF3 activation or PI3K/AKT signalling, respectively. Suppression of PSMA6 in HCV replicon cells led to significant induction of ISG15 expression, thus combined knock-down of both genes abrogated the antiviral effect induced by the separate suppression of ISG15. These data indicate that hepatic expression of PSMA6, which is upregulated during viral hepatitis, likely depends on TLR3 activation. PSMA6 affects the expression of immunoregulatory ISG15, a proviral factor in the pathogenesis of HCV infection. Therefore, the proteasome might be involved in the enigmatic interaction between ISG15 and HCV. © 2016 John Wiley & Sons Ltd.

  6. CRISPR/Cas9-Advancing Orthopoxvirus Genome Editing for Vaccine and Vector Development.

    Science.gov (United States)

    Okoli, Arinze; Okeke, Malachy I; Tryland, Morten; Moens, Ugo

    2018-01-22

    The clustered regularly interspaced short palindromic repeat (CRISPR)/associated protein 9 (Cas9) technology is revolutionizing genome editing approaches. Its high efficiency, specificity, versatility, flexibility, simplicity and low cost have made the CRISPR/Cas9 system preferable to other guided site-specific nuclease-based systems such as TALENs (Transcription Activator-like Effector Nucleases) and ZFNs (Zinc Finger Nucleases) in genome editing of viruses. CRISPR/Cas9 is presently being applied in constructing viral mutants, preventing virus infections, eradicating proviral DNA, and inhibiting viral replication in infected cells. The successful adaptation of CRISPR/Cas9 to editing the genome of Vaccinia virus paves the way for its application in editing other vaccine/vector-relevant orthopoxvirus (OPXV) strains. Thus, CRISPR/Cas9 can be used to resolve some of the major hindrances to the development of OPXV-based recombinant vaccines and vectors, including sub-optimal immunogenicity; transgene and genome instability; reversion of attenuation; potential of spread of transgenes to wildtype strains and close contacts, which are important biosafety and risk assessment considerations. In this article, we review the published literature on the application of CRISPR/Cas9 in virus genome editing and discuss the potentials of CRISPR/Cas9 in advancing OPXV-based recombinant vaccines and vectors. We also discuss the application of CRISPR/Cas9 in combating viruses of clinical relevance, the limitations of CRISPR/Cas9 and the current strategies to overcome them.

  7. CRISPR/Cas9—Advancing Orthopoxvirus Genome Editing for Vaccine and Vector Development

    Science.gov (United States)

    Okoli, Arinze; Okeke, Malachy I.; Tryland, Morten; Moens, Ugo

    2018-01-01

    The clustered regularly interspaced short palindromic repeat (CRISPR)/associated protein 9 (Cas9) technology is revolutionizing genome editing approaches. Its high efficiency, specificity, versatility, flexibility, simplicity and low cost have made the CRISPR/Cas9 system preferable to other guided site-specific nuclease-based systems such as TALENs (Transcription Activator-like Effector Nucleases) and ZFNs (Zinc Finger Nucleases) in genome editing of viruses. CRISPR/Cas9 is presently being applied in constructing viral mutants, preventing virus infections, eradicating proviral DNA, and inhibiting viral replication in infected cells. The successful adaptation of CRISPR/Cas9 to editing the genome of Vaccinia virus paves the way for its application in editing other vaccine/vector-relevant orthopoxvirus (OPXV) strains. Thus, CRISPR/Cas9 can be used to resolve some of the major hindrances to the development of OPXV-based recombinant vaccines and vectors, including sub-optimal immunogenicity; transgene and genome instability; reversion of attenuation; potential of spread of transgenes to wildtype strains and close contacts, which are important biosafety and risk assessment considerations. In this article, we review the published literature on the application of CRISPR/Cas9 in virus genome editing and discuss the potentials of CRISPR/Cas9 in advancing OPXV-based recombinant vaccines and vectors. We also discuss the application of CRISPR/Cas9 in combating viruses of clinical relevance, the limitations of CRISPR/Cas9 and the current strategies to overcome them. PMID:29361752

  8. Apoptosis, Toll-like, RIG-I-like and NOD-like Receptors Are Pathways Jointly Induced by Diverse Respiratory Bacterial and Viral Pathogens

    Science.gov (United States)

    Martínez, Isidoro; Oliveros, Juan C.; Cuesta, Isabel; de la Barrera, Jorge; Ausina, Vicente; Casals, Cristina; de Lorenzo, Alba; García, Ernesto; García-Fojeda, Belén; Garmendia, Junkal; González-Nicolau, Mar; Lacoma, Alicia; Menéndez, Margarita; Moranta, David; Nieto, Amelia; Ortín, Juan; Pérez-González, Alicia; Prat, Cristina; Ramos-Sevillano, Elisa; Regueiro, Verónica; Rodriguez-Frandsen, Ariel; Solís, Dolores; Yuste, José; Bengoechea, José A.; Melero, José A.

    2017-01-01

    Lower respiratory tract infections are among the top five leading causes of human death. Fighting these infections is therefore a world health priority. Searching for induced alterations in host gene expression shared by several relevant respiratory pathogens represents an alternative to identify new targets for wide-range host-oriented therapeutics. With this aim, alveolar macrophages were independently infected with three unrelated bacterial (Streptococcus pneumoniae, Klebsiella pneumoniae, and Staphylococcus aureus) and two dissimilar viral (respiratory syncytial virus and influenza A virus) respiratory pathogens, all of them highly relevant for human health. Cells were also activated with bacterial lipopolysaccharide (LPS) as a prototypical pathogen-associated molecular pattern. Patterns of differentially expressed cellular genes shared by the indicated pathogens were searched by microarray analysis. Most of the commonly up-regulated host genes were related to the innate immune response and/or apoptosis, with Toll-like, RIG-I-like and NOD-like receptors among the top 10 signaling pathways with over-expressed genes. These results identify new potential broad-spectrum targets to fight the important human infections caused by the bacteria and viruses studied here. PMID:28298903

  9. Effects of retinoic acid-inducible gene-I-like receptors activations and ionizing radiation cotreatment on cytotoxicity against human non-small cell lung cancer in vitro.

    Science.gov (United States)

    Yoshino, Hironori; Iwabuchi, Miyu; Kazama, Yuka; Furukawa, Maho; Kashiwakura, Ikuo

    2018-04-01

    Retinoic acid-inducible gene-I (RIG-I)-like receptors (RLRs) are pattern-recognition receptors that recognize pathogen-associated molecular patterns and induce antiviral immune responses. Recent studies have demonstrated that RLR activation induces antitumor immunity and cytotoxicity against different types of cancer, including lung cancer. However a previous report has demonstrated that ionizing radiation exerts a limited effect on RLR in human monocytic cell-derived macrophages, suggesting that RLR agonists may be used as effective immunostimulants during radiation therapy. However, it is unclear whether ionizing radiation affects the cytotoxicity of RLR agonists against cancer cells. Therefore, in the present study the effects of cotreatment with ionizing radiation and RLR agonists on cytotoxicity against human non-small cell lung cancer cells A549 and H1299 was investigated. Treatment with RLR agonist poly(I:C)/LyoVec™ [poly(I:C)] exerted cytotoxic effects against human non-small cell lung cancer. The cytotoxic effects of poly(I:C) were enhanced by cotreatment with ionizing radiation, and poly(I:C) pretreatment resulted in the radiosensitization of non-small cell lung cancer. Furthermore, cotreatment of A549 and H1299 cells with poly(I:C) and ionizing radiation effectively induced apoptosis in a caspase-dependent manner compared with treatment with poly(I:C) or ionizing radiation alone. These results indicate that RLR agonists and ionizing radiation cotreatment effectively exert cytotoxic effects against human non-small cell lung cancer through caspase-mediated apoptosis.

  10. Genome Imprinting

    Indian Academy of Sciences (India)

    the cell nucleus (mitochondrial and chloroplast genomes), and. (3) traits governed ... tively good embryonic development but very poor development of membranes and ... Human homologies for the type of situation described above are naturally ..... imprint; (b) New modifications of the paternal genome in germ cells of each ...

  11. Baculovirus Genomics

    NARCIS (Netherlands)

    Oers, van M.M.; Vlak, J.M.

    2007-01-01

    Baculovirus genomes are covalently closed circles of double stranded-DNA varying in size between 80 and 180 kilobase-pair. The genomes of more than fourty-one baculoviruses have been sequenced to date. The majority of these (37) are pathogenic to lepidopteran hosts; three infect sawflies

  12. Genomic Testing

    Science.gov (United States)

    ... this database. Top of Page Evaluation of Genomic Applications in Practice and Prevention (EGAPP™) In 2004, the Centers for Disease Control and Prevention launched the EGAPP initiative to establish and test a ... and other applications of genomic technology that are in transition from ...

  13. Ancient genomes

    OpenAIRE

    Hoelzel, A Rus

    2005-01-01

    Ever since its invention, the polymerase chain reaction has been the method of choice for work with ancient DNA. In an application of modern genomic methods to material from the Pleistocene, a recent study has instead undertaken to clone and sequence a portion of the ancient genome of the cave bear.

  14. Genome-wide association identifies multiple genomic regions associated with susceptibility to and control of ovine lentivirus.

    Directory of Open Access Journals (Sweden)

    Stephen N White

    Full Text Available BACKGROUND: Like human immunodeficiency virus (HIV, ovine lentivirus (OvLV is macrophage-tropic and causes lifelong infection. OvLV infects one quarter of U.S. sheep and induces pneumonia and body condition wasting. There is no vaccine to prevent OvLV infection and no cost-effective treatment for infected animals. However, breed differences in prevalence and proviral concentration have indicated a genetic basis for susceptibility to OvLV. A recent study identified TMEM154 variants in OvLV susceptibility. The objective here was to identify additional loci associated with odds and/or control of OvLV infection. METHODOLOGY/PRINCIPAL FINDINGS: This genome-wide association study (GWAS included 964 sheep from Rambouillet, Polypay, and Columbia breeds with serological status and proviral concentration phenotypes. Analytic models accounted for breed and age, as well as genotype. This approach identified TMEM154 (nominal P=9.2×10(-7; empirical P=0.13, provided 12 additional genomic regions associated with odds of infection, and provided 13 regions associated with control of infection (all nominal P<1 × 10(-5. Rapid decline of linkage disequilibrium with distance suggested many regions included few genes each. Genes in regions associated with odds of infection included DPPA2/DPPA4 (empirical P=0.006, and SYTL3 (P=0.051. Genes in regions associated with control of infection included a zinc finger cluster (ZNF192, ZSCAN16, ZNF389, and ZNF165; P=0.001, C19orf42/TMEM38A (P=0.047, and DLGAP1 (P=0.092. CONCLUSIONS/SIGNIFICANCE: These associations provide targets for mutation discovery in sheep susceptibility to OvLV. Aside from TMEM154, these genes have not been associated previously with lentiviral infection in any species, to our knowledge. Further, data from other species suggest functional hypotheses for future testing of these genes in OvLV and other lentiviral infections. Specifically, SYTL3 binds and may regulate RAB27A, which is required for enveloped

  15. The Genome Landscape of the African Green Monkey Kidney-Derived Vero Cell Line

    Science.gov (United States)

    Osada, Naoki; Kohara, Arihiro; Yamaji, Toshiyuki; Hirayama, Noriko; Kasai, Fumio; Sekizuka, Tsuyoshi; Kuroda, Makoto; Hanada, Kentaro

    2014-01-01

    Continuous cell lines that originate from mammalian tissues serve as not only invaluable tools for life sciences, but also important animal cell substrates for the production of various types of biological pharmaceuticals. Vero cells are susceptible to various types of microbes and toxins and have widely contributed to not only microbiology, but also the production of vaccines for human use. We here showed the genome landscape of a Vero cell line, in which 25,877 putative protein-coding genes were identified in the 2.97-Gb genome sequence. A homozygous ∼9-Mb deletion on chromosome 12 caused the loss of the type I interferon gene cluster and cyclin-dependent kinase inhibitor genes in Vero cells. In addition, an ∼59-Mb loss of heterozygosity around this deleted region suggested that the homozygosity of the deletion was established by a large-scale conversion. Moreover, a genomic analysis of Vero cells revealed a female Chlorocebus sabaeus origin and proviral variations of the endogenous simian type D retrovirus. These results revealed the genomic basis for the non-tumourigenic permanent Vero cell lineage susceptible to various pathogens and will be useful for generating new sub-lines and developing new tools in the quality control of Vero cells. PMID:25267831

  16. Herbarium genomics

    DEFF Research Database (Denmark)

    Bakker, Freek T.; Lei, Di; Yu, Jiaying

    2016-01-01

    Herbarium genomics is proving promising as next-generation sequencing approaches are well suited to deal with the usually fragmented nature of archival DNA. We show that routine assembly of partial plastome sequences from herbarium specimens is feasible, from total DNA extracts and with specimens...... up to 146 years old. We use genome skimming and an automated assembly pipeline, Iterative Organelle Genome Assembly, that assembles paired-end reads into a series of candidate assemblies, the best one of which is selected based on likelihood estimation. We used 93 specimens from 12 different...... correlation between plastome coverage and nuclear genome size (C value) in our samples, but the range of C values included is limited. Finally, we conclude that routine plastome sequencing from herbarium specimens is feasible and cost-effective (compared with Sanger sequencing or plastome...

  17. The Relative Influence of Metal Ion Binding Sites in the I-like Domain and the Interface with the Hybrid Domain on Rolling and Firm Adhesion by Integrin α4β7*

    OpenAIRE

    Chen, JianFeng; Takagi, Junichi; Xie, Can; Xiao, Tsan; Luo, Bing-Hao; Springer, Timothy A.

    2004-01-01

    We examined the effect of conformational change at the β7 I-like/hybrid domain interface on regulating the transition between rolling and firm adhesion by integrin α4β7. An N-glycosylation site was introduced into the I-like/hybrid domain interface to act as a wedge and to stabilize the open conformation of this interface and hence the open conformation of the α4β7 headpiece. Wild-type α4β7 mediates rolling adhesion in Ca2+ and Ca2+/Mg2+ but firm adhesion in Mg2+ and Mn2+. Stabilizing the ope...

  18. Variability of HIV-1 genomes among children and adolescents from Sao Paulo, Brazil.

    Directory of Open Access Journals (Sweden)

    Sabri Saeed Sanabani

    Full Text Available BACKGROUND: Genetic variability is a major feature of the human immunodeficiency virus type 1 (HIV-1 and considered the key factor to frustrating efforts to halt the virus epidemic. In this study, we aimed to investigate the genetic variability of HIV-1 strains among children and adolescents born from 1992 to 2009 in the state of Sao Paulo, Brazil. METHODOLOGY: Plasma and peripheral blood mononuclear cells (PBMC were collected from 51 HIV-1-positive children and adolescents on ART followed between September 1992 and July 2009. After extraction, the genetic materials were used in a polymerase chain reaction (PCR to amplify the viral near full length genomes (NFLGs from 5 overlapped fragments. NFLGs and partial amplicons were directly sequenced and data were phylogenetically inferred. RESULTS: Of the 51 samples studied, the NFLGs and partial fragments of HIV-1 from 42 PBMCs and 25 plasma were successfully subtyped. Results based on proviral DNA revealed that 22 (52.4% patients were infected with subtype B, 16 (38.1% were infected with BF1 mosaic variants and 4 (9.5% were infected with sub-subtype F1. All the BF1 recombinants were unique and distinct from any previously identified unique or circulating recombinant forms in South America. Evidence of dual infections was detected in 3 patients coinfected with the same or distinct HIV-1 subtypes. Ten of the 31 (32.2% and 12 of the 21 (57.1% subjects with recovered proviral and plasma, respectively, protease sequences were infected with major mutants resistant to protease inhibitors. The V3 sequences of 14 patients with available sequences from PBMC/or plasma were predicted to be R5-tropic virus except for two patients who harbored an X4 strain. CONCLUSIONS: The high proportion of HIV-1 BF1 recombinant, coinfection rate and vertical transmission in Brazil merits urgent attention and effective measures to reduce the transmission of HIV among spouses and sex partners.

  19. Genomic Amplification of an Endogenous Retrovirus in Zebrafish T-Cell Malignancies

    Directory of Open Access Journals (Sweden)

    J. Kimble Frazer

    2012-01-01

    Full Text Available Genomic instability plays a crucial role in oncogenesis. Somatically acquired mutations can disable some genes and inappropriately activate others. In addition, chromosomal rearrangements can amplify, delete, or even fuse genes, altering their functions and contributing to malignant phenotypes. Using array comparative genomic hybridization (aCGH, a technique to detect numeric variations between different DNA samples, we examined genomes from zebrafish (Danio rerio T-cell leukemias of three cancer-prone lines. In all malignancies tested, we identified recurring amplifications of a zebrafish endogenous retrovirus. This retrovirus, ZFERV, was first identified due to high expression of proviral transcripts in thymic tissue from larval and adult fish. We confirmed ZFERV amplifications by quantitative PCR analyses of DNA from wild-type fish tissue and normal and malignant D. rerio T cells. We also quantified ZFERV RNA expression and found that normal and neoplastic T cells both produce retrovirally encoded transcripts, but most cancers show dramatically increased transcription. In aggregate, these data imply that ZFERV amplification and transcription may be related to T-cell leukemogenesis. Based on these data and ZFERV’s phylogenetic relation to viruses of the murine-leukemia-related virus class of gammaretroviridae, we posit that ZFERV may be oncogenic via an insertional mutagenesis mechanism.

  20. Cephalopod genomics

    DEFF Research Database (Denmark)

    Albertin, Caroline B.; Bonnaud, Laure; Brown, C. Titus

    2012-01-01

    The Cephalopod Sequencing Consortium (CephSeq Consortium) was established at a NESCent Catalysis Group Meeting, ``Paths to Cephalopod Genomics-Strategies, Choices, Organization,'' held in Durham, North Carolina, USA on May 24-27, 2012. Twenty-eight participants representing nine countries (Austria......, Australia, China, Denmark, France, Italy, Japan, Spain and the USA) met to address the pressing need for genome sequencing of cephalopod mollusks. This group, drawn from cephalopod biologists, neuroscientists, developmental and evolutionary biologists, materials scientists, bioinformaticians and researchers...... active in sequencing, assembling and annotating genomes, agreed on a set of cephalopod species of particular importance for initial sequencing and developed strategies and an organization (CephSeq Consortium) to promote this sequencing. The conclusions and recommendations of this meeting are described...

  1. Genome Sequencing

    DEFF Research Database (Denmark)

    Sato, Shusei; Andersen, Stig Uggerhøj

    2014-01-01

    The current Lotus japonicus reference genome sequence is based on a hybrid assembly of Sanger TAC/BAC, Sanger shotgun and Illumina shotgun sequencing data generated from the Miyakojima-MG20 accession. It covers nearly all expressed L. japonicus genes and has been annotated mainly based on transcr......The current Lotus japonicus reference genome sequence is based on a hybrid assembly of Sanger TAC/BAC, Sanger shotgun and Illumina shotgun sequencing data generated from the Miyakojima-MG20 accession. It covers nearly all expressed L. japonicus genes and has been annotated mainly based...

  2. Comparative Genomics

    Indian Academy of Sciences (India)

    Home; Journals; Resonance – Journal of Science Education; Volume 11; Issue 8. Comparative Genomics - A Powerful New Tool in Biology. Anand K Bachhawat. General Article Volume 11 Issue 8 August 2006 pp 22-40. Fulltext. Click here to view fulltext PDF. Permanent link:

  3. Personal genomics services: whose genomes?

    Science.gov (United States)

    Gurwitz, David; Bregman-Eschet, Yael

    2009-07-01

    New companies offering personal whole-genome information services over the internet are dynamic and highly visible players in the personal genomics field. For fees currently ranging from US$399 to US$2500 and a vial of saliva, individuals can now purchase online access to their individual genetic information regarding susceptibility to a range of chronic diseases and phenotypic traits based on a genome-wide SNP scan. Most of the companies offering such services are based in the United States, but their clients may come from nearly anywhere in the world. Although the scientific validity, clinical utility and potential future implications of such services are being hotly debated, several ethical and regulatory questions related to direct-to-consumer (DTC) marketing strategies of genetic tests have not yet received sufficient attention. For example, how can we minimize the risk of unauthorized third parties from submitting other people's DNA for testing? Another pressing question concerns the ownership of (genotypic and phenotypic) information, as well as the unclear legal status of customers regarding their own personal information. Current legislation in the US and Europe falls short of providing clear answers to these questions. Until the regulation of personal genomics services catches up with the technology, we call upon commercial providers to self-regulate and coordinate their activities to minimize potential risks to individual privacy. We also point out some specific steps, along the trustee model, that providers of DTC personal genomics services as well as regulators and policy makers could consider for addressing some of the concerns raised below.

  4. Visualization for genomics: the Microbial Genome Viewer.

    NARCIS (Netherlands)

    Kerkhoven, R.; Enckevort, F.H.J. van; Boekhorst, J.; Molenaar, D; Siezen, R.J.

    2004-01-01

    SUMMARY: A Web-based visualization tool, the Microbial Genome Viewer, is presented that allows the user to combine complex genomic data in a highly interactive way. This Web tool enables the interactive generation of chromosome wheels and linear genome maps from genome annotation data stored in a

  5. The relative influence of metal ion binding sites in the I-like domain and the interface with the hybrid domain on rolling and firm adhesion by integrin alpha4beta7.

    Science.gov (United States)

    Chen, JianFeng; Takagi, Junichi; Xie, Can; Xiao, Tsan; Luo, Bing-Hao; Springer, Timothy A

    2004-12-31

    We examined the effect of conformational change at the beta(7) I-like/hybrid domain interface on regulating the transition between rolling and firm adhesion by integrin alpha(4)beta(7). An N-glycosylation site was introduced into the I-like/hybrid domain interface to act as a wedge and to stabilize the open conformation of this interface and hence the open conformation of the alpha(4) beta(7) headpiece. Wild-type alpha(4)beta(7) mediates rolling adhesion in Ca(2+) and Ca(2+)/Mg(2+) but firm adhesion in Mg(2+) and Mn(2+). Stabilizing the open headpiece resulted in firm adhesion in all divalent cations. The interaction between metal binding sites in the I-like domain and the interface with the hybrid domain was examined in double mutants. Changes at these two sites can either counterbalance one another or be additive, emphasizing mutuality and the importance of multiple interfaces in integrin regulation. A double mutant with counterbalancing deactivating ligand-induced metal ion binding site (LIMBS) and activating wedge mutations could still be activated by Mn(2+), confirming the importance of the adjacent to metal ion-dependent adhesion site (ADMIDAS) in integrin activation by Mn(2+). Overall, the results demonstrate the importance of headpiece allostery in the conversion of rolling to firm adhesion.

  6. The Relative Influence of Metal Ion Binding Sites in the I-like Domain and the Interface with the Hybrid Domain on Rolling and Firm Adhesion by Integrin α4β7*

    Science.gov (United States)

    Chen, JianFeng; Takagi, Junichi; Xie, Can; Xiao, Tsan; Luo, Bing-Hao; Springer, Timothy A.

    2015-01-01

    We examined the effect of conformational change at the β7 I-like/hybrid domain interface on regulating the transition between rolling and firm adhesion by integrin α4β7. An N-glycosylation site was introduced into the I-like/hybrid domain interface to act as a wedge and to stabilize the open conformation of this interface and hence the open conformation of the α4β7 headpiece. Wild-type α4β7 mediates rolling adhesion in Ca2+ and Ca2+/Mg2+ but firm adhesion in Mg2+ and Mn2+. Stabilizing the open headpiece resulted in firm adhesion in all divalent cations. The interaction between metal binding sites in the I-like domain and the interface with the hybrid domain was examined in double mutants. Changes at these two sites can either counterbalance one another or be additive, emphasizing mutuality and the importance of multiple interfaces in integrin regulation. A double mutant with counterbalancing deactivating ligand-induced metal ion binding site (LIMBS) and activating wedge mutations could still be activated by Mn2+, confirming the importance of the adjacent to metal ion-dependent adhesion site (ADMIDAS) in integrin activation by Mn2+. Overall, the results demonstrate the importance of headpiece allostery in the conversion of rolling to firm adhesion. PMID:15448154

  7. Ancient genomics

    DEFF Research Database (Denmark)

    Der Sarkissian, Clio; Allentoft, Morten Erik; Avila Arcos, Maria del Carmen

    2015-01-01

    throughput of next generation sequencing platforms and the ability to target short and degraded DNA molecules. Many ancient specimens previously unsuitable for DNA analyses because of extensive degradation can now successfully be used as source materials. Additionally, the analytical power obtained...... by increasing the number of sequence reads to billions effectively means that contamination issues that have haunted aDNA research for decades, particularly in human studies, can now be efficiently and confidently quantified. At present, whole genomes have been sequenced from ancient anatomically modern humans...

  8. Marine genomics

    DEFF Research Database (Denmark)

    Oliveira Ribeiro, Ângela Maria; Foote, Andrew David; Kupczok, Anne

    2017-01-01

    Marine ecosystems occupy 71% of the surface of our planet, yet we know little about their diversity. Although the inventory of species is continually increasing, as registered by the Census of Marine Life program, only about 10% of the estimated two million marine species are known. This lag......-throughput sequencing approaches have been helping to improve our knowledge of marine biodiversity, from the rich microbial biota that forms the base of the tree of life to a wealth of plant and animal species. In this review, we present an overview of the applications of genomics to the study of marine life, from...

  9. Clonal expansion of genome-intact HIV-1 in functionally polarized Th1 CD4+ T cells.

    Science.gov (United States)

    Lee, Guinevere Q; Orlova-Fink, Nina; Einkauf, Kevin; Chowdhury, Fatema Z; Sun, Xiaoming; Harrington, Sean; Kuo, Hsiao-Hsuan; Hua, Stephane; Chen, Hsiao-Rong; Ouyang, Zhengyu; Reddy, Kavidha; Dong, Krista; Ndung'u, Thumbi; Walker, Bruce D; Rosenberg, Eric S; Yu, Xu G; Lichterfeld, Mathias

    2017-06-30

    HIV-1 causes a chronic, incurable disease due to its persistence in CD4+ T cells that contain replication-competent provirus, but exhibit little or no active viral gene expression and effectively resist combination antiretroviral therapy (cART). These latently infected T cells represent an extremely small proportion of all circulating CD4+ T cells but possess a remarkable long-term stability and typically persist throughout life, for reasons that are not fully understood. Here we performed massive single-genome, near-full-length next-generation sequencing of HIV-1 DNA derived from unfractionated peripheral blood mononuclear cells, ex vivo-isolated CD4+ T cells, and subsets of functionally polarized memory CD4+ T cells. This approach identified multiple sets of independent, near-full-length proviral sequences from cART-treated individuals that were completely identical, consistent with clonal expansion of CD4+ T cells harboring intact HIV-1. Intact, near-full-genome HIV-1 DNA sequences that were derived from such clonally expanded CD4+ T cells constituted 62% of all analyzed genome-intact sequences in memory CD4 T cells, were preferentially observed in Th1-polarized cells, were longitudinally detected over a duration of up to 5 years, and were fully replication- and infection-competent. Together, these data suggest that clonal proliferation of Th1-polarized CD4+ T cells encoding for intact HIV-1 represents a driving force for stabilizing the pool of latently infected CD4+ T cells.

  10. Ensembl Genomes 2016: more genomes, more complexity.

    Science.gov (United States)

    Kersey, Paul Julian; Allen, James E; Armean, Irina; Boddu, Sanjay; Bolt, Bruce J; Carvalho-Silva, Denise; Christensen, Mikkel; Davis, Paul; Falin, Lee J; Grabmueller, Christoph; Humphrey, Jay; Kerhornou, Arnaud; Khobova, Julia; Aranganathan, Naveen K; Langridge, Nicholas; Lowy, Ernesto; McDowall, Mark D; Maheswari, Uma; Nuhn, Michael; Ong, Chuang Kee; Overduin, Bert; Paulini, Michael; Pedro, Helder; Perry, Emily; Spudich, Giulietta; Tapanari, Electra; Walts, Brandon; Williams, Gareth; Tello-Ruiz, Marcela; Stein, Joshua; Wei, Sharon; Ware, Doreen; Bolser, Daniel M; Howe, Kevin L; Kulesha, Eugene; Lawson, Daniel; Maslen, Gareth; Staines, Daniel M

    2016-01-04

    Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of programmatic and interactive interfaces to a rich range of data including reference sequence, gene models, transcriptional data, genetic variation and comparative analysis. This paper provides an update to the previous publications about the resource, with a focus on recent developments. These include the development of new analyses and views to represent polyploid genomes (of which bread wheat is the primary exemplar); and the continued up-scaling of the resource, which now includes over 23 000 bacterial genomes, 400 fungal genomes and 100 protist genomes, in addition to 55 genomes from invertebrate metazoa and 39 genomes from plants. This dramatic increase in the number of included genomes is one part of a broader effort to automate the integration of archival data (genome sequence, but also associated RNA sequence data and variant calls) within the context of reference genomes and make it available through the Ensembl user interfaces. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  11. Rodent malaria parasites : genome organization & comparative genomics

    NARCIS (Netherlands)

    Kooij, Taco W.A.

    2006-01-01

    The aim of the studies described in this thesis was to investigate the genome organization of rodent malaria parasites (RMPs) and compare the organization and gene content of the genomes of RMPs and the human malaria parasite P. falciparum. The release of the complete genome sequence of P.

  12. Funding Opportunity: Genomic Data Centers

    Science.gov (United States)

    Funding Opportunity CCG, Funding Opportunity Center for Cancer Genomics, CCG, Center for Cancer Genomics, CCG RFA, Center for cancer genomics rfa, genomic data analysis network, genomic data analysis network centers,

  13. Exploring Other Genomes: Bacteria.

    Science.gov (United States)

    Flannery, Maura C.

    2001-01-01

    Points out the importance of genomes other than the human genome project and provides information on the identified bacterial genomes Pseudomonas aeuroginosa, Leprosy, Cholera, Meningitis, Tuberculosis, Bubonic Plague, and plant pathogens. Considers the computer's use in genome studies. (Contains 14 references.) (YDS)

  14. Genomics With Cloud Computing

    OpenAIRE

    Sukhamrit Kaur; Sandeep Kaur

    2015-01-01

    Abstract Genomics is study of genome which provides large amount of data for which large storage and computation power is needed. These issues are solved by cloud computing that provides various cloud platforms for genomics. These platforms provides many services to user like easy access to data easy sharing and transfer providing storage in hundreds of terabytes more computational power. Some cloud platforms are Google genomics DNAnexus and Globus genomics. Various features of cloud computin...

  15. Genome Maps, a new generation genome browser.

    Science.gov (United States)

    Medina, Ignacio; Salavert, Francisco; Sanchez, Rubén; de Maria, Alejandro; Alonso, Roberto; Escobar, Pablo; Bleda, Marta; Dopazo, Joaquín

    2013-07-01

    Genome browsers have gained importance as more genomes and related genomic information become available. However, the increase of information brought about by new generation sequencing technologies is, at the same time, causing a subtle but continuous decrease in the efficiency of conventional genome browsers. Here, we present Genome Maps, a genome browser that implements an innovative model of data transfer and management. The program uses highly efficient technologies from the new HTML5 standard, such as scalable vector graphics, that optimize workloads at both server and client sides and ensure future scalability. Thus, data management and representation are entirely carried out by the browser, without the need of any Java Applet, Flash or other plug-in technology installation. Relevant biological data on genes, transcripts, exons, regulatory features, single-nucleotide polymorphisms, karyotype and so forth, are imported from web services and are available as tracks. In addition, several DAS servers are already included in Genome Maps. As a novelty, this web-based genome browser allows the local upload of huge genomic data files (e.g. VCF or BAM) that can be dynamically visualized in real time at the client side, thus facilitating the management of medical data affected by privacy restrictions. Finally, Genome Maps can easily be integrated in any web application by including only a few lines of code. Genome Maps is an open source collaborative initiative available in the GitHub repository (https://github.com/compbio-bigdata-viz/genome-maps). Genome Maps is available at: http://www.genomemaps.org.

  16. JGI Fungal Genomics Program

    Energy Technology Data Exchange (ETDEWEB)

    Grigoriev, Igor V.

    2011-03-14

    Genomes of energy and environment fungi are in focus of the Fungal Genomic Program at the US Department of Energy Joint Genome Institute (JGI). Its key project, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts), and explores fungal diversity by means of genome sequencing and analysis. Over 50 fungal genomes have been sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supported by functional genomics leads to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such 'parts' suggested by comparative genomics and functional analysis in these areas are presented here

  17. Genomic Encyclopedia of Fungi

    Energy Technology Data Exchange (ETDEWEB)

    Grigoriev, Igor

    2012-08-10

    Genomes of fungi relevant to energy and environment are in focus of the Fungal Genomic Program at the US Department of Energy Joint Genome Institute (JGI). Its key project, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts), and explores fungal diversity by means of genome sequencing and analysis. Over 150 fungal genomes have been sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supported by functional genomics leads to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such parts suggested by comparative genomics and functional analysis in these areas are presented here.

  18. Seroprevalence and genomic divergence of circulating strains of feline immunodeficiency virus among Felidae and Hyaenidae species.

    Science.gov (United States)

    Troyer, Jennifer L; Pecon-Slattery, Jill; Roelke, Melody E; Johnson, Warren; VandeWoude, Sue; Vazquez-Salat, Nuria; Brown, Meredith; Frank, Laurence; Woodroffe, Rosie; Winterbach, Christiaan; Winterbach, Hanlie; Hemson, Graham; Bush, Mitch; Alexander, Kathleen A; Revilla, Eloy; O'Brien, Stephen J

    2005-07-01

    Feline immunodeficiency virus (FIV) infects numerous wild and domestic feline species and is closely related to human immunodeficiency virus (HIV) and simian immunodeficiency virus (SIV). Species-specific strains of FIV have been described for domestic cat (Felis catus), puma (Puma concolor), lion (Panthera leo), leopard (Panthera pardus), and Pallas' cat (Otocolobus manul). Here, we employ a three-antigen Western blot screening (domestic cat, puma, and lion FIV antigens) and PCR analysis to survey worldwide prevalence, distribution, and genomic differentiation of FIV based on 3,055 specimens from 35 Felidae and 3 Hyaenidae species. Although FIV infects a wide variety of host species, it is confirmed to be endemic in free-ranging populations of nine Felidae and one Hyaenidae species. These include the large African carnivores (lion, leopard, cheetah, and spotted hyena), where FIV is widely distributed in multiple populations; most of the South American felids (puma, jaguar, ocelot, margay, Geoffroy's cat, and tigrina), which maintain a lower FIV-positive level throughout their range; and two Asian species, the Pallas' cat, which has a species-specific strain of FIV, and the leopard cat, which has a domestic cat FIV strain in one population. Phylogenetic analysis of FIV proviral sequence demonstrates that most species for which FIV is endemic harbor monophyletic, genetically distinct species-specific FIV strains, suggesting that FIV transfer between cat species has occurred in the past but is quite infrequent today.

  19. Seroprevalence and Genomic Divergence of Circulating Strains of Feline Immunodeficiency Virus among Felidae and Hyaenidae Species†

    Science.gov (United States)

    Troyer, Jennifer L.; Pecon-Slattery, Jill; Roelke, Melody E.; Johnson, Warren; VandeWoude, Sue; Vazquez-Salat, Nuria; Brown, Meredith; Frank, Laurence; Woodroffe, Rosie; Winterbach, Christiaan; Winterbach, Hanlie; Hemson, Graham; Bush, Mitch; Alexander, Kathleen A.; Revilla, Eloy; O'Brien, Stephen J.

    2005-01-01

    Feline immunodeficiency virus (FIV) infects numerous wild and domestic feline species and is closely related to human immunodeficiency virus (HIV) and simian immunodeficiency virus (SIV). Species-specific strains of FIV have been described for domestic cat (Felis catus), puma (Puma concolor), lion (Panthera leo), leopard (Panthera pardus), and Pallas' cat (Otocolobus manul). Here, we employ a three-antigen Western blot screening (domestic cat, puma, and lion FIV antigens) and PCR analysis to survey worldwide prevalence, distribution, and genomic differentiation of FIV based on 3,055 specimens from 35 Felidae and 3 Hyaenidae species. Although FIV infects a wide variety of host species, it is confirmed to be endemic in free-ranging populations of nine Felidae and one Hyaenidae species. These include the large African carnivores (lion, leopard, cheetah, and spotted hyena), where FIV is widely distributed in multiple populations; most of the South American felids (puma, jaguar, ocelot, margay, Geoffroy's cat, and tigrina), which maintain a lower FIV-positive level throughout their range; and two Asian species, the Pallas' cat, which has a species-specific strain of FIV, and the leopard cat, which has a domestic cat FIV strain in one population. Phylogenetic analysis of FIV proviral sequence demonstrates that most species for which FIV is endemic harbor monophyletic, genetically distinct species-specific FIV strains, suggesting that FIV transfer between cat species has occurred in the past but is quite infrequent today. PMID:15956574

  20. Genomics With Cloud Computing

    Directory of Open Access Journals (Sweden)

    Sukhamrit Kaur

    2015-04-01

    Full Text Available Abstract Genomics is study of genome which provides large amount of data for which large storage and computation power is needed. These issues are solved by cloud computing that provides various cloud platforms for genomics. These platforms provides many services to user like easy access to data easy sharing and transfer providing storage in hundreds of terabytes more computational power. Some cloud platforms are Google genomics DNAnexus and Globus genomics. Various features of cloud computing to genomics are like easy access and sharing of data security of data less cost to pay for resources but still there are some demerits like large time needed to transfer data less network bandwidth.

  1. Comparative Genome Analysis and Genome Evolution

    NARCIS (Netherlands)

    Snel, Berend

    2002-01-01

    This thesis described a collection of bioinformatic analyses on complete genome sequence data. We have studied the evolution of gene content and find that vertical inheritance dominates over horizontal gene trasnfer, even to the extent that we can use the gene content to make genome phylogenies.

  2. Genomic Data Commons launches

    Science.gov (United States)

    The Genomic Data Commons (GDC), a unified data system that promotes sharing of genomic and clinical data between researchers, launched today with a visit from Vice President Joe Biden to the operations center at the University of Chicago.

  3. Rat Genome Database (RGD)

    Data.gov (United States)

    U.S. Department of Health & Human Services — The Rat Genome Database (RGD) is a collaborative effort between leading research institutions involved in rat genetic and genomic research to collect, consolidate,...

  4. Visualization for genomics: the Microbial Genome Viewer.

    Science.gov (United States)

    Kerkhoven, Robert; van Enckevort, Frank H J; Boekhorst, Jos; Molenaar, Douwe; Siezen, Roland J

    2004-07-22

    A Web-based visualization tool, the Microbial Genome Viewer, is presented that allows the user to combine complex genomic data in a highly interactive way. This Web tool enables the interactive generation of chromosome wheels and linear genome maps from genome annotation data stored in a MySQL database. The generated images are in scalable vector graphics (SVG) format, which is suitable for creating high-quality scalable images and dynamic Web representations. Gene-related data such as transcriptome and time-course microarray experiments can be superimposed on the maps for visual inspection. The Microbial Genome Viewer 1.0 is freely available at http://www.cmbi.kun.nl/MGV

  5. Genomic prediction using subsampling

    OpenAIRE

    Xavier, Alencar; Xu, Shizhong; Muir, William; Rainey, Katy Martin

    2017-01-01

    Background Genome-wide assisted selection is a critical tool for the?genetic improvement of plants and animals. Whole-genome regression models in Bayesian framework represent the main family of prediction methods. Fitting such models with a large number of observations involves a prohibitive computational burden. We propose the use of subsampling bootstrap Markov chain in genomic prediction. Such method consists of fitting whole-genome regression models by subsampling observations in each rou...

  6. Genomic organization, sequence divergence, and recombination of feline immunodeficiency virus from lions in the wild

    Science.gov (United States)

    Pecon-Slattery, Jill; McCracken, Carrie L; Troyer, Jennifer L; VandeWoude, Sue; Roelke, Melody; Sondgeroth, Kerry; Winterbach, Christiaan; Winterbach, Hanlie; O'Brien, Stephen J

    2008-01-01

    Background Feline immunodeficiency virus (FIV) naturally infects multiple species of cat and is related to human immunodeficiency virus in humans. FIV infection causes AIDS-like disease and mortality in the domestic cat (Felis catus) and serves as a natural model for HIV infection in humans. In African lions (Panthera leo) and other exotic felid species, disease etiology introduced by FIV infection are less clear, but recent studies indicate that FIV causes moderate to severe CD4 depletion. Results In this study, comparative genomic methods are used to evaluate the full proviral genome of two geographically distinct FIV subtypes isolated from free-ranging lions. Genome organization of FIVPle subtype B (9891 bp) from lions in the Serengeti National Park in Tanzania and FIVPle subtype E (9899 bp) isolated from lions in the Okavango Delta in Botswana, both resemble FIV genome sequence from puma, Pallas cat and domestic cat across 5' LTR, gag, pol, vif, orfA, env, rev and 3'LTR regions. Comparative analyses of available full-length FIV consisting of subtypes A, B and C from FIVFca, Pallas cat FIVOma and two puma FIVPco subtypes A and B recapitulate the species-specific monophyly of FIV marked by high levels of genetic diversity both within and between species. Across all FIVPle gene regions except env, lion subtypes B and E are monophyletic, and marginally more similar to Pallas cat FIVOma than to other FIV. Sequence analyses indicate the SU and TM regions of env vary substantially between subtypes, with FIVPle subtype E more related to domestic cat FIVFca than to FIVPle subtype B and FIVOma likely reflecting recombination between strains in the wild. Conclusion This study demonstrates the necessity of whole-genome analysis to complement population/gene-based studies, which are of limited utility in uncovering complex events such as recombination that may lead to functional differences in virulence and pathogenicity. These full-length lion lentiviruses are integral to

  7. Genomic organization, sequence divergence, and recombination of feline immunodeficiency virus from lions in the wild

    Directory of Open Access Journals (Sweden)

    Sondgeroth Kerry

    2008-02-01

    Full Text Available Abstract Background Feline immunodeficiency virus (FIV naturally infects multiple species of cat and is related to human immunodeficiency virus in humans. FIV infection causes AIDS-like disease and mortality in the domestic cat (Felis catus and serves as a natural model for HIV infection in humans. In African lions (Panthera leo and other exotic felid species, disease etiology introduced by FIV infection are less clear, but recent studies indicate that FIV causes moderate to severe CD4 depletion. Results In this study, comparative genomic methods are used to evaluate the full proviral genome of two geographically distinct FIV subtypes isolated from free-ranging lions. Genome organization of FIVPle subtype B (9891 bp from lions in the Serengeti National Park in Tanzania and FIVPle subtype E (9899 bp isolated from lions in the Okavango Delta in Botswana, both resemble FIV genome sequence from puma, Pallas cat and domestic cat across 5' LTR, gag, pol, vif, orfA, env, rev and 3'LTR regions. Comparative analyses of available full-length FIV consisting of subtypes A, B and C from FIVFca, Pallas cat FIVOma and two puma FIVPco subtypes A and B recapitulate the species-specific monophyly of FIV marked by high levels of genetic diversity both within and between species. Across all FIVPle gene regions except env, lion subtypes B and E are monophyletic, and marginally more similar to Pallas cat FIVOma than to other FIV. Sequence analyses indicate the SU and TM regions of env vary substantially between subtypes, with FIVPle subtype E more related to domestic cat FIVFca than to FIVPle subtype B and FIVOma likely reflecting recombination between strains in the wild. Conclusion This study demonstrates the necessity of whole-genome analysis to complement population/gene-based studies, which are of limited utility in uncovering complex events such as recombination that may lead to functional differences in virulence and pathogenicity. These full-length lion

  8. Ebolavirus comparative genomics

    DEFF Research Database (Denmark)

    Jun, Se-Ran; Leuze, Michael R.; Nookaew, Intawat

    2015-01-01

    The 2014 Ebola outbreak in West Africa is the largest documented for this virus. To examine the dynamics of this genome, we compare more than 100 currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms...

  9. The Sequenced Angiosperm Genomes and Genome Databases.

    Science.gov (United States)

    Chen, Fei; Dong, Wei; Zhang, Jiawei; Guo, Xinyue; Chen, Junhao; Wang, Zhengjia; Lin, Zhenguo; Tang, Haibao; Zhang, Liangsheng

    2018-01-01

    Angiosperms, the flowering plants, provide the essential resources for human life, such as food, energy, oxygen, and materials. They also promoted the evolution of human, animals, and the planet earth. Despite the numerous advances in genome reports or sequencing technologies, no review covers all the released angiosperm genomes and the genome databases for data sharing. Based on the rapid advances and innovations in the database reconstruction in the last few years, here we provide a comprehensive review for three major types of angiosperm genome databases, including databases for a single species, for a specific angiosperm clade, and for multiple angiosperm species. The scope, tools, and data of each type of databases and their features are concisely discussed. The genome databases for a single species or a clade of species are especially popular for specific group of researchers, while a timely-updated comprehensive database is more powerful for address of major scientific mysteries at the genome scale. Considering the low coverage of flowering plants in any available database, we propose construction of a comprehensive database to facilitate large-scale comparative studies of angiosperm genomes and to promote the collaborative studies of important questions in plant biology.

  10. Characterization of a Full-Length Endogenous Beta-Retrovirus, EqERV-Beta1, in the Genome of the Horse (Equus caballus

    Directory of Open Access Journals (Sweden)

    Antoinette C. van der Kuyl

    2011-06-01

    Full Text Available Information on endogenous retroviruses fixed in the horse (Equus caballus genome is scarce. The recent availability of a draft sequence of the horse genome enables the detection of such integrated viruses by similarity search. Using translated nucleotide fragments from gamma-, beta-, and delta-retroviral genera for initial searches, a full-length beta-retrovirus genome was retrieved from a horse chromosome 5 contig. The provirus, tentatively named EqERV-beta1 (for the first equine endogenous beta-retrovirus, was 10434 nucleotide (nt in length with the usual retroviral genome structure of 5’LTR-gag-pro-pol-env-3’LTR. The LTRs were 1361 nt long, and differed approximately 1% from each other, suggestive of a relatively recent integration. Coding sequences for gag, pro and pol were present in three different reading-frames, as common for beta-retroviruses, and the reading frames were completely open, except that the env gene was interrupted by a single stopcodon. No reading frame was apparent downstream of the env gene, suggesting that EqERV-beta1 does not encode a superantigen like mouse mammary tumor virus (MMTV. A second proviral genome of EqERV-beta1, with no stopcodon in env, is additionally integrated on chromosome 5 downstream of the first virus. Single EqERV-beta1 LTRs were abundantly present on all chromosomes except chromosome 24. Phylogenetically, EqERV-beta1 most closely resembles an unclassified retroviral sequence from cattle (Bos taurus, and the murine beta-retrovirus MMTV.

  11. Bioinformatics decoding the genome

    CERN Multimedia

    CERN. Geneva; Deutsch, Sam; Michielin, Olivier; Thomas, Arthur; Descombes, Patrick

    2006-01-01

    Extracting the fundamental genomic sequence from the DNA From Genome to Sequence : Biology in the early 21st century has been radically transformed by the availability of the full genome sequences of an ever increasing number of life forms, from bacteria to major crop plants and to humans. The lecture will concentrate on the computational challenges associated with the production, storage and analysis of genome sequence data, with an emphasis on mammalian genomes. The quality and usability of genome sequences is increasingly conditioned by the careful integration of strategies for data collection and computational analysis, from the construction of maps and libraries to the assembly of raw data into sequence contigs and chromosome-sized scaffolds. Once the sequence is assembled, a major challenge is the mapping of biologically relevant information onto this sequence: promoters, introns and exons of protein-encoding genes, regulatory elements, functional RNAs, pseudogenes, transposons, etc. The methodological ...

  12. Genomic research in Eucalyptus.

    Science.gov (United States)

    Poke, Fiona S; Vaillancourt, René E; Potts, Brad M; Reid, James B

    2005-09-01

    Eucalyptus L'Hérit. is a genus comprised of more than 700 species that is of vital importance ecologically to Australia and to the forestry industry world-wide, being grown in plantations for the production of solid wood products as well as pulp for paper. With the sequencing of the genomes of Arabidopsis thaliana and Oryza sativa and the recent completion of the first tree genome sequence, Populus trichocarpa, attention has turned to the current status of genomic research in Eucalyptus. For several eucalypt species, large segregating families have been established, high-resolution genetic maps constructed and large EST databases generated. Collaborative efforts have been initiated for the integration of diverse genomic projects and will provide the framework for future research including exploiting the sequence of the entire eucalypt genome which is currently being sequenced. This review summarises the current position of genomic research in Eucalyptus and discusses the direction of future research.

  13. Genome packaging in viruses

    OpenAIRE

    Sun, Siyang; Rao, Venigalla B.; Rossmann, Michael G.

    2010-01-01

    Genome packaging is a fundamental process in a viral life cycle. Many viruses assemble preformed capsids into which the genomic material is subsequently packaged. These viruses use a packaging motor protein that is driven by the hydrolysis of ATP to condense the nucleic acids into a confined space. How these motor proteins package viral genomes had been poorly understood until recently, when a few X-ray crystal structures and cryo-electron microscopy structures became available. Here we discu...

  14. Between Two Fern Genomes

    Science.gov (United States)

    2014-01-01

    Ferns are the only major lineage of vascular plants not represented by a sequenced nuclear genome. This lack of genome sequence information significantly impedes our ability to understand and reconstruct genome evolution not only in ferns, but across all land plants. Azolla and Ceratopteris are ideal and complementary candidates to be the first ferns to have their nuclear genomes sequenced. They differ dramatically in genome size, life history, and habit, and thus represent the immense diversity of extant ferns. Together, this pair of genomes will facilitate myriad large-scale comparative analyses across ferns and all land plants. Here we review the unique biological characteristics of ferns and describe a number of outstanding questions in plant biology that will benefit from the addition of ferns to the set of taxa with sequenced nuclear genomes. We explain why the fern clade is pivotal for understanding genome evolution across land plants, and we provide a rationale for how knowledge of fern genomes will enable progress in research beyond the ferns themselves. PMID:25324969

  15. Causes of genome instability

    DEFF Research Database (Denmark)

    Langie, Sabine A S; Koppen, Gudrun; Desaulniers, Daniel

    2015-01-01

    function, chromosome segregation, telomere length). The purpose of this review is to describe the crucial aspects of genome instability, to outline the ways in which environmental chemicals can affect this cancer hallmark and to identify candidate chemicals for further study. The overall aim is to make......Genome instability is a prerequisite for the development of cancer. It occurs when genome maintenance systems fail to safeguard the genome's integrity, whether as a consequence of inherited defects or induced via exposure to environmental agents (chemicals, biological agents and radiation). Thus...

  16. Fungal Genomics Program

    Energy Technology Data Exchange (ETDEWEB)

    Grigoriev, Igor

    2012-03-12

    The JGI Fungal Genomics Program aims to scale up sequencing and analysis of fungal genomes to explore the diversity of fungi important for energy and the environment, and to promote functional studies on a system level. Combining new sequencing technologies and comparative genomics tools, JGI is now leading the world in fungal genome sequencing and analysis. Over 120 sequenced fungal genomes with analytical tools are available via MycoCosm (www.jgi.doe.gov/fungi), a web-portal for fungal biologists. Our model of interacting with user communities, unique among other sequencing centers, helps organize these communities, improves genome annotation and analysis work, and facilitates new larger-scale genomic projects. This resulted in 20 high-profile papers published in 2011 alone and contributing to the Genomics Encyclopedia of Fungi, which targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts). Our next grand challenges include larger scale exploration of fungal diversity (1000 fungal genomes), developing molecular tools for DOE-relevant model organisms, and analysis of complex systems and metagenomes.

  17. MIPS plant genome information resources.

    Science.gov (United States)

    Spannagl, Manuel; Haberer, Georg; Ernst, Rebecca; Schoof, Heiko; Mayer, Klaus F X

    2007-01-01

    The Munich Institute for Protein Sequences (MIPS) has been involved in maintaining plant genome databases since the Arabidopsis thaliana genome project. Genome databases and analysis resources have focused on individual genomes and aim to provide flexible and maintainable data sets for model plant genomes as a backbone against which experimental data, for example from high-throughput functional genomics, can be organized and evaluated. In addition, model genomes also form a scaffold for comparative genomics, and much can be learned from genome-wide evolutionary studies.

  18. Computational genomics of hyperthermophiles

    NARCIS (Netherlands)

    Werken, van de H.J.G.

    2008-01-01

    With the ever increasing number of completely sequenced prokaryotic genomes and the subsequent use of functional genomics tools, e.g. DNA microarray and proteomics, computational data analysis and the integration of microbial and molecular data is inevitable. This thesis describes the computational

  19. Safeguarding genome integrity

    DEFF Research Database (Denmark)

    Sørensen, Claus Storgaard; Syljuåsen, Randi G

    2012-01-01

    Mechanisms that preserve genome integrity are highly important during the normal life cycle of human cells. Loss of genome protective mechanisms can lead to the development of diseases such as cancer. Checkpoint kinases function in the cellular surveillance pathways that help cells to cope with D...

  20. Human genome I

    International Nuclear Information System (INIS)

    Anon.

    1989-01-01

    An international conference, Human Genome I, was held Oct. 2-4, 1989 in San Diego, Calif. Selected speakers discussed: Current Status of the Genome Project; Technique Innovations; Interesting regions; Applications; and Organization - Different Views of Current and Future Science and Procedures. Posters, consisting of 119 presentations, were displayed during the sessions. 119 were indexed for inclusion to the Energy Data Base

  1. Systems-level comparison of host responses induced by pandemic and seasonal influenza A H1N1 viruses in primary human type I-like alveolar epithelial cells in vitro

    Directory of Open Access Journals (Sweden)

    Guan Yi

    2010-10-01

    Full Text Available Abstract Background Pandemic influenza H1N1 (pdmH1N1 virus causes mild disease in humans but occasionally leads to severe complications and even death, especially in those who are pregnant or have underlying disease. Cytokine responses induced by pdmH1N1 viruses in vitro are comparable to other seasonal influenza viruses suggesting the cytokine dysregulation as seen in H5N1 infection is not a feature of the pdmH1N1 virus. However a comprehensive gene expression profile of pdmH1N1 in relevant primary human cells in vitro has not been reported. Type I alveolar epithelial cells are a key target cell in pdmH1N1 pneumonia. Methods We carried out a comprehensive gene expression profiling using the Affymetrix microarray platform to compare the transcriptomes of primary human alveolar type I-like alveolar epithelial cells infected with pdmH1N1 or seasonal H1N1 virus. Results Overall, we found that most of the genes that induced by the pdmH1N1 were similarly regulated in response to seasonal H1N1 infection with respect to both trend and extent of gene expression. These commonly responsive genes were largely related to the interferon (IFN response. Expression of the type III IFN IL29 was more prominent than the type I IFN IFNβ and a similar pattern of expression of both IFN genes was seen in pdmH1N1 and seasonal H1N1 infection. Genes that were significantly down-regulated in response to seasonal H1N1 but not in response to pdmH1N1 included the zinc finger proteins and small nucleolar RNAs. Gene Ontology (GO and pathway over-representation analysis suggested that these genes were associated with DNA binding and transcription/translation related functions. Conclusions Both seasonal H1N1 and pdmH1N1 trigger similar host responses including IFN-based antiviral responses and cytokine responses. Unlike the avian H5N1 virus, pdmH1N1 virus does not have an intrinsic capacity for cytokine dysregulation. The differences between pdmH1N1 and seasonal H1N1 viruses

  2. Rumen microbial genomics

    International Nuclear Information System (INIS)

    Morrison, M.; Nelson, K.E.

    2005-01-01

    Improving microbial degradation of plant cell wall polysaccharides remains one of the highest priority goals for all livestock enterprises, including the cattle herds and draught animals of developing countries. The North American Consortium for Genomics of Fibrolytic Ruminal Bacteria was created to promote the sequencing and comparative analysis of rumen microbial genomes, offering the potential to fully assess the genetic potential in a functional and comparative fashion. It has been found that the Fibrobacter succinogenes genome encodes many more endoglucanases and cellodextrinases than previously isolated, and several new processive endoglucanases have been identified by genome and proteomic analysis of Ruminococcus albus, in addition to a variety of strategies for its adhesion to fibre. The ramifications of acquiring genome sequence data for rumen microorganisms are profound, including the potential to elucidate and overcome the biochemical, ecological or physiological processes that are rate limiting for ruminal fibre degradation. (author)

  3. Microbial Genomes Multiply

    Science.gov (United States)

    Doolittle, Russell F.

    2002-01-01

    The publication of the first complete sequence of a bacterial genome in 1995 was a signal event, underscored by the fact that the article has been cited more than 2,100 times during the intervening seven years. It was a marvelous technical achievement, made possible by automatic DNA-sequencing machines. The feat is the more impressive in that complete genome sequencing has now been adopted in many different laboratories around the world. Four years ago in these columns I examined the situation after a dozen microbial genomes had been completed. Now, with upwards of 60 microbial genome sequences determined and twice that many in progress, it seems reasonable to assess just what is being learned. Are new concepts emerging about how cells work? Have there been practical benefits in the fields of medicine and agriculture? Is it feasible to determine the genomic sequence of every bacterial species on Earth? The answers to these questions maybe Yes, Perhaps, and No, respectively.

  4. Musa sebagai Model Genom

    Directory of Open Access Journals (Sweden)

    RITA MEGIA

    2005-12-01

    Full Text Available During the meeting in Arlington, USA in 2001, the scientists grouped in PROMUSA agreed with the launching of the Global Musa Genomics Consortium. The Consortium aims to apply genomics technologies to the improvement of this important crop. These genome projects put banana as the third model species after Arabidopsis and rice that will be analyzed and sequenced. Comparing to Arabidopsis and rice, banana genome provides a unique and powerful insight into structural and in functional genomics that could not be found in those two species. This paper discussed these subjects-including the importance of banana as the fourth main food in the world, the evolution and biodiversity of this genetic resource and its parasite.

  5. The genome editing revolution

    DEFF Research Database (Denmark)

    Stella, Stefano; Montoya, Guillermo

    2016-01-01

    -Cas system has become the main tool for genome editing in many laboratories. Currently the targeted genome editing technology has been used in many fields and may be a possible approach for human gene therapy. Furthermore, it can also be used to modifying the genomes of model organisms for studying human......In the last 10 years, we have witnessed a blooming of targeted genome editing systems and applications. The area was revolutionized by the discovery and characterization of the transcription activator-like effector proteins, which are easier to engineer to target new DNA sequences than...... sequence). This ribonucleoprotein complex protects bacteria from invading DNAs, and it was adapted to be used in genome editing. The CRISPR ribonucleic acid (RNA) molecule guides to the specific DNA site the Cas9 nuclease to cleave the DNA target. Two years and more than 1000 publications later, the CRISPR...

  6. Phytozome Comparative Plant Genomics Portal

    Energy Technology Data Exchange (ETDEWEB)

    Goodstein, David; Batra, Sajeev; Carlson, Joseph; Hayes, Richard; Phillips, Jeremy; Shu, Shengqiang; Schmutz, Jeremy; Rokhsar, Daniel

    2014-09-09

    The Dept. of Energy Joint Genome Institute is a genomics user facility supporting DOE mission science in the areas of Bioenergy, Carbon Cycling, and Biogeochemistry. The Plant Program at the JGI applies genomic, analytical, computational and informatics platforms and methods to: 1. Understand and accelerate the improvement (domestication) of bioenergy crops 2. Characterize and moderate plant response to climate change 3. Use comparative genomics to identify constrained elements and infer gene function 4. Build high quality genomic resource platforms of JGI Plant Flagship genomes for functional and experimental work 5. Expand functional genomic resources for Plant Flagship genomes

  7. Genome-derived vaccines.

    Science.gov (United States)

    De Groot, Anne S; Rappuoli, Rino

    2004-02-01

    Vaccine research entered a new era when the complete genome of a pathogenic bacterium was published in 1995. Since then, more than 97 bacterial pathogens have been sequenced and at least 110 additional projects are now in progress. Genome sequencing has also dramatically accelerated: high-throughput facilities can draft the sequence of an entire microbe (two to four megabases) in 1 to 2 days. Vaccine developers are using microarrays, immunoinformatics, proteomics and high-throughput immunology assays to reduce the truly unmanageable volume of information available in genome databases to a manageable size. Vaccines composed by novel antigens discovered from genome mining are already in clinical trials. Within 5 years we can expect to see a novel class of vaccines composed by genome-predicted, assembled and engineered T- and Bcell epitopes. This article addresses the convergence of three forces--microbial genome sequencing, computational immunology and new vaccine technologies--that are shifting genome mining for vaccines onto the forefront of immunology research.

  8. The Banana Genome Hub

    Science.gov (United States)

    Droc, Gaëtan; Larivière, Delphine; Guignon, Valentin; Yahiaoui, Nabila; This, Dominique; Garsmeur, Olivier; Dereeper, Alexis; Hamelin, Chantal; Argout, Xavier; Dufayard, Jean-François; Lengelle, Juliette; Baurens, Franc-Christophe; Cenci, Alberto; Pitollat, Bertrand; D’Hont, Angélique; Ruiz, Manuel; Rouard, Mathieu; Bocs, Stéphanie

    2013-01-01

    Banana is one of the world’s favorite fruits and one of the most important crops for developing countries. The banana reference genome sequence (Musa acuminata) was recently released. Given the taxonomic position of Musa, the completed genomic sequence has particular comparative value to provide fresh insights about the evolution of the monocotyledons. The study of the banana genome has been enhanced by a number of tools and resources that allows harnessing its sequence. First, we set up essential tools such as a Community Annotation System, phylogenomics resources and metabolic pathways. Then, to support post-genomic efforts, we improved banana existing systems (e.g. web front end, query builder), we integrated available Musa data into generic systems (e.g. markers and genetic maps, synteny blocks), we have made interoperable with the banana hub, other existing systems containing Musa data (e.g. transcriptomics, rice reference genome, workflow manager) and finally, we generated new results from sequence analyses (e.g. SNP and polymorphism analysis). Several uses cases illustrate how the Banana Genome Hub can be used to study gene families. Overall, with this collaborative effort, we discuss the importance of the interoperability toward data integration between existing information systems. Database URL: http://banana-genome.cirad.fr/ PMID:23707967

  9. Genomic instability following irradiation

    International Nuclear Information System (INIS)

    Hacker-Klom, U.B.; Goehde, W.

    2001-01-01

    Ionising irradiation may induce genomic instability. The broad spectrum of stress reactions in eukaryontic cells to irradiation complicates the discovery of cellular targets and pathways inducing genomic instability. Irradiation may initiate genomic instability by deletion of genes controlling stability, by induction of genes stimulating instability and/or by activating endogeneous cellular viruses. Alternatively or additionally it is discussed that the initiation of genomic instability may be a consequence of radiation or other agents independently of DNA damage implying non nuclear targets, e.g. signal cascades. As a further mechanism possibly involved our own results may suggest radiation-induced changes in chromatin structure. Once initiated the process of genomic instability probably is perpetuated by endogeneous processes necessary for proliferation. Genomic instability may be a cause or a consequence of the neoplastic phenotype. As a conclusion from the data available up to now a new interpretation of low level radiation effects for radiation protection and in radiotherapy appears useful. The detection of the molecular mechanisms of genomic instability will be important in this context and may contribute to a better understanding of phenomenons occurring at low doses <10 cSv which are not well understood up to now. (orig.)

  10. Traditional medicine and genomics

    Directory of Open Access Journals (Sweden)

    Kalpana Joshi

    2010-01-01

    Full Text Available ′Omics′ developments in the form of genomics, proteomics and metabolomics have increased the impetus of traditional medicine research. Studies exploring the genomic, proteomic and metabolomic basis of human constitutional types based on Ayurveda and other systems of oriental medicine are becoming popular. Such studies remain important to developing better understanding of human variations and individual differences. Countries like India, Korea, China and Japan are investing in research on evidence-based traditional medicines and scientific validation of fundamental principles. This review provides an account of studies addressing relationships between traditional medicine and genomics.

  11. Traditional medicine and genomics.

    Science.gov (United States)

    Joshi, Kalpana; Ghodke, Yogita; Shintre, Pooja

    2010-01-01

    'Omics' developments in the form of genomics, proteomics and metabolomics have increased the impetus of traditional medicine research. Studies exploring the genomic, proteomic and metabolomic basis of human constitutional types based on Ayurveda and other systems of oriental medicine are becoming popular. Such studies remain important to developing better understanding of human variations and individual differences. Countries like India, Korea, China and Japan are investing in research on evidence-based traditional medicines and scientific validation of fundamental principles. This review provides an account of studies addressing relationships between traditional medicine and genomics.

  12. Bacillus subtilis genome diversity.

    Science.gov (United States)

    Earl, Ashlee M; Losick, Richard; Kolter, Roberto

    2007-02-01

    Microarray-based comparative genomic hybridization (M-CGH) is a powerful method for rapidly identifying regions of genome diversity among closely related organisms. We used M-CGH to examine the genome diversity of 17 strains belonging to the nonpathogenic species Bacillus subtilis. Our M-CGH results indicate that there is considerable genetic heterogeneity among members of this species; nearly one-third of Bsu168-specific genes exhibited variability, as measured by the microarray hybridization intensities. The variable loci include those encoding proteins involved in antibiotic production, cell wall synthesis, sporulation, and germination. The diversity in these genes may reflect this organism's ability to survive in diverse natural settings.

  13. Genomic taxonomy of vibrios

    Directory of Open Access Journals (Sweden)

    Iida Tetsuya

    2009-10-01

    Full Text Available Abstract Background Vibrio taxonomy has been based on a polyphasic approach. In this study, we retrieve useful taxonomic information (i.e. data that can be used to distinguish different taxonomic levels, such as species and genera from 32 genome sequences of different vibrio species. We use a variety of tools to explore the taxonomic relationship between the sequenced genomes, including Multilocus Sequence Analysis (MLSA, supertrees, Average Amino Acid Identity (AAI, genomic signatures, and Genome BLAST atlases. Our aim is to analyse the usefulness of these tools for species identification in vibrios. Results We have generated four new genome sequences of three Vibrio species, i.e., V. alginolyticus 40B, V. harveyi-like 1DA3, and V. mimicus strains VM573 and VM603, and present a broad analyses of these genomes along with other sequenced Vibrio species. The genome atlas and pangenome plots provide a tantalizing image of the genomic differences that occur between closely related sister species, e.g. V. cholerae and V. mimicus. The vibrio pangenome contains around 26504 genes. The V. cholerae core genome and pangenome consist of 1520 and 6923 genes, respectively. Pangenomes might allow different strains of V. cholerae to occupy different niches. MLSA and supertree analyses resulted in a similar phylogenetic picture, with a clear distinction of four groups (Vibrio core group, V. cholerae-V. mimicus, Aliivibrio spp., and Photobacterium spp.. A Vibrio species is defined as a group of strains that share > 95% DNA identity in MLSA and supertree analysis, > 96% AAI, ≤ 10 genome signature dissimilarity, and > 61% proteome identity. Strains of the same species and species of the same genus will form monophyletic groups on the basis of MLSA and supertree. Conclusion The combination of different analytical and bioinformatics tools will enable the most accurate species identification through genomic computational analysis. This endeavour will culminate in

  14. Human Genome Project

    Energy Technology Data Exchange (ETDEWEB)

    Block, S. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Cornwall, J. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Dally, W. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Dyson, F. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Fortson, N. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Joyce, G. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Kimble, H. J. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Lewis, N. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Max, C. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Prince, T. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Schwitters, R. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Weinberger, P. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Woodin, W. H. [The MITRE Corporation, McLean, VA (US). JASON Program Office

    1998-01-04

    The study reviews Department of Energy supported aspects of the United States Human Genome Project, the joint National Institutes of Health/Department of Energy program to characterize all human genetic material, to discover the set of human genes, and to render them accessible for further biological study. The study concentrates on issues of technology, quality assurance/control, and informatics relevant to current effort on the genome project and needs beyond it. Recommendations are presented on areas of the genome program that are of particular interest to and supported by the Department of Energy.

  15. Human Genome Program

    Energy Technology Data Exchange (ETDEWEB)

    1993-01-01

    The DOE Human Genome program has grown tremendously, as shown by the marked increase in the number of genome-funded projects since the last workshop held in 1991. The abstracts in this book describe the genome research of DOE-funded grantees and contractors and invited guests, and all projects are represented at the workshop by posters. The 3-day meeting includes plenary sessions on ethical, legal, and social issues pertaining to the availability of genetic data; sequencing techniques, informatics support; and chromosome and cDNA mapping and sequencing.

  16. Genomic signal processing

    CERN Document Server

    Shmulevich, Ilya

    2007-01-01

    Genomic signal processing (GSP) can be defined as the analysis, processing, and use of genomic signals to gain biological knowledge, and the translation of that knowledge into systems-based applications that can be used to diagnose and treat genetic diseases. Situated at the crossroads of engineering, biology, mathematics, statistics, and computer science, GSP requires the development of both nonlinear dynamical models that adequately represent genomic regulation, and diagnostic and therapeutic tools based on these models. This book facilitates these developments by providing rigorous mathema

  17. The Sinbad retrotransposon from the genome of the human blood fluke, Schistosoma mansoni, and the distribution of related Pao-like elements

    Directory of Open Access Journals (Sweden)

    Morales Maria E

    2005-02-01

    Full Text Available Abstract Background Of the major families of long terminal repeat (LTR retrotransposons, the Pao/BEL family is probably the least well studied. It is becoming apparent that numerous LTR retrotransposons and other mobile genetic elements have colonized the genome of the human blood fluke, Schistosoma mansoni. Results A proviral form of Sinbad, a new LTR retrotransposon, was identified in the genome of S. mansoni. Phylogenetic analysis indicated that Sinbad belongs to one of five discreet subfamilies of Pao/BEL like elements. BLAST searches of whole genomes and EST databases indicated that members of this clade occurred in species of the Insecta, Nematoda, Echinodermata and Chordata, as well as Platyhelminthes, but were absent from all plants, fungi and lower eukaryotes examined. Among the deuterostomes examined, only aquatic species harbored these types of elements. All four species of nematode examined were positive for Sinbad sequences, although among insect and vertebrate genomes, some were positive and some negative. The full length, consensus Sinbad retrotransposon was 6,287 bp long and was flanked at its 5'- and 3'-ends by identical LTRs of 386 bp. Sinbad displayed a triple Cys-His RNA binding motif characteristic of Gag of Pao/BEL-like elements, followed by the enzymatic domains of protease, reverse transcriptase (RT, RNAseH, and integrase, in that order. A phylogenetic tree of deduced RT sequences from 26 elements revealed that Sinbad was most closely related to an unnamed element from the zebrafish Danio rerio and to Saci-1, also from S. mansoni. It was also closely related to Pao from Bombyx mori and to Ninja of Drosophila simulans. Sinbad was only distantly related to the other schistosome LTR retrotransposons Boudicca, Gulliver, Saci-2, Saci-3, and Fugitive, which are gypsy-like. Southern hybridization and bioinformatics analyses indicated that there were about 50 copies of Sinbad in the S. mansoni genome. The presence of ESTs

  18. Genomics and fish adaptation

    Directory of Open Access Journals (Sweden)

    Agostinho Antunes

    2015-12-01

    Full Text Available The completion of the human genome sequencing in 2003 opened a new perspective into the importance of whole genome sequencing projects, and currently multiple species are having their genomes completed sequenced, from simple organisms, such as bacteria, to more complex taxa, such as mammals. This voluminous sequencing data generated across multiple organisms provides also the framework to better understand the genetic makeup of such species and related ones, allowing to explore the genetic changes underlining the evolution of diverse phenotypic traits. Here, recent results from our group retrieved from comparative evolutionary genomic analyses of varied fish species will be considered to exemplify how gene novelty and gene enhancement by positive selection might have been determinant in the success of adaptive radiations into diverse habitats and lifestyles.

  19. Lophotrochozoan mitochondrial genomes

    Energy Technology Data Exchange (ETDEWEB)

    Valles, Yvonne; Boore, Jeffrey L.

    2005-10-01

    Progress in both molecular techniques and phylogeneticmethods has challenged many of the interpretations of traditionaltaxonomy. One example is in the recognition of the animal superphylumLophotrochozoa (annelids, mollusks, echiurans, platyhelminthes,brachiopods, and other phyla), although the relationships within thisgroup and the inclusion of some phyla remain uncertain. While much ofthis progress in phylogenetic reconstruction has been based on comparingsingle gene sequences, we are beginning to see the potential of comparinglarge-scale features of genomes, such as the relative order of genes.Even though tremendous progress is being made on the sequencedetermination of whole nuclear genomes, the dataset of choice forgenome-level characters for many animals across a broad taxonomic rangeremains mitochondrial genomes. We review here what is known aboutmitochondrial genomes of the lophotrochozoans and discuss the promisethat this dataset will enable insight into theirrelationships.

  20. Mouse Genome Informatics (MGI)

    Data.gov (United States)

    U.S. Department of Health & Human Services — MGI is the international database resource for the laboratory mouse, providing integrated genetic, genomic, and biological data to facilitate the study of human...

  1. Genomic definition of species

    Energy Technology Data Exchange (ETDEWEB)

    Crkvenjakov, R.; Drmanac, R.

    1991-07-01

    The subject of this paper is the definition of species based on the assumption that genome is the fundamental level for the origin and maintenance of biological diversity. For this view to be logically consistent it is necessary to assume the existence and operation of the new law which we call genome law. For this reason the genome law is included in the explanation of species phenomenon presented here even if its precise formulation and elaboration are left for the future. The intellectual underpinnings of this definition can be traced to Goldschmidt. We wish to explore some philosophical aspects of the definition of species in terms of the genome. The point of proposing the definition on these grounds is that any real advance in evolutionary theory has to be correct in both its philosophy and its science.

  2. Structural genomics in endocrinology

    NARCIS (Netherlands)

    Smit, J. W.; Romijn, J. A.

    2001-01-01

    Traditionally, endocrine research evolved from the phenotypical characterisation of endocrine disorders to the identification of underlying molecular pathophysiology. This approach has been, and still is, extremely successful. The introduction of genomics and proteomics has resulted in a reversal of

  3. Epidemiology & Genomics Research Program

    Science.gov (United States)

    The Epidemiology and Genomics Research Program, in the National Cancer Institute's Division of Cancer Control and Population Sciences, funds research in human populations to understand the determinants of cancer occurrence and outcomes.

  4. Annotating individual human genomes.

    Science.gov (United States)

    Torkamani, Ali; Scott-Van Zeeland, Ashley A; Topol, Eric J; Schork, Nicholas J

    2011-10-01

    Advances in DNA sequencing technologies have made it possible to rapidly, accurately and affordably sequence entire individual human genomes. As impressive as this ability seems, however, it will not likely amount to much if one cannot extract meaningful information from individual sequence data. Annotating variations within individual genomes and providing information about their biological or phenotypic impact will thus be crucially important in moving individual sequencing projects forward, especially in the context of the clinical use of sequence information. In this paper we consider the various ways in which one might annotate individual sequence variations and point out limitations in the available methods for doing so. It is arguable that, in the foreseeable future, DNA sequencing of individual genomes will become routine for clinical, research, forensic, and personal purposes. We therefore also consider directions and areas for further research in annotating genomic variants. Copyright © 2011 Elsevier Inc. All rights reserved.

  5. ANNOTATING INDIVIDUAL HUMAN GENOMES*

    Science.gov (United States)

    Torkamani, Ali; Scott-Van Zeeland, Ashley A.; Topol, Eric J.; Schork, Nicholas J.

    2014-01-01

    Advances in DNA sequencing technologies have made it possible to rapidly, accurately and affordably sequence entire individual human genomes. As impressive as this ability seems, however, it will not likely to amount to much if one cannot extract meaningful information from individual sequence data. Annotating variations within individual genomes and providing information about their biological or phenotypic impact will thus be crucially important in moving individual sequencing projects forward, especially in the context of the clinical use of sequence information. In this paper we consider the various ways in which one might annotate individual sequence variations and point out limitations in the available methods for doing so. It is arguable that, in the foreseeable future, DNA sequencing of individual genomes will become routine for clinical, research, forensic, and personal purposes. We therefore also consider directions and areas for further research in annotating genomic variants. PMID:21839162

  6. Yeast genome sequencing:

    DEFF Research Database (Denmark)

    Piskur, Jure; Langkjær, Rikke Breinhold

    2004-01-01

    For decades, unicellular yeasts have been general models to help understand the eukaryotic cell and also our own biology. Recently, over a dozen yeast genomes have been sequenced, providing the basis to resolve several complex biological questions. Analysis of the novel sequence data has shown...... of closely related species helps in gene annotation and to answer how many genes there really are within the genomes. Analysis of non-coding regions among closely related species has provided an example of how to determine novel gene regulatory sequences, which were previously difficult to analyse because...... they are short and degenerate and occupy different positions. Comparative genomics helps to understand the origin of yeasts and points out crucial molecular events in yeast evolutionary history, such as whole-genome duplication and horizontal gene transfer(s). In addition, the accumulating sequence data provide...

  7. Genetical Genomics for Evolutionary Studies

    NARCIS (Netherlands)

    Prins, J.C.P.; Smant, G.; Jansen, R.C.

    2012-01-01

    Genetical genomics combines acquired high-throughput genomic data with genetic analysis. In this chapter, we discuss the application of genetical genomics for evolutionary studies, where new high-throughput molecular technologies are combined with mapping quantitative trait loci (QTL) on the genome

  8. New insights into prevalence, genetic diversity, and proviral load of human T-cell leukemia virus types 1 and 2 in pregnant women in Gabon in equatorial central Africa.

    Science.gov (United States)

    Etenna, Sonia Lekana-Douki; Caron, Mélanie; Besson, Guillaume; Makuwa, Maria; Gessain, Antoine; Mahé, Antoine; Kazanji, Mirdad

    2008-11-01

    Human T-cell leukemia virus type 1 (HTLV-1) is highly endemic in areas of central Africa; mother-to-child transmission and sexual transmission are considered to be the predominant routes. To determine the prevalence and subtypes of HTLV-1/2 in pregnant women in Gabon, we conducted an epidemiological survey in the five main cities of the country. In 907 samples, the HTLV-1 seroprevalence was 2.1%, which is lower than that previously reported. Only one case of HTLV-2 infection was found. The HTLV-1 seroprevalence increased with age and differed between regions (P cosmopolitan subtype A. The new strains of subtype B exhibited wide genetic diversity, but there was no evidence of clustering of specific genomes within geographical regions of the country. Some strains were closely related to simian T-cell leukemia virus type 1 strains of great apes, suggesting that in these areas some HTLV-1 strains could arise from relatively recent interspecies transmission. The sole HTLV-2 strain belonged to subtype B. In this study we showed that the prevalence of HTLV-1 in the southeast is one of the highest in the world for pregnant women.

  9. The human genome project

    International Nuclear Information System (INIS)

    Worton, R.

    1996-01-01

    The Human Genome Project is a massive international research project, costing 3 to 5 billion dollars and expected to take 15 years, which will identify the all the genes in the human genome - i.e. the complete sequence of bases in human DNA. The prize will be the ability to identify genes causing or predisposing to disease, and in some cases the development of gene therapy, but this new knowledge will raise important ethical issues

  10. Decoding the human genome

    CERN Multimedia

    CERN. Geneva. Audiovisual Unit; Antonerakis, S E

    2002-01-01

    Decoding the Human genome is a very up-to-date topic, raising several questions besides purely scientific, in view of the two competing teams (public and private), the ethics of using the results, and the fact that the project went apparently faster and easier than expected. The lecture series will address the following chapters: Scientific basis and challenges. Ethical and social aspects of genomics.

  11. Molluscan Evolutionary Genomics

    Energy Technology Data Exchange (ETDEWEB)

    Simison, W. Brian; Boore, Jeffrey L.

    2005-12-01

    In the last 20 years there have been dramatic advances in techniques of high-throughput DNA sequencing, most recently accelerated by the Human Genome Project, a program that has determined the three billion base pair code on which we are based. Now this tremendous capability is being directed at other genome targets that are being sampled across the broad range of life. This opens up opportunities as never before for evolutionary and organismal biologists to address questions of both processes and patterns of organismal change. We stand at the dawn of a new 'modern synthesis' period, paralleling that of the early 20th century when the fledgling field of genetics first identified the underlying basis for Darwin's theory. We must now unite the efforts of systematists, paleontologists, mathematicians, computer programmers, molecular biologists, developmental biologists, and others in the pursuit of discovering what genomics can teach us about the diversity of life. Genome-level sampling for mollusks to date has mostly been limited to mitochondrial genomes and it is likely that these will continue to provide the best targets for broad phylogenetic sampling in the near future. However, we are just beginning to see an inroad into complete nuclear genome sequencing, with several mollusks and other eutrochozoans having been selected for work about to begin. Here, we provide an overview of the state of molluscan mitochondrial genomics, highlight a few of the discoveries from this research, outline the promise of broadening this dataset, describe upcoming projects to sequence whole mollusk nuclear genomes, and challenge the community to prepare for making the best use of these data.

  12. Human Germline Genome Editing

    OpenAIRE

    Ormond, Kelly E.; Mortlock, Douglas P.; Scholes, Derek T.; Bombard, Yvonne; Brody, Lawrence C.; Faucett, W. Andrew; Garrison, Nanibaa’ A.; Hercher, Laura; Isasi, Rosario; Middleton, Anna; Musunuru, Kiran; Shriner, Daniel; Virani, Alice; Young, Caroline E.

    2017-01-01

    With CRISPR/Cas9 and other genome-editing technologies, successful somatic and germline genome editing are becoming feasible. To respond, an American Society of Human Genetics (ASHG) workgroup developed this position statement, which was approved by the ASHG Board in March 2017. The workgroup included representatives from the UK Association of Genetic Nurses and Counsellors, Canadian Association of Genetic Counsellors, International Genetic Epidemiology Society, and US National Society of Gen...

  13. RadGenomics project

    Energy Technology Data Exchange (ETDEWEB)

    Iwakawa, Mayumi; Imai, Takashi; Harada, Yoshinobu [National Inst. of Radiological Sciences, Chiba (Japan). Frontier Research Center] [and others

    2002-06-01

    Human health is determined by a complex interplay of factors, predominantly between genetic susceptibility, environmental conditions and aging. The ultimate aim of the RadGenomics (Radiation Genomics) project is to understand the implications of heterogeneity in responses to ionizing radiation arising from genetic variation between individuals in the human population. The rapid progression of the human genome sequencing and the recent development of new technologies in molecular genetics are providing us with new opportunities to understand the genetic basis of individual differences in susceptibility to natural and/or artificial environmental factors, including radiation exposure. The RadGenomics project will inevitably lead to improved protocols for personalized radiotherapy and reductions in the potential side effects of such treatment. The project will contribute to future research into the molecular mechanisms of radiation sensitivity in humans and will stimulate the development of new high-throughput technologies for a broader application of biological and medical sciences. The staff members are specialists in a variety of fields, including genome science, radiation biology, medical science, molecular biology, and informatics, and have joined the RadGenomics project from various universities, companies, and research institutes. The project started in April 2001. (author)

  14. Comparative Genome Viewer

    International Nuclear Information System (INIS)

    Molineris, I.; Sales, G.

    2009-01-01

    The amount of information about genomes, both in the form of complete sequences and annotations, has been exponentially increasing in the last few years. As a result there is the need for tools providing a graphical representation of such information that should be comprehensive and intuitive. Visual representation is especially important in the comparative genomics field since it should provide a combined view of data belonging to different genomes. We believe that existing tools are limited in this respect as they focus on a single genome at a time (conservation histograms) or compress alignment representation to a single dimension. We have therefore developed a web-based tool called Comparative Genome Viewer (Cgv): it integrates a bidimensional representation of alignments between two regions, both at small and big scales, with the richness of annotations present in other genome browsers. We give access to our system through a web-based interface that provides the user with an interactive representation that can be updated in real time using the mouse to move from region to region and to zoom in on interesting details.

  15. Human social genomics.

    Directory of Open Access Journals (Sweden)

    Steven W Cole

    2014-08-01

    Full Text Available A growing literature in human social genomics has begun to analyze how everyday life circumstances influence human gene expression. Social-environmental conditions such as urbanity, low socioeconomic status, social isolation, social threat, and low or unstable social status have been found to associate with differential expression of hundreds of gene transcripts in leukocytes and diseased tissues such as metastatic cancers. In leukocytes, diverse types of social adversity evoke a common conserved transcriptional response to adversity (CTRA characterized by increased expression of proinflammatory genes and decreased expression of genes involved in innate antiviral responses and antibody synthesis. Mechanistic analyses have mapped the neural "social signal transduction" pathways that stimulate CTRA gene expression in response to social threat and may contribute to social gradients in health. Research has also begun to analyze the functional genomics of optimal health and thriving. Two emerging opportunities now stand to revolutionize our understanding of the everyday life of the human genome: network genomics analyses examining how systems-level capabilities emerge from groups of individual socially sensitive genomes and near-real-time transcriptional biofeedback to empirically optimize individual well-being in the context of the unique genetic, geographic, historical, developmental, and social contexts that jointly shape the transcriptional realization of our innate human genomic potential for thriving.

  16. RadGenomics project

    International Nuclear Information System (INIS)

    Iwakawa, Mayumi; Imai, Takashi; Harada, Yoshinobu

    2002-01-01

    Human health is determined by a complex interplay of factors, predominantly between genetic susceptibility, environmental conditions and aging. The ultimate aim of the RadGenomics (Radiation Genomics) project is to understand the implications of heterogeneity in responses to ionizing radiation arising from genetic variation between individuals in the human population. The rapid progression of the human genome sequencing and the recent development of new technologies in molecular genetics are providing us with new opportunities to understand the genetic basis of individual differences in susceptibility to natural and/or artificial environmental factors, including radiation exposure. The RadGenomics project will inevitably lead to improved protocols for personalized radiotherapy and reductions in the potential side effects of such treatment. The project will contribute to future research into the molecular mechanisms of radiation sensitivity in humans and will stimulate the development of new high-throughput technologies for a broader application of biological and medical sciences. The staff members are specialists in a variety of fields, including genome science, radiation biology, medical science, molecular biology, and informatics, and have joined the RadGenomics project from various universities, companies, and research institutes. The project started in April 2001. (author)

  17. Ultrafast comparison of personal genomes

    OpenAIRE

    Mauldin, Denise; Hood, Leroy; Robinson, Max; Glusman, Gustavo

    2017-01-01

    We present an ultra-fast method for comparing personal genomes. We transform the standard genome representation (lists of variants relative to a reference) into 'genome fingerprints' that can be readily compared across sequencing technologies and reference versions. Because of their reduced size, computation on the genome fingerprints is fast and requires little memory. This enables scaling up a variety of important genome analyses, including quantifying relatedness, recognizing duplicative s...

  18. Genomics using the Assembly of the Mink Genome

    DEFF Research Database (Denmark)

    Guldbrandtsen, Bernt; Cai, Zexi; Sahana, Goutam

    2018-01-01

    The American Mink’s (Neovison vison) genome has recently been sequenced. This opens numerous avenues of research both for studying the basic genetics and physiology of the mink as well as genetic improvement in mink. Using genotyping-by-sequencing (GBS) generated marker data for 2,352 Danish farm...... mink runs of homozygosity (ROH) were detect in mink genomes. Detectable ROH made up on average 1.7% of the genome indicating the presence of at most a moderate level of genomic inbreeding. The fraction of genome regions found in ROH varied. Ten percent of the included regions were never found in ROH....... The ability to detect ROH in the mink genome also demonstrates the general reliability of the new mink genome assembly. Keywords: american mink, run of homozygosity, genome, selection, genomic inbreeding...

  19. Genome size analyses of Pucciniales reveal the largest fungal genomes.

    Science.gov (United States)

    Tavares, Sílvia; Ramos, Ana Paula; Pires, Ana Sofia; Azinheira, Helena G; Caldeirinha, Patrícia; Link, Tobias; Abranches, Rita; Silva, Maria do Céu; Voegele, Ralf T; Loureiro, João; Talhinhas, Pedro

    2014-01-01

    Rust fungi (Basidiomycota, Pucciniales) are biotrophic plant pathogens which exhibit diverse complexities in their life cycles and host ranges. The completion of genome sequencing of a few rust fungi has revealed the occurrence of large genomes. Sequencing efforts for other rust fungi have been hampered by uncertainty concerning their genome sizes. Flow cytometry was recently applied to estimate the genome size of a few rust fungi, and confirmed the occurrence of large genomes in this order (averaging 225.3 Mbp, while the average for Basidiomycota was 49.9 Mbp and was 37.7 Mbp for all fungi). In this work, we have used an innovative and simple approach to simultaneously isolate nuclei from the rust and its host plant in order to estimate the genome size of 30 rust species by flow cytometry. Genome sizes varied over 10-fold, from 70 to 893 Mbp, with an average genome size value of 380.2 Mbp. Compared to the genome sizes of over 1800 fungi, Gymnosporangium confusum possesses the largest fungal genome ever reported (893.2 Mbp). Moreover, even the smallest rust genome determined in this study is larger than the vast majority of fungal genomes (94%). The average genome size of the Pucciniales is now of 305.5 Mbp, while the average Basidiomycota genome size has shifted to 70.4 Mbp and the average for all fungi reached 44.2 Mbp. Despite the fact that no correlation could be drawn between the genome sizes, the phylogenomics or the life cycle of rust fungi, it is interesting to note that rusts with Fabaceae hosts present genomes clearly larger than those with Poaceae hosts. Although this study comprises only a small fraction of the more than 7000 rust species described, it seems already evident that the Pucciniales represent a group where genome size expansion could be a common characteristic. This is in sharp contrast to sister taxa, placing this order in a relevant position in fungal genomics research.

  20. Comprehensive search for intra- and inter-specific sequence polymorphisms among coding envelope genes of retroviral origin found in the human genome: genes and pseudogenes

    Directory of Open Access Journals (Sweden)

    Vasilescu Alexandre

    2005-09-01

    Full Text Available Abstract Background The human genome carries a high load of proviral-like sequences, called Human Endogenous Retroviruses (HERVs, which are the genomic traces of ancient infections by active retroviruses. These elements are in most cases defective, but open reading frames can still be found for the retroviral envelope gene, with sixteen such genes identified so far. Several of them are conserved during primate evolution, having possibly been co-opted by their host for a physiological role. Results To characterize further their status, we presently sequenced 12 of these genes from a panel of 91 Caucasian individuals. Genomic analyses reveal strong sequence conservation (only two non synonymous Single Nucleotide Polymorphisms [SNPs] for the two HERV-W and HERV-FRD envelope genes, i.e. for the two genes specifically expressed in the placenta and possibly involved in syncytiotrophoblast formation. We further show – using an ex vivo fusion assay for each allelic form – that none of these SNPs impairs the fusogenic function. The other envelope proteins disclose variable polymorphisms, with the occurrence of a stop codon and/or frameshift for most – but not all – of them. Moreover, the sequence conservation analysis of the orthologous genes that can be found in primates shows that three env genes have been maintained in a fully coding state throughout evolution including envW and envFRD. Conclusion Altogether, the present study strongly suggests that some but not all envelope encoding sequences are bona fide genes. It also provides new tools to elucidate the possible role of endogenous envelope proteins as susceptibility factors in a number of pathologies where HERVs have been suspected to be involved.

  1. Genomes to Proteomes

    Energy Technology Data Exchange (ETDEWEB)

    Panisko, Ellen A. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Grigoriev, Igor [USDOE Joint Genome Inst., Walnut Creek, CA (United States); Daly, Don S. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Webb-Robertson, Bobbie-Jo [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Baker, Scott E. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States)

    2009-03-01

    Biologists are awash with genomic sequence data. In large part, this is due to the rapid acceleration in the generation of DNA sequence that occurred as public and private research institutes raced to sequence the human genome. In parallel with the large human genome effort, mostly smaller genomes of other important model organisms were sequenced. Projects following on these initial efforts have made use of technological advances and the DNA sequencing infrastructure that was built for the human and other organism genome projects. As a result, the genome sequences of many organisms are available in high quality draft form. While in many ways this is good news, there are limitations to the biological insights that can be gleaned from DNA sequences alone; genome sequences offer only a bird's eye view of the biological processes endemic to an organism or community. Fortunately, the genome sequences now being produced at such a high rate can serve as the foundation for other global experimental platforms such as proteomics. Proteomic methods offer a snapshot of the proteins present at a point in time for a given biological sample. Current global proteomics methods combine enzymatic digestion, separations, mass spectrometry and database searching for peptide identification. One key aspect of proteomics is the prediction of peptide sequences from mass spectrometry data. Global proteomic analysis uses computational matching of experimental mass spectra with predicted spectra based on databases of gene models that are often generated computationally. Thus, the quality of gene models predicted from a genome sequence is crucial in the generation of high quality peptide identifications. Once peptides are identified they can be assigned to their parent protein. Proteins identified as expressed in a given experiment are most useful when compared to other expressed proteins in a larger biological context or biochemical pathway. In this chapter we will discuss the automatic

  2. Experimental Induction of Genome Chaos.

    Science.gov (United States)

    Ye, Christine J; Liu, Guo; Heng, Henry H

    2018-01-01

    Genome chaos, or karyotype chaos, represents a powerful survival strategy for somatic cells under high levels of stress/selection. Since the genome context, not the gene content, encodes the genomic blueprint of the cell, stress-induced rapid and massive reorganization of genome topology functions as a very important mechanism for genome (karyotype) evolution. In recent years, the phenomenon of genome chaos has been confirmed by various sequencing efforts, and many different terms have been coined to describe different subtypes of the chaotic genome including "chromothripsis," "chromoplexy," and "structural mutations." To advance this exciting field, we need an effective experimental system to induce and characterize the karyotype reorganization process. In this chapter, an experimental protocol to induce chaotic genomes is described, following a brief discussion of the mechanism and implication of genome chaos in cancer evolution.

  3. Genome Sequences of Oryza Species

    KAUST Repository

    Kumagai, Masahiko; Tanaka, Tsuyoshi; Ohyanagi, Hajime; Hsing, Yue-Ie C.; Itoh, Takeshi

    2018-01-01

    This chapter summarizes recent data obtained from genome sequencing, annotation projects, and studies on the genome diversity of Oryza sativa and related Oryza species. O. sativa, commonly known as Asian rice, is the first monocot species whose complete genome sequence was deciphered based on physical mapping by an international collaborative effort. This genome, along with its accurate and comprehensive annotation, has become an indispensable foundation for crop genomics and breeding. With the development of innovative sequencing technologies, genomic studies of O. sativa have dramatically increased; in particular, a large number of cultivars and wild accessions have been sequenced and compared with the reference rice genome. Since de novo genome sequencing has become cost-effective, the genome of African cultivated rice, O. glaberrima, has also been determined. Comparative genomic studies have highlighted the independent domestication processes of different rice species, but it also turned out that Asian and African rice share a common gene set that has experienced similar artificial selection. An international project aimed at constructing reference genomes and examining the genome diversity of wild Oryza species is currently underway, and the genomes of some species are publicly available. This project provides a platform for investigations such as the evolution, development, polyploidization, and improvement of crops. Studies on the genomic diversity of Oryza species, including wild species, should provide new insights to solve the problem of growing food demands in the face of rapid climatic changes.

  4. Genome Sequences of Oryza Species

    KAUST Repository

    Kumagai, Masahiko

    2018-02-14

    This chapter summarizes recent data obtained from genome sequencing, annotation projects, and studies on the genome diversity of Oryza sativa and related Oryza species. O. sativa, commonly known as Asian rice, is the first monocot species whose complete genome sequence was deciphered based on physical mapping by an international collaborative effort. This genome, along with its accurate and comprehensive annotation, has become an indispensable foundation for crop genomics and breeding. With the development of innovative sequencing technologies, genomic studies of O. sativa have dramatically increased; in particular, a large number of cultivars and wild accessions have been sequenced and compared with the reference rice genome. Since de novo genome sequencing has become cost-effective, the genome of African cultivated rice, O. glaberrima, has also been determined. Comparative genomic studies have highlighted the independent domestication processes of different rice species, but it also turned out that Asian and African rice share a common gene set that has experienced similar artificial selection. An international project aimed at constructing reference genomes and examining the genome diversity of wild Oryza species is currently underway, and the genomes of some species are publicly available. This project provides a platform for investigations such as the evolution, development, polyploidization, and improvement of crops. Studies on the genomic diversity of Oryza species, including wild species, should provide new insights to solve the problem of growing food demands in the face of rapid climatic changes.

  5. Genome position specific priors for genomic prediction

    DEFF Research Database (Denmark)

    Brøndum, Rasmus Froberg; Su, Guosheng; Lund, Mogens Sandø

    2012-01-01

    casual mutation is different between the populations but affects the same gene. Proportions of a four-distribution mixture for SNP effects in segments of fixed size along the genome are derived from one population and set as location specific prior proportions of distributions of SNP effects...... for the target population. The model was tested using dairy cattle populations of different breeds: 540 Australian Jersey bulls, 2297 Australian Holstein bulls and 5214 Nordic Holstein bulls. The traits studied were protein-, fat- and milk yield. Genotypic data was Illumina 777K SNPs, real or imputed Results...

  6. The frequency of CD127low expressing CD4+CD25high T regulatory cells is inversely correlated with human T lymphotrophic virus type-1 (HTLV-1 proviral load in HTLV-1-infection and HTLV-1-associated myelopathy/tropical spastic paraparesis

    Directory of Open Access Journals (Sweden)

    Chieia Marco

    2008-07-01

    Full Text Available Abstract Background CD4+CD25high regulatory T (TReg cells modulate antigen-specific T cell responses, and can suppress anti-viral immunity. In HTLV-1 infection, a selective decrease in the function of TReg cell mediated HTLV-1-tax inhibition of FOXP3 expression has been described. The purpose of this study was to assess the frequency and phenotype of TReg cells in HTLV-1 asymptomatic carriers and in HTLV-1-associated neurological disease (HAM/TSP patients, and to correlate with measures of T cell activation. Results We were able to confirm that HTLV-I drives activation, spontaneous IFNγ production, and proliferation of CD4+ T cells. We also observed a significantly lower proportion of CTLA-4+ TReg cells (CD4+CD25high T cells in subjects with HAM/TSP patients compared to healthy controls. Ki-67 expression was negatively correlated to the frequency of CTLA-4+ TReg cells in HAM/TSP only, although Ki-67 expression was inversely correlated with the percentage of CD127low TReg cells in healthy control subjects. Finally, the proportion of CD127low TReg cells correlated inversely with HTLV-1 proviral load. Conclusion Taken together, the results suggest that TReg cells may be subverted in HAM/TSP patients, which could explain the marked cellular activation, spontaneous cytokine production, and proliferation of CD4+ T cells, in particular those expressing the CD25highCD127low phenotype. TReg cells represent a potential target for therapeutic intervention for patients with HTLV-1-related neurological diseases.

  7. Genomics of Volvocine Algae

    Science.gov (United States)

    Umen, James G.; Olson, Bradley J.S.C.

    2015-01-01

    Volvocine algae are a group of chlorophytes that together comprise a unique model for evolutionary and developmental biology. The species Chlamydomonas reinhardtii and Volvox carteri represent extremes in morphological diversity within the Volvocine clade. Chlamydomonas is unicellular and reflects the ancestral state of the group, while Volvox is multicellular and has evolved numerous innovations including germ-soma differentiation, sexual dimorphism, and complex morphogenetic patterning. The Chlamydomonas genome sequence has shed light on several areas of eukaryotic cell biology, metabolism and evolution, while the Volvox genome sequence has enabled a comparison with Chlamydomonas that reveals some of the underlying changes that enabled its transition to multicellularity, but also underscores the subtlety of this transition. Many of the tools and resources are in place to further develop Volvocine algae as a model for evolutionary genomics. PMID:25883411

  8. Genomics of Preterm Birth

    Science.gov (United States)

    Swaggart, Kayleigh A.; Pavlicev, Mihaela; Muglia, Louis J.

    2015-01-01

    The molecular mechanisms controlling human birth timing at term, or resulting in preterm birth, have been the focus of considerable investigation, but limited insights have been gained over the past 50 years. In part, these processes have remained elusive because of divergence in reproductive strategies and physiology shown by model organisms, making extrapolation to humans uncertain. Here, we summarize the evolution of progesterone signaling and variation in pregnancy maintenance and termination. We use this comparative physiology to support the hypothesis that selective pressure on genomic loci involved in the timing of parturition have shaped human birth timing, and that these loci can be identified with comparative genomic strategies. Previous limitations imposed by divergence of mechanisms provide an important new opportunity to elucidate fundamental pathways of parturition control through increasing availability of sequenced genomes and associated reproductive physiology characteristics across diverse organisms. PMID:25646385

  9. Genomics of Salmonella Species

    Science.gov (United States)

    Canals, Rocio; McClelland, Michael; Santiviago, Carlos A.; Andrews-Polymenis, Helene

    Progress in the study of Salmonella survival, colonization, and virulence has increased rapidly with the advent of complete genome sequencing and higher capacity assays for transcriptomic and proteomic analysis. Although many of these techniques have yet to be used to directly assay Salmonella growth on foods, these assays are currently in use to determine Salmonella factors necessary for growth in animal models including livestock animals and in in vitro conditions that mimic many different environments. As sequencing of the Salmonella genome and microarray analysis have revolutionized genomics and transcriptomics of salmonellae over the last decade, so are new high-throughput sequencing technologies currently accelerating the pace of our studies and allowing us to approach complex problems that were not previously experimentally tractable.

  10. Ebolavirus comparative genomics

    Science.gov (United States)

    Jun, Se-Ran; Leuze, Michael R.; Nookaew, Intawat; Uberbacher, Edward C.; Land, Miriam; Zhang, Qian; Wanchai, Visanu; Chai, Juanjuan; Nielsen, Morten; Trolle, Thomas; Lund, Ole; Buzard, Gregory S.; Pedersen, Thomas D.; Wassenaar, Trudy M.; Ussery, David W.

    2015-01-01

    The 2014 Ebola outbreak in West Africa is the largest documented for this virus. To examine the dynamics of this genome, we compare more than 100 currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms a distinct group from all other sequenced viral genomes. All filovirus genomes sequenced to date encode proteins with similar functions and gene order, although there is considerable divergence in sequences between the three genera Ebolavirus, Cuevavirus and Marburgvirus within the family Filoviridae. Whereas all ebolavirus genomes are quite similar (multiple sequences of the same strain are often identical), variation is most common in the intergenic regions and within specific areas of the genes encoding the glycoprotein (GP), nucleoprotein (NP) and polymerase (L). We predict regions that could contain epitope-binding sites, which might be good vaccine targets. This information, combined with glycosylation sites and experimentally determined epitopes, can identify the most promising regions for the development of therapeutic strategies. This manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the U.S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan). PMID:26175035

  11. Brief Guide to Genomics: DNA, Genes and Genomes

    Science.gov (United States)

    ... clinic. Most new drugs based on genome-based research are estimated to be at least 10 to 15 years away, though recent genome-driven efforts in lipid-lowering therapy have considerably shortened that interval. According ...

  12. Genomic Prediction in Barley

    DEFF Research Database (Denmark)

    Edriss, Vahid; Cericola, Fabio; Jensen, Jens D

    2015-01-01

    to next generation. The main goal of this study was to see the potential of using genomic prediction in a commercial Barley breeding program. The data used in this study was from Nordic Seed company which is located in Denmark. Around 350 advanced lines were genotyped with 9K Barely chip from Illumina....... Traits used in this study were grain yield, plant height and heading date. Heading date is number days it takes after 1st June for plant to head. Heritabilities were 0.33, 0.44 and 0.48 for yield, height and heading, respectively for the average of nine plots. The GBLUP model was used for genomic...

  13. Comparative Genomics Reveals High Genomic Diversity in the Genus Photobacterium.

    Science.gov (United States)

    Machado, Henrique; Gram, Lone

    2017-01-01

    Vibrionaceae is a large marine bacterial family, which can constitute up to 50% of the prokaryotic population in marine waters. Photobacterium is the second largest genus in the family and we used comparative genomics on 35 strains representing 16 of the 28 species described so far, to understand the genomic diversity present in the Photobacterium genus. Such understanding is important for ecophysiology studies of the genus. We used whole genome sequences to evaluate phylogenetic relationships using several analyses (16S rRNA, MLSA, fur , amino-acid usage, ANI), which allowed us to identify two misidentified strains. Genome analyses also revealed occurrence of higher and lower GC content clades, correlating with phylogenetic clusters. Pan- and core-genome analysis revealed the conservation of 25% of the genome throughout the genus, with a large and open pan-genome. The major source of genomic diversity could be traced to the smaller chromosome and plasmids. Several of the physiological traits studied in the genus did not correlate with phylogenetic data. Since horizontal gene transfer (HGT) is often suggested as a source of genetic diversity and a potential driver of genomic evolution in bacterial species, we looked into evidence of such in Photobacterium genomes. Genomic islands were the source of genomic differences between strains of the same species. Also, we found transposase genes and CRISPR arrays that suggest multiple encounters with foreign DNA. Presence of genomic exchange traits was widespread and abundant in the genus, suggesting a role in genomic evolution. The high genetic variability and indications of genetic exchange make it difficult to elucidate genome evolutionary paths and raise the awareness of the roles of foreign DNA in the genomic evolution of environmental organisms.

  14. phiGENOME: an integrative navigation throughout bacteriophage genomes.

    Science.gov (United States)

    Stano, Matej; Klucar, Lubos

    2011-11-01

    phiGENOME is a web-based genome browser generating dynamic and interactive graphical representation of phage genomes stored in the phiSITE, database of gene regulation in bacteriophages. phiGENOME is an integral part of the phiSITE web portal (http://www.phisite.org/phigenome) and it was optimised for visualisation of phage genomes with the emphasis on the gene regulatory elements. phiGENOME consists of three components: (i) genome map viewer built using Adobe Flash technology, providing dynamic and interactive graphical display of phage genomes; (ii) sequence browser based on precisely formatted HTML tags, providing detailed exploration of genome features on the sequence level and (iii) regulation illustrator, based on Scalable Vector Graphics (SVG) and designed for graphical representation of gene regulations. Bringing 542 complete genome sequences accompanied with their rich annotations and references, makes phiGENOME a unique information resource in the field of phage genomics. Copyright © 2011 Elsevier Inc. All rights reserved.

  15. Illuminating the Druggable Genome (IDG)

    Data.gov (United States)

    Federal Laboratory Consortium — Results from the Human Genome Project revealed that the human genome contains 20,000 to 25,000 genes. A gene contains (encodes) the information that each cell uses...

  16. National Human Genome Research Institute

    Science.gov (United States)

    ... Care Genomic Medicine Working Group New Horizons and Research Patient Management Policy and Ethics Issues Quick Links for Patient Care Education All About the Human Genome Project Fact Sheets Genetic Education Resources for ...

  17. Genomic prediction using subsampling.

    Science.gov (United States)

    Xavier, Alencar; Xu, Shizhong; Muir, William; Rainey, Katy Martin

    2017-03-24

    Genome-wide assisted selection is a critical tool for the genetic improvement of plants and animals. Whole-genome regression models in Bayesian framework represent the main family of prediction methods. Fitting such models with a large number of observations involves a prohibitive computational burden. We propose the use of subsampling bootstrap Markov chain in genomic prediction. Such method consists of fitting whole-genome regression models by subsampling observations in each round of a Markov Chain Monte Carlo. We evaluated the effect of subsampling bootstrap on prediction and computational parameters. Across datasets, we observed an optimal subsampling proportion of observations around 50% with replacement, and around 33% without replacement. Subsampling provided a substantial decrease in computation time, reducing the time to fit the model by half. On average, losses on predictive properties imposed by subsampling were negligible, usually below 1%. For each dataset, an optimal subsampling point that improves prediction properties was observed, but the improvements were also negligible. Combining subsampling with Gibbs sampling is an interesting ensemble algorithm. The investigation indicates that the subsampling bootstrap Markov chain algorithm substantially reduces computational burden associated with model fitting, and it may slightly enhance prediction properties.

  18. The Lotus japonicus genome

    DEFF Research Database (Denmark)

    Fabaceae, groundbreaking genetic and genomic research has established a significant body of knowledge on Lotus japonicus, which was adopted as a model species more than 20 years ago. The diverse nature of legumes means that such research has a wide potential and agricultural impact, for example...

  19. Genomic taxonomy of vibrios

    DEFF Research Database (Denmark)

    Thompson, Cristiane C.; Vicente, Ana Carolina P.; Souza, Rangel C.

    2009-01-01

    BACKGROUND: Vibrio taxonomy has been based on a polyphasic approach. In this study, we retrieve useful taxonomic information (i.e. data that can be used to distinguish different taxonomic levels, such as species and genera) from 32 genome sequences of different vibrio species. We use a variety of...

  20. The Genome Atlas Resource

    DEFF Research Database (Denmark)

    Azam Qureshi, Matloob; Rotenberg, Eva; Stærfeldt, Hans Henrik

    2010-01-01

    with scripts and algorithms developed in a variety of programming languages at the Centre for Biological Sequence Analysis in order to create a three-tier software application for genome analysis. The results are made available via a web interface developed in Java, PHP and Perl CGI. User...

  1. Genomic Signatures of Reinforcement

    Directory of Open Access Journals (Sweden)

    Austin G. Garner

    2018-04-01

    Full Text Available Reinforcement is the process by which selection against hybridization increases reproductive isolation between taxa. Much research has focused on demonstrating the existence of reinforcement, yet relatively little is known about the genetic basis of reinforcement or the evolutionary conditions under which reinforcement can occur. Inspired by reinforcement’s characteristic phenotypic pattern of reproductive trait divergence in sympatry but not in allopatry, we discuss whether reinforcement also leaves a distinct genomic pattern. First, we describe three patterns of genetic variation we expect as a consequence of reinforcement. Then, we discuss a set of alternative processes and complicating factors that may make the identification of reinforcement at the genomic level difficult. Finally, we consider how genomic analyses can be leveraged to inform if and to what extent reinforcement evolved in the face of gene flow between sympatric lineages and between allopatric and sympatric populations of the same lineage. Our major goals are to understand if genome scans for particular patterns of genetic variation could identify reinforcement, isolate the genetic basis of reinforcement, or infer the conditions under which reinforcement evolved.

  2. Better chocolate through genomics

    Science.gov (United States)

    Theobroma cacao, the cacao or chocolate tree, is a tropical understory tree whose seeds are used to make chocolate. And like any important crop, cacao is the subject of much research. On September 15, 2010, scientists publicly released a preliminary sequence of the cacao genome--which contains all o...

  3. Functional genomics of tomato

    Indian Academy of Sciences (India)

    2014-10-20

    Oct 20, 2014 ... 1Repository of Tomato Genomics Resources, Department of Plant Sciences, School .... Due to its position at the crossroads of Sanger's sequencing .... replacement for the microarray-based expression profiling. .... during RNA fragmentation step prior to library construction, ...... tomato pollen as a test case.

  4. Genomic Signatures of Reinforcement

    Science.gov (United States)

    Goulet, Benjamin E.

    2018-01-01

    Reinforcement is the process by which selection against hybridization increases reproductive isolation between taxa. Much research has focused on demonstrating the existence of reinforcement, yet relatively little is known about the genetic basis of reinforcement or the evolutionary conditions under which reinforcement can occur. Inspired by reinforcement’s characteristic phenotypic pattern of reproductive trait divergence in sympatry but not in allopatry, we discuss whether reinforcement also leaves a distinct genomic pattern. First, we describe three patterns of genetic variation we expect as a consequence of reinforcement. Then, we discuss a set of alternative processes and complicating factors that may make the identification of reinforcement at the genomic level difficult. Finally, we consider how genomic analyses can be leveraged to inform if and to what extent reinforcement evolved in the face of gene flow between sympatric lineages and between allopatric and sympatric populations of the same lineage. Our major goals are to understand if genome scans for particular patterns of genetic variation could identify reinforcement, isolate the genetic basis of reinforcement, or infer the conditions under which reinforcement evolved. PMID:29614048

  5. The Nostoc punctiforme Genome

    Energy Technology Data Exchange (ETDEWEB)

    John C. Meeks

    2001-12-31

    Nostoc punctiforme is a filamentous cyanobacterium with extensive phenotypic characteristics and a relatively large genome, approaching 10 Mb. The phenotypic characteristics include a photoautotrophic, diazotrophic mode of growth, but N. punctiforme is also facultatively heterotrophic; its vegetative cells have multiple development alternatives, including terminal differentiation into nitrogen-fixing heterocysts and transient differentiation into spore-like akinetes or motile filaments called hormogonia; and N. punctiforme has broad symbiotic competence with fungi and terrestrial plants, including bryophytes, gymnosperms and an angiosperm. The shotgun-sequencing phase of the N. punctiforme strain ATCC 29133 genome has been completed by the Joint Genome Institute. Annotation of an 8.9 Mb database yielded 7432 open reading frames, 45% of which encode proteins with known or probable known function and 29% of which are unique to N. punctiforme. Comparative analysis of the sequence indicates a genome that is highly plastic and in a state of flux, with numerous insertion sequences and multilocus repeats, as well as genes encoding transposases and DNA modification enzymes. The sequence also reveals the presence of genes encoding putative proteins that collectively define almost all characteristics of cyanobacteria as a group. N. punctiforme has an extensive potential to sense and respond to environmental signals as reflected by the presence of more than 400 genes encoding sensor protein kinases, response regulators and other transcriptional factors. The signal transduction systems and any of the large number of unique genes may play essential roles in the cell differentiation and symbiotic interaction properties of N. punctiforme.

  6. Comparative Genomics of Eukaryotes.

    NARCIS (Netherlands)

    Noort, V. van

    2007-01-01

    This thesis focuses on developing comparative genomics methods in eukaryotes, with an emphasis on applications for gene function prediction and regulatory element detection. In the past, methods have been developed to predict functional associations between gene pairs in prokaryotes. The challenge

  7. Searching for genomic constraints

    Energy Technology Data Exchange (ETDEWEB)

    Lio` , P [Cambridge, Univ. (United Kingdom). Genetics Dept.; Ruffo, S [Florence, Univ. (Italy). Fac. di Ingegneria. Dipt. di Energetica ` S. Stecco`

    1998-01-01

    The authors have analyzed general properties of very long DNA sequences belonging to simple and complex organisms, by using different correlation methods. They have distinguished those base compositional rules that concern the entire genome which they call `genomic constraints` from the rules that depend on the `external natural selection` acting on single genes, i. e. protein-centered constraints. They show that G + C content, purine / pyrimidine distributions and biological complexity of the organism are the most important factors which determine base compositional rules and genome complexity. Three main facts are here reported: bacteria with high G + C content have more restrictions on base composition than those with low G + C content; at constant G + C content more complex organisms, ranging from prokaryotes to higher eukaryotes (e.g. human) display an increase of repeats 10-20 nucleotides long, which are also partly responsible for long-range correlations; work selection of length 3 to 10 is stronger in human and in bacteria for two distinct reasons. With respect to previous studies, they have also compared the genomic sequence of the archeon Methanococcus jannaschii with those of bacteria and eukaryotes: it shows sometimes an intermediate statistical behaviour.

  8. Searching for genomic constraints

    International Nuclear Information System (INIS)

    Lio', P.; Ruffo, S.

    1998-01-01

    The authors have analyzed general properties of very long DNA sequences belonging to simple and complex organisms, by using different correlation methods. They have distinguished those base compositional rules that concern the entire genome which they call 'genomic constraints' from the rules that depend on the 'external natural selection' acting on single genes, i. e. protein-centered constraints. They show that G + C content, purine / pyrimidine distributions and biological complexity of the organism are the most important factors which determine base compositional rules and genome complexity. Three main facts are here reported: bacteria with high G + C content have more restrictions on base composition than those with low G + C content; at constant G + C content more complex organisms, ranging from prokaryotes to higher eukaryotes (e.g. human) display an increase of repeats 10-20 nucleotides long, which are also partly responsible for long-range correlations; work selection of length 3 to 10 is stronger in human and in bacteria for two distinct reasons. With respect to previous studies, they have also compared the genomic sequence of the archeon Methanococcus jannaschii with those of bacteria and eukaryotes: it shows sometimes an intermediate statistical behaviour

  9. Genomic sequencing in clinical trials

    OpenAIRE

    Mestan, Karen K; Ilkhanoff, Leonard; Mouli, Samdeep; Lin, Simon

    2011-01-01

    Abstract Human genome sequencing is the process by which the exact order of nucleic acid base pairs in the 24 human chromosomes is determined. Since the completion of the Human Genome Project in 2003, genomic sequencing is rapidly becoming a major part of our translational research efforts to understand and improve human health and disease. This article reviews the current and future directions of clinical research with respect to genomic sequencing, a technology that is just beginning to fin...

  10. Statistical Methods in Integrative Genomics

    Science.gov (United States)

    Richardson, Sylvia; Tseng, George C.; Sun, Wei

    2016-01-01

    Statistical methods in integrative genomics aim to answer important biology questions by jointly analyzing multiple types of genomic data (vertical integration) or aggregating the same type of data across multiple studies (horizontal integration). In this article, we introduce different types of genomic data and data resources, and then review statistical methods of integrative genomics, with emphasis on the motivation and rationale of these methods. We conclude with some summary points and future research directions. PMID:27482531

  11. From plant genomes to phenotypes

    OpenAIRE

    Bolger, Marie; Gundlach, Heidrun; Scholz, Uwe; Mayer, Klaus; Usadel, Björn; Schwacke, Rainer; Schmutzer, Thomas; Chen, Jinbo; Arend, Daniel; Oppermann, Markus; Weise, Stephan; Lange, Matthias; Fiorani, Fabio; Spannagl, Manuel

    2017-01-01

    Recent advances in sequencing technologies have greatly accelerated the rate of plant genome and applied breeding research. Despite this advancing trend, plant genomes continue to present numerous difficulties to the standard tools and pipelines not only for genome assembly but also gene annotation and downstream analysis.Here we give a perspective on tools, resources and services necessary to assemble and analyze plant genomes and link them to plant phenotypes.

  12. A Thousand Fly Genomes: An Expanded Drosophila Genome Nexus.

    Science.gov (United States)

    Lack, Justin B; Lange, Jeremy D; Tang, Alison D; Corbett-Detig, Russell B; Pool, John E

    2016-12-01

    The Drosophila Genome Nexus is a population genomic resource that provides D. melanogaster genomes from multiple sources. To facilitate comparisons across data sets, genomes are aligned using a common reference alignment pipeline which involves two rounds of mapping. Regions of residual heterozygosity, identity-by-descent, and recent population admixture are annotated to enable data filtering based on the user's needs. Here, we present a significant expansion of the Drosophila Genome Nexus, which brings the current data object to a total of 1,121 wild-derived genomes. New additions include 305 previously unpublished genomes from inbred lines representing six population samples in Egypt, Ethiopia, France, and South Africa, along with another 193 genomes added from recently-published data sets. We also provide an aligned D. simulans genome to facilitate divergence comparisons. This improved resource will broaden the range of population genomic questions that can addressed from multi-population allele frequencies and haplotypes in this model species. The larger set of genomes will also enhance the discovery of functionally relevant natural variation that exists within and between populations. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  13. The perennial ryegrass GenomeZipper: targeted use of genome resources for comparative grass genomics.

    Science.gov (United States)

    Pfeifer, Matthias; Martis, Mihaela; Asp, Torben; Mayer, Klaus F X; Lübberstedt, Thomas; Byrne, Stephen; Frei, Ursula; Studer, Bruno

    2013-02-01

    Whole-genome sequences established for model and major crop species constitute a key resource for advanced genomic research. For outbreeding forage and turf grass species like ryegrasses (Lolium spp.), such resources have yet to be developed. Here, we present a model of the perennial ryegrass (Lolium perenne) genome on the basis of conserved synteny to barley (Hordeum vulgare) and the model grass genome Brachypodium (Brachypodium distachyon) as well as rice (Oryza sativa) and sorghum (Sorghum bicolor). A transcriptome-based genetic linkage map of perennial ryegrass served as a scaffold to establish the chromosomal arrangement of syntenic genes from model grass species. This scaffold revealed a high degree of synteny and macrocollinearity and was then utilized to anchor a collection of perennial ryegrass genes in silico to their predicted genome positions. This resulted in the unambiguous assignment of 3,315 out of 8,876 previously unmapped genes to the respective chromosomes. In total, the GenomeZipper incorporates 4,035 conserved grass gene loci, which were used for the first genome-wide sequence divergence analysis between perennial ryegrass, barley, Brachypodium, rice, and sorghum. The perennial ryegrass GenomeZipper is an ordered, information-rich genome scaffold, facilitating map-based cloning and genome assembly in perennial ryegrass and closely related Poaceae species. It also represents a milestone in describing synteny between perennial ryegrass and fully sequenced model grass genomes, thereby increasing our understanding of genome organization and evolution in the most important temperate forage and turf grass species.

  14. Implementing genomics and pharmacogenomics in the clinic: The National Human Genome Research Institute's genomic medicine portfolio.

    Science.gov (United States)

    Manolio, Teri A

    2016-10-01

    Increasing knowledge about the influence of genetic variation on human health and growing availability of reliable, cost-effective genetic testing have spurred the implementation of genomic medicine in the clinic. As defined by the National Human Genome Research Institute (NHGRI), genomic medicine uses an individual's genetic information in his or her clinical care, and has begun to be applied effectively in areas such as cancer genomics, pharmacogenomics, and rare and undiagnosed diseases. In 2011 NHGRI published its strategic vision for the future of genomic research, including an ambitious research agenda to facilitate and promote the implementation of genomic medicine. To realize this agenda, NHGRI is consulting and facilitating collaborations with the external research community through a series of "Genomic Medicine Meetings," under the guidance and leadership of the National Advisory Council on Human Genome Research. These meetings have identified and begun to address significant obstacles to implementation, such as lack of evidence of efficacy, limited availability of genomics expertise and testing, lack of standards, and difficulties in integrating genomic results into electronic medical records. The six research and dissemination initiatives comprising NHGRI's genomic research portfolio are designed to speed the evaluation and incorporation, where appropriate, of genomic technologies and findings into routine clinical care. Actual adoption of successful approaches in clinical care will depend upon the willingness, interest, and energy of professional societies, practitioners, patients, and payers to promote their responsible use and share their experiences in doing so. Published by Elsevier Ireland Ltd.

  15. Applied Genomics of Foodborne Pathogens

    DEFF Research Database (Denmark)

    and customized source of information designed for and accessible to microbiologists interested in applying cutting-edge genomics in food safety and public health research. This book fills this void with a well-selected collection of topics, case studies, and bioinformatics tools contributed by experts......This book provides a timely and thorough snapshot into the emerging and fast evolving area of applied genomics of foodborne pathogens. Driven by the drastic advance of whole genome shot gun sequencing (WGS) technologies, genomics applications are becoming increasingly valuable and even essential...... at the forefront of foodborne pathogen genomics research....

  16. Chromatin dynamics in genome stability

    DEFF Research Database (Denmark)

    Nair, Nidhi; Shoaib, Muhammad; Sørensen, Claus Storgaard

    2017-01-01

    Genomic DNA is compacted into chromatin through packaging with histone and non-histone proteins. Importantly, DNA accessibility is dynamically regulated to ensure genome stability. This is exemplified in the response to DNA damage where chromatin relaxation near genomic lesions serves to promote...... access of relevant enzymes to specific DNA regions for signaling and repair. Furthermore, recent data highlight genome maintenance roles of chromatin through the regulation of endogenous DNA-templated processes including transcription and replication. Here, we review research that shows the importance...... of chromatin structure regulation in maintaining genome integrity by multiple mechanisms including facilitating DNA repair and directly suppressing endogenous DNA damage....

  17. Evolution of small prokaryotic genomes

    Directory of Open Access Journals (Sweden)

    David José Martínez-Cano

    2015-01-01

    Full Text Available As revealed by genome sequencing, the biology of prokaryotes with reduced genomes is strikingly diverse. These include free-living prokaryotes with ~800 genes as well as endosymbiotic bacteria with as few as ~140 genes. Comparative genomics is revealing the evolutionary mechanisms that led to these small genomes. In the case of free-living prokaryotes, natural selection directly favored genome reduction, while in the case of endosymbiotic prokaryotes neutral processes played a more prominent role. However, new experimental data suggest that selective processes may be at operation as well for endosymbiotic prokaryotes at least during the first stages of genome reduction. Endosymbiotic prokaryotes have evolved diverse strategies for living with reduced gene sets inside a host-defined medium. These include utilization of host-encoded functions (some of them coded by genes acquired by gene transfer from the endosymbiont and/or other bacteria; metabolic complementation between co-symbionts; and forming consortiums with other bacteria within the host. Recent genome sequencing projects of intracellular mutualistic bacteria showed that previously believed universal evolutionary trends like reduced G+C content and conservation of genome synteny are not always present in highly reduced genomes. Finally, the simplified molecular machinery of some of these organisms with small genomes may be used to aid in the design of artificial minimal cells. Here we review recent genomic discoveries of the biology of prokaryotes endowed with small gene sets and discuss the evolutionary mechanisms that have been proposed to explain their peculiar nature.

  18. Informational laws of genome structures

    Science.gov (United States)

    Bonnici, Vincenzo; Manca, Vincenzo

    2016-06-01

    In recent years, the analysis of genomes by means of strings of length k occurring in the genomes, called k-mers, has provided important insights into the basic mechanisms and design principles of genome structures. In the present study, we focus on the proper choice of the value of k for applying information theoretic concepts that express intrinsic aspects of genomes. The value k = lg2(n), where n is the genome length, is determined to be the best choice in the definition of some genomic informational indexes that are studied and computed for seventy genomes. These indexes, which are based on information entropies and on suitable comparisons with random genomes, suggest five informational laws, to which all of the considered genomes obey. Moreover, an informational genome complexity measure is proposed, which is a generalized logistic map that balances entropic and anti-entropic components of genomes and is related to their evolutionary dynamics. Finally, applications to computational synthetic biology are briefly outlined.

  19. Toward genome-enabled mycology.

    Science.gov (United States)

    Hibbett, David S; Stajich, Jason E; Spatafora, Joseph W

    2013-01-01

    Genome-enabled mycology is a rapidly expanding field that is characterized by the pervasive use of genome-scale data and associated computational tools in all aspects of fungal biology. Genome-enabled mycology is integrative and often requires teams of researchers with diverse skills in organismal mycology, bioinformatics and molecular biology. This issue of Mycologia presents the first complete fungal genomes in the history of the journal, reflecting the ongoing transformation of mycology into a genome-enabled science. Here, we consider the prospects for genome-enabled mycology and the technical and social challenges that will need to be overcome to grow the database of complete fungal genomes and enable all fungal biologists to make use of the new data.

  20. Genomic research perspectives in Kazakhstan

    Directory of Open Access Journals (Sweden)

    Ainur Akilzhanova

    2014-01-01

    Full Text Available Introduction: Technological advancements rapidly propel the field of genome research. Advances in genetics and genomics such as the sequence of the human genome, the human haplotype map, open access databases, cheaper genotyping and chemical genomics, have transformed basic and translational biomedical research. Several projects in the field of genomic and personalized medicine have been conducted at the Center for Life Sciences in Nazarbayev University. The prioritized areas of research include: genomics of multifactorial diseases, cancer genomics, bioinformatics, genetics of infectious diseases and population genomics. At present, DNA-based risk assessment for common complex diseases, application of molecular signatures for cancer diagnosis and prognosis, genome-guided therapy, and dose selection of therapeutic drugs are the important issues in personalized medicine. Results: To further develop genomic and biomedical projects at Center for Life Sciences, the development of bioinformatics research and infrastructure and the establishment of new collaborations in the field are essential. Widespread use of genetic tools will allow the identification of diseases before the onset of clinical symptoms, the individualization of drug treatment, and could induce individual behavioral changes on the basis of calculated disease risk. However, many challenges remain for the successful translation of genomic knowledge and technologies into health advances, such as medicines and diagnostics. It is important to integrate research and education in the fields of genomics, personalized medicine, and bioinformatics, which will be possible with opening of the new Medical Faculty at Nazarbayev University. People in practice and training need to be educated about the key concepts of genomics and engaged so they can effectively apply their knowledge in a matter that will bring the era of genomic medicine to patient care. This requires the development of well

  1. Mycobacteriophage genome database.

    Science.gov (United States)

    Joseph, Jerrine; Rajendran, Vasanthi; Hassan, Sameer; Kumar, Vanaja

    2011-01-01

    Mycobacteriophage genome database (MGDB) is an exclusive repository of the 64 completely sequenced mycobacteriophages with annotated information. It is a comprehensive compilation of the various gene parameters captured from several databases pooled together to empower mycobacteriophage researchers. The MGDB (Version No.1.0) comprises of 6086 genes from 64 mycobacteriophages classified into 72 families based on ACLAME database. Manual curation was aided by information available from public databases which was enriched further by analysis. Its web interface allows browsing as well as querying the classification. The main objective is to collect and organize the complexity inherent to mycobacteriophage protein classification in a rational way. The other objective is to browse the existing and new genomes and describe their functional annotation. The database is available for free at http://mpgdb.ibioinformatics.org/mpgdb.php.

  2. Precision genome editing

    DEFF Research Database (Denmark)

    Steentoft, Catharina; Bennett, Eric P; Schjoldager, Katrine Ter-Borch Gram

    2014-01-01

    Precise and stable gene editing in mammalian cell lines has until recently been hampered by the lack of efficient targeting methods. While different gene silencing strategies have had tremendous impact on many biological fields, they have generally not been applied with wide success in the field...... of glycobiology, primarily due to their low efficiencies, with resultant failure to impose substantial phenotypic consequences upon the final glycosylation products. Here, we review novel nuclease-based precision genome editing techniques enabling efficient and stable gene editing, including gene disruption...... by introducing single or double-stranded breaks at a defined genomic sequence. We here compare and contrast the different techniques and summarize their current applications, highlighting cases from the field of glycobiology as well as pointing to future opportunities. The emerging potential of precision gene...

  3. Alignment of whole genomes.

    Science.gov (United States)

    Delcher, A L; Kasif, S; Fleischmann, R D; Peterson, J; White, O; Salzberg, S L

    1999-01-01

    A new system for aligning whole genome sequences is described. Using an efficient data structure called a suffix tree, the system is able to rapidly align sequences containing millions of nucleotides. Its use is demonstrated on two strains of Mycoplasma tuberculosis, on two less similar species of Mycoplasma bacteria and on two syntenic sequences from human chromosome 12 and mouse chromosome 6. In each case it found an alignment of the input sequences, using between 30 s and 2 min of computation time. From the system output, information on single nucleotide changes, translocations and homologous genes can easily be extracted. Use of the algorithm should facilitate analysis of syntenic chromosomal regions, strain-to-strain comparisons, evolutionary comparisons and genomic duplications. PMID:10325427

  4. eGenomics: Cataloguing Our Complete Genome Collection III

    Directory of Open Access Journals (Sweden)

    Dawn Field

    2007-01-01

    Full Text Available This meeting report summarizes the proceedings of the “eGenomics: Cataloguing our Complete Genome Collection III” workshop held September 11–13, 2006, at the National Institute for Environmental eScience (NIEeS, Cambridge, United Kingdom. This 3rd workshop of the Genomic Standards Consortium was divided into two parts. The first half of the three-day workshop was dedicated to reviewing the genomic diversity of our current and future genome and metagenome collection, and exploring linkages to a series of existing projects through formal presentations. The second half was dedicated to strategic discussions. Outcomes of the workshop include a revised “Minimum Information about a Genome Sequence” (MIGS specification (v1.1, consensus on a variety of features to be added to the Genome Catalogue (GCat, agreement by several researchers to adopt MIGS for imminent genome publications, and an agreement by the EBI and NCBI to input their genome collections into GCat for the purpose of quantifying the amount of optional data already available (e.g., for geographic location coordinates and working towards a single, global list of all public genomes and metagenomes.

  5. Genomics Portals: integrative web-platform for mining genomics data.

    Science.gov (United States)

    Shinde, Kaustubh; Phatak, Mukta; Johannes, Freudenberg M; Chen, Jing; Li, Qian; Vineet, Joshi K; Hu, Zhen; Ghosh, Krishnendu; Meller, Jaroslaw; Medvedovic, Mario

    2010-01-13

    A large amount of experimental data generated by modern high-throughput technologies is available through various public repositories. Our knowledge about molecular interaction networks, functional biological pathways and transcriptional regulatory modules is rapidly expanding, and is being organized in lists of functionally related genes. Jointly, these two sources of information hold a tremendous potential for gaining new insights into functioning of living systems. Genomics Portals platform integrates access to an extensive knowledge base and a large database of human, mouse, and rat genomics data with basic analytical visualization tools. It provides the context for analyzing and interpreting new experimental data and the tool for effective mining of a large number of publicly available genomics datasets stored in the back-end databases. The uniqueness of this platform lies in the volume and the diversity of genomics data that can be accessed and analyzed (gene expression, ChIP-chip, ChIP-seq, epigenomics, computationally predicted binding sites, etc), and the integration with an extensive knowledge base that can be used in such analysis. The integrated access to primary genomics data, functional knowledge and analytical tools makes Genomics Portals platform a unique tool for interpreting results of new genomics experiments and for mining the vast amount of data stored in the Genomics Portals backend databases. Genomics Portals can be accessed and used freely at http://GenomicsPortals.org.

  6. Genomics Portals: integrative web-platform for mining genomics data

    Directory of Open Access Journals (Sweden)

    Ghosh Krishnendu

    2010-01-01

    Full Text Available Abstract Background A large amount of experimental data generated by modern high-throughput technologies is available through various public repositories. Our knowledge about molecular interaction networks, functional biological pathways and transcriptional regulatory modules is rapidly expanding, and is being organized in lists of functionally related genes. Jointly, these two sources of information hold a tremendous potential for gaining new insights into functioning of living systems. Results Genomics Portals platform integrates access to an extensive knowledge base and a large database of human, mouse, and rat genomics data with basic analytical visualization tools. It provides the context for analyzing and interpreting new experimental data and the tool for effective mining of a large number of publicly available genomics datasets stored in the back-end databases. The uniqueness of this platform lies in the volume and the diversity of genomics data that can be accessed and analyzed (gene expression, ChIP-chip, ChIP-seq, epigenomics, computationally predicted binding sites, etc, and the integration with an extensive knowledge base that can be used in such analysis. Conclusion The integrated access to primary genomics data, functional knowledge and analytical tools makes Genomics Portals platform a unique tool for interpreting results of new genomics experiments and for mining the vast amount of data stored in the Genomics Portals backend databases. Genomics Portals can be accessed and used freely at http://GenomicsPortals.org.

  7. Family genome browser: visualizing genomes with pedigree information.

    Science.gov (United States)

    Juan, Liran; Liu, Yongzhuang; Wang, Yongtian; Teng, Mingxiang; Zang, Tianyi; Wang, Yadong

    2015-07-15

    Families with inherited diseases are widely used in Mendelian/complex disease studies. Owing to the advances in high-throughput sequencing technologies, family genome sequencing becomes more and more prevalent. Visualizing family genomes can greatly facilitate human genetics studies and personalized medicine. However, due to the complex genetic relationships and high similarities among genomes of consanguineous family members, family genomes are difficult to be visualized in traditional genome visualization framework. How to visualize the family genome variants and their functions with integrated pedigree information remains a critical challenge. We developed the Family Genome Browser (FGB) to provide comprehensive analysis and visualization for family genomes. The FGB can visualize family genomes in both individual level and variant level effectively, through integrating genome data with pedigree information. Family genome analysis, including determination of parental origin of the variants, detection of de novo mutations, identification of potential recombination events and identical-by-decent segments, etc., can be performed flexibly. Diverse annotations for the family genome variants, such as dbSNP memberships, linkage disequilibriums, genes, variant effects, potential phenotypes, etc., are illustrated as well. Moreover, the FGB can automatically search de novo mutations and compound heterozygous variants for a selected individual, and guide investigators to find high-risk genes with flexible navigation options. These features enable users to investigate and understand family genomes intuitively and systematically. The FGB is available at http://mlg.hit.edu.cn/FGB/. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  8. Human Germline Genome Editing.

    Science.gov (United States)

    Ormond, Kelly E; Mortlock, Douglas P; Scholes, Derek T; Bombard, Yvonne; Brody, Lawrence C; Faucett, W Andrew; Garrison, Nanibaa' A; Hercher, Laura; Isasi, Rosario; Middleton, Anna; Musunuru, Kiran; Shriner, Daniel; Virani, Alice; Young, Caroline E

    2017-08-03

    With CRISPR/Cas9 and other genome-editing technologies, successful somatic and germline genome editing are becoming feasible. To respond, an American Society of Human Genetics (ASHG) workgroup developed this position statement, which was approved by the ASHG Board in March 2017. The workgroup included representatives from the UK Association of Genetic Nurses and Counsellors, Canadian Association of Genetic Counsellors, International Genetic Epidemiology Society, and US National Society of Genetic Counselors. These groups, as well as the American Society for Reproductive Medicine, Asia Pacific Society of Human Genetics, British Society for Genetic Medicine, Human Genetics Society of Australasia, Professional Society of Genetic Counselors in Asia, and Southern African Society for Human Genetics, endorsed the final statement. The statement includes the following positions. (1) At this time, given the nature and number of unanswered scientific, ethical, and policy questions, it is inappropriate to perform germline gene editing that culminates in human pregnancy. (2) Currently, there is no reason to prohibit in vitro germline genome editing on human embryos and gametes, with appropriate oversight and consent from donors, to facilitate research on the possible future clinical applications of gene editing. There should be no prohibition on making public funds available to support this research. (3) Future clinical application of human germline genome editing should not proceed unless, at a minimum, there is (a) a compelling medical rationale, (b) an evidence base that supports its clinical use, (c) an ethical justification, and (d) a transparent public process to solicit and incorporate stakeholder input. Copyright © 2017 American Society of Human Genetics. All rights reserved.

  9. Genomic Prediction from Whole Genome Sequence in Livestock: The 1000 Bull Genomes Project

    DEFF Research Database (Denmark)

    Hayes, Benjamin J; MacLeod, Iona M; Daetwyler, Hans D

    Advantages of using whole genome sequence data to predict genomic estimated breeding values (GEBV) include better persistence of accuracy of GEBV across generations and more accurate GEBV across breeds. The 1000 Bull Genomes Project provides a database of whole genome sequenced key ancestor bulls....... In a dairy data set, predictions using BayesRC and imputed sequence data from 1000 Bull Genomes were 2% more accurate than with 800k data. We could demonstrate the method identified causal mutations in some cases. Further improvements will come from more accurate imputation of sequence variant genotypes...

  10. Genomic technologies in neonatology

    Directory of Open Access Journals (Sweden)

    L. N. Chernova

    2017-01-01

    Full Text Available In recent years, there has been a tremendous trend toward personalized medicine. Advances in the field forced clinicians, including neonatologists, to take a fresh look at prevention, tactics of management and therapy of various diseases. In the center of attention of foreign, and increasingly Russian, researchers and doctors, there are individual genomic data that allow not only to assess the risks of some form of pathology, but also to successfully apply personalized strategies of prediction, prevention and targeted treatment. This article provides a brief review of the latest achievements of genomic technologies in newborns, examines the problems and potential applications of genomics in promoting the concept of personalized medicine in neonatology. The increasing amount of personalized data simply impossible to analyze only by the human mind. In this connection, the need of computers and bioinformatics is obvious. The article reveals the role of translational bioinformatics in the analysis and integration of the results of the accumulated fundamental research into complete clinical decisions. The latest advances in neonatal translational bioinformatics such as clinical decision support systems are considered. It helps to monitor vital parameters of newborns influencing the course of a particular disease, to calculate the increased risks of the development of various pathologies and to select the drugs.

  11. Value-based genomics.

    Science.gov (United States)

    Gong, Jun; Pan, Kathy; Fakih, Marwan; Pal, Sumanta; Salgia, Ravi

    2018-03-20

    Advancements in next-generation sequencing have greatly enhanced the development of biomarker-driven cancer therapies. The affordability and availability of next-generation sequencers have allowed for the commercialization of next-generation sequencing platforms that have found widespread use for clinical-decision making and research purposes. Despite the greater availability of tumor molecular profiling by next-generation sequencing at our doorsteps, the achievement of value-based care, or improving patient outcomes while reducing overall costs or risks, in the era of precision oncology remains a looming challenge. In this review, we highlight available data through a pre-established and conceptualized framework for evaluating value-based medicine to assess the cost (efficiency), clinical benefit (effectiveness), and toxicity (safety) of genomic profiling in cancer care. We also provide perspectives on future directions of next-generation sequencing from targeted panels to whole-exome or whole-genome sequencing and describe potential strategies needed to attain value-based genomics.

  12. Comparative Genomics Reveals High Genomic Diversity in the Genus Photobacterium

    DEFF Research Database (Denmark)

    Machado, Henrique; Gram, Lone

    2017-01-01

    was widespread and abundant in the genus, suggesting a role in genomic evolution. The high genetic variability and indications of genetic exchange make it difficult to elucidate genome evolutionary paths and raise the awareness of the roles of foreign DNA in the genomic evolution of environmental organisms.......Vibrionaceae is a large marine bacterial family, which can constitute up to 50% of the prokaryotic population in marine waters. Photobacterium is the second largest genus in the family and we used comparative genomics on 35 strains representing 16 of the 28 species described so far, to understand...... the genomic diversity present in the Photobacterium genus. Such understanding is important for ecophysiology studies of the genus. We used whole genome sequences to evaluate phylogenetic relationships using several analyses (16S rRNA, MLSA, fur, amino-acid usage, ANI), which allowed us to identify two...

  13. Genome update: the 1000th genome - a cautionary tale

    DEFF Research Database (Denmark)

    Lagesen, Karin; Ussery, David; Wassenaar, Gertrude Maria

    2010-01-01

    conclusions for example about the largest bacterial genome sequenced. Biological diversity is far greater than many have thought. For example, analysis of multiple Escherichia coli genomes has led to an estimate of around 45 000 gene families more genes than are recognized in the human genome. Moreover......There are now more than 1000 sequenced prokaryotic genomes deposited in public databases and available for analysis. Currently, although the sequence databases GenBank, DNA Database of Japan and EMBL are synchronized continually, there are slight differences in content at the genomes level...... for a variety of logistical reasons, including differences in format and loading errors, such as those caused by file transfer protocol interruptions. This means that the 1000th genome will be different in the various databases. Some of the data on the highly accessed web pages are inaccurate, leading to false...

  14. Efficient Breeding by Genomic Mating.

    Science.gov (United States)

    Akdemir, Deniz; Sánchez, Julio I

    2016-01-01

    Selection in breeding programs can be done by using phenotypes (phenotypic selection), pedigree relationship (breeding value selection) or molecular markers (marker assisted selection or genomic selection). All these methods are based on truncation selection, focusing on the best performance of parents before mating. In this article we proposed an approach to breeding, named genomic mating, which focuses on mating instead of truncation selection. Genomic mating uses information in a similar fashion to genomic selection but includes information on complementation of parents to be mated. Following the efficiency frontier surface, genomic mating uses concepts of estimated breeding values, risk (usefulness) and coefficient of ancestry to optimize mating between parents. We used a genetic algorithm to find solutions to this optimization problem and the results from our simulations comparing genomic selection, phenotypic selection and the mating approach indicate that current approach for breeding complex traits is more favorable than phenotypic and genomic selection. Genomic mating is similar to genomic selection in terms of estimating marker effects, but in genomic mating the genetic information and the estimated marker effects are used to decide which genotypes should be crossed to obtain the next breeding population.

  15. Comparative Genomics Reveals High Genomic Diversity in the Genus Photobacterium

    OpenAIRE

    Henrique Machado; Henrique Machado; Lone Gram

    2017-01-01

    Vibrionaceae is a large marine bacterial family, which can constitute up to 50% of the prokaryotic population in marine waters. Photobacterium is the second largest genus in the family and we used comparative genomics on 35 strains representing 16 of the 28 species described so far, to understand the genomic diversity present in the Photobacterium genus. Such understanding is important for ecophysiology studies of the genus. We used whole genome sequences to evaluate phylogenetic relationship...

  16. Genome Surfing As Driver of Microbial Genomic Diversity.

    Science.gov (United States)

    Choudoir, Mallory J; Panke-Buisse, Kevin; Andam, Cheryl P; Buckley, Daniel H

    2017-08-01

    Historical changes in population size, such as those caused by demographic range expansions, can produce nonadaptive changes in genomic diversity through mechanisms such as gene surfing. We propose that demographic range expansion of a microbial population capable of horizontal gene exchange can result in genome surfing, a mechanism that can cause widespread increase in the pan-genome frequency of genes acquired by horizontal gene exchange. We explain that patterns of genetic diversity within Streptomyces are consistent with genome surfing, and we describe several predictions for testing this hypothesis both in Streptomyces and in other microorganisms. Copyright © 2017 Elsevier Ltd. All rights reserved.

  17. Genome U-Plot: a whole genome visualization.

    Science.gov (United States)

    Gaitatzes, Athanasios; Johnson, Sarah H; Smadbeck, James B; Vasmatzis, George

    2018-05-15

    The ability to produce and analyze whole genome sequencing (WGS) data from samples with structural variations (SV) generated the need to visualize such abnormalities in simplified plots. Conventional two-dimensional representations of WGS data frequently use either circular or linear layouts. There are several diverse advantages regarding both these representations, but their major disadvantage is that they do not use the two-dimensional space very efficiently. We propose a layout, termed the Genome U-Plot, which spreads the chromosomes on a two-dimensional surface and essentially quadruples the spatial resolution. We present the Genome U-Plot for producing clear and intuitive graphs that allows researchers to generate novel insights and hypotheses by visualizing SVs such as deletions, amplifications, and chromoanagenesis events. The main features of the Genome U-Plot are its layered layout, its high spatial resolution and its improved aesthetic qualities. We compare conventional visualization schemas with the Genome U-Plot using visualization metrics such as number of line crossings and crossing angle resolution measures. Based on our metrics, we improve the readability of the resulting graph by at least 2-fold, making apparent important features and making it easy to identify important genomic changes. A whole genome visualization tool with high spatial resolution and improved aesthetic qualities. An implementation and documentation of the Genome U-Plot is publicly available at https://github.com/gaitat/GenomeUPlot. vasmatzis.george@mayo.edu. Supplementary data are available at Bioinformatics online.

  18. Genomic Data Commons and Genomic Cloud Pilots - Google Hangout

    Science.gov (United States)

    Join us for a live, moderated discussion about two NCI efforts to expand access to cancer genomics data: the Genomic Data Commons and Genomic Cloud Pilots. NCI subject matters experts will include Louis M. Staudt, M.D., Ph.D., Director Center for Cancer Genomics, Warren Kibbe, Ph.D., Director, NCI Center for Biomedical Informatics and Information Technology, and moderated by Anthony Kerlavage, Ph.D., Chief, Cancer Informatics Branch, Center for Biomedical Informatics and Information Technology. We welcome your questions before and during the Hangout on Twitter using the hashtag #AskNCI.

  19. Ensembl 2002: accommodating comparative genomics.

    Science.gov (United States)

    Clamp, M; Andrews, D; Barker, D; Bevan, P; Cameron, G; Chen, Y; Clark, L; Cox, T; Cuff, J; Curwen, V; Down, T; Durbin, R; Eyras, E; Gilbert, J; Hammond, M; Hubbard, T; Kasprzyk, A; Keefe, D; Lehvaslaiho, H; Iyer, V; Melsopp, C; Mongin, E; Pettett, R; Potter, S; Rust, A; Schmidt, E; Searle, S; Slater, G; Smith, J; Spooner, W; Stabenau, A; Stalker, J; Stupka, E; Ureta-Vidal, A; Vastrik, I; Birney, E

    2003-01-01

    The Ensembl (http://www.ensembl.org/) database project provides a bioinformatics framework to organise biology around the sequences of large genomes. It is a comprehensive source of stable automatic annotation of human, mouse and other genome sequences, available as either an interactive web site or as flat files. Ensembl also integrates manually annotated gene structures from external sources where available. As well as being one of the leading sources of genome annotation, Ensembl is an open source software engineering project to develop a portable system able to handle very large genomes and associated requirements. These range from sequence analysis to data storage and visualisation and installations exist around the world in both companies and at academic sites. With both human and mouse genome sequences available and more vertebrate sequences to follow, many of the recent developments in Ensembl have focusing on developing automatic comparative genome analysis and visualisation.

  20. The Ensembl genome database project.

    Science.gov (United States)

    Hubbard, T; Barker, D; Birney, E; Cameron, G; Chen, Y; Clark, L; Cox, T; Cuff, J; Curwen, V; Down, T; Durbin, R; Eyras, E; Gilbert, J; Hammond, M; Huminiecki, L; Kasprzyk, A; Lehvaslaiho, H; Lijnzaad, P; Melsopp, C; Mongin, E; Pettett, R; Pocock, M; Potter, S; Rust, A; Schmidt, E; Searle, S; Slater, G; Smith, J; Spooner, W; Stabenau, A; Stalker, J; Stupka, E; Ureta-Vidal, A; Vastrik, I; Clamp, M

    2002-01-01

    The Ensembl (http://www.ensembl.org/) database project provides a bioinformatics framework to organise biology around the sequences of large genomes. It is a comprehensive source of stable automatic annotation of the human genome sequence, with confirmed gene predictions that have been integrated with external data sources, and is available as either an interactive web site or as flat files. It is also an open source software engineering project to develop a portable system able to handle very large genomes and associated requirements from sequence analysis to data storage and visualisation. The Ensembl site is one of the leading sources of human genome sequence annotation and provided much of the analysis for publication by the international human genome project of the draft genome. The Ensembl system is being installed around the world in both companies and academic sites on machines ranging from supercomputers to laptops.

  1. Comparative Genomics in Homo sapiens.

    Science.gov (United States)

    Oti, Martin; Sammeth, Michael

    2018-01-01

    Genomes can be compared at different levels of divergence, either between species or within species. Within species genomes can be compared between different subpopulations, such as human subpopulations from different continents. Investigating the genomic differences between different human subpopulations is important when studying complex diseases that are affected by many genetic variants, as the variants involved can differ between populations. The 1000 Genomes Project collected genome-scale variation data for 2504 human individuals from 26 different populations, enabling a systematic comparison of variation between human subpopulations. In this chapter, we present step-by-step a basic protocol for the identification of population-specific variants employing the 1000 Genomes data. These variants are subsequently further investigated for those that affect the proteome or RNA splice sites, to investigate potentially biologically relevant differences between the populations.

  2. The genome of Arabidopsis thaliana.

    OpenAIRE

    Goodman, H M; Ecker, J R; Dean, C

    1995-01-01

    Arabidopsis thaliana is a small flowering plant that is a member of the family cruciferae. It has many characteristics--diploid genetics, rapid growth cycle, relatively low repetitive DNA content, and small genome size--that recommend it as the model for a plant genome project. The current status of the genetic and physical maps, as well as efforts to sequence the genome, are presented. Examples are given of genes isolated by using map-based cloning. The importance of the Arabidopsis project ...

  3. Advances in editing microalgae genomes

    OpenAIRE

    Daboussi, Fayza

    2017-01-01

    There have been significant advances in microalgal genomics over the last decade. Nevertheless, there are still insufficient tools for the manipulation of microalgae genomes and the development of microalgae as industrial biofactories. Several research groups have recently contributed to progress by demonstrating that particular nucleases can be used for targeted and stable modifications of the genomes of some microalgae species. The nucleases include Meganucleases, Zinc Finger nucleases, TAL...

  4. Genomic selection in plant breeding.

    Science.gov (United States)

    Newell, Mark A; Jannink, Jean-Luc

    2014-01-01

    Genomic selection (GS) is a method to predict the genetic value of selection candidates based on the genomic estimated breeding value (GEBV) predicted from high-density markers positioned throughout the genome. Unlike marker-assisted selection, the GEBV is based on all markers including both minor and major marker effects. Thus, the GEBV may capture more of the genetic variation for the particular trait under selection.

  5. Genomic Feature Models

    DEFF Research Database (Denmark)

    Sørensen, Peter; Edwards, Stefan McKinnon; Rohde, Palle Duun

    -additive genetic mechanisms. These modeling approaches have proven to be highly useful to determine population genetic parameters as well as prediction of genetic risk or value. We present a series of statistical modelling approaches that use prior biological information for evaluating the collective action......Whole-genome sequences and multiple trait phenotypes from large numbers of individuals will soon be available in many populations. Well established statistical modeling approaches enable the genetic analyses of complex trait phenotypes while accounting for a variety of additive and non...... regions and gene ontologies) that provide better model fit and increase predictive ability of the statistical model for this trait....

  6. Genomic dairy cattle breeding

    DEFF Research Database (Denmark)

    Mark, Thomas; Sandøe, Peter

    2010-01-01

    the thoughts of breeders and other stakeholders on how to best make use of genomic breeding in the future. Intensive breeding has played a major role in securing dramatic increases in milk yield since the Second World War. Until recently, the main focus in dairy cattle breeding was on production traits...... it less accountable to the concern of private farmers for the welfare of their animals. It is argued that there is a need to mobilise a wide range of stakeholders to monitor developments and maintain pressure on breeding companies so that they are aware of the need to take precautionary measures to avoid...

  7. Organizational heterogeneity of vertebrate genomes.

    Science.gov (United States)

    Frenkel, Svetlana; Kirzhner, Valery; Korol, Abraham

    2012-01-01

    Genomes of higher eukaryotes are mosaics of segments with various structural, functional, and evolutionary properties. The availability of whole-genome sequences allows the investigation of their structure as "texts" using different statistical and computational methods. One such method, referred to as Compositional Spectra (CS) analysis, is based on scoring the occurrences of fixed-length oligonucleotides (k-mers) in the target DNA sequence. CS analysis allows generating species- or region-specific characteristics of the genome, regardless of their length and the presence of coding DNA. In this study, we consider the heterogeneity of vertebrate genomes as a joint effect of regional variation in sequence organization superimposed on the differences in nucleotide composition. We estimated compositional and organizational heterogeneity of genome and chromosome sequences separately and found that both heterogeneity types vary widely among genomes as well as among chromosomes in all investigated taxonomic groups. The high correspondence of heterogeneity scores obtained on three genome fractions, coding, repetitive, and the remaining part of the noncoding DNA (the genome dark matter--GDM) allows the assumption that CS-heterogeneity may have functional relevance to genome regulation. Of special interest for such interpretation is the fact that natural GDM sequences display the highest deviation from the corresponding reshuffled sequences.

  8. Organizational heterogeneity of vertebrate genomes.

    Directory of Open Access Journals (Sweden)

    Svetlana Frenkel

    Full Text Available Genomes of higher eukaryotes are mosaics of segments with various structural, functional, and evolutionary properties. The availability of whole-genome sequences allows the investigation of their structure as "texts" using different statistical and computational methods. One such method, referred to as Compositional Spectra (CS analysis, is based on scoring the occurrences of fixed-length oligonucleotides (k-mers in the target DNA sequence. CS analysis allows generating species- or region-specific characteristics of the genome, regardless of their length and the presence of coding DNA. In this study, we consider the heterogeneity of vertebrate genomes as a joint effect of regional variation in sequence organization superimposed on the differences in nucleotide composition. We estimated compositional and organizational heterogeneity of genome and chromosome sequences separately and found that both heterogeneity types vary widely among genomes as well as among chromosomes in all investigated taxonomic groups. The high correspondence of heterogeneity scores obtained on three genome fractions, coding, repetitive, and the remaining part of the noncoding DNA (the genome dark matter--GDM allows the assumption that CS-heterogeneity may have functional relevance to genome regulation. Of special interest for such interpretation is the fact that natural GDM sequences display the highest deviation from the corresponding reshuffled sequences.

  9. Genome engineering in Vibrio cholerae

    DEFF Research Database (Denmark)

    Val, Marie-Eve; Skovgaard, Ole; Ducos-Galand, Magaly

    2012-01-01

    Although bacteria with multipartite genomes are prevalent, our knowledge of the mechanisms maintaining their genome is very limited, and much remains to be learned about the structural and functional interrelationships of multiple chromosomes. Owing to its bi-chromosomal genome architecture and its....... This difficulty was surmounted using a unique and powerful strategy based on massive rearrangement of prokaryotic genomes. We developed a site-specific recombination-based engineering tool, which allows targeted, oriented, and reciprocal DNA exchanges. Using this genetic tool, we obtained a panel of V. cholerae...

  10. Genome Writing: Current Progress and Related Applications

    Directory of Open Access Journals (Sweden)

    Yueqiang Wang

    2018-02-01

    Full Text Available The ultimate goal of synthetic biology is to build customized cells or organisms to meet specific industrial or medical needs. The most important part of the customized cell is a synthetic genome. Advanced genomic writing technologies are required to build such an artificial genome. Recently, the partially-completed synthetic yeast genome project represents a milestone in this field. In this mini review, we briefly introduce the techniques for de novo genome synthesis and genome editing. Furthermore, we summarize recent research progresses and highlight several applications in the synthetic genome field. Finally, we discuss current challenges and future prospects. Keywords: Synthetic biology, Genome writing, Genome editing, Bioethics, Biosafety

  11. Comparative genomics of Lactobacillus and other LAB

    DEFF Research Database (Denmark)

    Wassenaar, Trudy M.; Lukjancenko, Oksana

    2014-01-01

    that of the others, with the two Streptococcus species having the shortest genomes. The widest distribution in genome content was observed for Lactobacillus. The number of tRNA and rRNA gene copies varied considerably, with exceptional high numbers observed for Lb. delbrueckii, while these numbers were relatively......The genomes of 66 LABs, belonging to five different genera, were compared for genome size and gene content. The analyzed genomes included 37 Lactobacillus genomes of 17 species, six Lactococcus lactis genomes, four Leuconostoc genomes of three species, six Streptococcus genomes of two species...

  12. Genome Update: alignment of bacterial chromosomes

    DEFF Research Database (Denmark)

    Ussery, David; Jensen, Mette; Poulsen, Tine Rugh

    2004-01-01

    There are four new microbial genomes listed in this month's Genome Update, three belonging to Gram-positive bacteria and one belonging to an archaeon that lives at pH 0; all of these genomes are listed in Table 1⇓. The method of genome comparison this month is that of genome alignment and, as an ...

  13. Insights into structural variations and genome rearrangements in prokaryotic genomes.

    Science.gov (United States)

    Periwal, Vinita; Scaria, Vinod

    2015-01-01

    Structural variations (SVs) are genomic rearrangements that affect fairly large fragments of DNA. Most of the SVs such as inversions, deletions and translocations have been largely studied in context of genetic diseases in eukaryotes. However, recent studies demonstrate that genome rearrangements can also have profound impact on prokaryotic genomes, leading to altered cell phenotype. In contrast to single-nucleotide variations, SVs provide a much deeper insight into organization of bacterial genomes at a much better resolution. SVs can confer change in gene copy number, creation of new genes, altered gene expression and many other functional consequences. High-throughput technologies have now made it possible to explore SVs at a much refined resolution in bacterial genomes. Through this review, we aim to highlight the importance of the less explored field of SVs in prokaryotic genomes and their impact. We also discuss its potential applicability in the emerging fields of synthetic biology and genome engineering where targeted SVs could serve to create sophisticated and accurate genome editing. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  14. Comparative genomics reveals insights into avian genome evolution and adaptation

    DEFF Research Database (Denmark)

    Zhang, Guojie; Li, Cai; Li, Qiye

    2014-01-01

    Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, ...

  15. Pre-genomic, genomic and post-genomic study of microbial communities involved in bioenergy.

    Science.gov (United States)

    Rittmann, Bruce E; Krajmalnik-Brown, Rosa; Halden, Rolf U

    2008-08-01

    Microorganisms can produce renewable energy in large quantities and without damaging the environment or disrupting food supply. The microbial communities must be robust and self-stabilizing, and their essential syntrophies must be managed. Pre-genomic, genomic and post-genomic tools can provide crucial information about the structure and function of these microbial communities. Applying these tools will help accelerate the rate at which microbial bioenergy processes move from intriguing science to real-world practice.

  16. Genomic instability and radiation

    Energy Technology Data Exchange (ETDEWEB)

    Little, John B [Harvard School of Public Health, Boston, MA 02115 (United States)

    2003-06-01

    Genomic instability is a hallmark of cancer cells, and is thought to be involved in the process of carcinogenesis. Indeed, a number of rare genetic disorders associated with a predisposition to cancer are characterised by genomic instability occurring in somatic cells. Of particular interest is the observation that transmissible instability can be induced in somatic cells from normal individuals by exposure to ionising radiation, leading to a persistent enhancement in the rate at which mutations and chromosomal aberrations arise in the progeny of the irradiated cells after many generations of replication. If such induced instability is involved in radiation carcinogenesis, it would imply that the initial carcinogenic event may not be a rare mutation occurring in a specific gene or set of genes. Rather, radiation may induce a process of instability in many cells in a population, enhancing the rate at which the multiple gene mutations necessary for the development of cancer may arise in a given cell lineage. Furthermore, radiation could act at any stage in the development of cancer by facilitating the accumulation of the remaining genetic events required to produce a fully malignant tumour. The experimental evidence for such induced instability is reviewed. (review)

  17. Genomic instability and radiation

    International Nuclear Information System (INIS)

    Little, John B

    2003-01-01

    Genomic instability is a hallmark of cancer cells, and is thought to be involved in the process of carcinogenesis. Indeed, a number of rare genetic disorders associated with a predisposition to cancer are characterised by genomic instability occurring in somatic cells. Of particular interest is the observation that transmissible instability can be induced in somatic cells from normal individuals by exposure to ionising radiation, leading to a persistent enhancement in the rate at which mutations and chromosomal aberrations arise in the progeny of the irradiated cells after many generations of replication. If such induced instability is involved in radiation carcinogenesis, it would imply that the initial carcinogenic event may not be a rare mutation occurring in a specific gene or set of genes. Rather, radiation may induce a process of instability in many cells in a population, enhancing the rate at which the multiple gene mutations necessary for the development of cancer may arise in a given cell lineage. Furthermore, radiation could act at any stage in the development of cancer by facilitating the accumulation of the remaining genetic events required to produce a fully malignant tumour. The experimental evidence for such induced instability is reviewed. (review)

  18. Theory of microbial genome evolution

    Science.gov (United States)

    Koonin, Eugene

    Bacteria and archaea have small genomes tightly packed with protein-coding genes. This compactness is commonly perceived as evidence of adaptive genome streamlining caused by strong purifying selection in large microbial populations. In such populations, even the small cost incurred by nonfunctional DNA because of extra energy and time expenditure is thought to be sufficient for this extra genetic material to be eliminated by selection. However, contrary to the predictions of this model, there exists a consistent, positive correlation between the strength of selection at the protein sequence level, measured as the ratio of nonsynonymous to synonymous substitution rates, and microbial genome size. By fitting the genome size distributions in multiple groups of prokaryotes to predictions of mathematical models of population evolution, we show that only models in which acquisition of additional genes is, on average, slightly beneficial yield a good fit to genomic data. Thus, the number of genes in prokaryotic genomes seems to reflect the equilibrium between the benefit of additional genes that diminishes as the genome grows and deletion bias. New genes acquired by microbial genomes, on average, appear to be adaptive. Evolution of bacterial and archaeal genomes involves extensive horizontal gene transfer and gene loss. Many microbes have open pangenomes, where each newly sequenced genome contains more than 10% `ORFans', genes without detectable homologues in other species. A simple, steady-state evolutionary model reveals two sharply distinct classes of microbial genes, one of which (ORFans) is characterized by effectively instantaneous gene replacement, whereas the other consists of genes with finite, distributed replacement rates. These findings imply a conservative estimate of at least a billion distinct genes in the prokaryotic genomic universe.

  19. Genomic selection: genome-wide prediction in plant improvement.

    Science.gov (United States)

    Desta, Zeratsion Abera; Ortiz, Rodomiro

    2014-09-01

    Association analysis is used to measure relations between markers and quantitative trait loci (QTL). Their estimation ignores genes with small effects that trigger underpinning quantitative traits. By contrast, genome-wide selection estimates marker effects across the whole genome on the target population based on a prediction model developed in the training population (TP). Whole-genome prediction models estimate all marker effects in all loci and capture small QTL effects. Here, we review several genomic selection (GS) models with respect to both the prediction accuracy and genetic gain from selection. Phenotypic selection or marker-assisted breeding protocols can be replaced by selection, based on whole-genome predictions in which phenotyping updates the model to build up the prediction accuracy. Copyright © 2014 Elsevier Ltd. All rights reserved.

  20. Parasite Genome Projects and the Trypanosoma cruzi Genome Initiative

    Directory of Open Access Journals (Sweden)

    Wim Degrave

    1997-11-01

    Full Text Available Since the start of the human genome project, a great number of genome projects on other "model" organism have been initiated, some of them already completed. Several initiatives have also been started on parasite genomes, mainly through support from WHO/TDR, involving North-South and South-South collaborations, and great hopes are vested in that these initiatives will lead to new tools for disease control and prevention, as well as to the establishment of genomic research technology in developing countries. The Trypanosoma cruzi genome project, using the clone CL-Brener as starting point, has made considerable progress through the concerted action of more than 20 laboratories, most of them in the South. A brief overview of the current state of the project is given

  1. A Taste of Algal Genomes from the Joint Genome Institute

    Energy Technology Data Exchange (ETDEWEB)

    Kuo, Alan; Grigoriev, Igor

    2012-06-17

    Algae play profound roles in aquatic food chains and the carbon cycle, can impose health and economic costs through toxic blooms, provide models for the study of symbiosis, photosynthesis, and eukaryotic evolution, and are candidate sources for bio-fuels; all of these research areas are part of the mission of DOE's Joint Genome Institute (JGI). To date JGI has sequenced, assembled, annotated, and released to the public the genomes of 18 species and strains of algae, sampling almost all of the major clades of photosynthetic eukaryotes. With more algal genomes currently undergoing analysis, JGI continues its commitment to driving forward basic and applied algal science. Among these ongoing projects are the pan-genome of the dominant coccolithophore Emiliania huxleyi, the interrelationships between the 4 genomes in the nucleomorph-containing Bigelowiella natans and Guillardia theta, and the search for symbiosis genes of lichens.

  2. I Like Chocolate Ice Cream: A Lesson in Thinking Civics

    Science.gov (United States)

    Waterson, Robert A.

    2012-01-01

    In curricula that encourages philosophy as having an integral role in educational programs, students get the opportunity to wonder and speculate, in a natural state surrounded by questions. A. K. Salmon notes that when thinking becomes a part of a young child's routine, the child becomes more open and responsive to situations that require thinking…

  3. I Imagine, I Experience, I Like: The False Experience Effect

    OpenAIRE

    Priyali Rajagopal; Nicole Votolato Montgomery

    2011-01-01

    False memories refer to the mistaken belief that an event that did not occur did occur. Much of the research on false memories has focused on the antecedents to and the characteristics of such memories, with little focus on the consequences of false memories. In this research, we show that exposure to an imagery-evoking ad can result in an erroneous belief that an individual has experienced the advertised brand. We also demonstrate that such false experiential beliefs function akin to genuine...

  4. OryzaGenome: Genome Diversity Database of Wild Oryza Species

    KAUST Repository

    Ohyanagi, Hajime

    2015-11-18

    The species in the genus Oryza, encompassing nine genome types and 23 species, are a rich genetic resource and may have applications in deeper genomic analyses aiming to understand the evolution of plant genomes. With the advancement of next-generation sequencing (NGS) technology, a flood of Oryza species reference genomes and genomic variation information has become available in recent years. This genomic information, combined with the comprehensive phenotypic information that we are accumulating in our Oryzabase, can serve as an excellent genotype-phenotype association resource for analyzing rice functional and structural evolution, and the associated diversity of the Oryza genus. Here we integrate our previous and future phenotypic/habitat information and newly determined genotype information into a united repository, named OryzaGenome, providing the variant information with hyperlinks to Oryzabase. The current version of OryzaGenome includes genotype information of 446 O. rufipogon accessions derived by imputation and of 17 accessions derived by imputation-free deep sequencing. Two variant viewers are implemented: SNP Viewer as a conventional genome browser interface and Variant Table as a textbased browser for precise inspection of each variant one by one. Portable VCF (variant call format) file or tabdelimited file download is also available. Following these SNP (single nucleotide polymorphism) data, reference pseudomolecules/ scaffolds/contigs and genome-wide variation information for almost all of the closely and distantly related wild Oryza species from the NIG Wild Rice Collection will be available in future releases. All of the resources can be accessed through http://viewer.shigen.info/oryzagenome/.

  5. Cocoa/Cotton Comparative Genomics

    Science.gov (United States)

    With genome sequence from two members of the Malvaceae family recently made available, we are exploring syntenic relationships, gene content, and evolutionary trajectories between the cacao and cotton genomes. An assembly of cacao (Theobroma cacao) using Illumina and 454 sequence technology yielded ...

  6. Genomic selection in dairy cattle

    NARCIS (Netherlands)

    Roos, de A.P.W.

    2011-01-01

    The objectives of this Ph.D. thesis were (1) to optimise genomic selection in dairy cattle with respect to the accuracy of predicting total genetic merit and (2) to optimise a dairy cattle breeding program using genomic selection. The study was performed using a combination of real data sets and

  7. Cloud computing for comparative genomics

    Directory of Open Access Journals (Sweden)

    Pivovarov Rimma

    2010-05-01

    Full Text Available Abstract Background Large comparative genomics studies and tools are becoming increasingly more compute-expensive as the number of available genome sequences continues to rise. The capacity and cost of local computing infrastructures are likely to become prohibitive with the increase, especially as the breadth of questions continues to rise. Alternative computing architectures, in particular cloud computing environments, may help alleviate this increasing pressure and enable fast, large-scale, and cost-effective comparative genomics strategies going forward. To test this, we redesigned a typical comparative genomics algorithm, the reciprocal smallest distance algorithm (RSD, to run within Amazon's Elastic Computing Cloud (EC2. We then employed the RSD-cloud for ortholog calculations across a wide selection of fully sequenced genomes. Results We ran more than 300,000 RSD-cloud processes within the EC2. These jobs were farmed simultaneously to 100 high capacity compute nodes using the Amazon Web Service Elastic Map Reduce and included a wide mix of large and small genomes. The total computation time took just under 70 hours and cost a total of $6,302 USD. Conclusions The effort to transform existing comparative genomics algorithms from local compute infrastructures is not trivial. However, the speed and flexibility of cloud computing environments provides a substantial boost with manageable cost. The procedure designed to transform the RSD algorithm into a cloud-ready application is readily adaptable to similar comparative genomics problems.

  8. Cloud computing for comparative genomics.

    Science.gov (United States)

    Wall, Dennis P; Kudtarkar, Parul; Fusaro, Vincent A; Pivovarov, Rimma; Patil, Prasad; Tonellato, Peter J

    2010-05-18

    Large comparative genomics studies and tools are becoming increasingly more compute-expensive as the number of available genome sequences continues to rise. The capacity and cost of local computing infrastructures are likely to become prohibitive with the increase, especially as the breadth of questions continues to rise. Alternative computing architectures, in particular cloud computing environments, may help alleviate this increasing pressure and enable fast, large-scale, and cost-effective comparative genomics strategies going forward. To test this, we redesigned a typical comparative genomics algorithm, the reciprocal smallest distance algorithm (RSD), to run within Amazon's Elastic Computing Cloud (EC2). We then employed the RSD-cloud for ortholog calculations across a wide selection of fully sequenced genomes. We ran more than 300,000 RSD-cloud processes within the EC2. These jobs were farmed simultaneously to 100 high capacity compute nodes using the Amazon Web Service Elastic Map Reduce and included a wide mix of large and small genomes. The total computation time took just under 70 hours and cost a total of $6,302 USD. The effort to transform existing comparative genomics algorithms from local compute infrastructures is not trivial. However, the speed and flexibility of cloud computing environments provides a substantial boost with manageable cost. The procedure designed to transform the RSD algorithm into a cloud-ready application is readily adaptable to similar comparative genomics problems.

  9. The promise of insect genomics

    DEFF Research Database (Denmark)

    Grimmelikhuijzen, Cornelis J P; Cazzamali, Giuseppe; Williamson, Michael

    2007-01-01

    Insects are the largest animal group in the world and are ecologically and economically extremely important. This importance of insects is reflected by the existence of currently 24 insect genome projects. Our perspective discusses the state-of-the-art of these genome projects and the impacts...

  10. Bioinformatics of genomic association mapping

    NARCIS (Netherlands)

    Vaez Barzani, Ahmad

    2015-01-01

    In this thesis we present an overview of bioinformatics-based approaches for genomic association mapping, with emphasis on human quantitative traits and their contribution to complex diseases. We aim to provide a comprehensive walk-through of the classic steps of genomic association mapping

  11. Molecular characterization of human T-cell lymphotropic virus type 1 full and partial genomes by Illumina massively parallel sequencing technology.

    Directory of Open Access Journals (Sweden)

    Rodrigo Pessôa

    Full Text Available BACKGROUND: Here, we report on the partial and full-length genomic (FLG variability of HTLV-1 sequences from 90 well-characterized subjects, including 48 HTLV-1 asymptomatic carriers (ACs, 35 HTLV-1-associated myelopathy/tropical spastic paraparesis (HAM/TSP and 7 adult T-cell leukemia/lymphoma (ATLL patients, using an Illumina paired-end protocol. METHODS: Blood samples were collected from 90 individuals, and DNA was extracted from the PBMCs to measure the proviral load and to amplify the HTLV-1 FLG from two overlapping fragments. The amplified PCR products were subjected to deep sequencing. The sequencing data were assembled, aligned, and mapped against the HTLV-1 genome with sufficient genetic resemblance and utilized for further phylogenetic analysis. RESULTS: A high-throughput sequencing-by-synthesis instrument was used to obtain an average of 3210- and 5200-fold coverage of the partial (n = 14 and FLG (n = 76 data from the HTLV-1 strains, respectively. The results based on the phylogenetic trees of consensus sequences from partial and FLGs revealed that 86 (95.5% individuals were infected with the transcontinental sub-subtypes of the cosmopolitan subtype (aA and that 4 individuals (4.5% were infected with the Japanese sub-subtypes (aB. A comparison of the nucleotide and amino acids of the FLG between the three clinical settings yielded no correlation between the sequenced genotype and clinical outcomes. The evolutionary relationships among the HTLV sequences were inferred from nucleotide sequence, and the results are consistent with the hypothesis that there were multiple introductions of the transcontinental subtype in Brazil. CONCLUSIONS: This study has increased the number of subtype aA full-length genomes from 8 to 81 and HTLV-1 aB from 2 to 5 sequences. The overall data confirmed that the cosmopolitan transcontinental sub-subtypes were the most prevalent in the Brazilian population. It is hoped that this valuable genomic data

  12. Molecular characterization of human T-cell lymphotropic virus type 1 full and partial genomes by Illumina massively parallel sequencing technology.

    Science.gov (United States)

    Pessôa, Rodrigo; Watanabe, Jaqueline Tomoko; Nukui, Youko; Pereira, Juliana; Casseb, Jorge; Kasseb, Jorge; de Oliveira, Augusto César Penalva; Segurado, Aluisio Cotrim; Sanabani, Sabri Saeed

    2014-01-01

    Here, we report on the partial and full-length genomic (FLG) variability of HTLV-1 sequences from 90 well-characterized subjects, including 48 HTLV-1 asymptomatic carriers (ACs), 35 HTLV-1-associated myelopathy/tropical spastic paraparesis (HAM/TSP) and 7 adult T-cell leukemia/lymphoma (ATLL) patients, using an Illumina paired-end protocol. Blood samples were collected from 90 individuals, and DNA was extracted from the PBMCs to measure the proviral load and to amplify the HTLV-1 FLG from two overlapping fragments. The amplified PCR products were subjected to deep sequencing. The sequencing data were assembled, aligned, and mapped against the HTLV-1 genome with sufficient genetic resemblance and utilized for further phylogenetic analysis. A high-throughput sequencing-by-synthesis instrument was used to obtain an average of 3210- and 5200-fold coverage of the partial (n = 14) and FLG (n = 76) data from the HTLV-1 strains, respectively. The results based on the phylogenetic trees of consensus sequences from partial and FLGs revealed that 86 (95.5%) individuals were infected with the transcontinental sub-subtypes of the cosmopolitan subtype (aA) and that 4 individuals (4.5%) were infected with the Japanese sub-subtypes (aB). A comparison of the nucleotide and amino acids of the FLG between the three clinical settings yielded no correlation between the sequenced genotype and clinical outcomes. The evolutionary relationships among the HTLV sequences were inferred from nucleotide sequence, and the results are consistent with the hypothesis that there were multiple introductions of the transcontinental subtype in Brazil. This study has increased the number of subtype aA full-length genomes from 8 to 81 and HTLV-1 aB from 2 to 5 sequences. The overall data confirmed that the cosmopolitan transcontinental sub-subtypes were the most prevalent in the Brazilian population. It is hoped that this valuable genomic data will add to our current understanding of the

  13. Allele coding in genomic evaluation

    DEFF Research Database (Denmark)

    Standen, Ismo; Christensen, Ole Fredslund

    2011-01-01

    Genomic data are used in animal breeding to assist genetic evaluation. Several models to estimate genomic breeding values have been studied. In general, two approaches have been used. One approach estimates the marker effects first and then, genomic breeding values are obtained by summing marker...... effects. In the second approach, genomic breeding values are estimated directly using an equivalent model with a genomic relationship matrix. Allele coding is the method chosen to assign values to the regression coefficients in the statistical model. A common allele coding is zero for the homozygous...... genotype of the first allele, one for the heterozygote, and two for the homozygous genotype for the other allele. Another common allele coding changes these regression coefficients by subtracting a value from each marker such that the mean of regression coefficients is zero within each marker. We call...

  14. Pathophysiology of MDS: genomic aberrations.

    Science.gov (United States)

    Ichikawa, Motoshi

    2016-01-01

    Myelodysplastic syndromes (MDS) are characterized by clonal proliferation of hematopoietic stem/progenitor cells and their apoptosis, and show a propensity to progress to acute myelogenous leukemia (AML). Although MDS are recognized as neoplastic diseases caused by genomic aberrations of hematopoietic cells, the details of the genetic abnormalities underlying disease development have not as yet been fully elucidated due to difficulties in analyzing chromosomal abnormalities. Recent advances in comprehensive analyses of disease genomes including whole-genome sequencing technologies have revealed the genomic abnormalities in MDS. Surprisingly, gene mutations were found in approximately 80-90% of cases with MDS, and the novel mutations discovered with these technologies included previously unknown, MDS-specific, mutations such as those of the genes in the RNA-splicing machinery. It is anticipated that these recent studies will shed new light on the pathophysiology of MDS due to genomic aberrations.

  15. Chemical biology on the genome.

    Science.gov (United States)

    Balasubramanian, Shankar

    2014-08-15

    In this article I discuss studies towards understanding the structure and function of DNA in the context of genomes from the perspective of a chemist. The first area I describe concerns the studies that led to the invention and subsequent development of a method for sequencing DNA on a genome scale at high speed and low cost, now known as Solexa/Illumina sequencing. The second theme will feature the four-stranded DNA structure known as a G-quadruplex with a focus on its fundamental properties, its presence in cellular genomic DNA and the prospects for targeting such a structure in cels with small molecules. The final topic for discussion is naturally occurring chemically modified DNA bases with an emphasis on chemistry for decoding (or sequencing) such modifications in genomic DNA. The genome is a fruitful topic to be further elucidated by the creation and application of chemical approaches. Copyright © 2014 Elsevier Ltd. All rights reserved.

  16. IMA Genome-F 5G

    OpenAIRE

    Wingfield, Brenda D.; Barnes, Irene; Wilhelm de Beer, Z.; De Vos, Lieschen; Duong, Tuan A.; Kanzi, Aquillah M.; Naidoo, Kershney; Nguyen, Hai D.T.; Santana, Quentin C.; Sayari, Mohammad; Seifert, Keith A.; Steenkamp, Emma T.; Trollip, Conrad; van der Merwe, Nicolaas A.; van der Nest, Magriet A.

    2015-01-01

    The genomes of Ceratocystis eucalypticola, Chrysoporthe cubensis, Chrysoporthe deuterocubensis, Davidsoniella virescens, Fusarium temperatum, Graphilbum fragrans, Penicillium nordicum and Thielaviopsis musarum are presented in this genome announcement. These seven genomes are from plant pathogens and otherwise economically important fungal species. The genome sizes range from 28 Mb in the case of T. musarum to 45 Mb for Fusarium temperatum. These genomes include the first reports of genomes f...

  17. [Preface for genome editing special issue].

    Science.gov (United States)

    Gu, Feng; Gao, Caixia

    2017-10-25

    Genome editing technology, as an innovative biotechnology, has been widely used for editing the genome from model organisms, animals, plants and microbes. CRISPR/Cas9-based genome editing technology shows its great value and potential in the dissection of functional genomics, improved breeding and genetic disease treatment. In the present special issue, the principle and application of genome editing techniques has been summarized. The advantages and disadvantages of the current genome editing technology and future prospects would also be highlighted.

  18. Privacy in the Genomic Era.

    Science.gov (United States)

    Naveed, Muhammad; Ayday, Erman; Clayton, Ellen W; Fellay, Jacques; Gunter, Carl A; Hubaux, Jean-Pierre; Malin, Bradley A; Wang, Xiaofeng

    2015-09-01

    Genome sequencing technology has advanced at a rapid pace and it is now possible to generate highly-detailed genotypes inexpensively. The collection and analysis of such data has the potential to support various applications, including personalized medical services. While the benefits of the genomics revolution are trumpeted by the biomedical community, the increased availability of such data has major implications for personal privacy; notably because the genome has certain essential features, which include (but are not limited to) (i) an association with traits and certain diseases, (ii) identification capability (e.g., forensics), and (iii) revelation of family relationships. Moreover, direct-to-consumer DNA testing increases the likelihood that genome data will be made available in less regulated environments, such as the Internet and for-profit companies. The problem of genome data privacy thus resides at the crossroads of computer science, medicine, and public policy. While the computer scientists have addressed data privacy for various data types, there has been less attention dedicated to genomic data. Thus, the goal of this paper is to provide a systematization of knowledge for the computer science community. In doing so, we address some of the (sometimes erroneous) beliefs of this field and we report on a survey we conducted about genome data privacy with biomedical specialists. Then, after characterizing the genome privacy problem, we review the state-of-the-art regarding privacy attacks on genomic data and strategies for mitigating such attacks, as well as contextualizing these attacks from the perspective of medicine and public policy. This paper concludes with an enumeration of the challenges for genome data privacy and presents a framework to systematize the analysis of threats and the design of countermeasures as the field moves forward.

  19. Privacy in the Genomic Era

    Science.gov (United States)

    NAVEED, MUHAMMAD; AYDAY, ERMAN; CLAYTON, ELLEN W.; FELLAY, JACQUES; GUNTER, CARL A.; HUBAUX, JEAN-PIERRE; MALIN, BRADLEY A.; WANG, XIAOFENG

    2015-01-01

    Genome sequencing technology has advanced at a rapid pace and it is now possible to generate highly-detailed genotypes inexpensively. The collection and analysis of such data has the potential to support various applications, including personalized medical services. While the benefits of the genomics revolution are trumpeted by the biomedical community, the increased availability of such data has major implications for personal privacy; notably because the genome has certain essential features, which include (but are not limited to) (i) an association with traits and certain diseases, (ii) identification capability (e.g., forensics), and (iii) revelation of family relationships. Moreover, direct-to-consumer DNA testing increases the likelihood that genome data will be made available in less regulated environments, such as the Internet and for-profit companies. The problem of genome data privacy thus resides at the crossroads of computer science, medicine, and public policy. While the computer scientists have addressed data privacy for various data types, there has been less attention dedicated to genomic data. Thus, the goal of this paper is to provide a systematization of knowledge for the computer science community. In doing so, we address some of the (sometimes erroneous) beliefs of this field and we report on a survey we conducted about genome data privacy with biomedical specialists. Then, after characterizing the genome privacy problem, we review the state-of-the-art regarding privacy attacks on genomic data and strategies for mitigating such attacks, as well as contextualizing these attacks from the perspective of medicine and public policy. This paper concludes with an enumeration of the challenges for genome data privacy and presents a framework to systematize the analysis of threats and the design of countermeasures as the field moves forward. PMID:26640318

  20. The genomes and comparative genomics of Lactobacillus delbrueckii phages.

    Science.gov (United States)

    Riipinen, Katja-Anneli; Forsman, Päivi; Alatossava, Tapani

    2011-07-01

    Lactobacillus delbrueckii phages are a great source of genetic diversity. Here, the genome sequences of Lb. delbrueckii phages LL-Ku, c5 and JCL1032 were analyzed in detail, and the genetic diversity of Lb. delbrueckii phages belonging to different taxonomic groups was explored. The lytic isometric group b phages LL-Ku (31,080 bp) and c5 (31,841 bp) showed a minimum nucleotide sequence identity of 90% over about three-fourths of their genomes. The genomic locations of their lysis modules were unique, and the genomes featured several putative overlapping transcription units of genes. LL-Ku and c5 virions displayed peptidoglycan hydrolytic activity associated with a ~36-kDa protein similar in size to the endolysin. Unexpectedly, the 49,433-bp genome of the prolate phage JCL1032 (temperate, group c) revealed a conserved gene order within its structural genes. Lb. delbrueckii phages representing groups a (a phage LL-H), b and c possessed only limited protein sequence homology. Genomic comparison of LL-Ku and c5 suggested that diversification of Lb. delbrueckii phages is mainly due to insertions, deletions and recombination. For the first time, the complete genome sequences of group b and c Lb. delbrueckii phages are reported.

  1. Genomics technologies to study structural variations in the grapevine genome

    Directory of Open Access Journals (Sweden)

    Cardone Maria Francesca

    2016-01-01

    Full Text Available Grapevine is one of the most important crop plants in the world. Recently there was great expansion of genomics resources about grapevine genome, thus providing increasing efforts for molecular breeding. Current cultivars display a great level of inter-specific differentiation that needs to be investigated to reach a comprehensive understanding of the genetic basis of phenotypic differences, and to find responsible genes selected by cross breeding programs. While there have been significant advances in resolving the pattern and nature of single nucleotide polymorphisms (SNPs on plant genomes, few data are available on copy number variation (CNV. Furthermore association between structural variations and phenotypes has been described in only a few cases. We combined high throughput biotechnologies and bioinformatics tools, to reveal the first inter-varietal atlas of structural variation (SV for the grapevine genome. We sequenced and compared four table grape cultivars with the Pinot noir inbred line PN40024 genome as the reference. We detected roughly 8% of the grapevine genome affected by genomic variations. Taken into account phenotypic differences existing among the studied varieties we performed comparison of SVs among them and the reference and next we performed an in-depth analysis of gene content of polymorphic regions. This allowed us to identify genes showing differences in copy number as putative functional candidates for important traits in grapevine cultivation.

  2. Marine Bacterial Genomics

    DEFF Research Database (Denmark)

    Machado, Henrique

    For decades, terrestrial microorganisms have been used as sources of countless enzymes and chemical compounds that have been produced by pharmaceutical and biotech companies and used by mankind. There is a need for new chemical compounds, including antibiotics,new enzymatic activities and new...... microorganisms to be used as cell factories for production. Therefore exploitation of new microbial niches and use of different strategies is an opportunity to boost discoveries. Even though scientists have started to explore several habitats other than the terrestrial ones, the marine environment stands out...... as a hitherto under-explored niche. This thesis work uses high-throughput sequencing technologies on a collection of marine bacteria established during the Galathea 3 expedition, with the purpose of unraveling new biodiversity and new bioactivities. Several tools were used for genomic analysis in order...

  3. The South Asian genome.

    Directory of Open Access Journals (Sweden)

    John C Chambers

    Full Text Available The genetic sequence variation of people from the Indian subcontinent who comprise one-quarter of the world's population, is not well described. We carried out whole genome sequencing of 168 South Asians, along with whole-exome sequencing of 147 South Asians to provide deeper characterisation of coding regions. We identify 12,962,155 autosomal sequence variants, including 2,946,861 new SNPs and 312,738 novel indels. This catalogue of SNPs and indels amongst South Asians provides the first comprehensive map of genetic variation in this major human population, and reveals evidence for selective pressures on genes involved in skin biology, metabolism, infection and immunity. Our results will accelerate the search for the genetic variants underlying susceptibility to disorders such as type-2 diabetes and cardiovascular disease which are highly prevalent amongst South Asians.

  4. Comparative RNA genomics

    DEFF Research Database (Denmark)

    Backofen, Rolf; Gorodkin, Jan; Hofacker, Ivo L.

    2018-01-01

    Over the last two decades it has become clear that RNA is much more than just a boring intermediate in protein expression. Ancient RNAs still appear in the core information metabolism and comprise a surprisingly large component in bacterial gene regulation. A common theme with these types of mostly...... small RNAs is their reliance of conserved secondary structures. Large scale sequencing projects, on the other hand, have profoundly changed our understanding of eukaryotic genomes. Pervasively transcribed, they give rise to a plethora of large and evolutionarily extremely flexible noncoding RNAs...... that exert a vastly diverse array of molecule functions. In this chapter we provide a—necessarily incomplete—overview of the current state of comparative analysis of noncoding RNAs, emphasizing computational approaches as a means to gain a global picture of the modern RNA world....

  5. Materials Genome Initiative

    Science.gov (United States)

    Vickers, John

    2015-01-01

    The Materials Genome Initiative (MGI) project element is a cross-Center effort that is focused on the integration of computational tools to simulate manufacturing processes and materials behavior. These computational simulations will be utilized to gain understanding of processes and materials behavior to accelerate process development and certification to more efficiently integrate new materials in existing NASA projects and to lead to the design of new materials for improved performance. This NASA effort looks to collaborate with efforts at other government agencies and universities working under the national MGI. MGI plans to develop integrated computational/experimental/ processing methodologies for accelerating discovery and insertion of materials to satisfy NASA's unique mission demands. The challenges include validated design tools that incorporate materials properties, processes, and design requirements; and materials process control to rapidly mature emerging manufacturing methods and develop certified manufacturing processes

  6. Inheritance of the yeast mitochondrial genome

    DEFF Research Database (Denmark)

    Piskur, Jure

    1994-01-01

    Mitochondrion, extrachromosomal genetics, intergenic sequences, genome size, mitochondrial DNA, petite mutation, yeast......Mitochondrion, extrachromosomal genetics, intergenic sequences, genome size, mitochondrial DNA, petite mutation, yeast...

  7. Plantagora: modeling whole genome sequencing and assembly of plant genomes.

    Directory of Open Access Journals (Sweden)

    Roger Barthelson

    Full Text Available BACKGROUND: Genomics studies are being revolutionized by the next generation sequencing technologies, which have made whole genome sequencing much more accessible to the average researcher. Whole genome sequencing with the new technologies is a developing art that, despite the large volumes of data that can be produced, may still fail to provide a clear and thorough map of a genome. The Plantagora project was conceived to address specifically the gap between having the technical tools for genome sequencing and knowing precisely the best way to use them. METHODOLOGY/PRINCIPAL FINDINGS: For Plantagora, a platform was created for generating simulated reads from several different plant genomes of different sizes. The resulting read files mimicked either 454 or Illumina reads, with varying paired end spacing. Thousands of datasets of reads were created, most derived from our primary model genome, rice chromosome one. All reads were assembled with different software assemblers, including Newbler, Abyss, and SOAPdenovo, and the resulting assemblies were evaluated by an extensive battery of metrics chosen for these studies. The metrics included both statistics of the assembly sequences and fidelity-related measures derived by alignment of the assemblies to the original genome source for the reads. The results were presented in a website, which includes a data graphing tool, all created to help the user compare rapidly the feasibility and effectiveness of different sequencing and assembly strategies prior to testing an approach in the lab. Some of our own conclusions regarding the different strategies were also recorded on the website. CONCLUSIONS/SIGNIFICANCE: Plantagora provides a substantial body of information for comparing different approaches to sequencing a plant genome, and some conclusions regarding some of the specific approaches. Plantagora also provides a platform of metrics and tools for studying the process of sequencing and assembly

  8. Genomes in turmoil: quantification of genome dynamics in prokaryote supergenomes.

    Science.gov (United States)

    Puigbò, Pere; Lobkovsky, Alexander E; Kristensen, David M; Wolf, Yuri I; Koonin, Eugene V

    2014-08-21

    Genomes of bacteria and archaea (collectively, prokaryotes) appear to exist in incessant flux, expanding via horizontal gene transfer and gene duplication, and contracting via gene loss. However, the actual rates of genome dynamics and relative contributions of different types of event across the diversity of prokaryotes are largely unknown, as are the sizes of microbial supergenomes, i.e. pools of genes that are accessible to the given microbial species. We performed a comprehensive analysis of the genome dynamics in 35 groups (34 bacterial and one archaeal) of closely related microbial genomes using a phylogenetic birth-and-death maximum likelihood model to quantify the rates of gene family gain and loss, as well as expansion and reduction. The results show that loss of gene families dominates the evolution of prokaryotes, occurring at approximately three times the rate of gain. The rates of gene family expansion and reduction are typically seven and twenty times less than the gain and loss rates, respectively. Thus, the prevailing mode of evolution in bacteria and archaea is genome contraction, which is partially compensated by the gain of new gene families via horizontal gene transfer. However, the rates of gene family gain, loss, expansion and reduction vary within wide ranges, with the most stable genomes showing rates about 25 times lower than the most dynamic genomes. For many groups, the supergenome estimated from the fraction of repetitive gene family gains includes about tenfold more gene families than the typical genome in the group although some groups appear to have vast, 'open' supergenomes. Reconstruction of evolution for groups of closely related bacteria and archaea reveals an extremely rapid and highly variable flux of genes in evolving microbial genomes, demonstrates that extensive gene loss and horizontal gene transfer leading to innovation are the two dominant evolutionary processes, and yields robust estimates of the supergenome size.

  9. 1000 Bull Genomes - Toward genomic Selectionf from whole genome sequence Data in Dairy and Beef Cattle

    NARCIS (Netherlands)

    Hayes, B.; Daetwyler, H.D.; Fries, R.; Guldbrandtsen, B.; Mogens Sando Lund, M.; Didier A. Boichard, D.A.; Stothard, P.; Veerkamp, R.F.; Hulsegge, B.; Rocha, D.; Tassell, C.; Mullaart, E.; Gredler, B.; Druet, T.; Bagnato, A.; Goddard, M.E.; Chamberlain, H.L.

    2013-01-01

    Genomic prediction of breeding values is now used as the basis for selection of dairy cattle, and in some cases beef cattle, in a number of countries. When genomic prediction was introduced most of the information was to thought to be derived from linkage disequilibrium between markers and causative

  10. Comparing Mycobacterium tuberculosis genomes using genome topology networks.

    Science.gov (United States)

    Jiang, Jianping; Gu, Jianlei; Zhang, Liang; Zhang, Chenyi; Deng, Xiao; Dou, Tonghai; Zhao, Guoping; Zhou, Yan

    2015-02-14

    Over the last decade, emerging research methods, such as comparative genomic analysis and phylogenetic study, have yielded new insights into genotypes and phenotypes of closely related bacterial strains. Several findings have revealed that genomic structural variations (SVs), including gene gain/loss, gene duplication and genome rearrangement, can lead to different phenotypes among strains, and an investigation of genes affected by SVs may extend our knowledge of the relationships between SVs and phenotypes in microbes, especially in pathogenic bacteria. In this work, we introduce a 'Genome Topology Network' (GTN) method based on gene homology and gene locations to analyze genomic SVs and perform phylogenetic analysis. Furthermore, the concept of 'unfixed ortholog' has been proposed, whose members are affected by SVs in genome topology among close species. To improve the precision of 'unfixed ortholog' recognition, a strategy to detect annotation differences and complete gene annotation was applied. To assess the GTN method, a set of thirteen complete M. tuberculosis genomes was analyzed as a case study. GTNs with two different gene homology-assigning methods were built, the Clusters of Orthologous Groups (COG) method and the orthoMCL clustering method, and two phylogenetic trees were constructed accordingly, which may provide additional insights into whole genome-based phylogenetic analysis. We obtained 24 unfixable COG groups, of which most members were related to immunogenicity and drug resistance, such as PPE-repeat proteins (COG5651) and transcriptional regulator TetR gene family members (COG1309). The GTN method has been implemented in PERL and released on our website. The tool can be downloaded from http://homepage.fudan.edu.cn/zhouyan/gtn/ , and allows re-annotating the 'lost' genes among closely related genomes, analyzing genes affected by SVs, and performing phylogenetic analysis. With this tool, many immunogenic-related and drug resistance-related genes

  11. A universal genomic coordinate translator for comparative genomics.

    Science.gov (United States)

    Zamani, Neda; Sundström, Görel; Meadows, Jennifer R S; Höppner, Marc P; Dainat, Jacques; Lantz, Henrik; Haas, Brian J; Grabherr, Manfred G

    2014-06-30

    Genomic duplications constitute major events in the evolution of species, allowing paralogous copies of genes to take on fine-tuned biological roles. Unambiguously identifying the orthology relationship between copies across multiple genomes can be resolved by synteny, i.e. the conserved order of genomic sequences. However, a comprehensive analysis of duplication events and their contributions to evolution would require all-to-all genome alignments, which increases at N2 with the number of available genomes, N. Here, we introduce Kraken, software that omits the all-to-all requirement by recursively traversing a graph of pairwise alignments and dynamically re-computing orthology. Kraken scales linearly with the number of targeted genomes, N, which allows for including large numbers of genomes in analyses. We first evaluated the method on the set of 12 Drosophila genomes, finding that orthologous correspondence computed indirectly through a graph of multiple synteny maps comes at minimal cost in terms of sensitivity, but reduces overall computational runtime by an order of magnitude. We then used the method on three well-annotated mammalian genomes, human, mouse, and rat, and show that up to 93% of protein coding transcripts have unambiguous pairwise orthologous relationships across the genomes. On a nucleotide level, 70 to 83% of exons match exactly at both splice junctions, and up to 97% on at least one junction. We last applied Kraken to an RNA-sequencing dataset from multiple vertebrates and diverse tissues, where we confirmed that brain-specific gene family members, i.e. one-to-many or many-to-many homologs, are more highly correlated across species than single-copy (i.e. one-to-one homologous) genes. Not limited to protein coding genes, Kraken also identifies thousands of newly identified transcribed loci, likely non-coding RNAs that are consistently transcribed in human, chimpanzee and gorilla, and maintain significant correlation of expression levels across

  12. The Perennial Ryegrass GenomeZipper – Targeted Use of Genome Resources for Comparative Grass Genomics

    DEFF Research Database (Denmark)

    Pfeiffer, Matthias; Martis, Mihaela; Asp, Torben

    2013-01-01

    (Lolium perenne) genome on the basis of conserved synteny to barley (Hordeum vulgare) and the model grass genome Brachypodium (Brachypodium distachyon) as well as rice (Oryza sativa) and sorghum (Sorghum bicolor). A transcriptome-based genetic linkage map of perennial ryegrass served as a scaffold......Whole-genome sequences established for model and major crop species constitute a key resource for advanced genomic research. For outbreeding forage and turf grass species like ryegrasses (Lolium spp.), such resources have yet to be developed. Here, we present a model of the perennial ryegrass...... to establish the chromosomal arrangement of syntenic genes from model grass species. This scaffold revealed a high degree of synteny and macrocollinearity and was then utilized to anchor a collection of perennial ryegrass genes in silico to their predicted genome positions. This resulted in the unambiguous...

  13. Components of Adenovirus Genome Packaging

    Science.gov (United States)

    Ahi, Yadvinder S.; Mittal, Suresh K.

    2016-01-01

    Adenoviruses (AdVs) are icosahedral viruses with double-stranded DNA (dsDNA) genomes. Genome packaging in AdV is thought to be similar to that seen in dsDNA containing icosahedral bacteriophages and herpesviruses. Specific recognition of the AdV genome is mediated by a packaging domain located close to the left end of the viral genome and is mediated by the viral packaging machinery. Our understanding of the role of various components of the viral packaging machinery in AdV genome packaging has greatly advanced in recent years. Characterization of empty capsids assembled in the absence of one or more components involved in packaging, identification of the unique vertex, and demonstration of the role of IVa2, the putative packaging ATPase, in genome packaging have provided compelling evidence that AdVs follow a sequential assembly pathway. This review provides a detailed discussion on the functions of the various viral and cellular factors involved in AdV genome packaging. We conclude by briefly discussing the roles of the empty capsids, assembly intermediates, scaffolding proteins, portal vertex and DNA encapsidating enzymes in AdV assembly and packaging. PMID:27721809

  14. The genome of Eucalyptus grandis

    Energy Technology Data Exchange (ETDEWEB)

    Myburg, Alexander A.; Grattapaglia, Dario; Tuskan, Gerald A.; Hellsten, Uffe; Hayes, Richard D.; Grimwood, Jane; Jenkins, Jerry; Lindquist, Erika; Tice, Hope; Bauer, Diane; Goodstein, David M.; Dubchak, Inna; Poliakov, Alexandre; Mizrachi, Eshchar; Kullan, Anand R. K.; Hussey, Steven G.; Pinard, Desre; van der Merwe, Karen; Singh, Pooja; van Jaarsveld, Ida; Silva-Junior, Orzenil B.; Togawa, Roberto C.; Pappas, Marilia R.; Faria, Danielle A.; Sansaloni, Carolina P.; Petroli, Cesar D.; Yang, Xiaohan; Ranjan, Priya; Tschaplinski, Timothy J.; Ye, Chu-Yu; Li, Ting; Sterck, Lieven; Vanneste, Kevin; Murat, Florent; Soler, Marçal; Clemente, Hélène San; Saidi, Naijib; Cassan-Wang, Hua; Dunand, Christophe; Hefer, Charles A.; Bornberg-Bauer, Erich; Kersting, Anna R.; Vining, Kelly; Amarasinghe, Vindhya; Ranik, Martin; Naithani, Sushma; Elser, Justin; Boyd, Alexander E.; Liston, Aaron; Spatafora, Joseph W.; Dharmwardhana, Palitha; Raja, Rajani; Sullivan, Christopher; Romanel, Elisson; Alves-Ferreira, Marcio; Külheim, Carsten; Foley, William; Carocha, Victor; Paiva, Jorge; Kudrna, David; Brommonschenkel, Sergio H.; Pasquali, Giancarlo; Byrne, Margaret; Rigault, Philippe; Tibbits, Josquin; Spokevicius, Antanas; Jones, Rebecca C.; Steane, Dorothy A.; Vaillancourt, René E.; Potts, Brad M.; Joubert, Fourie; Barry, Kerrie; Pappas, Georgios J.; Strauss, Steven H.; Jaiswal, Pankaj; Grima-Pettenati, Jacqueline; Salse, Jérôme; Van de Peer, Yves; Rokhsar, Daniel S.; Schmutz, Jeremy

    2014-06-11

    Eucalypts are the world s most widely planted hardwood trees. Their broad adaptability, rich species diversity, fast growth and superior multipurpose wood, have made them a global renewable resource of fiber and energy that mitigates human pressures on natural forests. We sequenced and assembled >94% of the 640 Mbp genome of Eucalyptus grandis into its 11 chromosomes. A set of 36,376 protein coding genes were predicted revealing that 34% occur in tandem duplications, the largest proportion found thus far in any plant genome. Eucalypts also show the highest diversity of genes for plant specialized metabolism that act as chemical defence against biotic agents and provide unique pharmaceutical oils. Resequencing of a set of inbred tree genomes revealed regions of strongly conserved heterozygosity, likely hotspots of inbreeding depression. The resequenced genome of the sister species E. globulus underscored the high inter-specific genome colinearity despite substantial genome size variation in the genus. The genome of E. grandis is the first reference for the early diverging Rosid order Myrtales and is placed here basal to the Eurosids. This resource expands knowledge on the unique biology of large woody perennials and provides a powerful tool to accelerate comparative biology, breeding and biotechnology.

  15. [Genome editing of industrial microorganism].

    Science.gov (United States)

    Zhu, Linjiang; Li, Qi

    2015-03-01

    Genome editing is defined as highly-effective and precise modification of cellular genome in a large scale. In recent years, such genome-editing methods have been rapidly developed in the field of industrial strain improvement. The quickly-updating methods thoroughly change the old mode of inefficient genetic modification, which is "one modification, one selection marker, and one target site". Highly-effective modification mode in genome editing have been developed including simultaneous modification of multiplex genes, highly-effective insertion, replacement, and deletion of target genes in the genome scale, cut-paste of a large DNA fragment. These new tools for microbial genome editing will certainly be applied widely, and increase the efficiency of industrial strain improvement, and promote the revolution of traditional fermentation industry and rapid development of novel industrial biotechnology like production of biofuel and biomaterial. The technological principle of these genome-editing methods and their applications were summarized in this review, which can benefit engineering and construction of industrial microorganism.

  16. Genome size variation in the genus Avena.

    Science.gov (United States)

    Yan, Honghai; Martin, Sara L; Bekele, Wubishet A; Latta, Robert G; Diederichsen, Axel; Peng, Yuanying; Tinker, Nicholas A

    2016-03-01

    Genome size is an indicator of evolutionary distance and a metric for genome characterization. Here, we report accurate estimates of genome size in 99 accessions from 26 species of Avena. We demonstrate that the average genome size of C genome diploid species (2C = 10.26 pg) is 15% larger than that of A genome species (2C = 8.95 pg), and that this difference likely accounts for a progression of size among tetraploid species, where AB genome configuration had similar genome sizes (average 2C = 25.74 pg). Genome size was mostly consistent within species and in general agreement with current information about evolutionary distance among species. Results also suggest that most of the polyploid species in Avena have experienced genome downsizing in relation to their diploid progenitors. Genome size measurements could provide additional quality control for species identification in germplasm collections, especially in cases where diploid and polyploid species have similar morphology.

  17. OryzaGenome: Genome Diversity Database of Wild Oryza Species

    KAUST Repository

    Ohyanagi, Hajime; Ebata, Toshinobu; Huang, Xuehui; Gong, Hao; Fujita, Masahiro; Mochizuki, Takako; Toyoda, Atsushi; Fujiyama, Asao; Kaminuma, Eli; Nakamura, Yasukazu; Feng, Qi; Wang, Zi Xuan; Han, Bin; Kurata, Nori

    2015-01-01

    . Portable VCF (variant call format) file or tabdelimited file download is also available. Following these SNP (single nucleotide polymorphism) data, reference pseudomolecules/ scaffolds/contigs and genome-wide variation information for almost all

  18. Genome Modeling System: A Knowledge Management Platform for Genomics.

    Directory of Open Access Journals (Sweden)

    Malachi Griffith

    2015-07-01

    Full Text Available In this work, we present the Genome Modeling System (GMS, an analysis information management system capable of executing automated genome analysis pipelines at a massive scale. The GMS framework provides detailed tracking of samples and data coupled with reliable and repeatable analysis pipelines. The GMS also serves as a platform for bioinformatics development, allowing a large team to collaborate on data analysis, or an individual researcher to leverage the work of others effectively within its data management system. Rather than separating ad-hoc analysis from rigorous, reproducible pipelines, the GMS promotes systematic integration between the two. As a demonstration of the GMS, we performed an integrated analysis of whole genome, exome and transcriptome sequencing data from a breast cancer cell line (HCC1395 and matched lymphoblastoid line (HCC1395BL. These data are available for users to test the software, complete tutorials and develop novel GMS pipeline configurations. The GMS is available at https://github.com/genome/gms.

  19. Comparative genomics reveals insights into avian genome evolution and adaptation

    Science.gov (United States)

    Zhang, Guojie; Li, Cai; Li, Qiye; Li, Bo; Larkin, Denis M.; Lee, Chul; Storz, Jay F.; Antunes, Agostinho; Greenwold, Matthew J.; Meredith, Robert W.; Ödeen, Anders; Cui, Jie; Zhou, Qi; Xu, Luohao; Pan, Hailin; Wang, Zongji; Jin, Lijun; Zhang, Pei; Hu, Haofu; Yang, Wei; Hu, Jiang; Xiao, Jin; Yang, Zhikai; Liu, Yang; Xie, Qiaolin; Yu, Hao; Lian, Jinmin; Wen, Ping; Zhang, Fang; Li, Hui; Zeng, Yongli; Xiong, Zijun; Liu, Shiping; Zhou, Long; Huang, Zhiyong; An, Na; Wang, Jie; Zheng, Qiumei; Xiong, Yingqi; Wang, Guangbiao; Wang, Bo; Wang, Jingjing; Fan, Yu; da Fonseca, Rute R.; Alfaro-Núñez, Alonzo; Schubert, Mikkel; Orlando, Ludovic; Mourier, Tobias; Howard, Jason T.; Ganapathy, Ganeshkumar; Pfenning, Andreas; Whitney, Osceola; Rivas, Miriam V.; Hara, Erina; Smith, Julia; Farré, Marta; Narayan, Jitendra; Slavov, Gancho; Romanov, Michael N; Borges, Rui; Machado, João Paulo; Khan, Imran; Springer, Mark S.; Gatesy, John; Hoffmann, Federico G.; Opazo, Juan C.; Håstad, Olle; Sawyer, Roger H.; Kim, Heebal; Kim, Kyu-Won; Kim, Hyeon Jeong; Cho, Seoae; Li, Ning; Huang, Yinhua; Bruford, Michael W.; Zhan, Xiangjiang; Dixon, Andrew; Bertelsen, Mads F.; Derryberry, Elizabeth; Warren, Wesley; Wilson, Richard K; Li, Shengbin; Ray, David A.; Green, Richard E.; O’Brien, Stephen J.; Griffin, Darren; Johnson, Warren E.; Haussler, David; Ryder, Oliver A.; Willerslev, Eske; Graves, Gary R.; Alström, Per; Fjeldså, Jon; Mindell, David P.; Edwards, Scott V.; Braun, Edward L.; Rahbek, Carsten; Burt, David W.; Houde, Peter; Zhang, Yong; Yang, Huanming; Wang, Jian; Jarvis, Erich D.; Gilbert, M. Thomas P.; Wang, Jun

    2015-01-01

    Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits. PMID:25504712

  20. The bonobo genome compared with the chimpanzee and human genomes

    Science.gov (United States)

    Prüfer, Kay; Munch, Kasper; Hellmann, Ines; Akagi, Keiko; Miller, Jason R.; Walenz, Brian; Koren, Sergey; Sutton, Granger; Kodira, Chinnappa; Winer, Roger; Knight, James R.; Mullikin, James C.; Meader, Stephen J.; Ponting, Chris P.; Lunter, Gerton; Higashino, Saneyuki; Hobolth, Asger; Dutheil, Julien; Karakoç, Emre; Alkan, Can; Sajjadian, Saba; Catacchio, Claudia Rita; Ventura, Mario; Marques-Bonet, Tomas; Eichler, Evan E.; André, Claudine; Atencia, Rebeca; Mugisha, Lawrence; Junhold, Jörg; Patterson, Nick; Siebauer, Michael; Good, Jeffrey M.; Fischer, Anne; Ptak, Susan E.; Lachmann, Michael; Symer, David E.; Mailund, Thomas; Schierup, Mikkel H.; Andrés, Aida M.; Kelso, Janet; Pääbo, Svante

    2012-01-01

    Two African apes are the closest living relatives of humans: the chimpanzee (Pan troglodytes) and the bonobo (Pan paniscus). Although they are similar in many respects, bonobos and chimpanzees differ strikingly in key social and sexual behaviours1–4, and for some of these traits they show more similarity with humans than with each other. Here we report the sequencing and assembly of the bonobo genome to study its evolutionary relationship with the chimpanzee and human genomes. We find that more than three per cent of the human genome is more closely related to either the bonobo or the chimpanzee genome than these are to each other. These regions allow various aspects of the ancestry of the two ape species to be reconstructed. In addition, many of the regions that overlap genes may eventually help us understand the genetic basis of phenotypes that humans share with one of the two apes to the exclusion of the other. PMID:22722832

  1. Capturing prokaryotic dark matter genomes.

    Science.gov (United States)

    Gasc, Cyrielle; Ribière, Céline; Parisot, Nicolas; Beugnot, Réjane; Defois, Clémence; Petit-Biderre, Corinne; Boucher, Delphine; Peyretaillade, Eric; Peyret, Pierre

    2015-12-01

    Prokaryotes are the most diverse and abundant cellular life forms on Earth. Most of them, identified by indirect molecular approaches, belong to microbial dark matter. The advent of metagenomic and single-cell genomic approaches has highlighted the metabolic capabilities of numerous members of this dark matter through genome reconstruction. Thus, linking functions back to the species has revolutionized our understanding of how ecosystem function is sustained by the microbial world. This review will present discoveries acquired through the illumination of prokaryotic dark matter genomes by these innovative approaches. Copyright © 2015 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.

  2. Human genome. 1993 Program report

    Energy Technology Data Exchange (ETDEWEB)

    1994-03-01

    The purpose of this report is to update the Human Genome 1991-92 Program Report and provide new information on the DOE genome program to researchers, program managers, other government agencies, and the interested public. This FY 1993 supplement includes abstracts of 60 new or renewed projects and listings of 112 continuing and 28 completed projects. These two reports, taken together, present the most complete published view of the DOE Human Genome Program through FY 1993. Research is progressing rapidly toward 15-year goals of mapping and sequencing the DNA of each of the 24 different human chromosomes.

  3. Implementing Genome-Driven Oncology

    Science.gov (United States)

    Hyman, David M.; Taylor, Barry S.; Baselga, José

    2017-01-01

    Early successes in identifying and targeting individual oncogenic drivers, together with the increasing feasibility of sequencing tumor genomes, have brought forth the promise of genome-driven oncology care. As we expand the breadth and depth of genomic analyses, the biological and clinical complexity of its implementation will be unparalleled. Challenges include target credentialing and validation, implementing drug combinations, clinical trial designs, targeting tumor heterogeneity, and deploying technologies beyond DNA sequencing, among others. We review how contemporary approaches are tackling these challenges and will ultimately serve as an engine for biological discovery and increase our insight into cancer and its treatment. PMID:28187282

  4. Deep whole-genome sequencing of 90 Han Chinese genomes.

    Science.gov (United States)

    Lan, Tianming; Lin, Haoxiang; Zhu, Wenjuan; Laurent, Tellier Christian Asker Melchior; Yang, Mengcheng; Liu, Xin; Wang, Jun; Wang, Jian; Yang, Huanming; Xu, Xun; Guo, Xiaosen

    2017-09-01

    Next-generation sequencing provides a high-resolution insight into human genetic information. However, the focus of previous studies has primarily been on low-coverage data due to the high cost of sequencing. Although the 1000 Genomes Project and the Haplotype Reference Consortium have both provided powerful reference panels for imputation, low-frequency and novel variants remain difficult to discover and call with accuracy on the basis of low-coverage data. Deep sequencing provides an optimal solution for the problem of these low-frequency and novel variants. Although whole-exome sequencing is also a viable choice for exome regions, it cannot account for noncoding regions, sometimes resulting in the absence of important, causal variants. For Han Chinese populations, the majority of variants have been discovered based upon low-coverage data from the 1000 Genomes Project. However, high-coverage, whole-genome sequencing data are limited for any population, and a large amount of low-frequency, population-specific variants remain uncharacterized. We have performed whole-genome sequencing at a high depth (∼×80) of 90 unrelated individuals of Chinese ancestry, collected from the 1000 Genomes Project samples, including 45 Northern Han Chinese and 45 Southern Han Chinese samples. Eighty-three of these 90 have been sequenced by the 1000 Genomes Project. We have identified 12 568 804 single nucleotide polymorphisms, 2 074 210 short InDels, and 26 142 structural variations from these 90 samples. Compared to the Han Chinese data from the 1000 Genomes Project, we have found 7 000 629 novel variants with low frequency (defined as minor allele frequency genome. Compared to the 1000 Genomes Project, these Han Chinese deep sequencing data enhance the characterization of a large number of low-frequency, novel variants. This will be a valuable resource for promoting Chinese genetics research and medical development. Additionally, it will provide a valuable supplement to the 1000

  5. Identification of genomic sites for CRISPR/Cas9-based genome editing in the Vitis vinifera genome

    Science.gov (United States)

    CRISPR/Cas9 has been recently demonstrated as an effective and popular genome editing tool for modifying genomes of human, animals, microorganisms, and plants. Success of such genome editing is highly dependent on the availability of suitable target sites in the genomes to be edited. Many specific t...

  6. Genomics and the human genome project: implications for psychiatry

    OpenAIRE

    Kelsoe, J R

    2004-01-01

    In the past decade the Human Genome Project has made extraordinary strides in understanding of fundamental human genetics. The complete human genetic sequence has been determined, and the chromosomal location of almost all human genes identified. Presently, a large international consortium, the HapMap Project, is working to identify a large portion of genetic variation in different human populations and the structure and relationship of these variants to each other. The Human Genome Project h...

  7. Challenges in Whole-Genome Annotation of Pyrosequenced Eukaryotic Genomes

    Energy Technology Data Exchange (ETDEWEB)

    Kuo, Alan; Grigoriev, Igor

    2009-04-17

    Pyrosequencing technologies such as 454/Roche and Solexa/Illumina vastly lower the cost of nucleotide sequencing compared to the traditional Sanger method, and thus promise to greatly expand the number of sequenced eukaryotic genomes. However, the new technologies also bring new challenges such as shorter reads and new kinds and higher rates of sequencing errors, which complicate genome assembly and gene prediction. At JGI we are deploying 454 technology for the sequencing and assembly of ever-larger eukaryotic genomes. Here we describe our first whole-genome annotation of a purely 454-sequenced fungal genome that is larger than a yeast (>30 Mbp). The pezizomycotine (filamentous ascomycote) Aspergillus carbonarius belongs to the Aspergillus section Nigri species complex, members of which are significant as platforms for bioenergy and bioindustrial technology, as members of soil microbial communities and players in the global carbon cycle, and as agricultural toxigens. Application of a modified version of the standard JGI Annotation Pipeline has so far predicted ~;;10k genes. ~;;12percent of these preliminary annotations suffer a potential frameshift error, which is somewhat higher than the ~;;9percent rate in the Sanger-sequenced and conventionally assembled and annotated genome of fellow Aspergillus section Nigri member A. niger. Also,>90percent of A. niger genes have potential homologs in the A. carbonarius preliminary annotation. Weconclude, and with further annotation and comparative analysis expect to confirm, that 454 sequencing strategies provide a promising substrate for annotation of modestly sized eukaryotic genomes. We will also present results of annotation of a number of other pyrosequenced fungal genomes of bioenergy interest.

  8. Genome projects and the functional-genomic era.

    Science.gov (United States)

    Sauer, Sascha; Konthur, Zoltán; Lehrach, Hans

    2005-12-01

    The problems we face today in public health as a result of the -- fortunately -- increasing age of people and the requirements of developing countries create an urgent need for new and innovative approaches in medicine and in agronomics. Genomic and functional genomic approaches have a great potential to at least partially solve these problems in the future. Important progress has been made by procedures to decode genomic information of humans, but also of other key organisms. The basic comprehension of genomic information (and its transfer) should now give us the possibility to pursue the next important step in life science eventually leading to a basic understanding of biological information flow; the elucidation of the function of all genes and correlative products encoded in the genome, as well as the discovery of their interactions in a molecular context and the response to environmental factors. As a result of the sequencing projects, we are now able to ask important questions about sequence variation and can start to comprehensively study the function of expressed genes on different levels such as RNA, protein or the cell in a systematic context including underlying networks. In this article we review and comment on current trends in large-scale systematic biological research. A particular emphasis is put on technology developments that can provide means to accomplish the tasks of future lines of functional genomics.

  9. The Amaranth Genome: Genome, Transcriptome, and Physical Map Assembly

    Directory of Open Access Journals (Sweden)

    J. W. Clouse

    2016-03-01

    Full Text Available Amaranth ( L. is an emerging pseudocereal native to the New World that has garnered increased attention in recent years because of its nutritional quality, in particular its seed protein and more specifically its high levels of the essential amino acid lysine. It belongs to the Amaranthaceae family, is an ancient paleopolyploid that shows disomic inheritance (2 = 32, and has an estimated genome size of 466 Mb. Here we present a high-quality draft genome sequence of the grain amaranth. The genome assembly consisted of 377 Mb in 3518 scaffolds with an N of 371 kb. Repetitive element analysis predicted that 48% of the genome is comprised of repeat sequences, of which -like elements were the most commonly classified retrotransposon. A de novo transcriptome consisting of 66,370 contigs was assembled from eight different amaranth tissue and abiotic stress libraries. Annotation of the genome identified 23,059 protein-coding genes. Seven grain amaranths (, , and and their putative progenitor ( were resequenced. A single nucleotide polymorphism (SNP phylogeny supported the classification of as the progenitor species of the grain amaranths. Lastly, we generated a de novo physical map for using the BioNano Genomics’ Genome Mapping platform. The physical map spanned 340 Mb and a hybrid assembly using the BioNano physical maps nearly doubled the N of the assembly to 697 kb. Moreover, we analyzed synteny between amaranth and sugar beet ( L. and estimated, using analysis, the age of the most recent polyploidization event in amaranth.

  10. Whole genome sequencing and bioinformatics analysis of two Egyptian genomes.

    Science.gov (United States)

    ElHefnawi, Mahmoud; Jeon, Sungwon; Bhak, Youngjune; ElFiky, Asmaa; Horaiz, Ahmed; Jun, JeHoon; Kim, Hyunho; Bhak, Jong

    2018-05-15

    We report two Egyptian male genomes (EGP1 and EGP2) sequenced at ~ 30× sequencing depths. EGP1 had 4.7 million variants, where 198,877 were novel variants while EGP2 had 209,109 novel variants out of 4.8 million variants. The mitochondrial haplogroup of the two individuals were identified to be H7b1 and L2a1c, respectively. We also identified the Y haplogroup of EGP1 (R1b) and EGP2 (J1a2a1a2 > P58 > FGC11). EGP1 had a mutation in the NADH gene of the mitochondrial genome ND4 (m.11778 G > A) that causes Leber's hereditary optic neuropathy. Some SNPs shared by the two genomes were associated with an increased level of cholesterol and triglycerides, probably related with Egyptians obesity. Comparison of these genomes with African and Western-Asian genomes can provide insights on Egyptian ancestry and genetic history. This resource can be used to further understand genomic diversity and functional classification of variants as well as human migration and evolution across Africa and Western-Asia. Copyright © 2017. Published by Elsevier B.V.

  11. Epidemiología genómica y paraparesia espástica tropical asociada a la infección por el virus linfotrópico humano de células T tipo 1 Genome epidemiology and tropical spastic paraparesis associated with human T-cell lymphotropic virus type 1

    Directory of Open Access Journals (Sweden)

    Mercedes Salcedo-Cifuentes

    2011-11-01

    Full Text Available OBJETIVO: Caracterizar el ambiente genómico de las secuencias adyacentes al virus linfotrópico humano de células T tipo 1 (HTLV-1 en pacientes con paraparesia espástica tropical y mielopatía asociada a la infección con HTLV-1 (PET/MAH de diferentes regiones de Colombia y del Japón. MÉTODOS: Se enfrentaron 71 clones recombinantes con secuencias del genoma humano adyacentes al 5'-LTR de pacientes con PET/MAH, a las bases de datos del Genome Browser y del Gen-Bank. Se identificaron y analizaron estadísticamente 16 variables genómicas estructurales y composicionales mediante el programa informático R, versión 2.8.1, en una ventana de 0,5 Mb. RESULTADOS: El 43,0% de los provirus se localizaron en los cromosomas del grupo C; 74% de las secuencias se ubicaron en regiones teloméricas y subteloméricas (P OBJECTIVE: Characterize the genomic environment of the sequences adjacent to human T-cell lymphotropic virus type 1 (HTLV-1 in patients with HTLV-1-associated myelopathy/tropical spastic paraparesis (HAM/TSP in different regions of Colombia and Japan. METHODS: A total of 71 recombinant clones with human genome sequences adjacent to 5' LTR in patients with HAM/TSP were compared to the Genome Browser and GenBank databases. Sixteen structural and compositional genome variables were identified, and statistical analysis was conducted in the R computer program, version 2.8.1, in a 0.5 Mb window. RESULTS: A total of 43.0% of the proviruses were located in the group C chromosomes; 74% of the sequences were located in the telomeric and subtelomeric regions (P < 0.05. A cluster analysis was used to establish the hierarchical relations between the genome characteristics included in the study. The analysis of principal components identified the components that defined the preferred genome environments for proviral integration in cases of HAM/TSP. CONCLUSIONS: HTLV-1 was integrated more often in chromatin regions rich in CpG islands with a high density

  12. Comparative genomic hybridization.

    Science.gov (United States)

    Pinkel, Daniel; Albertson, Donna G

    2005-01-01

    Altering DNA copy number is one of the many ways that gene expression and function may be modified. Some variations are found among normal individuals ( 14, 35, 103 ), others occur in the course of normal processes in some species ( 33 ), and still others participate in causing various disease states. For example, many defects in human development are due to gains and losses of chromosomes and chromosomal segments that occur prior to or shortly after fertilization, whereas DNA dosage alterations that occur in somatic cells are frequent contributors to cancer. Detecting these aberrations, and interpreting them within the context of broader knowledge, facilitates identification of critical genes and pathways involved in biological processes and diseases, and provides clinically relevant information. Over the past several years array comparative genomic hybridization (array CGH) has demonstrated its value for analyzing DNA copy number variations. In this review we discuss the state of the art of array CGH and its applications in medical genetics and cancer, emphasizing general concepts rather than specific results.

  13. Functional Genomics Group. Program Description

    National Research Council Canada - National Science Library

    Burian, Dennis

    2008-01-01

    .... This article reviews mechanisms of gene regulation and discusses how genomics is changing the way medicine is practiced today as a means of demonstrating that molecular medicine is here to stay...

  14. Genomic Resources for Cancer Epidemiology

    Science.gov (United States)

    This page provides links to research resources, complied by the Epidemiology and Genomics Research Program, that may be of interest to genetic epidemiologists conducting cancer research, but is not exhaustive.

  15. Fungal genomics beyond Saccharomyces cerevisiae?

    DEFF Research Database (Denmark)

    Hofmann, Gerald; Mcintyre, Mhairi; Nielsen, Jens

    2003-01-01

    Fungi are used extensively in both fundamental research and industrial applications. Saccharomyces cerevisiae has been the model organism for fungal research for many years, particularly in functional genomics. However, considering the diversity within the fungal kingdom, it is obvious...

  16. Genome engineering in human cells.

    Science.gov (United States)

    Song, Minjung; Kim, Young-Hoon; Kim, Jin-Soo; Kim, Hyongbum

    2014-01-01

    Genome editing in human cells is of great value in research, medicine, and biotechnology. Programmable nucleases including zinc-finger nucleases, transcription activator-like effector nucleases, and RNA-guided engineered nucleases recognize a specific target sequence and make a double-strand break at that site, which can result in gene disruption, gene insertion, gene correction, or chromosomal rearrangements. The target sequence complexities of these programmable nucleases are higher than 3.2 mega base pairs, the size of the haploid human genome. Here, we briefly introduce the structure of the human genome and the characteristics of each programmable nuclease, and review their applications in human cells including pluripotent stem cells. In addition, we discuss various delivery methods for nucleases, programmable nickases, and enrichment of gene-edited human cells, all of which facilitate efficient and precise genome editing in human cells.

  17. [Advances in microbial genome reduction and modification].

    Science.gov (United States)

    Wang, Jianli; Wang, Xiaoyuan

    2013-08-01

    Microbial genome reduction and modification are important strategies for constructing cellular chassis used for synthetic biology. This article summarized the essential genes and the methods to identify them in microorganisms, compared various strategies for microbial genome reduction, and analyzed the characteristics of some microorganisms with the minimized genome. This review shows the important role of genome reduction in constructing cellular chassis.

  18. 2004 Structural, Function and Evolutionary Genomics

    Energy Technology Data Exchange (ETDEWEB)

    Douglas L. Brutlag Nancy Ryan Gray

    2005-03-23

    This Gordon conference will cover the areas of structural, functional and evolutionary genomics. It will take a systematic approach to genomics, examining the evolution of proteins, protein functional sites, protein-protein interactions, regulatory networks, and metabolic networks. Emphasis will be placed on what we can learn from comparative genomics and entire genomes and proteomes.

  19. Draft Genome Sequence of Lactobacillus rhamnosus 2166.

    OpenAIRE

    Karlyshev, Andrey V.; Melnikov, Vyacheslav G.; Kosarev, Igor V.; Abramov, Vyacheslav M.

    2014-01-01

    In this report, we present a draft sequence of the genome of Lactobacillus rhamnosus strain 2166, a potential novel probiotic. Genome annotation and read mapping onto a reference genome of L. rhamnosus strain GG allowed for the identification of the differences and similarities in the genomic contents and gene arrangements of these strains.

  20. Genomic instability and radiation effects

    International Nuclear Information System (INIS)

    Christian Streffer

    2007-01-01

    Complete text of publication follows. Cancer, genetic mutations and developmental abnormalities are apparently associated with an increased genomic instability. Such phenomena have been frequently shown in human cancer cells in vitro and in situ. It is also well-known that individuals with a genetic predisposition for cancer proneness, such as ataxia telangiectesia, Fanconi anaemia etc. demonstrate a general high genomic instability e.g. in peripheral lymphocytes before a cancer has developed. Analogous data have been found in mice which develop a specific congenital malformation which has a genetic background. Under these aspects it is of high interest that ionising radiation can increase the genomic instability of mammalian cells after exposures in vitro an in vivo. This phenomenon is expressed 20 to 40 cell cycles after the exposure e.g. by de novo chromosomal aberrations. Such effects have been observed with high and low LET radiation, high LET radiation is more efficient. With low LET radiation a good dose response is observed in the dose range 0.2 to 2.0 Gy, Recently it has been reported that senescence and genomic instability was induced in human fibroblasts after 1 mGy carbon ions (1 in 18 cells are hit), apparently bystander effects also occurred under these conditions. The instability has been shown with DNA damage, chromosomal aberrations, gene mutation and cell death. It is also transferred to the next generation of mice with respect to gene mutations, chromosomal aberrations and congenital malformations. Several mechanisms have been discussed. The involvement of telomeres has gained interest. Genomic instability seems to be induced by a general lesion to the whole genome. The transmission of one chromosome from an irradiated cell to an non-irradiated cell leads to genomic instability in the untreated cells. Genomic instability increases mutation rates in the affected cells in general. As radiation late effects (cancer, gene mutations and congenital

  1. Genomic diversity of Lactobacillus salivarius

    OpenAIRE

    Raftis, Emma J.

    2015-01-01

    Lactobacillus salivarius is unusual among the lactobacilli due to its multireplicon genome architecture. The circular megaplasmids harboured by L. salivarius strains encode strain-specific traits for intestinal survival and probiotic activity. L. salivarius strains are increasingly being exploited for their probiotic properties in humans and animals. In terms of probiotic strain selection, it is important to have an understanding of the level of genomic diversity present in this species. Comp...

  2. Population Genomics of Paramecium Species.

    Science.gov (United States)

    Johri, Parul; Krenek, Sascha; Marinov, Georgi K; Doak, Thomas G; Berendonk, Thomas U; Lynch, Michael

    2017-05-01

    Population-genomic analyses are essential to understanding factors shaping genomic variation and lineage-specific sequence constraints. The dearth of such analyses for unicellular eukaryotes prompted us to assess genomic variation in Paramecium, one of the most well-studied ciliate genera. The Paramecium aurelia complex consists of ∼15 morphologically indistinguishable species that diverged subsequent to two rounds of whole-genome duplications (WGDs, as long as 320 MYA) and possess extremely streamlined genomes. We examine patterns of both nuclear and mitochondrial polymorphism, by sequencing whole genomes of 10-13 worldwide isolates of each of three species belonging to the P. aurelia complex: P. tetraurelia, P. biaurelia, P. sexaurelia, as well as two outgroup species that do not share the WGDs: P. caudatum and P. multimicronucleatum. An apparent absence of global geographic population structure suggests continuous or recent dispersal of Paramecium over long distances. Intergenic regions are highly constrained relative to coding sequences, especially in P. caudatum and P. multimicronucleatum that have shorter intergenic distances. Sequence diversity and divergence are reduced up to ∼100-150 bp both upstream and downstream of genes, suggesting strong constraints imposed by the presence of densely packed regulatory modules. In addition, comparison of sequence variation at non-synonymous and synonymous sites suggests similar recent selective pressures on paralogs within and orthologs across the deeply diverging species. This study presents the first genome-wide population-genomic analysis in ciliates and provides a valuable resource for future studies in evolutionary and functional genetics in Paramecium. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  3. Reconstructing ancient genomes and epigenomes

    DEFF Research Database (Denmark)

    Orlando, Ludovic Antoine Alexandre; Gilbert, M. Thomas P.; Willerslev, Eske

    2015-01-01

    DNA studies have now progressed to whole-genome sequencing for an increasing number of ancient individuals and extinct species, as well as to epigenomic characterization. Such advances have enabled the sequencing of specimens of up to 1 million years old, which, owing to their extensive DNA damage...... and contamination, were previously not amenable to genetic analyses. In this Review, we discuss these varied technical challenges and solutions for sequencing ancient genomes and epigenomes....

  4. GOBASE: an organelle genome database

    OpenAIRE

    O?Brien, Emmet A.; Zhang, Yue; Wang, Eric; Marie, Veronique; Badejoko, Wole; Lang, B. Franz; Burger, Gertraud

    2008-01-01

    The organelle genome database GOBASE, now in its 21st release (June 2008), contains all published mitochondrion-encoded sequences (?913 000) and chloroplast-encoded sequences (?250 000) from a wide range of eukaryotic taxa. For all sequences, information on related genes, exons, introns, gene products and taxonomy is available, as well as selected genome maps and RNA secondary structures. Recent major enhancements to database functionality include: (i) addition of an interface for RNA editing...

  5. The fishes of Genome 10K

    KAUST Repository

    Bernardi, Giacomo

    2012-09-01

    The Genome 10K project aims to sequence the genomes of 10,000 vertebrates, representing approximately one genome for each vertebrate genus. Since fishes (cartilaginous fishes, ray-finned fishes and lobe-finned fishes) represent more than 50% of extant vertebrates, it is planned to target 4,000 fish genomes. At present, nearly 60 fish genomes are being sequenced at various public funded labs, and under a Genome 10K and BGI pilot project. An additional 100 fishes have been identified for sequencing in the next phase of Genome 10K project. © 2012 Elsevier B.V.

  6. The integrated microbial genome resource of analysis.

    Science.gov (United States)

    Checcucci, Alice; Mengoni, Alessio

    2015-01-01

    Integrated Microbial Genomes and Metagenomes (IMG) is a biocomputational system that allows to provide information and support for annotation and comparative analysis of microbial genomes and metagenomes. IMG has been developed by the US Department of Energy (DOE)-Joint Genome Institute (JGI). IMG platform contains both draft and complete genomes, sequenced by Joint Genome Institute and other public and available genomes. Genomes of strains belonging to Archaea, Bacteria, and Eukarya domains are present as well as those of viruses and plasmids. Here, we provide some essential features of IMG system and case study for pangenome analysis.

  7. The fishes of Genome 10K

    KAUST Repository

    Bernardi, Giacomo; Wiley, Edward O.; Mansour, Hicham; Miller, Michael R.; Ortí , Guillermo; Haussler, David H.; O'Brien, Stephen J O; Ryder, Oliver A.; Venkatesh, Byrappa

    2012-01-01

    The Genome 10K project aims to sequence the genomes of 10,000 vertebrates, representing approximately one genome for each vertebrate genus. Since fishes (cartilaginous fishes, ray-finned fishes and lobe-finned fishes) represent more than 50% of extant vertebrates, it is planned to target 4,000 fish genomes. At present, nearly 60 fish genomes are being sequenced at various public funded labs, and under a Genome 10K and BGI pilot project. An additional 100 fishes have been identified for sequencing in the next phase of Genome 10K project. © 2012 Elsevier B.V.

  8. The dynamic genome of Hydra

    Science.gov (United States)

    Chapman, Jarrod A.; Kirkness, Ewen F.; Simakov, Oleg; Hampson, Steven E.; Mitros, Therese; Weinmaier, Therese; Rattei, Thomas; Balasubramanian, Prakash G.; Borman, Jon; Busam, Dana; Disbennett, Kathryn; Pfannkoch, Cynthia; Sumin, Nadezhda; Sutton, Granger G.; Viswanathan, Lakshmi Devi; Walenz, Brian; Goodstein, David M.; Hellsten, Uffe; Kawashima, Takeshi; Prochnik, Simon E.; Putnam, Nicholas H.; Shu, Shengquiang; Blumberg, Bruce; Dana, Catherine E.; Gee, Lydia; Kibler, Dennis F.; Law, Lee; Lindgens, Dirk; Martinez, Daniel E.; Peng, Jisong; Wigge, Philip A.; Bertulat, Bianca; Guder, Corina; Nakamura, Yukio; Ozbek, Suat; Watanabe, Hiroshi; Khalturin, Konstantin; Hemmrich, Georg; Franke, André; Augustin, René; Fraune, Sebastian; Hayakawa, Eisuke; Hayakawa, Shiho; Hirose, Mamiko; Hwang, Jung Shan; Ikeo, Kazuho; Nishimiya-Fujisawa, Chiemi; Ogura, Atshushi; Takahashi, Toshio; Steinmetz, Patrick R. H.; Zhang, Xiaoming; Aufschnaiter, Roland; Eder, Marie-Kristin; Gorny, Anne-Kathrin; Salvenmoser, Willi; Heimberg, Alysha M.; Wheeler, Benjamin M.; Peterson, Kevin J.; Böttger, Angelika; Tischler, Patrick; Wolf, Alexander; Gojobori, Takashi; Remington, Karin A.; Strausberg, Robert L.; Venter, J. Craig; Technau, Ulrich; Hobmayer, Bert; Bosch, Thomas C. G.; Holstein, Thomas W.; Fujisawa, Toshitaka; Bode, Hans R.; David, Charles N.; Rokhsar, Daniel S.; Steele, Robert E.

    2015-01-01

    The freshwater cnidarian Hydra was first described in 17021 and has been the object of study for 300 years. Experimental studies of Hydra between 1736 and 1744 culminated in the discovery of asexual reproduction of an animal by budding, the first description of regeneration in an animal, and successful transplantation of tissue between animals2. Today, Hydra is an important model for studies of axial patterning3, stem cell biology4 and regeneration5. Here we report the genome of Hydra magnipapillata and compare it to the genomes of the anthozoan Nematostella vectensis6 and other animals. The Hydra genome has been shaped by bursts of transposable element expansion, horizontal gene transfer, trans-splicing, and simplification of gene structure and gene content that parallel simplification of the Hydra life cycle. We also report the sequence of the genome of a novel bacterium stably associated with H. magnipapillata. Comparisons of the Hydra genome to the genomes of other animals shed light on the evolution of epithelia, contractile tissues, developmentally regulated transcription factors, the Spemann–Mangold organizer, pluripotency genes and the neuromuscular junction. PMID:20228792

  9. Universal pacemaker of genome evolution.

    Science.gov (United States)

    Snir, Sagi; Wolf, Yuri I; Koonin, Eugene V

    2012-01-01

    A fundamental observation of comparative genomics is that the distribution of evolution rates across the complete sets of orthologous genes in pairs of related genomes remains virtually unchanged throughout the evolution of life, from bacteria to mammals. The most straightforward explanation for the conservation of this distribution appears to be that the relative evolution rates of all genes remain nearly constant, or in other words, that evolutionary rates of different genes are strongly correlated within each evolving genome. This correlation could be explained by a model that we denoted Universal PaceMaker (UPM) of genome evolution. The UPM model posits that the rate of evolution changes synchronously across genome-wide sets of genes in all evolving lineages. Alternatively, however, the correlation between the evolutionary rates of genes could be a simple consequence of molecular clock (MC). We sought to differentiate between the MC and UPM models by fitting thousands of phylogenetic trees for bacterial and archaeal genes to supertrees that reflect the dominant trend of vertical descent in the evolution of archaea and bacteria and that were constrained according to the two models. The goodness of fit for the UPM model was better than the fit for the MC model, with overwhelming statistical significance, although similarly to the MC, the UPM is strongly overdispersed. Thus, the results of this analysis reveal a universal, genome-wide pacemaker of evolution that could have been in operation throughout the history of life.

  10. A Genome-Wide Landscape of Retrocopies in Primate Genomes.

    Science.gov (United States)

    Navarro, Fábio C P; Galante, Pedro A F

    2015-07-29

    Gene duplication is a key factor contributing to phenotype diversity across and within species. Although the availability of complete genomes has led to the extensive study of genomic duplications, the dynamics and variability of gene duplications mediated by retrotransposition are not well understood. Here, we predict mRNA retrotransposition and use comparative genomics to investigate their origin and variability across primates. Analyzing seven anthropoid primate genomes, we found a similar number of mRNA retrotranspositions (∼7,500 retrocopies) in Catarrhini (Old Word Monkeys, including humans), but a surprising large number of retrocopies (∼10,000) in Platyrrhini (New World Monkeys), which may be a by-product of higher long interspersed nuclear element 1 activity in these genomes. By inferring retrocopy orthology, we dated most of the primate retrocopy origins, and estimated a decrease in the fixation rate in recent primate history, implying a smaller number of species-specific retrocopies. Moreover, using RNA-Seq data, we identified approximately 3,600 expressed retrocopies. As expected, most of these retrocopies are located near or within known genes, present tissue-specific and even species-specific expression patterns, and no expression correlation to their parental genes. Taken together, our results provide further evidence that mRNA retrotransposition is an active mechanism in primate evolution and suggest that retrocopies may not only introduce great genetic variability between lineages but also create a large reservoir of potentially functional new genomic loci in primate genomes. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  11. HGVA: the Human Genome Variation Archive

    OpenAIRE

    Lopez, Javier; Coll, Jacobo; Haimel, Matthias; Kandasamy, Swaathi; Tarraga, Joaquin; Furio-Tari, Pedro; Bari, Wasim; Bleda, Marta; Rueda, Antonio; Gr?f, Stefan; Rendon, Augusto; Dopazo, Joaquin; Medina, Ignacio

    2017-01-01

    Abstract High-profile genomic variation projects like the 1000 Genomes project or the Exome Aggregation Consortium, are generating a wealth of human genomic variation knowledge which can be used as an essential reference for identifying disease-causing genotypes. However, accessing these data, contrasting the various studies and integrating those data in downstream analyses remains cumbersome. The Human Genome Variation Archive (HGVA) tackles these challenges and facilitates access to genomic...

  12. The Genomic Code: Genome Evolution and Potential Applications

    KAUST Repository

    Bernardi, Giorgio

    2016-01-25

    The genome of metazoans is organized according to a genomic code which comprises three laws: 1) Compositional correlations hold between contiguous coding and non-coding sequences, as well as among the three codon positions of protein-coding genes; these correlations are the consequence of the fact that the genomes under consideration consist of fairly homogeneous, long (≥200Kb) sequences, the isochores; 2) Although isochores are defined on the basis of purely compositional properties, GC levels of isochores are correlated with all tested structural and functional properties of the genome; 3) GC levels of isochores are correlated with chromosome architecture from interphase to metaphase; in the case of interphase the correlation concerns isochores and the three-dimensional “topological associated domains” (TADs); in the case of mitotic chromosomes, the correlation concerns isochores and chromosomal bands. Finally, the genomic code is the fourth and last pillar of molecular biology, the first three pillars being 1) the double helix structure of DNA; 2) the regulation of gene expression in prokaryotes; and 3) the genetic code.

  13. Discovery of previously unidentified genomic disorders from the duplication architecture of the human genome

    NARCIS (Netherlands)

    Sharp, Andrew J.; Hansen, Sierra; Selzer, Rebecca R.; Cheng, Ze; Regan, Regina; Hurst, Jane A.; Stewart, Helen; Price, Sue M.; Blair, Edward; Hennekam, Raoul C.; Fitzpatrick, Carrie A.; Segraves, Rick; Richmond, Todd A.; Guiver, Cheryl; Albertson, Donna G.; Pinkel, Daniel; Eis, Peggy S.; Schwartz, Stuart; Knight, Samantha J. L.; Eichler, Evan E.

    2006-01-01

    Genomic disorders are characterized by the presence of flanking segmental duplications that predispose these regions to recurrent rearrangement. Based on the duplication architecture of the genome, we investigated 130 regions that we hypothesized as candidates for previously undescribed genomic

  14. Supplementary Material for: Whole genome sequencing reveals genomic heterogeneity and antibiotic purification in Mycobacterium tuberculosis isolates

    KAUST Repository

    Black, PA; Vos, M. de; Louw, GE; Merwe, RG van der; Dippenaar, A.; Streicher, EM; Abdallah, AM; Sampson, SL; Victor, TC; Dolby, T.; Simpson, JA; Helden, PD van; Warren, RM; Pain, Arnab

    2015-01-01

    Abstract Background Whole genome sequencing has revolutionised the interrogation of mycobacterial genomes. Recent studies have reported conflicting findings on the genomic stability of Mycobacterium tuberculosis during the evolution of drug

  15. Human Genome Education Program

    Energy Technology Data Exchange (ETDEWEB)

    Richard Myers; Lane Conn

    2000-05-01

    The funds from the DOE Human Genome Program, for the project period 2/1/96 through 1/31/98, have provided major support for the curriculum development and field testing efforts for two high school level instructional units: Unit 1, ''Exploring Genetic Conditions: Genes, Culture and Choices''; and Unit 2, ''DNA Snapshots: Peaking at Your DNA''. In the original proposal, they requested DOE support for the partial salary and benefits of a Field Test Coordinator position to: (1) complete the field testing and revision of two high school curriculum units, and (2) initiate the education of teachers using these units. During the project period of this two-year DOE grant, a part-time Field-Test Coordinator was hired (Ms. Geraldine Horsma) and significant progress has been made in both of the original proposal objectives. Field testing for Unit 1 has occurred in over 12 schools (local and non-local sites with diverse student populations). Field testing for Unit 2 has occurred in over 15 schools (local and non-local sites) and will continue in 12-15 schools during the 96-97 school year. For both curricula, field-test sites and site teachers were selected for their interest in genetics education and in hands-on science education. Many of the site teachers had no previous experience with HGEP or the unit under development. Both of these first-year biology curriculum units, which contain genetics, biotechnology, societal, ethical and cultural issues related to HGP, are being implemented in many local and non-local schools (SF Bay Area, Southern California, Nebraska, Hawaii, and Texas) and in programs for teachers. These units will reach over 10,000 students in the SF Bay Area and continues to receive support from local corporate and private philanthropic organizations. Although HGEP unit development is nearing completion for both units, data is still being gathered and analyzed on unit effectiveness and student learning. The final field

  16. Allele coding in genomic evaluation

    Directory of Open Access Journals (Sweden)

    Christensen Ole F

    2011-06-01

    Full Text Available Abstract Background Genomic data are used in animal breeding to assist genetic evaluation. Several models to estimate genomic breeding values have been studied. In general, two approaches have been used. One approach estimates the marker effects first and then, genomic breeding values are obtained by summing marker effects. In the second approach, genomic breeding values are estimated directly using an equivalent model with a genomic relationship matrix. Allele coding is the method chosen to assign values to the regression coefficients in the statistical model. A common allele coding is zero for the homozygous genotype of the first allele, one for the heterozygote, and two for the homozygous genotype for the other allele. Another common allele coding changes these regression coefficients by subtracting a value from each marker such that the mean of regression coefficients is zero within each marker. We call this centered allele coding. This study considered effects of different allele coding methods on inference. Both marker-based and equivalent models were considered, and restricted maximum likelihood and Bayesian methods were used in inference. Results Theoretical derivations showed that parameter estimates and estimated marker effects in marker-based models are the same irrespective of the allele coding, provided that the model has a fixed general mean. For the equivalent models, the same results hold, even though different allele coding methods lead to different genomic relationship matrices. Calculated genomic breeding values are independent of allele coding when the estimate of the general mean is included into the values. Reliabilities of estimated genomic breeding values calculated using elements of the inverse of the coefficient matrix depend on the allele coding because different allele coding methods imply different models. Finally, allele coding affects the mixing of Markov chain Monte Carlo algorithms, with the centered coding being

  17. Genomic selection in maritime pine.

    Science.gov (United States)

    Isik, Fikret; Bartholomé, Jérôme; Farjat, Alfredo; Chancerel, Emilie; Raffin, Annie; Sanchez, Leopoldo; Plomion, Christophe; Bouffier, Laurent

    2016-01-01

    A two-generation maritime pine (Pinus pinaster Ait.) breeding population (n=661) was genotyped using 2500 SNP markers. The extent of linkage disequilibrium and utility of genomic selection for growth and stem straightness improvement were investigated. The overall intra-chromosomal linkage disequilibrium was r(2)=0.01. Linkage disequilibrium corrected for genomic relationships derived from markers was smaller (rV(2)=0.006). Genomic BLUP, Bayesian ridge regression and Bayesian LASSO regression statistical models were used to obtain genomic estimated breeding values. Two validation methods (random sampling 50% of the population and 10% of the progeny generation as validation sets) were used with 100 replications. The average predictive ability across statistical models and validation methods was about 0.49 for stem sweep, and 0.47 and 0.43 for total height and tree diameter, respectively. The sensitivity analysis suggested that prior densities (variance explained by markers) had little or no discernible effect on posterior means (residual variance) in Bayesian prediction models. Sampling from the progeny generation for model validation increased the predictive ability of markers for tree diameter and stem sweep but not for total height. The results are promising despite low linkage disequilibrium and low marker coverage of the genome (∼1.39 markers/cM). Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  18. The genome of Prunus mume.

    Science.gov (United States)

    Zhang, Qixiang; Chen, Wenbin; Sun, Lidan; Zhao, Fangying; Huang, Bangqing; Yang, Weiru; Tao, Ye; Wang, Jia; Yuan, Zhiqiong; Fan, Guangyi; Xing, Zhen; Han, Changlei; Pan, Huitang; Zhong, Xiao; Shi, Wenfang; Liang, Xinming; Du, Dongliang; Sun, Fengming; Xu, Zongda; Hao, Ruijie; Lv, Tian; Lv, Yingmin; Zheng, Zequn; Sun, Ming; Luo, Le; Cai, Ming; Gao, Yike; Wang, Junyi; Yin, Ye; Xu, Xun; Cheng, Tangren; Wang, Jun

    2012-01-01

    Prunus mume (mei), which was domesticated in China more than 3,000 years ago as ornamental plant and fruit, is one of the first genomes among Prunus subfamilies of Rosaceae been sequenced. Here, we assemble a 280M genome by combining 101-fold next-generation sequencing and optical mapping data. We further anchor 83.9% of scaffolds to eight chromosomes with genetic map constructed by restriction-site-associated DNA sequencing. Combining P. mume genome with available data, we succeed in reconstructing nine ancestral chromosomes of Rosaceae family, as well as depicting chromosome fusion, fission and duplication history in three major subfamilies. We sequence the transcriptome of various tissues and perform genome-wide analysis to reveal the characteristics of P. mume, including its regulation of early blooming in endodormancy, immune response against bacterial infection and biosynthesis of flower scent. The P. mume genome sequence adds to our understanding of Rosaceae evolution and provides important data for improvement of fruit trees.

  19. The genome of Chenopodium quinoa

    KAUST Repository

    Jarvis, David Erwin; Ho, Yung Shwen; Lightfoot, Damien; Schmö ckel, Sandra M.; Li, Bo; Borm, Theo J. A.; Ohyanagi, Hajime; Mineta, Katsuhiko; Michell, Craig; Saber, Noha; Kharbatia, Najeh M.; Rupper, Ryan R.; Sharp, Aaron R.; Dally, Nadine; Boughton, Berin A.; Woo, Yong; Gao, Ge; Schijlen, Elio G. W. M.; Guo, Xiujie; Momin, Afaque Ahmad Imtiyaz; Negrã o, Só nia; Al-Babili, Salim; Gehring, Christoph A; Roessner, Ute; Jung, Christian; Murphy, Kevin; Arold, Stefan T.; Gojobori, Takashi; Linden, C. Gerard van der; Loo, Eibertus N. van; Jellen, Eric N.; Maughan, Peter J.; Tester, Mark A.

    2017-01-01

    Chenopodium quinoa (quinoa) is a highly nutritious grain identified as an important crop to improve world food security. Unfortunately, few resources are available to facilitate its genetic improvement. Here we report the assembly of a high-quality, chromosome-scale reference genome sequence for quinoa, which was produced using single-molecule real-time sequencing in combination with optical, chromosome-contact and genetic maps. We also report the sequencing of two diploids from the ancestral gene pools of quinoa, which enables the identification of sub-genomes in quinoa, and reduced-coverage genome sequences for 22 other samples of the allotetraploid goosefoot complex. The genome sequence facilitated the identification of the transcription factor likely to control the production of anti-nutritional triterpenoid saponins found in quinoa seeds, including a mutation that appears to cause alternative splicing and a premature stop codon in sweet quinoa strains. These genomic resources are an important first step towards the genetic improvement of quinoa.

  20. The genome of Chenopodium quinoa

    KAUST Repository

    Jarvis, David Erwin

    2017-02-08

    Chenopodium quinoa (quinoa) is a highly nutritious grain identified as an important crop to improve world food security. Unfortunately, few resources are available to facilitate its genetic improvement. Here we report the assembly of a high-quality, chromosome-scale reference genome sequence for quinoa, which was produced using single-molecule real-time sequencing in combination with optical, chromosome-contact and genetic maps. We also report the sequencing of two diploids from the ancestral gene pools of quinoa, which enables the identification of sub-genomes in quinoa, and reduced-coverage genome sequences for 22 other samples of the allotetraploid goosefoot complex. The genome sequence facilitated the identification of the transcription factor likely to control the production of anti-nutritional triterpenoid saponins found in quinoa seeds, including a mutation that appears to cause alternative splicing and a premature stop codon in sweet quinoa strains. These genomic resources are an important first step towards the genetic improvement of quinoa.

  1. The genome of Chenopodium quinoa.

    Science.gov (United States)

    Jarvis, David E; Ho, Yung Shwen; Lightfoot, Damien J; Schmöckel, Sandra M; Li, Bo; Borm, Theo J A; Ohyanagi, Hajime; Mineta, Katsuhiko; Michell, Craig T; Saber, Noha; Kharbatia, Najeh M; Rupper, Ryan R; Sharp, Aaron R; Dally, Nadine; Boughton, Berin A; Woo, Yong H; Gao, Ge; Schijlen, Elio G W M; Guo, Xiujie; Momin, Afaque A; Negrão, Sónia; Al-Babili, Salim; Gehring, Christoph; Roessner, Ute; Jung, Christian; Murphy, Kevin; Arold, Stefan T; Gojobori, Takashi; Linden, C Gerard van der; van Loo, Eibertus N; Jellen, Eric N; Maughan, Peter J; Tester, Mark

    2017-02-16

    Chenopodium quinoa (quinoa) is a highly nutritious grain identified as an important crop to improve world food security. Unfortunately, few resources are available to facilitate its genetic improvement. Here we report the assembly of a high-quality, chromosome-scale reference genome sequence for quinoa, which was produced using single-molecule real-time sequencing in combination with optical, chromosome-contact and genetic maps. We also report the sequencing of two diploids from the ancestral gene pools of quinoa, which enables the identification of sub-genomes in quinoa, and reduced-coverage genome sequences for 22 other samples of the allotetraploid goosefoot complex. The genome sequence facilitated the identification of the transcription factor likely to control the production of anti-nutritional triterpenoid saponins found in quinoa seeds, including a mutation that appears to cause alternative splicing and a premature stop codon in sweet quinoa strains. These genomic resources are an important first step towards the genetic improvement of quinoa.

  2. Genomics of Arctic cod

    Science.gov (United States)

    Wilson, Robert E.; Sage, George K.; Sonsthagen, Sarah A.; Gravley, Megan C.; Menning, Damian; Talbot, Sandra L.

    2017-01-01

    The Arctic cod (Boreogadus saida) is an abundant marine fish that plays a vital role in the marine food web. To better understand the population genetic structure and the role of natural selection acting on the maternally-inherited mitochondrial genome (mitogenome), a molecule often associated with adaptations to temperature, we analyzed genetic data collected from 11 biparentally-inherited nuclear microsatellite DNA loci and nucleotide sequence data from from the mitochondrial DNA (mtDNA) cytochrome b (cytb) gene and, for a subset of individuals, the entire mitogenome. In addition, due to potential of species misidentification with morphologically similar Polar cod (Arctogadus glacialis), we used ddRAD-Seq data to determine the level of divergence between species and identify species-specific markers. Based on the findings presented here, Arctic cod across the Pacific Arctic (Bering, Chukchi, and Beaufort Seas) comprise a single panmictic population with high genetic diversity compared to other gadids. High genetic diversity was indicated across all 13 protein-coding genes in the mitogenome. In addition, we found moderate levels of genetic diversity in the nuclear microsatellite loci, with highest diversity found in the Chukchi Sea. Our analyses of markers from both marker classes (nuclear microsatellite fragment data and mtDNA cytb sequence data) failed to uncover a signal of microgeographic genetic structure within Arctic cod across the three regions, within the Alaskan Beaufort Sea, or between near-shore or offshore habitats. Further, data from a subset of mitogenomes revealed no genetic differentiation between Bering, Chukchi, and Beaufort seas populations for Arctic cod, Saffron cod (Eleginus gracilis), or Walleye pollock (Gadus chalcogrammus). However, we uncovered significant differences in the distribution of microsatellite alleles between the southern Chukchi and central and eastern Beaufort Sea samples of Arctic cod. Finally, using ddRAD-Seq data, we

  3. The Pediatric Cancer Genome Project

    Science.gov (United States)

    Downing, James R; Wilson, Richard K; Zhang, Jinghui; Mardis, Elaine R; Pui, Ching-Hon; Ding, Li; Ley, Timothy J; Evans, William E

    2013-01-01

    The St. Jude Children’s Research Hospital–Washington University Pediatric Cancer Genome Project (PCGP) is participating in the international effort to identify somatic mutations that drive cancer. These cancer genome sequencing efforts will not only yield an unparalleled view of the altered signaling pathways in cancer but should also identify new targets against which novel therapeutics can be developed. Although these projects are still deep in the phase of generating primary DNA sequence data, important results are emerging and valuable community resources are being generated that should catalyze future cancer research. We describe here the rationale for conducting the PCGP, present some of the early results of this project and discuss the major lessons learned and how these will affect the application of genomic sequencing in the clinic. PMID:22641210

  4. Integrating genomics into evolutionary medicine.

    Science.gov (United States)

    Rodríguez, Juan Antonio; Marigorta, Urko M; Navarro, Arcadi

    2014-12-01

    The application of the principles of evolutionary biology into medicine was suggested long ago and is already providing insight into the ultimate causes of disease. However, a full systematic integration of medical genomics and evolutionary medicine is still missing. Here, we briefly review some cases where the combination of the two fields has proven profitable and highlight two of the main issues hindering the development of evolutionary genomic medicine as a mature field, namely the dissociation between fitness and health and the still considerable difficulties in predicting phenotypes from genotypes. We use publicly available data to illustrate both problems and conclude that new approaches are needed for evolutionary genomic medicine to overcome these obstacles. Copyright © 2014 Elsevier Ltd. All rights reserved.

  5. Enhancer Identification through Comparative Genomics

    Energy Technology Data Exchange (ETDEWEB)

    Visel, Axel; Bristow, James; Pennacchio, Len A.

    2006-10-01

    With the availability of genomic sequence from numerousvertebrates, a paradigm shift has occurred in the identification ofdistant-acting gene regulatory elements. In contrast to traditionalgene-centric studies in which investigators randomly scanned genomicfragments that flank genes of interest in functional assays, the modernapproach begins electronically with publicly available comparativesequence datasets that provide investigators with prioritized lists ofputative functional sequences based on their evolutionary conservation.However, although a large number of tools and resources are nowavailable, application of comparative genomic approaches remains far fromtrivial. In particular, it requires users to dynamically consider thespecies and methods for comparison depending on the specific biologicalquestion under investigation. While there is currently no single generalrule to this end, it is clear that when applied appropriately,comparative genomic approaches exponentially increase our power ingenerating biological hypotheses for subsequent experimentaltesting.

  6. Genomics of Escherichia and Shigella

    Science.gov (United States)

    Perna, Nicole T.

    The laboratory workhorse Escherichia coli K-12 is among the most intensively studied living organisms on earth, and this single strain serves as the model system behind much of our understanding of prokaryotic molecular biology. Dense genome sequencing and recent insightful comparative analyses are making the species E. coli, as a whole, an emerging system for studying prokaryotic population genetics and the relationship between system-scale, or genome-scale, molecular evolution and complex traits like host range and pathogenic potential. Genomic perspective has revealed a coherent but dynamic species united by intraspecific gene flow via homologous lateral or horizontal transfer and differentiated by content flux mediated by acquisition of DNA segments from interspecies transfers.

  7. Genomic composition factors affect codon usage in porcine genome ...

    African Journals Online (AJOL)

    ... be explored for designing degenerate primers, necessitate selecting appropriate hosts expression systems to manipulate the expression of target genes in vivo or in vitro and improve the accuracy of gene prediction from genomic sequences thus maximizing the effectiveness of genetic manipulations in synthetic biology.

  8. Multiplexed precision genome editing with trackable genomic barcodes in yeast.

    Science.gov (United States)

    Roy, Kevin R; Smith, Justin D; Vonesch, Sibylle C; Lin, Gen; Tu, Chelsea Szu; Lederer, Alex R; Chu, Angela; Suresh, Sundari; Nguyen, Michelle; Horecka, Joe; Tripathi, Ashutosh; Burnett, Wallace T; Morgan, Maddison A; Schulz, Julia; Orsley, Kevin M; Wei, Wu; Aiyar, Raeka S; Davis, Ronald W; Bankaitis, Vytas A; Haber, James E; Salit, Marc L; St Onge, Robert P; Steinmetz, Lars M

    2018-07-01

    Our understanding of how genotype controls phenotype is limited by the scale at which we can precisely alter the genome and assess the phenotypic consequences of each perturbation. Here we describe a CRISPR-Cas9-based method for multiplexed accurate genome editing with short, trackable, integrated cellular barcodes (MAGESTIC) in Saccharomyces cerevisiae. MAGESTIC uses array-synthesized guide-donor oligos for plasmid-based high-throughput editing and features genomic barcode integration to prevent plasmid barcode loss and to enable robust phenotyping. We demonstrate that editing efficiency can be increased more than fivefold by recruiting donor DNA to the site of breaks using the LexA-Fkh1p fusion protein. We performed saturation editing of the essential gene SEC14 and identified amino acids critical for chemical inhibition of lipid signaling. We also constructed thousands of natural genetic variants, characterized guide mismatch tolerance at the genome scale, and ascertained that cryptic Pol III termination elements substantially reduce guide efficacy. MAGESTIC will be broadly useful to uncover the genetic basis of phenotypes in yeast.

  9. Genomic composition factors affect codon usage in porcine genome

    African Journals Online (AJOL)

    j.khobondo

    2015-01-28

    Jan 28, 2015 ... The mutational bias hypothesis predicted that genes in the GC-rich regions of the genome ... observed codon divided by its expected frequency at equilibrium. An RSCU value close to 1 indicates lack of bias, ..... study our results points to preferred usage of both C or G and A or T at the synonyms sites as ...

  10. The Phaeodactylum genome reveals the evolutionary history of diatom genomes

    Czech Academy of Sciences Publication Activity Database

    Bowler, Ch.; Allen, A. E.; Badger, J. H.; Grimwood, J.; Jabbari, K.; Kuo, A.; Maheswari, U.; Martens, C.; Maumus, F.; Otillar, R. P.; Rayko, E.; Salamov, A.; Vandepoele, K.; Beszteri, B.; Gruber, A.; Heijde, M.; Katinka, M.; Mock, T.; Valentin, K.; Verret, F.; Berges, J. A.; Brownlee, C.; Cadoret, J.-P.; Chiovitti, A.; Choi, Ch. J.; Coesel, S.; De Martino, A.; Detter, J. Ch.; Durkin, C.; Falciatore, A.; Fournet, J.; Haruta, M.; Huysman, M. J. J.; Jenkins, B. D.; Jiroutová, Kateřina; Jorgensen, R. E.; Joubert, Y.; Kaplan, A.; Kröger, N.; Kroth, P. G.; La Roche, J.; Lindquist, E.; Lommer, M.; Martin–Jézéquel, V.; Lopez, P. J.; Lucas, S.; Mangogna, M.; McGinnis, K.; Medlin, L. K.; Montsant, A.; Oudot–Le Secq, M.-P.; Napoli, C.; Oborník, Miroslav; Schnitzler Parker, M.; Petit, J.-L.; Porcel, B. M.; Poulsen, N.; Robison, M.; Rychlewski, L.; Rynearson, T. A.; Schmutz, J.; Shapiro, H.; Siaut, M.; Stanley, M.; Sussman, M. R.; Taylor, A. R.; Vardi, A.; von Dassow, P.; Vyverman, W.; Willis, A.; Wyrwicz, L. S.; Rokhsar, D. S.; Weissenbach, J.; Armbrust, E. V.; Green, B. R.; Van de Peer, Y.; Grigoriev, I. V.

    2008-01-01

    Roč. 456, 13-11-2008 (2008), s. 239-244 ISSN 0028-0836 Institutional research plan: CEZ:AV0Z60220518 Keywords : Phaeodactylum * genome * evolution * diatom Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 31.434, year: 2008

  11. Genome Size Dynamics and Evolution in Monocots

    Directory of Open Access Journals (Sweden)

    Ilia J. Leitch

    2010-01-01

    Full Text Available Monocot genomic diversity includes striking variation at many levels. This paper compares various genomic characters (e.g., range of chromosome numbers and ploidy levels, occurrence of endopolyploidy, GC content, chromosome packaging and organization, genome size between monocots and the remaining angiosperms to discern just how distinctive monocot genomes are. One of the most notable features of monocots is their wide range and diversity of genome sizes, including the species with the largest genome so far reported in plants. This genomic character is analysed in greater detail, within a phylogenetic context. By surveying available genome size and chromosome data it is apparent that different monocot orders follow distinctive modes of genome size and chromosome evolution. Further insights into genome size-evolution and dynamics were obtained using statistical modelling approaches to reconstruct the ancestral genome size at key nodes across the monocot phylogenetic tree. Such approaches reveal that while the ancestral genome size of all monocots was small (1C=1.9 pg, there have been several major increases and decreases during monocot evolution. In addition, notable increases in the rates of genome size-evolution were found in Asparagales and Poales compared with other monocot lineages.

  12. The Perennial Ryegrass GenomeZipper: Targeted Use of Genome Resources for Comparative Grass Genomics1[C][W

    Science.gov (United States)

    Pfeifer, Matthias; Martis, Mihaela; Asp, Torben; Mayer, Klaus F.X.; Lübberstedt, Thomas; Byrne, Stephen; Frei, Ursula; Studer, Bruno

    2013-01-01

    Whole-genome sequences established for model and major crop species constitute a key resource for advanced genomic research. For outbreeding forage and turf grass species like ryegrasses (Lolium spp.), such resources have yet to be developed. Here, we present a model of the perennial ryegrass (Lolium perenne) genome on the basis of conserved synteny to barley (Hordeum vulgare) and the model grass genome Brachypodium (Brachypodium distachyon) as well as rice (Oryza sativa) and sorghum (Sorghum bicolor). A transcriptome-based genetic linkage map of perennial ryegrass served as a scaffold to establish the chromosomal arrangement of syntenic genes from model grass species. This scaffold revealed a high degree of synteny and macrocollinearity and was then utilized to anchor a collection of perennial ryegrass genes in silico to their predicted genome positions. This resulted in the unambiguous assignment of 3,315 out of 8,876 previously unmapped genes to the respective chromosomes. In total, the GenomeZipper incorporates 4,035 conserved grass gene loci, which were used for the first genome-wide sequence divergence analysis between perennial ryegrass, barley, Brachypodium, rice, and sorghum. The perennial ryegrass GenomeZipper is an ordered, information-rich genome scaffold, facilitating map-based cloning and genome assembly in perennial ryegrass and closely related Poaceae species. It also represents a milestone in describing synteny between perennial ryegrass and fully sequenced model grass genomes, thereby increasing our understanding of genome organization and evolution in the most important temperate forage and turf grass species. PMID:23184232

  13. Implementing genomics and pharmacogenomics in the clinic: The National Human Genome Research Institute’s genomic medicine portfolio

    Science.gov (United States)

    Manolio, Teri A.

    2016-01-01

    Increasing knowledge about the influence of genetic variation on human health and growing availability of reliable, cost-effective genetic testing have spurred the implementation of genomic medicine in the clinic. As defined by the National Human Genome Research Institute (NHGRI), genomic medicine uses an individual’s genetic information in his or her clinical care, and has begun to be applied effectively in areas such as cancer genomics, pharmacogenomics, and rare and undiagnosed diseases. In 2011 NHGRI published its strategic vision for the future of genomic research, including an ambitious research agenda to facilitate and promote the implementation of genomic medicine. To realize this agenda, NHGRI is consulting and facilitating collaborations with the external research community through a series of “Genomic Medicine Meetings,” under the guidance and leadership of the National Advisory Council on Human Genome Research. These meetings have identified and begun to address significant obstacles to implementation, such as lack of evidence of efficacy, limited availability of genomics expertise and testing, lack of standards, and diffficulties in integrating genomic results into electronic medical records. The six research and dissemination initiatives comprising NHGRI’s genomic research portfolio are designed to speed the evaluation and incorporation, where appropriate, of genomic technologies and findings into routine clinical care. Actual adoption of successful approaches in clinical care will depend upon the willingness, interest, and energy of professional societies, practitioners, patients, and payers to promote their responsible use and share their experiences in doing so. PMID:27612677

  14. The Chlamydomonas genome project: a decade on

    Science.gov (United States)

    Blaby, Ian K.; Blaby-Haas, Crysten; Tourasse, Nicolas; Hom, Erik F. Y.; Lopez, David; Aksoy, Munevver; Grossman, Arthur; Umen, James; Dutcher, Susan; Porter, Mary; King, Stephen; Witman, George; Stanke, Mario; Harris, Elizabeth H.; Goodstein, David; Grimwood, Jane; Schmutz, Jeremy; Vallon, Olivier; Merchant, Sabeeha S.; Prochnik, Simon

    2014-01-01

    The green alga Chlamydomonas reinhardtii is a popular unicellular organism for studying photosynthesis, cilia biogenesis and micronutrient homeostasis. Ten years since its genome project was initiated, an iterative process of improvements to the genome and gene predictions has propelled this organism to the forefront of the “omics” era. Housed at Phytozome, the Joint Genome Institute’s (JGI) plant genomics portal, the most up-to-date genomic data include a genome arranged on chromosomes and high-quality gene models with alternative splice forms supported by an abundance of RNA-Seq data. Here, we present the past, present and future of Chlamydomonas genomics. Specifically, we detail progress on genome assembly and gene model refinement, discuss resources for gene annotations, functional predictions and locus ID mapping between versions and, importantly, outline a standardized framework for naming genes. PMID:24950814

  15. Big Data Analysis of Human Genome Variations

    KAUST Repository

    Gojobori, Takashi

    2016-01-01

    Since the human genome draft sequence was in public for the first time in 2000, genomic analyses have been intensively extended to the population level. The following three international projects are good examples for large-scale studies of human

  16. V-GAP: Viral genome assembly pipeline

    KAUST Repository

    Nakamura, Yoji

    2015-10-22

    Next-generation sequencing technologies have allowed the rapid determination of the complete genomes of many organisms. Although shotgun sequences from large genome organisms are still difficult to reconstruct perfect contigs each of which represents a full chromosome, those from small genomes have been assembled successfully into a very small number of contigs. In this study, we show that shotgun reads from phage genomes can be reconstructed into a single contig by controlling the number of read sequences used in de novo assembly. We have developed a pipeline to assemble small viral genomes with good reliability using a resampling method from shotgun data. This pipeline, named V-GAP (Viral Genome Assembly Pipeline), will contribute to the rapid genome typing of viruses, which are highly divergent, and thus will meet the increasing need for viral genome comparisons in metagenomic studies.

  17. All about the Human Genome Project (HGP)

    Science.gov (United States)

    ... Care Genomic Medicine Working Group New Horizons and Research Patient Management Policy and Ethics Issues Quick Links for Patient Care Education All About the Human Genome Project Fact Sheets Genetic Education Resources for ...

  18. Hapsembler: An Assembler for Highly Polymorphic Genomes

    Science.gov (United States)

    Donmez, Nilgun; Brudno, Michael

    As whole genome sequencing has become a routine biological experiment, algorithms for assembly of whole genome shotgun data has become a topic of extensive research, with a plethora of off-the-shelf methods that can reconstruct the genomes of many organisms. Simultaneously, several recently sequenced genomes exhibit very high polymorphism rates. For these organisms genome assembly remains a challenge as most assemblers are unable to handle highly divergent haplotypes in a single individual. In this paper we describe Hapsembler, an assembler for highly polymorphic genomes, which makes use of paired reads. Our experiments show that Hapsembler produces accurate and contiguous assemblies of highly polymorphic genomes, while performing on par with the leading tools on haploid genomes. Hapsembler is available for download at http://compbio.cs.toronto.edu/hapsembler.

  19. Collaborative Genomics Study Advances Precision Oncology

    Science.gov (United States)

    A collaborative study conducted by two Office of Cancer Genomics (OCG) initiatives highlights the importance of integrating structural and functional genomics programs to improve cancer therapies, and more specifically, contribute to precision oncology treatments for children.

  20. Unsupervised statistical identification of genomic islands using ...

    Indian Academy of Sciences (India)

    Vibrio species. These investigations lead to observations that are of evolutionary ... Identification of genomic islands in prokaryotic genomes has received considerable attention in the literature due to .... For instance, selective pres- sures as a ...

  1. V-GAP: Viral genome assembly pipeline

    KAUST Repository

    Nakamura, Yoji; Yasuike, Motoshige; Nishiki, Issei; Iwasaki, Yuki; Fujiwara, Atushi; Kawato, Yasuhiko; Nakai, Toshihiro; Nagai, Satoshi; Kobayashi, Takanori; Gojobori, Takashi; Ototake, Mitsuru

    2015-01-01

    Next-generation sequencing technologies have allowed the rapid determination of the complete genomes of many organisms. Although shotgun sequences from large genome organisms are still difficult to reconstruct perfect contigs each of which represents a full chromosome, those from small genomes have been assembled successfully into a very small number of contigs. In this study, we show that shotgun reads from phage genomes can be reconstructed into a single contig by controlling the number of read sequences used in de novo assembly. We have developed a pipeline to assemble small viral genomes with good reliability using a resampling method from shotgun data. This pipeline, named V-GAP (Viral Genome Assembly Pipeline), will contribute to the rapid genome typing of viruses, which are highly divergent, and thus will meet the increasing need for viral genome comparisons in metagenomic studies.

  2. The 1000 bull genome project

    Science.gov (United States)

    To meet growing global demands for high value protein from milk and meat, rates of genetic gain in domestic cattle must be accelerated. At the same time, animal health and welfare must be considered. The 1000 bull genomes project supports these goals by providing annotated sequence variants and ge...

  3. Computational genomics of specialized metabolism

    NARCIS (Netherlands)

    Medema, Marnix H.

    2018-01-01

    Microbial and plant specialized metabolites, also known as natural products, are key mediators of microbe-microbe and host-microbe interactions and constitute a rich resource for drug development. In the past decade, genome mining has emerged as a prominent strategy for natural product discovery.

  4. Genomic Heritability: What Is It?

    DEFF Research Database (Denmark)

    de los Campos, Gustavo; Sorensen, Daniel; Gianola, Daniel

    2015-01-01

    Whole-genome regression methods are being increasingly used for the analysis and prediction of complex traits and diseases. In human genetics, these methods are commonly used for inferences about genetic parameters, such as the amount of genetic variance among individuals or the proportion of phe...

  5. Targeted sequencing of plant genomes

    Science.gov (United States)

    Mark D. Huynh

    2014-01-01

    Next-generation sequencing (NGS) has revolutionized the field of genetics by providing a means for fast and relatively affordable sequencing. With the advancement of NGS, wholegenome sequencing (WGS) has become more commonplace. However, sequencing an entire genome is still not cost effective or even beneficial in all cases. In studies that do not require a whole-...

  6. Fungal genome resources at NCBI

    Science.gov (United States)

    Robbertse, B.; Tatusova, T.

    2011-01-01

    The National Center for Biotechnology Information (NCBI) is well known for the nucleotide sequence archive, GenBank and sequence analysis tool BLAST. However, NCBI integrates many types of biomolecular data from variety of sources and makes it available to the scientific community as interactive web resources as well as organized releases of bulk data. These tools are available to explore and compare fungal genomes. Searching all databases with Fungi [organism] at http://www.ncbi.nlm.nih.gov/ is the quickest way to find resources of interest with fungal entries. Some tools though are resources specific and can be indirectly accessed from a particular database in the Entrez system. These include graphical viewers and comparative analysis tools such as TaxPlot, TaxMap and UniGene DDD (found via UniGene Homepage). Gene and BioProject pages also serve as portals to external data such as community annotation websites, BioGrid and UniProt. There are many different ways of accessing genomic data at NCBI. Depending on the focus and goal of research projects or the level of interest, a user would select a particular route for accessing genomic databases and resources. This review article describes methods of accessing fungal genome data and provides examples that illustrate the use of analysis tools. PMID:22737589

  7. Genomic methods take the plunge

    DEFF Research Database (Denmark)

    Cammen, Kristina M.; Andrews, Kimberly R.; Carroll, Emma L.

    2016-01-01

    The dramatic increase in the application of genomic techniques to non-model organisms (NMOs) over the past decade has yielded numerous valuable contributions to evolutionary biology and ecology, many of which would not have been possible with traditional genetic markers. We review this recent...

  8. Ecological Genomics of Marine Picocyanobacteria†

    Science.gov (United States)

    Scanlan, D. J.; Ostrowski, M.; Mazard, S.; Dufresne, A.; Garczarek, L.; Hess, W. R.; Post, A. F.; Hagemann, M.; Paulsen, I.; Partensky, F.

    2009-01-01

    Summary: Marine picocyanobacteria of the genera Prochlorococcus and Synechococcus numerically dominate the picophytoplankton of the world ocean, making a key contribution to global primary production. Prochlorococcus was isolated around 20 years ago and is probably the most abundant photosynthetic organism on Earth. The genus comprises specific ecotypes which are phylogenetically distinct and differ markedly in their photophysiology, allowing growth over a broad range of light and nutrient conditions within the 45°N to 40°S latitudinal belt that they occupy. Synechococcus and Prochlorococcus are closely related, together forming a discrete picophytoplankton clade, but are distinguishable by their possession of dissimilar light-harvesting apparatuses and differences in cell size and elemental composition. Synechococcus strains have a ubiquitous oceanic distribution compared to that of Prochlorococcus strains and are characterized by phylogenetically discrete lineages with a wide range of pigmentation. In this review, we put our current knowledge of marine picocyanobacterial genomics into an environmental context and present previously unpublished genomic information arising from extensive genomic comparisons in order to provide insights into the adaptations of these marine microbes to their environment and how they are reflected at the genomic level. PMID:19487728

  9. Genomic applications in forensic medicine

    DEFF Research Database (Denmark)

    Børsting, Claus; Morling, Niels

    2016-01-01

    Since the 1980s, advances in DNA technology have revolutionized the scope and practice of forensic medicine. From the days of restriction fragment length polymorphisms (RFLPs) to short tandem repeats (STRs), the current focus is on the next generation genome sequencing. It has been almost a decad...

  10. Human Genome Research: Decoding DNA

    Science.gov (United States)

    dropdown arrow Site Map A-Z Index Menu Synopsis Human Genome Research: Decoding DNA Resources with of the DNA double helix during April 2003. James D. Watson, Francis Crick, and Maurice Wilkins were company Celera announced the completion of a "working draft" reference DNA sequence of the human

  11. The genome of Chenopodium quinoa

    NARCIS (Netherlands)

    Jarvis, D.E.; Shwen Ho, Yung; Lightfoot, Damien J.; Schmöckel, Sandra M.; Li, Bo; Borm, T.J.A.; Ohyanagi, Hajime; Mineta, Katsuhiko; Mitchell, Craig T.; Saber, Noha; Kharbatia, Najeh M.; Rupper, Ryan R.; Sharp, Aaron R.; Dally, Nadine; Boughton, Berin A.; Woo, Yong H.; Gao, Ge; Schijlen, E.G.W.M.; Guo, Xiujie; Momin, Afaque A.; Negräo, Sónia; Al-Babili, Salim; Gehring, Christoph; Roessner, Ute; Jung, Christian; Murphy, Kevin; Arold, Stefan T.; Gojobori, Takashi; Linden, van der C.G.; Loo, van E.N.; Jellen, Eric N.; Maughan, Peter J.; Tester, Mark

    2017-01-01

    Chenopodium quinoa (quinoa) is a highly nutritious grain identified as an important crop to improve world food security. Unfortunately, few resources are available to facilitate its genetic improvement. Here we report the assembly of a high-quality, chromosome-scale reference genome sequence for

  12. Genome position and gene amplification

    Czech Academy of Sciences Publication Activity Database

    Jirsová, Pavla; Snijders, A.M.; Kwek, S.; Roydasgupta, R.; Fridlyand, J.; Tokuyasu, T.; Pinkel, D.; Albertson, D. G.

    2007-01-01

    Roč. 8, č. 6 (2007), r120 ISSN 1474-760X Institutional research plan: CEZ:AV0Z50040507; CEZ:AV0Z50040702 Keywords : gene amplification * array comparative genomic hybridization * oncogene Subject RIV: BO - Biophysics Impact factor: 6.589, year: 2007

  13. India, Genomic diversity & Disease susceptibility

    Indian Academy of Sciences (India)

    Table of contents. India, Genomic diversity & Disease susceptibility · India, a paradise for Genetic Studies · Involved in earlier stages of Immune response protecting us from Diseases, Responsible for kidney and other transplant rejections Inherited from our parents · PowerPoint Presentation · Slide 5 · Slide 6 · Slide 7.

  14. GENOMIC FEATURES OF COTESIA PLUTELLAE POLYDNAVIRUS

    Institute of Scientific and Technical Information of China (English)

    LIUCai-ling; ZHUXiang-xiong; FuWen-jun; ZHAOMu-jun

    2003-01-01

    Polydnavirus was purified from the calyx fluid of Cotesia plutellae ovary. The genomic features of C. plutellae polydnavirus (CpPDV) were investigated. The viral genome consists of at least 12 different segments and the aggregate genome size is a lower estimate of 80kbp. By partial digestion of CpPDV DNA with BamHI and subsequent ligation with BamHI-cut plasmid Bluescript, a representative library of CpPDV genome was obtained.

  15. Reconciling Utility with Privacy in Genomics

    OpenAIRE

    Humbert, Mathias; Ayday, Erman; Hubaux, Jean-Pierre; Telenti, Amalio

    2014-01-01

    Direct-to-consumer genetic testing makes it possible for everyone to learn their genome sequences. In order to contribute to medical research, a growing number of people publish their genomic data on the Web, sometimes under their real identities. However, this is at odds not only with their own privacy but also with the privacy of their relatives. The genomes of relatives being highly correlated, some family members might be opposed to revealing any of the family's genomic data. In this pape...

  16. Genome chaos: survival strategy during crisis.

    Science.gov (United States)

    Liu, Guo; Stevens, Joshua B; Horne, Steven D; Abdallah, Batoul Y; Ye, Karen J; Bremer, Steven W; Ye, Christine J; Chen, David J; Heng, Henry H

    2014-01-01

    Genome chaos, a process of complex, rapid genome re-organization, results in the formation of chaotic genomes, which is followed by the potential to establish stable genomes. It was initially detected through cytogenetic analyses, and recently confirmed by whole-genome sequencing efforts which identified multiple subtypes including "chromothripsis", "chromoplexy", "chromoanasynthesis", and "chromoanagenesis". Although genome chaos occurs commonly in tumors, both the mechanism and detailed aspects of the process are unknown due to the inability of observing its evolution over time in clinical samples. Here, an experimental system to monitor the evolutionary process of genome chaos was developed to elucidate its mechanisms. Genome chaos occurs following exposure to chemotherapeutics with different mechanisms, which act collectively as stressors. Characterization of the karyotype and its dynamic changes prior to, during, and after induction of genome chaos demonstrates that chromosome fragmentation (C-Frag) occurs just prior to chaotic genome formation. Chaotic genomes seem to form by random rejoining of chromosomal fragments, in part through non-homologous end joining (NHEJ). Stress induced genome chaos results in increased karyotypic heterogeneity. Such increased evolutionary potential is demonstrated by the identification of increased transcriptome dynamics associated with high levels of karyotypic variance. In contrast to impacting on a limited number of cancer genes, re-organized genomes lead to new system dynamics essential for cancer evolution. Genome chaos acts as a mechanism of rapid, adaptive, genome-based evolution that plays an essential role in promoting rapid macroevolution of new genome-defined systems during crisis, which may explain some unwanted consequences of cancer treatment.

  17. Genomics-assisted breeding in fruit trees

    OpenAIRE

    Iwata, Hiroyoshi; Minamikawa, Mai F.; Kajiya-Kanegae, Hiromi; Ishimori, Motoyuki; Hayashi, Takeshi

    2016-01-01

    Recent advancements in genomic analysis technologies have opened up new avenues to promote the efficiency of plant breeding. Novel genomics-based approaches for plant breeding and genetics research, such as genome-wide association studies (GWAS) and genomic selection (GS), are useful, especially in fruit tree breeding. The breeding of fruit trees is hindered by their long generation time, large plant size, long juvenile phase, and the necessity to wait for the physiological maturity of the pl...

  18. insights from the genome of Melitaea cinxia

    OpenAIRE

    Ahola, Virpi; Wahlberg, Niklas; Frilander, Mikko J.

    2017-01-01

    The first lepidopteran genome (Bombyx mori) was published in 2004. Ten years later the genome of Melitaea cinxia came out as the third butterfly genome published, and the first eukaryotic genome sequenced in Finland. Owing to Ilkka Hanski, the M. cinxia system in the angstrom land Islands has become a famous model for metapopulation biology. More than 20 years of research on this system provides a strong ecological basis upon which a genetic framework could be built. Genetic knowledge is an e...

  19. GRAbB : Selective Assembly of Genomic Regions, a New Niche for Genomic Research

    NARCIS (Netherlands)

    Brankovics, Balázs; Zhang, Hao; van Diepeningen, Anne D; van der Lee, Theo A J; Waalwijk, Cees; de Hoog, G Sybren

    GRAbB (Genomic Region Assembly by Baiting) is a new program that is dedicated to assemble specific genomic regions from NGS data. This approach is especially useful when dealing with multi copy regions, such as mitochondrial genome and the rDNA repeat region, parts of the genome that are often

  20. Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaea

    OpenAIRE

    Wolf Yuri I; Novichkov Pavel S; Sorokin Alexander V; Makarova Kira S; Koonin Eugene V

    2007-01-01

    Abstract Background An evolutionary classification of genes from sequenced genomes that distinguishes between orthologs and paralogs is indispensable for genome annotation and evolutionary reconstruction. Shortly after multiple genome sequences of bacteria, archaea, and unicellular eukaryotes became available, an attempt on such a classification was implemented in Clusters of Orthologous Groups of proteins (COGs). Rapid accumulation of genome sequences creates opportunities for refining COGs ...

  1. Accounting for discovery bias in genomic EPD

    Science.gov (United States)

    Genomics has contributed substantially to genetic improvement of beef cattle. The implementation is through computation of genomically enhanced expected progeny differences (GE-EPD), which are predictions of genetic merit of individual animals based on genomic information, pedigree, and data on the ...

  2. The UCSC Genome Browser Database: update 2006

    DEFF Research Database (Denmark)

    Hinrichs, A S; Karolchik, D; Baertsch, R

    2006-01-01

    The University of California Santa Cruz Genome Browser Database (GBD) contains sequence and annotation data for the genomes of about a dozen vertebrate species and several major model organisms. Genome annotations typically include assembly data, sequence composition, genes and gene predictions, ...

  3. The UCSC genome browser database: update 2007

    DEFF Research Database (Denmark)

    Kuhn, R M; Karolchik, D; Zweig, A S

    2006-01-01

    The University of California, Santa Cruz Genome Browser Database contains, as of September 2006, sequence and annotation data for the genomes of 13 vertebrate and 19 invertebrate species. The Genome Browser displays a wide variety of annotations at all scales from the single nucleotide level up t...

  4. Impact of genomics on microbial food safety

    NARCIS (Netherlands)

    Abee, T.; Schaik, van W.; Siezen, R.J.

    2004-01-01

    Genome sequences are now available for many of the microbes that cause food-borne diseases. The information contained in pathogen genome sequences, together with the development of themed and whole-genome DNA microarrays and improved proteomics techniques, might provide tools for the rapid detection

  5. Genomic individuality and its biological implications.

    Science.gov (United States)

    Zhao, J

    1996-06-01

    It is a widely accepted fundamental concept that all somatic genomes of a human individual are identical to each other. The theoretical basis of this concept is that all of these somatic genomes are the descendants of the genome of a single fertilized cell as well as the simple replicated products of asexual reproduction, thus not forming any new recombined genomes. The question here is whether such a concept might only represent one side of somatic genome biology and, even worse, whether it has perhaps already led to a very prevalent misconception that within the organism body, there exists no variability among individual somatic genomes. A hypothesis, called genomic individuality, is proposed, simply saying that every individual somatic genome, perhaps with rare exceptions, has its own unique or individual 'genetic identity' or 'fingerprint', which is characterized by its distinctive sequences or patterns of deoxyribonucleic acid molecules, or both. Thus, no two somatic genomes can be identical to each other in every or all aspects, and consequently, there must be a great deal of genomic variation present within the body of any multicellular organism. The concept or hypothesis of genomic individuality would not only provide a more complete understanding of genome biology, but also suggest a new insight into the studies of the biology of cells and organisms.

  6. Leaner and meaner genomes in Escherichia coli

    DEFF Research Database (Denmark)

    Ussery, David

    2006-01-01

    A 'better' Escherichia coli K-12 genome has recently been engineered in which about 15% of the genome has been removed by planned deletions. Comparison with related bacterial genomes that have undergone a natural reduction in size suggests that there is plenty of scope for yet more deletions....

  7. Comparative Genomics of Carp Herpesviruses

    Science.gov (United States)

    Kurobe, Tomofumi; Gatherer, Derek; Cunningham, Charles; Korf, Ian; Fukuda, Hideo; Hedrick, Ronald P.; Waltzek, Thomas B.

    2013-01-01

    Three alloherpesviruses are known to cause disease in cyprinid fish: cyprinid herpesviruses 1 and 3 (CyHV1 and CyHV3) in common carp and koi and cyprinid herpesvirus 2 (CyHV2) in goldfish. We have determined the genome sequences of CyHV1 and CyHV2 and compared them with the published CyHV3 sequence. The CyHV1 and CyHV2 genomes are 291,144 and 290,304 bp, respectively, in size, and thus the CyHV3 genome, at 295,146 bp, remains the largest recorded among the herpesviruses. Each of the three genomes consists of a unique region flanked at each terminus by a sizeable direct repeat. The CyHV1, CyHV2, and CyHV3 genomes are predicted to contain 137, 150, and 155 unique, functional protein-coding genes, respectively, of which six, four, and eight, respectively, are duplicated in the terminal repeat. The three viruses share 120 orthologous genes in a largely colinear arrangement, of which up to 55 are also conserved in the other member of the genus Cyprinivirus, anguillid herpesvirus 1. Twelve genes are conserved convincingly in all sequenced alloherpesviruses, and two others are conserved marginally. The reference CyHV3 strain has been reported to contain five fragmented genes that are presumably nonfunctional. The CyHV2 strain has two fragmented genes, and the CyHV1 strain has none. CyHV1, CyHV2, and CyHV3 have five, six, and five families of paralogous genes, respectively. One family unique to CyHV1 is related to cellular JUNB, which encodes a transcription factor involved in oncogenesis. To our knowledge, this is the first time that JUNB-related sequences have been reported in a herpesvirus. PMID:23269803

  8. Characterization of partial and near full-length genomes of HIV-1 strains sampled from recently infected individuals in São Paulo, Brazil.

    Directory of Open Access Journals (Sweden)

    Sabri Saeed Sanabani

    Full Text Available BACKGROUND: Genetic variability is a major feature of human immunodeficiency virus type 1 (HIV-1 and is considered the key factor frustrating efforts to halt the HIV epidemic. A proper understanding of HIV-1 genomic diversity is a fundamental prerequisite for proper epidemiology, genetic diagnosis, and successful drugs and vaccines design. Here, we report on the partial and near full-length genomic (NFLG variability of HIV-1 isolates from a well-characterized cohort of recently infected patients in São Paul, Brazil. METHODOLOGY: HIV-1 proviral DNA was extracted from the peripheral blood mononuclear cells of 113 participants. The NFLG and partial fragments were determined by overlapping nested PCR and direct sequencing. The data were phylogenetically analyzed. RESULTS: Of the 113 samples (90.3% male; median age 31 years; 79.6% homosexual men studied, 77 (68.1% NFLGs and 32 (29.3% partial fragments were successfully subtyped. Of the successfully subtyped sequences, 88 (80.7% were subtype B sequences, 12 (11% BF1 recombinants, 3 (2.8% subtype C sequences, 2 (1.8% BC recombinants and subclade F1 each, 1 (0.9% CRF02 AG, and 1 (0.9% CRF31 BC. Primary drug resistance mutations were observed in 14/101 (13.9% of samples, with 5.9% being resistant to protease inhibitors and nucleoside reverse transcriptase inhibitors (NRTI and 4.9% resistant to non-NRTIs. Predictions of viral tropism were determined for 86 individuals. X4 or X4 dual or mixed-tropic viruses (X4/DM were seen in 26 (30.2% of subjects. The proportion of X4 viruses in homosexuals was detected in 19/69 (27.5%. CONCLUSIONS: Our results confirm the existence of various HIV-1 subtypes circulating in São Paulo, and indicate that subtype B account for the majority of infections. Antiretroviral (ARV drug resistance is relatively common among recently infected patients. The proportion of X4 viruses in homosexuals was significantly higher than the proportion seen in other study populations.

  9. Genome Variation Map: a data repository of genome variations in BIG Data Center

    OpenAIRE

    Song, Shuhui; Tian, Dongmei; Li, Cuiping; Tang, Bixia; Dong, Lili; Xiao, Jingfa; Bao, Yiming; Zhao, Wenming; He, Hang; Zhang, Zhang

    2017-01-01

    Abstract The Genome Variation Map (GVM; http://bigd.big.ac.cn/gvm/) is a public data repository of genome variations. As a core resource in the BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, GVM dedicates to collect, integrate and visualize genome variations for a wide range of species, accepts submissions of different types of genome variations from all over the world and provides free open access to all publicly available data in support of worldwide research a...

  10. Discovering regulatory motifs in the Plasmodium genome using comparative genomics

    OpenAIRE

    Wu, Jie; Sieglaff, Douglas H.; Gervin, Joshua; Xie, Xiaohui S.

    2008-01-01

    Motivation: Understanding gene regulation in Plasmodium, the causative agent of malaria, is an important step in deciphering its complex life cycle as well as leading to possible new targets for therapeutic applications. Very little is known about gene regulation in Plasmodium, and in particular, few regulatory elements have been identified. Such discovery has been significantly hampered by the high A-T content of some of the genomes of Plasmodium species, as well as the challenge in associat...

  11. Prospects for Genomic Research in Forestry

    Directory of Open Access Journals (Sweden)

    K. V. Krutovsky

    2014-08-01

    Full Text Available Conifers are keystone species of boreal forests. Their whole genome sequencing, assembly and annotation will allow us to understand the evolution of the complex ancient giant conifer genomes that are 4 times larger in larch and 7–9 times larger in pines than the human genome. Genomic studies will allow also to obtain important whole genome sequence data and develop highly polymorphic and informative genetic markers, such as microsatellites and single nucleotide polymorphisms (SNPs that can be efficiently used in timber origin identification, for genetic variation monitoring, to study local and climate change adaptation and in tree improvement and conservation programs.

  12. Value of a newly sequenced bacterial genome

    DEFF Research Database (Denmark)

    Barbosa, Eudes; Aburjaile, Flavia F; Ramos, Rommel Tj

    2014-01-01

    and annotation will not be undertaken. It is important to know what is lost when we settle for a draft genome and to determine the "scientific value" of a newly sequenced genome. This review addresses the expected impact of newly sequenced genomes on antibacterial discovery and vaccinology. Also, it discusses...... heightened expectations that NGS would boost antibacterial discovery and vaccine development. Although many possible drug and vaccine targets have been discovered, the success rate of genome-based analysis has remained below expectations. Furthermore, NGS has had consequences for genome quality, resulting...

  13. Software for computing and annotating genomic ranges.

    Directory of Open Access Journals (Sweden)

    Michael Lawrence

    Full Text Available We describe Bioconductor infrastructure for representing and computing on annotated genomic ranges and integrating genomic data with the statistical computing features of R and its extensions. At the core of the infrastructure are three packages: IRanges, GenomicRanges, and GenomicFeatures. These packages provide scalable data structures for representing annotated ranges on the genome, with special support for transcript structures, read alignments and coverage vectors. Computational facilities include efficient algorithms for overlap and nearest neighbor detection, coverage calculation and other range operations. This infrastructure directly supports more than 80 other Bioconductor packages, including those for sequence analysis, differential expression analysis and visualization.

  14. Resolution effects in reconstructing ancestral genomes.

    Science.gov (United States)

    Zheng, Chunfang; Jeong, Yuji; Turcotte, Madisyn Gabrielle; Sankoff, David

    2018-05-09

    The reconstruction of ancestral genomes must deal with the problem of resolution, necessarily involving a trade-off between trying to identify genomic details and being overwhelmed by noise at higher resolutions. We use the median reconstruction at the synteny block level, of the ancestral genome of the order Gentianales, based on coffee, Rhazya stricta and grape, to exemplify the effects of resolution (granularity) on comparative genomic analyses. We show how decreased resolution blurs the differences between evolving genomes, with respect to rate, mutational process and other characteristics.

  15. The ecoresponsive genome of Daphnia pulex

    Energy Technology Data Exchange (ETDEWEB)

    Colbourne, John K.; Pfrender, Michael E.; Gilbert, Donald; Thomas, W. Kelley; Tucker, Abraham; Oakley, Todd H.; Tokishita, Shinichi; Aerts, Andrea; Arnold, Georg J.; Basu, Malay Kumar; Bauer, Darren J.; Caceres, Carla E.; Carmel, Liran; Casola, Claudio; Choi, Jeong-Hyeon; Detter, John C.; Dong, Qunfeng; Dusheyko, Serge; Eads, Brian D.; Frohlich, Thomas; Geiler-Samerotte, Kerry A.; Gerlach, Daniel; Hatcher, Phil; Jogdeo, Sanjuro; Krijgsveld, Jeroen; Kriventseva, Evgenia V; Kültz, Dietmar; Laforsch, Christian; Lindquist, Erika; Lopez, Jacqueline; Manak, Robert; Muller, Jean; Pangilinan, Jasmyn; Patwardhan, Rupali P.; Pitluck, Samuel; Pritham, Ellen J.; Rechtsteiner, Andreas; Rho, Mina; Rogozin, Igor B.; Sakarya, Onur; Salamov, Asaf; Schaack, Sarah; Shapiro, Harris; Shiga, Yasuhiro; Skalitzky, Courtney; Smith, Zachary; Souvorov, Alexander; Sung, Way; Tang, Zuojian; Tsuchiya, Dai; Tu, Hank; Vos, Harmjan; Wang, Mei; Wolf, Yuri I.; Yamagata, Hideo; Yamada, Takuji; Ye, Yuzhen; Shaw, Joseph R.; Andrews, Justen; Crease, Teresa J.; Tang, Haixu; Lucas, Susan M.; Robertson, Hugh M.; Bork, Peer; Koonin, Eugene V.; Zdobnov, Evgeny M.; Grigoriev, Igor V.; Lynch, Michael; Boore, Jeffrey L.

    2011-02-04

    This document provides supporting material related to the sequencing of the ecoresponsive genome of Daphnia pulex. This material includes information on materials and methods and supporting text, as well as supplemental figures, tables, and references. The coverage of materials and methods addresses genome sequence, assembly, and mapping to chromosomes, gene inventory, attributes of a compact genome, the origin and preservation of Daphnia pulex genes, implications of Daphnia's genome structure, evolutionary diversification of duplicated genes, functional significance of expanded gene families, and ecoresponsive genes. Supporting text covers chromosome studies, gene homology among Daphnia genomes, micro-RNA and transposable elements and the 46 Daphnia pulex opsins. 36 figures, 50 tables, 183 references.

  16. Software for computing and annotating genomic ranges.

    Science.gov (United States)

    Lawrence, Michael; Huber, Wolfgang; Pagès, Hervé; Aboyoun, Patrick; Carlson, Marc; Gentleman, Robert; Morgan, Martin T; Carey, Vincent J

    2013-01-01

    We describe Bioconductor infrastructure for representing and computing on annotated genomic ranges and integrating genomic data with the statistical computing features of R and its extensions. At the core of the infrastructure are three packages: IRanges, GenomicRanges, and GenomicFeatures. These packages provide scalable data structures for representing annotated ranges on the genome, with special support for transcript structures, read alignments and coverage vectors. Computational facilities include efficient algorithms for overlap and nearest neighbor detection, coverage calculation and other range operations. This infrastructure directly supports more than 80 other Bioconductor packages, including those for sequence analysis, differential expression analysis and visualization.

  17. Genomic alterations detected by comparative genomic hybridization in ovarian endometriomas

    Directory of Open Access Journals (Sweden)

    L.C. Veiga-Castelli

    2010-08-01

    Full Text Available Endometriosis is a complex and multifactorial disease. Chromosomal imbalance screening in endometriotic tissue can be used to detect hot-spot regions in the search for a possible genetic marker for endometriosis. The objective of the present study was to detect chromosomal imbalances by comparative genomic hybridization (CGH in ectopic tissue samples from ovarian endometriomas and eutopic tissue from the same patients. We evaluated 10 ovarian endometriotic tissues and 10 eutopic endometrial tissues by metaphase CGH. CGH was prepared with normal and test DNA enzymatically digested, ligated to adaptors and amplified by PCR. A second PCR was performed for DNA labeling. Equal amounts of both normal and test-labeled DNA were hybridized in human normal metaphases. The Isis FISH Imaging System V 5.0 software was used for chromosome analysis. In both eutopic and ectopic groups, 4/10 samples presented chromosomal alterations, mainly chromosomal gains. CGH identified 11q12.3-q13.1, 17p11.1-p12, 17q25.3-qter, and 19p as critical regions. Genomic imbalances in 11q, 17p, 17q, and 19p were detected in normal eutopic and/or ectopic endometrium from women with ovarian endometriosis. These regions contain genes such as POLR2G, MXRA7 and UBA52 involved in biological processes that may lead to the establishment and maintenance of endometriotic implants. This genomic imbalance may affect genes in which dysregulation impacts both eutopic and ectopic endometrium.

  18. Fungal Genomics for Energy and Environment

    Energy Technology Data Exchange (ETDEWEB)

    Grigoriev, Igor V.

    2013-03-11

    Genomes of fungi relevant to energy and environment are in focus of the Fungal Genomic Program at the US Department of Energy Joint Genome Institute (JGI). One of its projects, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts) by means of genome sequencing and analysis. New chapters of the Encyclopedia can be opened with user proposals to the JGI Community Sequencing Program (CSP). Another JGI project, the 1000 fungal genomes, explores fungal diversity on genome level at scale and is open for users to nominate new species for sequencing. Over 200 fungal genomes have been sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supported by functional genomics leads to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such parts suggested by comparative genomics and functional analysis in these areas are presented here.

  19. Insights into Conifer Giga-Genomes1

    Science.gov (United States)

    De La Torre, Amanda R.; Birol, Inanc; Bousquet, Jean; Ingvarsson, Pär K.; Jansson, Stefan; Jones, Steven J.M.; Keeling, Christopher I.; MacKay, John; Nilsson, Ove; Ritland, Kermit; Street, Nathaniel; Yanchuk, Alvin; Zerbe, Philipp; Bohlmann, Jörg

    2014-01-01

    Insights from sequenced genomes of major land plant lineages have advanced research in almost every aspect of plant biology. Until recently, however, assembled genome sequences of gymnosperms have been missing from this picture. Conifers of the pine family (Pinaceae) are a group of gymnosperms that dominate large parts of the world’s forests. Despite their ecological and economic importance, conifers seemed long out of reach for complete genome sequencing, due in part to their enormous genome size (20–30 Gb) and the highly repetitive nature of their genomes. Technological advances in genome sequencing and assembly enabled the recent publication of three conifer genomes: white spruce (Picea glauca), Norway spruce (Picea abies), and loblolly pine (Pinus taeda). These genome sequences revealed distinctive features compared with other plant genomes and may represent a window into the past of seed plant genomes. This Update highlights recent advances, remaining challenges, and opportunities in light of the publication of the first conifer and gymnosperm genomes. PMID:25349325

  20. Genome Improvement at JGI-HAGSC

    Energy Technology Data Exchange (ETDEWEB)

    Grimwood, Jane; Schmutz, Jeremy J.; Myers, Richard M.

    2012-03-03

    Since the completion of the sequencing of the human genome, the Joint Genome Institute (JGI) has rapidly expanded its scientific goals in several DOE mission-relevant areas. At the JGI-HAGSC, we have kept pace with this rapid expansion of projects with our focus on assessing, assembling, improving and finishing eukaryotic whole genome shotgun (WGS) projects for which the shotgun sequence is generated at the Production Genomic Facility (JGI-PGF). We follow this by combining the draft WGS with genomic resources generated at JGI-HAGSC or in collaborator laboratories (including BAC end sequences, genetic maps and FLcDNA sequences) to produce an improved draft sequence. For eukaryotic genomes important to the DOE mission, we then add further information from directed experiments to produce reference genomic sequences that are publicly available for any scientific researcher. Also, we have continued our program for producing BAC-based finished sequence, both for adding information to JGI genome projects and for small BAC-based sequencing projects proposed through any of the JGI sequencing programs. We have now built our computational expertise in WGS assembly and analysis and have moved eukaryotic genome assembly from the JGI-PGF to JGI-HAGSC. We have concentrated our assembly development work on large plant genomes and complex fungal and algal genomes.

  1. Genomic Diversity and Evolution of the Lyssaviruses

    Science.gov (United States)

    Delmas, Olivier; Holmes, Edward C.; Talbi, Chiraz; Larrous, Florence; Dacheux, Laurent; Bouchier, Christiane; Bourhy, Hervé

    2008-01-01

    Lyssaviruses are RNA viruses with single-strand, negative-sense genomes responsible for rabies-like diseases in mammals. To date, genomic and evolutionary studies have most often utilized partial genome sequences, particularly of the nucleoprotein and glycoprotein genes, with little consideration of genome-scale evolution. Herein, we report the first genomic and evolutionary analysis using complete genome sequences of all recognised lyssavirus genotypes, including 14 new complete genomes of field isolates from 6 genotypes and one genotype that is completely sequenced for the first time. In doing so we significantly increase the extent of genome sequence data available for these important viruses. Our analysis of these genome sequence data reveals that all lyssaviruses have the same genomic organization. A phylogenetic analysis reveals strong geographical structuring, with the greatest genetic diversity in Africa, and an independent origin for the two known genotypes that infect European bats. We also suggest that multiple genotypes may exist within the diversity of viruses currently classified as ‘Lagos Bat’. In sum, we show that rigorous phylogenetic techniques based on full length genome sequence provide the best discriminatory power for genotype classification within the lyssaviruses. PMID:18446239

  2. Genomic diversity and evolution of the lyssaviruses.

    Directory of Open Access Journals (Sweden)

    Olivier Delmas

    2008-04-01

    Full Text Available Lyssaviruses are RNA viruses with single-strand, negative-sense genomes responsible for rabies-like diseases in mammals. To date, genomic and evolutionary studies have most often utilized partial genome sequences, particularly of the nucleoprotein and glycoprotein genes, with little consideration of genome-scale evolution. Herein, we report the first genomic and evolutionary analysis using complete genome sequences of all recognised lyssavirus genotypes, including 14 new complete genomes of field isolates from 6 genotypes and one genotype that is completely sequenced for the first time. In doing so we significantly increase the extent of genome sequence data available for these important viruses. Our analysis of these genome sequence data reveals that all lyssaviruses have the same genomic organization. A phylogenetic analysis reveals strong geographical structuring, with the greatest genetic diversity in Africa, and an independent origin for the two known genotypes that infect European bats. We also suggest that multiple genotypes may exist within the diversity of viruses currently classified as 'Lagos Bat'. In sum, we show that rigorous phylogenetic techniques based on full length genome sequence provide the best discriminatory power for genotype classification within the lyssaviruses.

  3. Snake Genome Sequencing: Results and Future Prospects.

    Science.gov (United States)

    Kerkkamp, Harald M I; Kini, R Manjunatha; Pospelov, Alexey S; Vonk, Freek J; Henkel, Christiaan V; Richardson, Michael K

    2016-12-01

    Snake genome sequencing is in its infancy-very much behind the progress made in sequencing the genomes of humans, model organisms and pathogens relevant to biomedical research, and agricultural species. We provide here an overview of some of the snake genome projects in progress, and discuss the biological findings, with special emphasis on toxinology, from the small number of draft snake genomes already published. We discuss the future of snake genomics, pointing out that new sequencing technologies will help overcome the problem of repetitive sequences in assembling snake genomes. Genome sequences are also likely to be valuable in examining the clustering of toxin genes on the chromosomes, in designing recombinant antivenoms and in studying the epigenetic regulation of toxin gene expression.

  4. Insights from Human/Mouse genome comparisons

    Energy Technology Data Exchange (ETDEWEB)

    Pennacchio, Len A.

    2003-03-30

    Large-scale public genomic sequencing efforts have provided a wealth of vertebrate sequence data poised to provide insights into mammalian biology. These include deep genomic sequence coverage of human, mouse, rat, zebrafish, and two pufferfish (Fugu rubripes and Tetraodon nigroviridis) (Aparicio et al. 2002; Lander et al. 2001; Venter et al. 2001; Waterston et al. 2002). In addition, a high-priority has been placed on determining the genomic sequence of chimpanzee, dog, cow, frog, and chicken (Boguski 2002). While only recently available, whole genome sequence data have provided the unique opportunity to globally compare complete genome contents. Furthermore, the shared evolutionary ancestry of vertebrate species has allowed the development of comparative genomic approaches to identify ancient conserved sequences with functionality. Accordingly, this review focuses on the initial comparison of available mammalian genomes and describes various insights derived from such analysis.

  5. Sequencing intractable DNA to close microbial genomes.

    Directory of Open Access Journals (Sweden)

    Richard A Hurt

    Full Text Available Advancement in high throughput DNA sequencing technologies has supported a rapid proliferation of microbial genome sequencing projects, providing the genetic blueprint for in-depth studies. Oftentimes, difficult to sequence regions in microbial genomes are ruled "intractable" resulting in a growing number of genomes with sequence gaps deposited in databases. A procedure was developed to sequence such problematic regions in the "non-contiguous finished" Desulfovibrio desulfuricans ND132 genome (6 intractable gaps and the Desulfovibrio africanus genome (1 intractable gap. The polynucleotides surrounding each gap formed GC rich secondary structures making the regions refractory to amplification and sequencing. Strand-displacing DNA polymerases used in concert with a novel ramped PCR extension cycle supported amplification and closure of all gap regions in both genomes. The developed procedures support accurate gene annotation, and provide a step-wise method that reduces the effort required for genome finishing.

  6. Snake Genome Sequencing: Results and Future Prospects

    Directory of Open Access Journals (Sweden)

    Harald M. I. Kerkkamp

    2016-12-01

    Full Text Available Snake genome sequencing is in its infancy—very much behind the progress made in sequencing the genomes of humans, model organisms and pathogens relevant to biomedical research, and agricultural species. We provide here an overview of some of the snake genome projects in progress, and discuss the biological findings, with special emphasis on toxinology, from the small number of draft snake genomes already published. We discuss the future of snake genomics, pointing out that new sequencing technologies will help overcome the problem of repetitive sequences in assembling snake genomes. Genome sequences are also likely to be valuable in examining the clustering of toxin genes on the chromosomes, in designing recombinant antivenoms and in studying the epigenetic regulation of toxin gene expression.

  7. Sequencing Intractable DNA to Close Microbial Genomes

    Energy Technology Data Exchange (ETDEWEB)

    Hurt, Jr., Richard Ashley [ORNL; Brown, Steven D [ORNL; Podar, Mircea [ORNL; Palumbo, Anthony Vito [ORNL; Elias, Dwayne A [ORNL

    2012-01-01

    Advancement in high throughput DNA sequencing technologies has supported a rapid proliferation of microbial genome sequencing projects, providing the genetic blueprint for for in-depth studies. Oftentimes, difficult to sequence regions in microbial genomes are ruled intractable resulting in a growing number of genomes with sequence gaps deposited in databases. A procedure was developed to sequence such difficult regions in the non-contiguous finished Desulfovibrio desulfuricans ND132 genome (6 intractable gaps) and the Desulfovibrio africanus genome (1 intractable gap). The polynucleotides surrounding each gap formed GC rich secondary structures making the regions refractory to amplification and sequencing. Strand-displacing DNA polymerases used in concert with a novel ramped PCR extension cycle supported amplification and closure of all gap regions in both genomes. These developed procedures support accurate gene annotation, and provide a step-wise method that reduces the effort required for genome finishing.

  8. Current development and application of soybean genomics

    Institute of Scientific and Technical Information of China (English)

    Lingli HE; Jing ZHAO; Man ZHAO; Chaoying HE

    2011-01-01

    Soybean (Glycine max),an important domesticated species originated in China,constitutes a major source of edible oils and high-quality plant proteins worldwide.In spite of its complex genome as a consequence of an ancient tetraploidilization,platforms for map-based genomics,sequence-based genomics,comparative genomics and functional genomics have been well developed in the last decade,thus rich repertoires of genomic tools and resources are available,which have been influencing the soybean genetic improvement.Here we mainly review the progresses of soybean (including its wild relative Glycine soja) genomics and its impetus for soybean breeding,and raise the major biological questions needing to be addressed.Genetic maps,physical maps,QTL and EST mapping have been so well achieved that the marker assisted selection and positional cloning in soybean is feasible and even routine.Whole genome sequencing and transcriptomic analyses provide a large collection of molecular markers and predicted genes,which are instrumental to comparative genomics and functional genomics.Comparative genomics has started to reveal the evolution of soybean genome and the molecular basis of soybean domestication process.Microarrays resources,mutagenesis and efficient transformation systems become essential components of soybean functional genomics.Furthermore,phenotypic functional genomics via both forward and reverse genetic approaches has inferred functions of many genes involved in plant and seed development,in response to abiotic stresses,functioning in plant-pathogenic microbe interactions,and controlling the oil and protein content of seed.These achievements have paved the way for generation of transgenic or genetically modified (GM) soybean crops.

  9. One bacterial cell, one complete genome.

    Directory of Open Access Journals (Sweden)

    Tanja Woyke

    2010-04-01

    Full Text Available While the bulk of the finished microbial genomes sequenced to date are derived from cultured bacterial and archaeal representatives, the vast majority of microorganisms elude current culturing attempts, severely limiting the ability to recover complete or even partial genomes from these environmental species. Single cell genomics is a novel culture-independent approach, which enables access to the genetic material of an individual cell. No single cell genome has to our knowledge been closed and finished to date. Here we report the completed genome from an uncultured single cell of Candidatus Sulcia muelleri DMIN. Digital PCR on single symbiont cells isolated from the bacteriome of the green sharpshooter Draeculacephala minerva bacteriome allowed us to assess that this bacteria is polyploid with genome copies ranging from approximately 200-900 per cell, making it a most suitable target for single cell finishing efforts. For single cell shotgun sequencing, an individual Sulcia cell was isolated and whole genome amplified by multiple displacement amplification (MDA. Sanger-based finishing methods allowed us to close the genome. To verify the correctness of our single cell genome and exclude MDA-derived artifacts, we independently shotgun sequenced and assembled the Sulcia genome from pooled bacteriomes using a metagenomic approach, yielding a nearly identical genome. Four variations we detected appear to be genuine biological differences between the two samples. Comparison of the single cell genome with bacteriome metagenomic sequence data detected two single nucleotide polymorphisms (SNPs, indicating extremely low genetic diversity within a Sulcia population. This study demonstrates the power of single cell genomics to generate a complete, high quality, non-composite reference genome within an environmental sample, which can be used for population genetic analyzes.

  10. One Bacterial Cell, One Complete Genome

    Energy Technology Data Exchange (ETDEWEB)

    Woyke, Tanja; Tighe, Damon; Mavrommatis, Konstantinos; Clum, Alicia; Copeland, Alex; Schackwitz, Wendy; Lapidus, Alla; Wu, Dongying; McCutcheon, John P.; McDonald, Bradon R.; Moran, Nancy A.; Bristow, James; Cheng, Jan-Fang

    2010-04-26

    While the bulk of the finished microbial genomes sequenced to date are derived from cultured bacterial and archaeal representatives, the vast majority of microorganisms elude current culturing attempts, severely limiting the ability to recover complete or even partial genomes from these environmental species. Single cell genomics is a novel culture-independent approach, which enables access to the genetic material of an individual cell. No single cell genome has to our knowledge been closed and finished to date. Here we report the completed genome from an uncultured single cell of Candidatus Sulcia muelleri DMIN. Digital PCR on single symbiont cells isolated from the bacteriome of the green sharpshooter Draeculacephala minerva bacteriome allowed us to assess that this bacteria is polyploid with genome copies ranging from approximately 200?900 per cell, making it a most suitable target for single cell finishing efforts. For single cell shotgun sequencing, an individual Sulcia cell was isolated and whole genome amplified by multiple displacement amplification (MDA). Sanger-based finishing methods allowed us to close the genome. To verify the correctness of our single cell genome and exclude MDA-derived artifacts, we independently shotgun sequenced and assembled the Sulcia genome from pooled bacteriomes using a metagenomic approach, yielding a nearly identical genome. Four variations we detected appear to be genuine biological differences between the two samples. Comparison of the single cell genome with bacteriome metagenomic sequence data detected two single nucleotide polymorphisms (SNPs), indicating extremely low genetic diversity within a Sulcia population. This study demonstrates the power of single cell genomics to generate a complete, high quality, non-composite reference genome within an environmental sample, which can be used for population genetic analyzes.

  11. 10. international mouse genome conference

    Energy Technology Data Exchange (ETDEWEB)

    Meisler, M.H.

    1996-12-31

    Ten years after hosting the First International Mammalian Genome Conference in Paris in 1986, Dr. Jean-Louis Guenet presided over the Tenth Conference at the Pasteur Institute, October 7--10, 1996. The 1986 conference was a satellite to the Human Gene Mapping Workshop and had approximately 50 attendees. The 1996 meeting was attended by 300 scientists from around the world. In the interim, the number of mapped loci in the mouse increased from 1,000 to over 20,000. This report contains a listing of the program and its participants, and two articles that review the meeting and the role of the laboratory mouse in the Human Genome project. More than 200 papers were presented at the conference covering the following topics: International mouse chromosome committee meetings; Mutant generation and identification; Physical and genetic maps; New technology and resources; Chromatin structure and gene regulation; Rate and hamster genetic maps; Informatics and databases; and Quantitative trait analysis.

  12. Genome Organization Drives Chromosome Fragility.

    Science.gov (United States)

    Canela, Andres; Maman, Yaakov; Jung, Seolkyoung; Wong, Nancy; Callen, Elsa; Day, Amanda; Kieffer-Kwon, Kyong-Rim; Pekowska, Aleksandra; Zhang, Hongliang; Rao, Suhas S P; Huang, Su-Chen; Mckinnon, Peter J; Aplan, Peter D; Pommier, Yves; Aiden, Erez Lieberman; Casellas, Rafael; Nussenzweig, André

    2017-07-27

    In this study, we show that evolutionarily conserved chromosome loop anchors bound by CCCTC-binding factor (CTCF) and cohesin are vulnerable to DNA double strand breaks (DSBs) mediated by topoisomerase 2B (TOP2B). Polymorphisms in the genome that redistribute CTCF/cohesin occupancy rewire DNA cleavage sites to novel loop anchors. While transcription- and replication-coupled genomic rearrangements have been well documented, we demonstrate that DSBs formed at loop anchors are largely transcription-, replication-, and cell-type-independent. DSBs are continuously formed throughout interphase, are enriched on both sides of strong topological domain borders, and frequently occur at breakpoint clusters commonly translocated in cancer. Thus, loop anchors serve as fragile sites that generate DSBs and chromosomal rearrangements. VIDEO ABSTRACT. Published by Elsevier Inc.

  13. Mathematical Analysis of Genomic Evolution

    Directory of Open Access Journals (Sweden)

    Cedric Green

    2011-01-01

    Full Text Available Changes in nucleotide sequences, or mutations, accumulate from generation to generation in the genomes of all living organisms. The mutations can be advantageous, deleterious, or neutral. The goal of this project is to determine the amount of advantageous mutations it takes to get human (Homo sapiens DNA from the DNA of genetically distinct organisms. We do this by collecting the genomic data of such organisms, and estimating the amount of mutations it takes to transform yeast (Saccharomyces cerevisiae DNA to the DNA of a human. We calculate the typical number of mutations occurring annually through the organism's average life span and the average mutation rate. This allows us to determine the total number of mutations as well as the probability of advantageous mutations. Not surprisingly, this probability proves to be fairly small. A more precise estimate can be determined by accounting for the differences in the chromosomal structure and phenomena like horizontal gene transfer.

  14. A Million Cancer Genome Warehouse

    Science.gov (United States)

    2012-11-20

    of a national program for Cancer Information Donors, the American Society for Clinical Oncology (ASCO) has proposed a rapid learning system for...or Scala and Spark; “scrum” organization of small programming teams; calculating “velocity” to predict time to develop new features; and Agile...2012 to 00-00-2012 4. TITLE AND SUBTITLE A Million Cancer Genome Warehouse 5a. CONTRACT NUMBER 5b. GRANT NUMBER 5c. PROGRAM ELEMENT NUMBER 6

  15. Microbial genomes: Blueprints for life

    Energy Technology Data Exchange (ETDEWEB)

    Relman, David A.; Strauss, Evelyn

    2000-12-31

    Complete microbial genome sequences hold the promise of profound new insights into microbial pathogenesis, evolution, diagnostics, and therapeutics. From these insights will come a new foundation for understanding the evolution of single-celled life, as well as the evolution of more complex life forms. This report is an in-depth analysis of scientific issues that provides recommendations and will be widely disseminated to the scientific community, federal agencies, industry and the public.

  16. Genomic Signatures of Sexual Conflict.

    Science.gov (United States)

    Kasimatis, Katja R; Nelson, Thomas C; Phillips, Patrick C

    2017-10-30

    Sexual conflict is a specific class of intergenomic conflict that describes the reciprocal sex-specific fitness costs generated by antagonistic reproductive interactions. The potential for sexual conflict is an inherent property of having a shared genome between the sexes and, therefore, is an extreme form of an environment-dependent fitness effect. In this way, many of the predictions from environment-dependent selection can be used to formulate expected patterns of genome evolution under sexual conflict. However, the pleiotropic and transmission constraints inherent to having alleles move across sex-specific backgrounds from generation to generation further modulate the anticipated signatures of selection. We outline methods for detecting candidate sexual conflict loci both across and within populations. Additionally, we consider the ability of genome scans to identify sexually antagonistic loci by modeling allele frequency changes within males and females due to a single generation of selection. In particular, we highlight the need to integrate genotype, phenotype, and functional information to truly distinguish sexual conflict from other forms of sexual differentiation. © The American Genetic Association 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  17. Multiple models for Rosaceae genomics.

    Science.gov (United States)

    Shulaev, Vladimir; Korban, Schuyler S; Sosinski, Bryon; Abbott, Albert G; Aldwinckle, Herb S; Folta, Kevin M; Iezzoni, Amy; Main, Dorrie; Arús, Pere; Dandekar, Abhaya M; Lewers, Kim; Brown, Susan K; Davis, Thomas M; Gardiner, Susan E; Potter, Daniel; Veilleux, Richard E

    2008-07-01

    The plant family Rosaceae consists of over 100 genera and 3,000 species that include many important fruit, nut, ornamental, and wood crops. Members of this family provide high-value nutritional foods and contribute desirable aesthetic and industrial products. Most rosaceous crops have been enhanced by human intervention through sexual hybridization, asexual propagation, and genetic improvement since ancient times, 4,000 to 5,000 B.C. Modern breeding programs have contributed to the selection and release of numerous cultivars having significant economic impact on the U.S. and world markets. In recent years, the Rosaceae community, both in the United States and internationally, has benefited from newfound organization and collaboration that have hastened progress in developing genetic and genomic resources for representative crops such as apple (Malus spp.), peach (Prunus spp.), and strawberry (Fragaria spp.). These resources, including expressed sequence tags, bacterial artificial chromosome libraries, physical and genetic maps, and molecular markers, combined with genetic transformation protocols and bioinformatics tools, have rendered various rosaceous crops highly amenable to comparative and functional genomics studies. This report serves as a synopsis of the resources and initiatives of the Rosaceae community, recent developments in Rosaceae genomics, and plans to apply newly accumulated knowledge and resources toward breeding and crop improvement.

  18. Group normalization for genomic data.

    Science.gov (United States)

    Ghandi, Mahmoud; Beer, Michael A

    2012-01-01

    Data normalization is a crucial preliminary step in analyzing genomic datasets. The goal of normalization is to remove global variation to make readings across different experiments comparable. In addition, most genomic loci have non-uniform sensitivity to any given assay because of variation in local sequence properties. In microarray experiments, this non-uniform sensitivity is due to different DNA hybridization and cross-hybridization efficiencies, known as the probe effect. In this paper we introduce a new scheme, called Group Normalization (GN), to remove both global and local biases in one integrated step, whereby we determine the normalized probe signal by finding a set of reference probes with similar responses. Compared to conventional normalization methods such as Quantile normalization and physically motivated probe effect models, our proposed method is general in the sense that it does not require the assumption that the underlying signal distribution be identical for the treatment and control, and is flexible enough to correct for nonlinear and higher order probe effects. The Group Normalization algorithm is computationally efficient and easy to implement. We also describe a variant of the Group Normalization algorithm, called Cross Normalization, which efficiently amplifies biologically relevant differences between any two genomic datasets.

  19. Group normalization for genomic data.

    Directory of Open Access Journals (Sweden)

    Mahmoud Ghandi

    Full Text Available Data normalization is a crucial preliminary step in analyzing genomic datasets. The goal of normalization is to remove global variation to make readings across different experiments comparable. In addition, most genomic loci have non-uniform sensitivity to any given assay because of variation in local sequence properties. In microarray experiments, this non-uniform sensitivity is due to different DNA hybridization and cross-hybridization efficiencies, known as the probe effect. In this paper we introduce a new scheme, called Group Normalization (GN, to remove both global and local biases in one integrated step, whereby we determine the normalized probe signal by finding a set of reference probes with similar responses. Compared to conventional normalization methods such as Quantile normalization and physically motivated probe effect models, our proposed method is general in the sense that it does not require the assumption that the underlying signal distribution be identical for the treatment and control, and is flexible enough to correct for nonlinear and higher order probe effects. The Group Normalization algorithm is computationally efficient and easy to implement. We also describe a variant of the Group Normalization algorithm, called Cross Normalization, which efficiently amplifies biologically relevant differences between any two genomic datasets.

  20. Parallel processing of genomics data

    Science.gov (United States)

    Agapito, Giuseppe; Guzzi, Pietro Hiram; Cannataro, Mario

    2016-10-01

    The availability of high-throughput experimental platforms for the analysis of biological samples, such as mass spectrometry, microarrays and Next Generation Sequencing, have made possible to analyze a whole genome in a single experiment. Such platforms produce an enormous volume of data per single experiment, thus the analysis of this enormous flow of data poses several challenges in term of data storage, preprocessing, and analysis. To face those issues, efficient, possibly parallel, bioinformatics software needs to be used to preprocess and analyze data, for instance to highlight genetic variation associated with complex diseases. In this paper we present a parallel algorithm for the parallel preprocessing and statistical analysis of genomics data, able to face high dimension of data and resulting in good response time. The proposed system is able to find statistically significant biological markers able to discriminate classes of patients that respond to drugs in different ways. Experiments performed on real and synthetic genomic datasets show good speed-up and scalability.

  1. The wolf reference genome sequence (Canis lupus lupus) and its implications for Canis spp. population genomics

    DEFF Research Database (Denmark)

    Gopalakrishnan, Shyam; Samaniego Castruita, Jose Alfredo; Sinding, Mikkel Holger Strander

    2017-01-01

    Background An increasing number of studies are addressing the evolutionary genomics of dog domestication, principally through resequencing dog, wolf and related canid genomes. There is, however, only one de novo assembled canid genome currently available against which to map such data - that of a......Background An increasing number of studies are addressing the evolutionary genomics of dog domestication, principally through resequencing dog, wolf and related canid genomes. There is, however, only one de novo assembled canid genome currently available against which to map such data...... that regardless of the reference genome choice, most evolutionary genomic analyses yield qualitatively similar results, including those exploring the structure between the wolves and dogs using admixture and principal component analysis. However, we do observe differences in the genomic coverage of re-mapped...

  2. The Small Nuclear Genomes of Selaginella Are Associated with a Low Rate of Genome Size Evolution.

    Science.gov (United States)

    Baniaga, Anthony E; Arrigo, Nils; Barker, Michael S

    2016-06-03

    The haploid nuclear genome size (1C DNA) of vascular land plants varies over several orders of magnitude. Much of this observed diversity in genome size is due to the proliferation and deletion of transposable elements. To date, all vascular land plant lineages with extremely small nuclear genomes represent recently derived states, having ancestors with much larger genome sizes. The Selaginellaceae represent an ancient lineage with extremely small genomes. It is unclear how small nuclear genomes evolved in Selaginella We compared the rates of nuclear genome size evolution in Selaginella and major vascular plant clades in a comparative phylogenetic framework. For the analyses, we collected 29 new flow cytometry estimates of haploid genome size in Selaginella to augment publicly available data. Selaginella possess some of the smallest known haploid nuclear genome sizes, as well as the lowest rate of genome size evolution observed across all vascular land plants included in our analyses. Additionally, our analyses provide strong support for a history of haploid nuclear genome size stasis in Selaginella Our results indicate that Selaginella, similar to other early diverging lineages of vascular land plants, has relatively low rates of genome size evolution. Further, our analyses highlight that a rapid transition to a small genome size is only one route to an extremely small genome. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  3. St2-80: a new FISH marker for St genome and genome analysis in Triticeae.

    Science.gov (United States)

    Wang, Long; Shi, Qinghua; Su, Handong; Wang, Yi; Sha, Lina; Fan, Xing; Kang, Houyang; Zhang, Haiqin; Zhou, Yonghong

    2017-07-01

    The St genome is one of the most fundamental genomes in Triticeae. Repetitive sequences are widely used to distinguish different genomes or species. The primary objectives of this study were to (i) screen a new sequence that could easily distinguish the chromosome of the St genome from those of other genomes by fluorescence in situ hybridization (FISH) and (ii) investigate the genome constitution of some species that remain uncertain and controversial. We used degenerated oligonucleotide primer PCR (Dop-PCR), Dot-blot, and FISH to screen for a new marker of the St genome and to test the efficiency of this marker in the detection of the St chromosome at different ploidy levels. Signals produced by a new FISH marker (denoted St 2 -80) were present on the entire arm of chromosomes of the St genome, except in the centromeric region. On the contrary, St 2 -80 signals were present in the terminal region of chromosomes of the E, H, P, and Y genomes. No signal was detected in the A and B genomes, and only weak signals were detected in the terminal region of chromosomes of the D genome. St 2 -80 signals were obvious and stable in chromosomes of different genomes, whether diploid or polyploid. Therefore, St 2 -80 is a potential and useful FISH marker that can be used to distinguish the St genome from those of other genomes in Triticeae.

  4. Unleashing the genome of Brassica rapa

    Directory of Open Access Journals (Sweden)

    Haibao eTang

    2012-07-01

    Full Text Available The completion and release of the Brassica rapa genome is of great benefit to researchers of the Brassicas, Arabidopsis, and genome evolution. While its lineage is closely related to the model organism Arabidopsis thaliana, the Brassicas experienced a whole genome triplication subsequent to their divergence. This event contemporaneously created three copies of its ancestral genome, which had diploidized through the process of homeologous gene loss known as fractionation. By the fractionation of homeologous gene content and genetic regulatory binding sites, Brassica’s genome is well placed to use comparative genomic techniques to identify syntenic regions, homeologous gene duplications, and putative regulatory sequences. Here, we use the comparative genomics platform CoGe to perform several different genomic analyses with which to study structural changes of its genome and dynamics of various genetic elements. Starting with whole genome comparisons, the Brassica paleohexaploidy is characterized, syntenic regions with Arabidopsis thaliana are identified, and the TOC1 gene in the circadian rhythm pathway from Arabidopsis thaliana is used to find duplicated orthologs in Brassica rapa. These TOC1 genes are further analyzed to identify conserved noncoding sequences that contain cis-acting regulatory elements and promoter sequences previously implicated in circadian rhythmicity. Each 'cookbook style' analysis includes a step-by-step walkthrough with links to CoGe to quickly reproduce each step of the analytical process.

  5. Human Genome Sequencing in Health and Disease

    Science.gov (United States)

    Gonzaga-Jauregui, Claudia; Lupski, James R.; Gibbs, Richard A.

    2013-01-01

    Following the “finished,” euchromatic, haploid human reference genome sequence, the rapid development of novel, faster, and cheaper sequencing technologies is making possible the era of personalized human genomics. Personal diploid human genome sequences have been generated, and each has contributed to our better understanding of variation in the human genome. We have consequently begun to appreciate the vastness of individual genetic variation from single nucleotide to structural variants. Translation of genome-scale variation into medically useful information is, however, in its infancy. This review summarizes the initial steps undertaken in clinical implementation of personal genome information, and describes the application of whole-genome and exome sequencing to identify the cause of genetic diseases and to suggest adjuvant therapies. Better analysis tools and a deeper understanding of the biology of our genome are necessary in order to decipher, interpret, and optimize clinical utility of what the variation in the human genome can teach us. Personal genome sequencing may eventually become an instrument of common medical practice, providing information that assists in the formulation of a differential diagnosis. We outline herein some of the remaining challenges. PMID:22248320

  6. Comparative Reannotation of 21 Aspergillus Genomes

    Energy Technology Data Exchange (ETDEWEB)

    Salamov, Asaf; Riley, Robert; Kuo, Alan; Grigoriev, Igor

    2013-03-08

    We used comparative gene modeling to reannotate 21 Aspergillus genomes. Initial automatic annotation of individual genomes may contain some errors of different nature, e.g. missing genes, incorrect exon-intron structures, 'chimeras', which fuse 2 or more real genes or alternatively splitting some real genes into 2 or more models. The main premise behind the comparative modeling approach is that for closely related genomes most orthologous families have the same conserved gene structure. The algorithm maps all gene models predicted in each individual Aspergillus genome to the other genomes and, for each locus, selects from potentially many competing models, the one which most closely resembles the orthologous genes from other genomes. This procedure is iterated until no further change in gene models is observed. For Aspergillus genomes we predicted in total 4503 new gene models ( ~;;2percent per genome), supported by comparative analysis, additionally correcting ~;;18percent of old gene models. This resulted in a total of 4065 more genes with annotated PFAM domains (~;;3percent increase per genome). Analysis of a few genomes with EST/transcriptomics data shows that the new annotation sets also have a higher number of EST-supported splice sites at exon-intron boundaries.

  7. Fueling the Future with Fungal Genomes

    Energy Technology Data Exchange (ETDEWEB)

    Grigoriev, Igor V.

    2014-10-27

    Genomes of fungi relevant to energy and environment are in focus of the JGI Fungal Genomic Program. One of its projects, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts and pathogens) and biorefinery processes (cellulose degradation and sugar fermentation) by means of genome sequencing and analysis. New chapters of the Encyclopedia can be opened with user proposals to the JGI Community Science Program (CSP). Another JGI project, the 1000 fungal genomes, explores fungal diversity on genome level at scale and is open for users to nominate new species for sequencing. Over 400 fungal genomes have been sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supported by functional genomics will lead to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such ‘parts’ suggested by comparative genomics and functional analysis in these areas are presented here.

  8. The wolf reference genome sequence (Canis lupus lupus) and its implications for Canis spp. population genomics

    DEFF Research Database (Denmark)

    Gopalakrishnan, Shyam; Samaniego Castruita, Jose Alfredo; Sinding, Mikkel Holger Strander

    2017-01-01

    Background An increasing number of studies are addressing the evolutionary genomics of dog domestication, principally through resequencing dog, wolf and related canid genomes. There is, however, only one de novo assembled canid genome currently available against which to map such data - that of a......Background An increasing number of studies are addressing the evolutionary genomics of dog domestication, principally through resequencing dog, wolf and related canid genomes. There is, however, only one de novo assembled canid genome currently available against which to map such data...

  9. Genome plasticity and systems evolution in Streptomyces

    Science.gov (United States)

    2012-01-01

    Background Streptomycetes are filamentous soil-dwelling bacteria. They are best known as the producers of a great variety of natural products such as antibiotics, antifungals, antiparasitics, and anticancer agents and the decomposers of organic substances for carbon recycling. They are also model organisms for the studies of gene regulatory networks, morphological differentiation, and stress response. The availability of sets of genomes from closely related Streptomyces strains makes it possible to assess the mechanisms underlying genome plasticity and systems adaptation. Results We present the results of a comprehensive analysis of the genomes of five Streptomyces species with distinct phenotypes. These streptomycetes have a pan-genome comprised of 17,362 orthologous families which includes 3,096 components in the core genome, 5,066 components in the dispensable genome, and 9,200 components that are uniquely present in only one species. The core genome makes up about 33%-45% of each genome repertoire. It contains important genes for Streptomyces biology including those involved in gene regulation, secretion, secondary metabolism and morphological differentiation. Abundant duplicate genes have been identified, with 4%-11% of the whole genomes composed of lineage-specific expansions (LSEs), suggesting that frequent gene duplication or lateral gene transfer events play a role in shaping the genome diversification within this genus. Two patterns of expansion, single gene expansion and chromosome block expansion are observed, representing different scales of duplication. Conclusions Our results provide a catalog of genome components and their potential functional roles in gene regulatory networks and metabolic networks. The core genome components reveal the minimum requirement for streptomycetes to sustain a successful lifecycle in the soil environment, reflecting the effects of both genome evolution and environmental stress acting upon the expressed phenotypes. A

  10. Rice Genome Research: Current Status and Future Perspectives

    Directory of Open Access Journals (Sweden)

    Bin Han

    2008-11-01

    Full Text Available Rice ( L. is the leading genomics system among the crop plants. The sequence of the rice genome, the first cereal plant genome, was published in 2005. This review summarizes progress made in rice genome annotations, comparative genomics, and functional genomics researches. It also maps out the status of rice genomics globally and provides a vision of future research directions and resource building.

  11. Genome digging: insight into the mitochondrial genome of Homo.

    Directory of Open Access Journals (Sweden)

    Igor V Ovchinnikov

    2010-12-01

    Full Text Available A fraction of the Neanderthal mitochondrial genome sequence has a similarity with a 5,839-bp nuclear DNA sequence of mitochondrial origin (numt on the human chromosome 1. This fact has never been interpreted. Although this phenomenon may be attributed to contamination and mosaic assembly of Neanderthal mtDNA from short sequencing reads, we explain the mysterious similarity by integration of this numt (mtAncestor-1 into the nuclear genome of the common ancestor of Neanderthals and modern humans not long before their reproductive split.Exploiting bioinformatics, we uncovered an additional numt (mtAncestor-2 with a high similarity to the Neanderthal mtDNA and indicated that both numts represent almost identical replicas of the mtDNA sequences ancestral to the mitochondrial genomes of Neanderthals and modern humans. In the proteins, encoded by mtDNA, the majority of amino acids distinguishing chimpanzees from humans and Neanderthals were acquired by the ancestral hominins. The overall rate of nonsynonymous evolution in Neanderthal mitochondrial protein-coding genes is not higher than in other lineages. The model incorporating the ancestral hominin mtDNA sequences estimates the average divergence age of the mtDNAs of Neanderthals and modern humans to be 450,000-485,000 years. The mtAncestor-1 and mtAncestor-2 sequences were incorporated into the nuclear genome approximately 620,000 years and 2,885,000 years ago, respectively.This study provides the first insight into the evolution of the mitochondrial DNA in hominins ancestral to Neanderthals and humans. We hypothesize that mtAncestor-1 and mtAncestor-2 are likely to be molecular fossils of the mtDNAs of Homo heidelbergensis and a stem Homo lineage. The d(N/d(S dynamics suggests that the effective population size of extinct hominins was low. However, the hominin lineage ancestral to humans, Neanderthals and H. heidelbergensis, had a larger effective population size and possessed genetic diversity

  12. Genome-wide identification of significant aberrations in cancer genome.

    Science.gov (United States)

    Yuan, Xiguo; Yu, Guoqiang; Hou, Xuchu; Shih, Ie-Ming; Clarke, Robert; Zhang, Junying; Hoffman, Eric P; Wang, Roger R; Zhang, Zhen; Wang, Yue

    2012-07-27

    Somatic Copy Number Alterations (CNAs) in human genomes are present in almost all human cancers. Systematic efforts to characterize such structural variants must effectively distinguish significant consensus events from random background aberrations. Here we introduce Significant Aberration in Cancer (SAIC), a new method for characterizing and assessing the statistical significance of recurrent CNA units. Three main features of SAIC include: (1) exploiting the intrinsic correlation among consecutive probes to assign a score to each CNA unit instead of single probes; (2) performing permutations on CNA units that preserve correlations inherent in the copy number data; and (3) iteratively detecting Significant Copy Number Aberrations (SCAs) and estimating an unbiased null distribution by applying an SCA-exclusive permutation scheme. We test and compare the performance of SAIC against four peer methods (GISTIC, STAC, KC-SMART, CMDS) on a large number of simulation datasets. Experimental results show that SAIC outperforms peer methods in terms of larger area under the Receiver Operating Characteristics curve and increased detection power. We then apply SAIC to analyze structural genomic aberrations acquired in four real cancer genome-wide copy number data sets (ovarian cancer, metastatic prostate cancer, lung adenocarcinoma, glioblastoma). When compared with previously reported results, SAIC successfully identifies most SCAs known to be of biological significance and associated with oncogenes (e.g., KRAS, CCNE1, and MYC) or tumor suppressor genes (e.g., CDKN2A/B). Furthermore, SAIC identifies a number of novel SCAs in these copy number data that encompass tumor related genes and may warrant further studies. Supported by a well-grounded theoretical framework, SAIC has been developed and used to identify SCAs in various cancer copy number data sets, providing useful information to study the landscape of cancer genomes. Open-source and platform-independent SAIC software is

  13. Genome-wide identification of significant aberrations in cancer genome

    Directory of Open Access Journals (Sweden)

    Yuan Xiguo

    2012-07-01

    Full Text Available Abstract Background Somatic Copy Number Alterations (CNAs in human genomes are present in almost all human cancers. Systematic efforts to characterize such structural variants must effectively distinguish significant consensus events from random background aberrations. Here we introduce Significant Aberration in Cancer (SAIC, a new method for characterizing and assessing the statistical significance of recurrent CNA units. Three main features of SAIC include: (1 exploiting the intrinsic correlation among consecutive probes to assign a score to each CNA unit instead of single probes; (2 performing permutations on CNA units that preserve correlations inherent in the copy number data; and (3 iteratively detecting Significant Copy Number Aberrations (SCAs and estimating an unbiased null distribution by applying an SCA-exclusive permutation scheme. Results We test and compare the performance of SAIC against four peer methods (GISTIC, STAC, KC-SMART, CMDS on a large number of simulation datasets. Experimental results show that SAIC outperforms peer methods in terms of larger area under the Receiver Operating Characteristics curve and increased detection power. We then apply SAIC to analyze structural genomic aberrations acquired in four real cancer genome-wide copy number data sets (ovarian cancer, metastatic prostate cancer, lung adenocarcinoma, glioblastoma. When compared with previously reported results, SAIC successfully identifies most SCAs known to be of biological significance and associated with oncogenes (e.g., KRAS, CCNE1, and MYC or tumor suppressor genes (e.g., CDKN2A/B. Furthermore, SAIC identifies a number of novel SCAs in these copy number data that encompass tumor related genes and may warrant further studies. Conclusions Supported by a well-grounded theoretical framework, SAIC has been developed and used to identify SCAs in various cancer copy number data sets, providing useful information to study the landscape of cancer genomes

  14. Molecular cytogenetic and genomic analyses reveal new insights into the origin of the wheat B genome.

    Science.gov (United States)

    Zhang, Wei; Zhang, Mingyi; Zhu, Xianwen; Cao, Yaping; Sun, Qing; Ma, Guojia; Chao, Shiaoman; Yan, Changhui; Xu, Steven S; Cai, Xiwen

    2018-02-01

    This work pinpointed the goatgrass chromosomal segment in the wheat B genome using modern cytogenetic and genomic technologies, and provided novel insights into the origin of the wheat B genome. Wheat is a typical allopolyploid with three homoeologous subgenomes (A, B, and D). The donors of the subgenomes A and D had been identified, but not for the subgenome B. The goatgrass Aegilops speltoides (genome SS) has been controversially considered a possible candidate for the donor of the wheat B genome. However, the relationship of the Ae. speltoides S genome with the wheat B genome remains largely obscure. The present study assessed the homology of the B and S genomes using an integrative cytogenetic and genomic approach, and revealed the contribution of Ae. speltoides to the origin of the wheat B genome. We discovered noticeable homology between wheat chromosome 1B and Ae. speltoides chromosome 1S, but not between other chromosomes in the B and S genomes. An Ae. speltoides-originated segment spanning a genomic region of approximately 10.46 Mb was detected on the long arm of wheat chromosome 1B (1BL). The Ae. speltoides-originated segment on 1BL was found to co-evolve with the rest of the B genome. Evidently, Ae. speltoides had been involved in the origin of the wheat B genome, but should not be considered an exclusive donor of this genome. The wheat B genome might have a polyphyletic origin with multiple ancestors involved, including Ae. speltoides. These novel findings will facilitate genome studies in wheat and other polyploids.

  15. RPAN: rice pan-genome browser for ∼3000 rice genomes.

    Science.gov (United States)

    Sun, Chen; Hu, Zhiqiang; Zheng, Tianqing; Lu, Kuangchen; Zhao, Yue; Wang, Wensheng; Shi, Jianxin; Wang, Chunchao; Lu, Jinyuan; Zhang, Dabing; Li, Zhikang; Wei, Chaochun

    2017-01-25

    A pan-genome is the union of the gene sets of all the individuals of a clade or a species and it provides a new dimension of genome complexity with the presence/absence variations (PAVs) of genes among these genomes. With the progress of sequencing technologies, pan-genome study is becoming affordable for eukaryotes with large-sized genomes. The Asian cultivated rice, Oryza sativa L., is one of the major food sources for the world and a model organism in plant biology. Recently, the 3000 Rice Genome Project (3K RGP) sequenced more than 3000 rice genomes with a mean sequencing depth of 14.3×, which provided a tremendous resource for rice research. In this paper, we present a genome browser, Rice Pan-genome Browser (RPAN), as a tool to search and visualize the rice pan-genome derived from 3K RGP. RPAN contains a database of the basic information of 3010 rice accessions, including genomic sequences, gene annotations, PAV information and gene expression data of the rice pan-genome. At least 12 000 novel genes absent in the reference genome were included. RPAN also provides multiple search and visualization functions. RPAN can be a rich resource for rice biology and rice breeding. It is available at http://cgm.sjtu.edu.cn/3kricedb/ or http://www.rmbreeding.cn/pan3k. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  16. GenColors-based comparative genome databases for small eukaryotic genomes.

    Science.gov (United States)

    Felder, Marius; Romualdi, Alessandro; Petzold, Andreas; Platzer, Matthias; Sühnel, Jürgen; Glöckner, Gernot

    2013-01-01

    Many sequence data repositories can give a quick and easily accessible overview on genomes and their annotations. Less widespread is the possibility to compare related genomes with each other in a common database environment. We have previously described the GenColors database system (http://gencolors.fli-leibniz.de) and its applications to a number of bacterial genomes such as Borrelia, Legionella, Leptospira and Treponema. This system has an emphasis on genome comparison. It combines data from related genomes and provides the user with an extensive set of visualization and analysis tools. Eukaryote genomes are normally larger than prokaryote genomes and thus pose additional challenges for such a system. We have, therefore, adapted GenColors to also handle larger datasets of small eukaryotic genomes and to display eukaryotic gene structures. Further recent developments include whole genome views, genome list options and, for bacterial genome browsers, the display of horizontal gene transfer predictions. Two new GenColors-based databases for two fungal species (http://fgb.fli-leibniz.de) and for four social amoebas (http://sacgb.fli-leibniz.de) were set up. Both new resources open up a single entry point for related genomes for the amoebozoa and fungal research communities and other interested users. Comparative genomics approaches are greatly facilitated by these resources.

  17. PGSB/MIPS Plant Genome Information Resources and Concepts for the Analysis of Complex Grass Genomes.

    Science.gov (United States)

    Spannagl, Manuel; Bader, Kai; Pfeifer, Matthias; Nussbaumer, Thomas; Mayer, Klaus F X

    2016-01-01

    PGSB (Plant Genome and Systems Biology; formerly MIPS-Munich Institute for Protein Sequences) has been involved in developing, implementing and maintaining plant genome databases for more than a decade. Genome databases and analysis resources have focused on individual genomes and aim to provide flexible and maintainable datasets for model plant genomes as a backbone against which experimental data, e.g., from high-throughput functional genomics, can be organized and analyzed. In addition, genomes from both model and crop plants form a scaffold for comparative genomics, assisted by specialized tools such as the CrowsNest viewer to explore conserved gene order (synteny) between related species on macro- and micro-levels.The genomes of many economically important Triticeae plants such as wheat, barley, and rye present a great challenge for sequence assembly and bioinformatic analysis due to their enormous complexity and large genome size. Novel concepts and strategies have been developed to deal with these difficulties and have been applied to the genomes of wheat, barley, rye, and other cereals. This includes the GenomeZipper concept, reference-guided exome assembly, and "chromosome genomics" based on flow cytometry sorted chromosomes.

  18. Big Data Analytics for Genomic Medicine.

    Science.gov (United States)

    He, Karen Y; Ge, Dongliang; He, Max M

    2017-02-15

    Genomic medicine attempts to build individualized strategies for diagnostic or therapeutic decision-making by utilizing patients' genomic information. Big Data analytics uncovers hidden patterns, unknown correlations, and other insights through examining large-scale various data sets. While integration and manipulation of diverse genomic data and comprehensive electronic health records (EHRs) on a Big Data infrastructure exhibit challenges, they also provide a feasible opportunity to develop an efficient and effective approach to identify clinically actionable genetic variants for individualized diagnosis and therapy. In this paper, we review the challenges of manipulating large-scale next-generation sequencing (NGS) data and diverse clinical data derived from the EHRs for genomic medicine. We introduce possible solutions for different challenges in manipulating, managing, and analyzing genomic and clinical data to implement genomic medicine. Additionally, we also present a practical Big Data toolset for identifying clinically actionable genetic variants using high-throughput NGS data and EHRs.

  19. Body maps on the human genome.

    Science.gov (United States)

    Cherniak, Christopher; Rodriguez-Esteban, Raul

    2013-12-20

    Chromosomes have territories, or preferred locales, in the cell nucleus. When these sites are taken into account, some large-scale structure of the human genome emerges. The synoptic picture is that genes highly expressed in particular topologically compact tissues are not randomly distributed on the genome. Rather, such tissue-specific genes tend to map somatotopically onto the complete chromosome set. They seem to form a "genome homunculus": a multi-dimensional, genome-wide body representation extending across chromosome territories of the entire spermcell nucleus. The antero-posterior axis of the body significantly corresponds to the head-tail axis of the nucleus, and the dorso-ventral body axis to the central-peripheral nucleus axis. This large-scale genomic structure includes thousands of genes. One rationale for a homuncular genome structure would be to minimize connection costs in genetic networks. Somatotopic maps in cerebral cortex have been reported for over a century.

  20. Genome aliquoting with double cut and join

    Directory of Open Access Journals (Sweden)

    Sankoff David

    2008-01-01

    Full Text Available Abstract Background The genome aliquoting probem is, given an observed genome A with n copies of each gene, presumed to descend from an n-way polyploidization event from an ordinary diploid genome B, followed by a history of chromosomal rearrangements, to reconstruct the identity of the original genome B'. The idea is to construct B', containing exactly one copy of each gene, so as to minimize the number of rearrangements d(A, B' ⊕ B' ⊕ ... ⊕ B' necessary to convert the observed genome B' ⊕ B' ⊕ ... ⊕ B' into A. Results In this paper we make the first attempt to define and solve the genome aliquoting problem. We present a heuristic algorithm for the problem as well the data from our experiments demonstrating its validity. Conclusion The heuristic performs well, consistently giving a non-trivial result. The question as to the existence or non-existence of an exact solution to this problem remains open.

  1. The Saccharomyces Genome Database Variant Viewer.

    Science.gov (United States)

    Sheppard, Travis K; Hitz, Benjamin C; Engel, Stacia R; Song, Giltae; Balakrishnan, Rama; Binkley, Gail; Costanzo, Maria C; Dalusag, Kyla S; Demeter, Janos; Hellerstedt, Sage T; Karra, Kalpana; Nash, Robert S; Paskov, Kelley M; Skrzypek, Marek S; Weng, Shuai; Wong, Edith D; Cherry, J Michael

    2016-01-04

    The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org) is the authoritative community resource for the Saccharomyces cerevisiae reference genome sequence and its annotation. In recent years, we have moved toward increased representation of sequence variation and allelic differences within S. cerevisiae. The publication of numerous additional genomes has motivated the creation of new tools for their annotation and analysis. Here we present the Variant Viewer: a dynamic open-source web application for the visualization of genomic and proteomic differences. Multiple sequence alignments have been constructed across high quality genome sequences from 11 different S. cerevisiae strains and stored in the SGD. The alignments and summaries are encoded in JSON and used to create a two-tiered dynamic view of the budding yeast pan-genome, available at http://www.yeastgenome.org/variant-viewer. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  2. REGEN: Ancestral Genome Reconstruction for Bacteria

    Directory of Open Access Journals (Sweden)

    João C. Setubal

    2012-07-01

    Full Text Available Ancestral genome reconstruction can be understood as a phylogenetic study with more details than a traditional phylogenetic tree reconstruction. We present a new computational system called REGEN for ancestral bacterial genome reconstruction at both the gene and replicon levels. REGEN reconstructs gene content, contiguous gene runs, and replicon structure for each ancestral genome. Along each branch of the phylogenetic tree, REGEN infers evolutionary events, including gene creation and deletion and replicon fission and fusion. The reconstruction can be performed by either a maximum parsimony or a maximum likelihood method. Gene content reconstruction is based on the concept of neighboring gene pairs. REGEN was designed to be used with any set of genomes that are sufficiently related, which will usually be the case for bacteria within the same taxonomic order. We evaluated REGEN using simulated genomes and genomes in the Rhizobiales order.

  3. Big Data Analytics for Genomic Medicine

    Science.gov (United States)

    He, Karen Y.; Ge, Dongliang; He, Max M.

    2017-01-01

    Genomic medicine attempts to build individualized strategies for diagnostic or therapeutic decision-making by utilizing patients’ genomic information. Big Data analytics uncovers hidden patterns, unknown correlations, and other insights through examining large-scale various data sets. While integration and manipulation of diverse genomic data and comprehensive electronic health records (EHRs) on a Big Data infrastructure exhibit challenges, they also provide a feasible opportunity to develop an efficient and effective approach to identify clinically actionable genetic variants for individualized diagnosis and therapy. In this paper, we review the challenges of manipulating large-scale next-generation sequencing (NGS) data and diverse clinical data derived from the EHRs for genomic medicine. We introduce possible solutions for different challenges in manipulating, managing, and analyzing genomic and clinical data to implement genomic medicine. Additionally, we also present a practical Big Data toolset for identifying clinically actionable genetic variants using high-throughput NGS data and EHRs. PMID:28212287

  4. [Ethical considerations in genomic cohort study].

    Science.gov (United States)

    Choi, Eun Kyung; Kim, Ock-Joo

    2007-03-01

    During the last decade, genomic cohort study has been developed in many countries by linking health data and genetic data in stored samples. Genomic cohort study is expected to find key genetic components that contribute to common diseases, thereby promising great advance in genome medicine. While many countries endeavor to build biobank systems, biobank-based genome research has raised important ethical concerns including genetic privacy, confidentiality, discrimination, and informed consent. Informed consent for biobank poses an important question: whether true informed consent is possible in population-based genomic cohort research where the nature of future studies is unforeseeable when consent is obtained. Due to the sensitive character of genetic information, protecting privacy and keeping confidentiality become important topics. To minimize ethical problems and achieve scientific goals to its maximum degree, each country strives to build population-based genomic cohort research project, by organizing public consultation, trying public and expert consensus in research, and providing safeguards to protect privacy and confidentiality.

  5. Genome Evolution of Plant-Parasitic Nematodes.

    Science.gov (United States)

    Kikuchi, Taisei; Eves-van den Akker, Sebastian; Jones, John T

    2017-08-04

    Plant parasitism has evolved independently on at least four separate occasions in the phylum Nematoda. The application of next-generation sequencing (NGS) to plant-parasitic nematodes has allowed a wide range of genome- or transcriptome-level comparisons, and these have identified genome adaptations that enable parasitism of plants. Current genome data suggest that horizontal gene transfer, gene family expansions, evolution of new genes that mediate interactions with the host, and parasitism-specific gene regulation are important adaptations that allow nematodes to parasitize plants. Sequencing of a larger number of nematode genomes, including plant parasites that show different modes of parasitism or that have evolved in currently unsampled clades, and using free-living taxa as comparators would allow more detailed analysis and a better understanding of the organization of key genes within the genomes. This would facilitate a more complete understanding of the way in which parasitism has shaped the genomes of plant-parasitic nematodes.

  6. The characterization of twenty sequenced human genomes.

    Directory of Open Access Journals (Sweden)

    Kimberly Pelak

    2010-09-01

    Full Text Available We present the analysis of twenty human genomes to evaluate the prospects for identifying rare functional variants that contribute to a phenotype of interest. We sequenced at high coverage ten "case" genomes from individuals with severe hemophilia A and ten "control" genomes. We summarize the number of genetic variants emerging from a study of this magnitude, and provide a proof of concept for the identification of rare and highly-penetrant functional variants by confirming that the cause of hemophilia A is easily recognizable in this data set. We also show that the number of novel single nucleotide variants (SNVs discovered per genome seems to stabilize at about 144,000 new variants per genome, after the first 15 individuals have been sequenced. Finally, we find that, on average, each genome carries 165 homozygous protein-truncating or stop loss variants in genes representing a diverse set of pathways.

  7. Extreme-Scale De Novo Genome Assembly

    Energy Technology Data Exchange (ETDEWEB)

    Georganas, Evangelos [Intel Corporation, Santa Clara, CA (United States); Hofmeyr, Steven [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Joint Genome Inst.; Egan, Rob [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Computational Research Division; Buluc, Aydin [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Joint Genome Inst.; Oliker, Leonid [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Joint Genome Inst.; Rokhsar, Daniel [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Computational Research Division; Yelick, Katherine [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Joint Genome Inst.

    2017-09-26

    De novo whole genome assembly reconstructs genomic sequence from short, overlapping, and potentially erroneous DNA segments and is one of the most important computations in modern genomics. This work presents HipMER, a high-quality end-to-end de novo assembler designed for extreme scale analysis, via efficient parallelization of the Meraculous code. Genome assembly software has many components, each of which stresses different components of a computer system. This chapter explains the computational challenges involved in each step of the HipMer pipeline, the key distributed data structures, and communication costs in detail. We present performance results of assembling the human genome and the large hexaploid wheat genome on large supercomputers up to tens of thousands of cores.

  8. Multiscale modeling of three-dimensional genome

    Science.gov (United States)

    Zhang, Bin; Wolynes, Peter

    The genome, the blueprint of life, contains nearly all the information needed to build and maintain an entire organism. A comprehensive understanding of the genome is of paramount interest to human health and will advance progress in many areas, including life sciences, medicine, and biotechnology. The overarching goal of my research is to understand the structure-dynamics-function relationships of the human genome. In this talk, I will be presenting our efforts in moving towards that goal, with a particular emphasis on studying the three-dimensional organization, the structure of the genome with multi-scale approaches. Specifically, I will discuss the reconstruction of genome structures at both interphase and metaphase by making use of data from chromosome conformation capture experiments. Computationally modeling of chromatin fiber at atomistic level from first principles will also be presented as our effort for studying the genome structure from bottom up.

  9. Open Access Data Sharing in Genomic Research

    Directory of Open Access Journals (Sweden)

    Stacey Pereira

    2014-08-01

    Full Text Available The current emphasis on broad sharing of human genomic data generated in research in order to maximize utility and public benefit is a significant legacy of the Human Genome Project. Concerns about privacy and discrimination have led to policy responses that restrict access to genomic data as the means for protecting research participants. Our research and experience show, however, that a considerable number of research participants agree to open access sharing of their genomic data when given the choice. General policies that limit access to all genomic data fail to respect the autonomy of these participants and, at the same time, unnecessarily limit the utility of the data. We advocate instead a more balanced approach that allows for individual choice and encourages informed decision making, while protecting against the misuse of genomic data through enhanced legislation.

  10. Human Contamination in Public Genome Assemblies.

    Science.gov (United States)

    Kryukov, Kirill; Imanishi, Tadashi

    2016-01-01

    Contamination in genome assembly can lead to wrong or confusing results when using such genome as reference in sequence comparison. Although bacterial contamination is well known, the problem of human-originated contamination received little attention. In this study we surveyed 45,735 available genome assemblies for evidence of human contamination. We used lineage specificity to distinguish between contamination and conservation. We found that 154 genome assemblies contain fragments that with high confidence originate as contamination from human DNA. Majority of contaminating human sequences were present in the reference human genome assembly for over a decade. We recommend that existing contaminated genomes should be revised to remove contaminated sequence, and that new assemblies should be thoroughly checked for presence of human DNA before submitting them to public databases.

  11. Data Mining Supercomputing with SAS JMP® Genomics

    Directory of Open Access Journals (Sweden)

    Richard S. Segall

    2011-02-01

    Full Text Available JMP® Genomics is statistical discovery software that can uncover meaningful patterns in high-throughput genomics and proteomics data. JMP® Genomics is designed for biologists, biostatisticians, statistical geneticists, and those engaged in analyzing the vast stores of data that are common in genomic research (SAS, 2009. Data mining was performed using JMP® Genomics on the two collections of microarray databases available from National Center for Biotechnology Information (NCBI for lung cancer and breast cancer. The Gene Expression Omnibus (GEO of NCBI serves as a public repository for a wide range of highthroughput experimental data, including the two collections of lung cancer and breast cancer that were used for this research. The results for applying data mining using software JMP® Genomics are shown in this paper with numerous screen shots.

  12. REGEN: Ancestral Genome Reconstruction for Bacteria.

    Science.gov (United States)

    Yang, Kuan; Heath, Lenwood S; Setubal, João C

    2012-07-18

    Ancestral genome reconstruction can be understood as a phylogenetic study with more details than a traditional phylogenetic tree reconstruction. We present a new computational system called REGEN for ancestral bacterial genome reconstruction at both the gene and replicon levels. REGEN reconstructs gene content, contiguous gene runs, and replicon structure for each ancestral genome. Along each branch of the phylogenetic tree, REGEN infers evolutionary events, including gene creation and deletion and replicon fission and fusion. The reconstruction can be performed by either a maximum parsimony or a maximum likelihood method. Gene content reconstruction is based on the concept of neighboring gene pairs. REGEN was designed to be used with any set of genomes that are sufficiently related, which will usually be the case for bacteria within the same taxonomic order. We evaluated REGEN using simulated genomes and genomes in the Rhizobiales order.

  13. On the Epistemological Crisis in Genomics

    Science.gov (United States)

    Dougherty, Edward R

    2008-01-01

    There is an epistemological crisis in genomics. At issue is what constitutes scientific knowledge in genomic science, or systems biology in general. Does this crisis require a new perspective on knowledge heretofore absent from science or is it merely a matter of interpreting new scientific developments in an existing epistemological framework? This paper discusses the manner in which the experimental method, as developed and understood over recent centuries, leads naturally to a scientific epistemology grounded in an experimental-mathematical duality. It places genomics into this epistemological framework and examines the current situation in genomics. Meaning and the constitution of scientific knowledge are key concerns for genomics, and the nature of the epistemological crisis in genomics depends on how these are understood. PMID:19440447

  14. Application of Genomic Tools in Plant Breeding

    OpenAIRE

    Pérez-de-Castro, A.M.; Vilanova, S.; Cañizares, J.; Pascual, L.; Blanca, J.M.; Díez, M.J.; Prohens, J.; Picó, B.

    2012-01-01

    Plant breeding has been very successful in developing improved varieties using conventional tools and methodologies. Nowadays, the availability of genomic tools and resources is leading to a new revolution of plant breeding, as they facilitate the study of the genotype and its relationship with the phenotype, in particular for complex traits. Next Generation Sequencing (NGS) technologies are allowing the mass sequencing of genomes and transcriptomes, which is producing a vast array of genomic...

  15. The Genomic Evolution of Prostate Cancer

    Science.gov (United States)

    2017-06-01

    the proposed project : 1. To continue to acquire a comprehensive understanding of prostate cancer genomics . 2. To develop an understanding of... Genetics I • ECEV 35901 Evolutionary Genomics • Fundamentals of Clinical Research • HGEN 47400 Introduction to Probability and Statistics for Geneticists...Marc Gillard,2 David M. Hatcher,5 Westin R. Tom,5 Walter M. Stadler2 and Kevin P. White1,2,3 1Institute for Genomics and Systems Biology , Departments of

  16. Genome-scale neurogenetics: methodology and meaning.

    Science.gov (United States)

    McCarroll, Steven A; Feng, Guoping; Hyman, Steven E

    2014-06-01

    Genetic analysis is currently offering glimpses into molecular mechanisms underlying such neuropsychiatric disorders as schizophrenia, bipolar disorder and autism. After years of frustration, success in identifying disease-associated DNA sequence variation has followed from new genomic technologies, new genome data resources, and global collaborations that could achieve the scale necessary to find the genes underlying highly polygenic disorders. Here we describe early results from genome-scale studies of large numbers of subjects and the emerging significance of these results for neurobiology.

  17. Personalized medicine: new genomics, old lessons

    OpenAIRE

    Offit, Kenneth

    2011-01-01

    Personalized medicine uses traditional, as well as emerging concepts of the genetic and environmental basis of disease to individualize prevention, diagnosis and treatment. Personalized genomics plays a vital, but not exclusive role in this evolving model of personalized medicine. The distinctions between genetic and genomic medicine are more quantitative than qualitative. Personalized genomics builds on principles established by the integration of genetics into medical practice. Principles s...

  18. Discovering Complete Quasispecies In Bacterial Genomes

    OpenAIRE

    Bertels, Frederic; Gokhale, Chaitanya; Traulsen, Arne

    2017-01-01

    Mobile genetic elements can be found in almost all genomes. Possibly the most common nonautonomous mobile genetic elements in bacteria are repetitive extragenic palindromic doublets forming hairpins (REPINs) that can occur hundreds of times within a genome. The sum of all REPINs in a genome can be viewed as an evolving population because REPINs replicate and mutate. In contrast to most other biological populations, we know the exact composition of the REPIN population and the sequence of each...

  19. REGEN: Ancestral Genome Reconstruction for Bacteria

    OpenAIRE

    Yang, Kuan; Heath, Lenwood S.; Setubal, João C.

    2012-01-01

    Ancestral genome reconstruction can be understood as a phylogenetic study with more details than a traditional phylogenetic tree reconstruction. We present a new computational system called REGEN for ancestral bacterial genome reconstruction at both the gene and replicon levels. REGEN reconstructs gene content, contiguous gene runs, and replicon structure for each ancestral genome. Along each branch of the phylogenetic tree, REGEN infers evolutionary events, including gene creation and deleti...

  20. Genomic V exons from whole genome shotgun data in reptiles.

    Science.gov (United States)

    Olivieri, D N; von Haeften, B; Sánchez-Espinel, C; Faro, J; Gambón-Deza, F

    2014-08-01

    Reptiles and mammals diverged over 300 million years ago, creating two parallel evolutionary lineages amongst terrestrial vertebrates. In reptiles, two main evolutionary lines emerged: one gave rise to Squamata, while the other gave rise to Testudines, Crocodylia, and Aves. In this study, we determined the genomic variable (V) exons from whole genome shotgun sequencing (WGS) data in reptiles corresponding to the three main immunoglobulin (IG) loci and the four main T cell receptor (TR) loci. We show that Squamata lack the TRG and TRD genes, and snakes lack the IGKV genes. In representative species of Testudines and Crocodylia, the seven major IG and TR loci are maintained. As in mammals, genes of the IG loci can be grouped into well-defined IMGT clans through a multi-species phylogenetic analysis. We show that the reptilian IGHV and IGLV genes are distributed amongst the established mammalian clans, while their IGKV genes are found within a single clan, nearly exclusive from the mammalian sequences. The reptilian and mammalian TRAV genes cluster into six common evolutionary clades (since IMGT clans have not been defined for TR). In contrast, the reptilian TRBV genes cluster into three clades, which have few mammalian members. In this locus, the V exon sequences from mammals appear to have undergone different evolutionary diversification processes that occurred outside these shared reptilian clans. These sequences can be obtained in a freely available public repository (http://vgenerepertoire.org).

  1. What does it mean to be genomically literate?: National Human Genome Research Institute Meeting Report.

    Science.gov (United States)

    Hurle, Belen; Citrin, Toby; Jenkins, Jean F; Kaphingst, Kimberly A; Lamb, Neil; Roseman, Jo Ellen; Bonham, Vence L

    2013-08-01

    Genomic discoveries will increasingly advance the science of medicine. Limited genomic literacy may adversely impact the public's understanding and use of the power of genetics and genomics in health care and public health. In November 2011, a meeting was held by the National Human Genome Research Institute to examine the challenge of achieving genomic literacy for the general public, from kindergarten to grade 12 to adult education. The role of the media in disseminating scientific messages and in perpetuating or reducing misconceptions was also discussed. Workshop participants agreed that genomic literacy will be achieved only through active engagement between genomics experts and the varied constituencies that comprise the public. This report summarizes the background, content, and outcomes from this meeting, including recommendations for a research agenda to inform decisions about how to advance genomic literacy in our society.

  2. The Global Invertebrate Genomics Alliance (GIGA): Developing Community Resources to Study Diverse Invertebrate Genomes

    KAUST Repository

    Bracken-Grissom, Heather; Collins, Allen G.; Collins, Timothy; Crandall, Keith; Distel, Daniel; Dunn, Casey; Giribet, Gonzalo; Haddock, Steven; Knowlton, Nancy; Martindale, Mark; Medina, Monica; Messing, Charles; O'Brien, Stephen J.; Paulay, Gustav; Putnam, Nicolas; Ravasi, Timothy; Rouse, Greg W.; Ryan, Joseph F.; Schulze, Anja; Worheide, Gert; Adamska, Maja; Bailly, Xavier; Breinholt, Jesse; Browne, William E.; Diaz, M. Christina; Evans, Nathaniel; Flot, Jean-Francois; Fogarty, Nicole; Johnston, Matthew; Kamel, Bishoy; Kawahara, Akito Y.; Laberge, Tammy; Lavrov, Dennis; Michonneau, Francois; Moroz, Leonid L.; Oakley, Todd; Osborne, Karen; Pomponi, Shirley A.; Rhodes, Adelaide; Rodriguez-Lanetty, Mauricio; Santos, Scott R.; Satoh, Nori; Thacker, Robert W.; Van de Peer, Yves; Voolstra, Christian R.; Welch, David Mark; Winston, Judith; Zhou, Xin

    2013-01-01

    Over 95% of all metazoan (animal) species comprise the invertebrates, but very few genomes from these organisms have been sequenced. We have, therefore, formed a Global Invertebrate Genomics Alliance (GIGA). Our intent is to build a collaborative

  3. The life cycle of a genome project: perspectives and guidelines inspired by insect genome projects.

    Science.gov (United States)

    Papanicolaou, Alexie

    2016-01-01

    Many research programs on non-model species biology have been empowered by genomics. In turn, genomics is underpinned by a reference sequence and ancillary information created by so-called "genome projects". The most reliable genome projects are the ones created as part of an active research program and designed to address specific questions but their life extends past publication. In this opinion paper I outline four key insights that have facilitated maintaining genomic communities: the key role of computational capability, the iterative process of building genomic resources, the value of community participation and the importance of manual curation. Taken together, these ideas can and do ensure the longevity of genome projects and the growing non-model species community can use them to focus a discussion with regards to its future genomic infrastructure.

  4. The genome portal of the Department of Energy Joint Genome Institute: 2014 updates

    Energy Technology Data Exchange (ETDEWEB)

    Nordberg, Henrik [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Cantor, Michael [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Dusheyko, Serge [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Hua, Susan [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Poliakov, Alexander [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Shabalov, Igor [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Smirnova, Tatyana [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Grigoriev, Igor V. [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Dubchak, Inna [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States)

    2013-11-12

    The U.S. Department of Energy (DOE) Joint Genome Institute (JGI), a national user facility, serves the diverse scientific community by providing integrated high-throughput sequencing and computational analysis to enable system-based scientific approaches in support of DOE missions related to clean energy generation and environmental characterization. The JGI Genome Portal (http://genome.jgi.doe.gov) provides unified access to all JGI genomic databases and analytical tools. The JGI maintains extensive data management systems and specialized analytical capabilities to manage and interpret complex genomic data. A user can search, download and explore multiple data sets available for all DOE JGI sequencing projects including their status, assemblies and annotations of sequenced genomes. In this paper, we describe major updates of the Genome Portal in the past 2 years with a specific emphasis on efficient handling of the rapidly growing amount of diverse genomic data accumulated in JGI.

  5. [Genomic selection and its application].

    Science.gov (United States)

    Li, Heng-De; Bao, Zhen-Min; Sun, Xiao-Wen

    2011-12-01

    Selective breeding is very important in agricultural production and breeding value estimation is the core of selective breeding. With the development of genetic markers, especially high throughput genotyping technology, it becomes available to estimate breeding value at genome level, i.e. genomic selection (GS). In this review, the methods of GS was categorized into two groups: one is to predict genomic estimated breeding value (GEBV) based on the allele effect, such as least squares, random regression - best linear unbiased prediction (RR-BLUP), Bayes and principle component analysis, etc; the other is to predict GEBV with genetic relationship matrix, which constructs genetic relationship matrix via high throughput genetic markers and then predicts GEBV through linear mixed model, i.e. GBLUP. The basic principles of these methods were also introduced according to the above two classifications. Factors affecting GS accuracy include markers of type and density, length of haplotype, the size of reference population, the extent between marker-QTL and so on. Among the methods of GS, Bayes and GBLUP are usually more accurate than the others and least squares is the worst. GBLUP is time-efficient and can combine pedigree with genotypic information, hence it is superior to other methods. Although progress was made in GS, there are still some challenges, for examples, united breeding, long-term genetic gain with GS, and disentangling markers with and without contribution to the traits. GS has been applied in animal and plant breeding practice and also has the potential to predict genetic predisposition in humans and study evolutionary dynamics. GS, which is more precise than the traditional method, is a breakthrough at measuring genetic relationship. Therefore, GS will be a revolutionary event in the history of animal and plant breeding.

  6. Genomic dysregulation in gastric tumors.

    Science.gov (United States)

    Janjigian, Yelena Y; Kelsen, David P

    2013-03-01

    Gastric cancer is among the most common human malignancies and the second leading cause of cancer-related death. The different epidemiologic and histopathology of subtypes of gastric cancer are associated with different genomic patterns. Data suggests that gene expression patterns of proximal, distal gastric cancers-intestinal type, and diffuse/signet cell are well separated. This review summarizes the genetic and epigenetic changes thought to drive gastric cancer and the emerging paradigm of gastric cancer as three unique disease subtypes. Copyright © 2012 Wiley Periodicals, Inc.

  7. Quality Assessment of Domesticated Animal Genome Assemblies

    DEFF Research Database (Denmark)

    Seemann, Stefan E; Anthon, Christian; Palasca, Oana

    2015-01-01

    affected by the lack of genomic sequence. Herein, we quantify the quality of the genome assemblies of 20 domesticated animals and related species by assessing a range of measurable parameters, and we show that there is a positive correlation between the fraction of mappable reads from RNAseq data...... domesticated animal genomes still need to be sequenced deeper in order to produce high-quality assemblies. In the meanwhile, ironically, the extent to which RNAseq and other next-generation data is produced frequently far exceeds that of the genomic sequence. Furthermore, basic comparative analysis is often...

  8. Privacy Challenges of Genomic Big Data.

    Science.gov (United States)

    Shen, Hong; Ma, Jian

    2017-01-01

    With the rapid advancement of high-throughput DNA sequencing technologies, genomics has become a big data discipline where large-scale genetic information of human individuals can be obtained efficiently with low cost. However, such massive amount of personal genomic data creates tremendous challenge for privacy, especially given the emergence of direct-to-consumer (DTC) industry that provides genetic testing services. Here we review the recent development in genomic big data and its implications on privacy. We also discuss the current dilemmas and future challenges of genomic privacy.

  9. Genome technologies and personalized dental medicine.

    Science.gov (United States)

    Eng, G; Chen, A; Vess, T; Ginsburg, G S

    2012-04-01

    The addition of genomic information to our understanding of oral disease is driving important changes in oral health care. It is anticipated that genome-derived information will promote a deeper understanding of disease etiology and permit earlier diagnosis, allowing for preventative measures prior to disease onset rather than treatment that attempts to repair the diseased state. Advances in genome technologies have fueled expectations for this proactive healthcare approach. Application of genomic testing is expanding and has already begun to find its way into the practice of clinical dentistry. To take full advantage of the information and technologies currently available, it is vital that dental care providers, consumers, and policymakers be aware of genomic approaches to understanding of oral diseases and the application of genomic testing to disease diagnosis and treatment. Ethical, legal, clinical, and educational initiatives are also required to responsibly incorporate genomic information into the practice of dentistry. This article provides an overview of the application of genomic technologies to oral health care and introduces issues that require consideration if we are to realize the full potential of genomics to enable the practice of personalized dental medicine. © 2011 John Wiley & Sons A/S.

  10. Microbial species delineation using whole genome sequences.

    Science.gov (United States)

    Varghese, Neha J; Mukherjee, Supratim; Ivanova, Natalia; Konstantinidis, Konstantinos T; Mavrommatis, Kostas; Kyrpides, Nikos C; Pati, Amrita

    2015-08-18

    Increased sequencing of microbial genomes has revealed that prevailing prokaryotic species assignments can be inconsistent with whole genome information for a significant number of species. The long-standing need for a systematic and scalable species assignment technique can be met by the genome-wide Average Nucleotide Identity (gANI) metric, which is widely acknowledged as a robust measure of genomic relatedness. In this work, we demonstrate that the combination of gANI and the alignment fraction (AF) between two genomes accurately reflects their genomic relatedness. We introduce an efficient implementation of AF,gANI and discuss its successful application to 86.5M genome pairs between 13,151 prokaryotic genomes assigned to 3032 species. Subsequently, by comparing the genome clusters obtained from complete linkage clustering of these pairs to existing taxonomy, we observed that nearly 18% of all prokaryotic species suffer from anomalies in species definition. Our results can be used to explore central questions such as whether microorganisms form a continuum of genetic diversity or distinct species represented by distinct genetic signatures. We propose that this precise and objective AF,gANI-based species definition: the MiSI (Microbial Species Identifier) method, be used to address previous inconsistencies in species classification and as the primary guide for new taxonomic species assignment, supplemented by the traditional polyphasic approach, as required. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  11. Ecology and genomics of Bacillus subtilis.

    Science.gov (United States)

    Earl, Ashlee M; Losick, Richard; Kolter, Roberto

    2008-06-01

    Bacillus subtilis is a remarkably diverse bacterial species that is capable of growth within many environments. Recent microarray-based comparative genomic analyses have revealed that members of this species also exhibit considerable genomic diversity. The identification of strain-specific genes might explain how B. subtilis has become so broadly adapted. The goal of identifying ecologically adaptive genes could soon be realized with the imminent release of several new B. subtilis genome sequences. As we embark upon this exciting new era of B. subtilis comparative genomics we review what is currently known about the ecology and evolution of this species.

  12. Environmental Medicine Genome Bank (EMGB): Current Composition

    National Research Council Canada - National Science Library

    Sonna, Larry

    2000-01-01

    The USARIEM Environmental Medicine Genome Bank (EMGB) project is an ongoing effort to identify and characterize genes relevant to environmental injuries and illnesses and to human physical performance...

  13. DEFINING THE CHEMICAL SPACE OF PUBLIC GENOMIC ...

    Science.gov (United States)

    The current project aims to chemically index the genomics content of public genomic databases to make these data accessible in relation to other publicly available, chemically-indexed toxicological information. By defining the chemical space of public genomic data, it is possible to identify classes of chemicals on which to develop methodologies for the integration of chemogenomic data into predictive toxicology. The chemical space of public genomic data will be presented as well as the methodologies and tools developed to identify this chemical space.

  14. Genomics for paediatricians: promises and pitfalls.

    Science.gov (United States)

    Hammond, Carrie Louise; Willoughby, Josh Matthew; Parker, Michael James

    2018-03-24

    In recent years, there have been significant advances in genetic technologies, evolving the field of genomics from genetics. This has huge diagnostic potential, as genomic testing increasingly becomes part of mainstream medicine. However, there are numerous potential pitfalls in the interpretation of genomic data. It is therefore essential that we educate clinicians more widely about the appropriate interpretation and utilisation of genomic testing. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  15. A Genomics Approach to Tumor Gemome Analysis

    National Research Council Canada - National Science Library

    Collins, Colin

    2002-01-01

    Genomes of solid tumors are often highly rearranged and these rearrangements promote cancer progression through disruption of genes mediating immortality, survival, metastasis, and resistance to therapy...

  16. Initiating genomic selection in tetraploid potato

    DEFF Research Database (Denmark)

    Sverrisdóttir, Elsa; Janss, Luc; Byrne, Stephen

    Breeding for more space and resource efficient crops is important to feed the world’s increasing population. Potatoes produce approximately twice the amount of calories per hectare compared to cereals. The traditional “mate and phenotype” breeding approach is costly and time-consuming; however......, the completion of the genome sequence of potato has enabled the application of genomics-assisted breeding technologies. Genomic selection using genome-wide molecular markers is becoming increasingly applicable to crops as the genotyping costs continue to reduce and it is thus an attractive breeding alternative...... selection, can be obtained with good prediction accuracies in tetraploid potato....

  17. Radiation-induced instability of human genome

    International Nuclear Information System (INIS)

    Ryabchenko, N.N.; Demina, Eh.A.

    2014-01-01

    A brief review is dedicated to the phenomenon of radiation-induced genomic instability where the increased level of genomic changes in the offspring of irradiated cells is characteristic. Particular attention is paid to the problems of genomic instability induced by the low-dose radiation, role of the bystander effect in formation of radiation-induced instability, and its relationship with individual radiosensitivity. We believe that in accordance with the paradigm of modern radiobiology the increased human individual radiosensitivity can be formed due to the genome instability onset and is a significant risk factor for radiation-induced cancer

  18. The infinite sites model of genome evolution.

    Science.gov (United States)

    Ma, Jian; Ratan, Aakrosh; Raney, Brian J; Suh, Bernard B; Miller, Webb; Haussler, David

    2008-09-23

    We formalize the problem of recovering the evolutionary history of a set of genomes that are related to an unseen common ancestor genome by operations of speciation, deletion, insertion, duplication, and rearrangement of segments of bases. The problem is examined in the limit as the number of bases in each genome goes to infinity. In this limit, the chromosomes are represented by continuous circles or line segments. For such an infinite-sites model, we present a polynomial-time algorithm to find the most parsimonious evolutionary history of any set of related present-day genomes.

  19. Evolution of the Largest Mammalian Genome.

    Science.gov (United States)

    Evans, Ben J; Upham, Nathan S; Golding, Goeffrey B; Ojeda, Ricardo A; Ojeda, Agustina A

    2017-06-01

    The genome of the red vizcacha rat (Rodentia, Octodontidae, Tympanoctomys barrerae) is the largest of all mammals, and about double the size of their close relative, the mountain vizcacha rat Octomys mimax, even though the lineages that gave rise to these species diverged from each other only about 5 Ma. The mechanism for this rapid genome expansion is controversial, and hypothesized to be a consequence of whole genome duplication or accumulation of repetitive elements. To test these alternative but nonexclusive hypotheses, we gathered and evaluated evidence from whole transcriptome and whole genome sequences of T. barrerae and O. mimax. We recovered support for genome expansion due to accumulation of a diverse assemblage of repetitive elements, which represent about one half and one fifth of the genomes of T. barrerae and O. mimax, respectively, but we found no strong signal of whole genome duplication. In both species, repetitive sequences were rare in transcribed regions as compared with the rest of the genome, and mostly had no close match to annotated repetitive sequences from other rodents. These findings raise new questions about the genomic dynamics of these repetitive elements, their connection to widespread chromosomal fissions that occurred in the T. barrerae ancestor, and their fitness effects-including during the evolution of hypersaline dietary tolerance in T. barrerae. ©The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  20. Deleterious mutation accumulation in organelle genomes.

    Science.gov (United States)

    Lynch, M; Blanchard, J L

    1998-01-01

    It is well established on theoretical grounds that the accumulation of mildly deleterious mutations in nonrecombining genomes is a major extinction risk in obligately asexual populations. Sexual populations can also incur mutational deterioration in genomic regions that experience little or no recombination, i.e., autosomal regions near centromeres, Y chromosomes, and organelle genomes. Our results suggest, for a wide array of genes (transfer RNAs, ribosomal RNAs, and proteins) in a diverse collection of species (animals, plants, and fungi), an almost universal increase in the fixation probabilities of mildly deleterious mutations arising in mitochondrial and chloroplast genomes relative to those arising in the recombining nuclear genome. This enhanced width of the selective sieve in organelle genomes does not appear to be a consequence of relaxed selection, but can be explained by the decline in the efficiency of selection that results from the reduction of effective population size induced by uniparental inheritance. Because of the very low mutation rates of organelle genomes (on the order of 10(-4) per genome per year), the reduction in fitness resulting from mutation accumulation in such genomes is a very long-term process, not likely to imperil many species on time scales of less than a million years, but perhaps playing some role in phylogenetic lineage sorting on time scales of 10 to 100 million years.

  1. Goodbye genome paper, hello genome report: the increasing popularity of 'genome announcements' and their impact on science.

    Science.gov (United States)

    Smith, David Roy

    2017-05-01

    Next-generation sequencing technologies have revolutionized genomics and altered the scientific publication landscape. Life-science journals abound with genome papers-peer-reviewed descriptions of newly sequenced chromosomes. Although they once filled the pages of Nature and Science, genome papers are now mostly relegated to journals with low-impact factors. Some have forecast the death of the genome paper and argued that they are using up valuable resources and not advancing science. However, the publication rate of genome papers is on the rise. This increase is largely because some journals have created a new category of manuscript called genome reports, which are short, fast-tracked papers describing a chromosome sequence(s), its GenBank accession number and little else. In 2015, for example, more than 2000 genome reports were published, and 2016 is poised to bring even more. Here, I highlight the growing popularity of genome reports and discuss their merits, drawbacks and impact on science and the academic publication infrastructure. Genome reports can be excellent assets for the research community, but they are also being used as quick and easy routes to a publication, and in some instances they are not peer reviewed. One of the best arguments for genome reports is that they are a citable, user-generated genomic resource providing essential methodological and biological information, which may not be present in the sequence database. But they are expensive and time-consuming avenues for achieving such a goal. © The Author 2016. Published by Oxford University Press.

  2. Population Genomics of Infectious and Integrated Wolbachia pipientis Genomes in Drosophila ananassae

    Science.gov (United States)

    Choi, Jae Young; Bubnell, Jaclyn E.; Aquadro, Charles F.

    2015-01-01

    Coevolution between Drosophila and its endosymbiont Wolbachia pipientis has many intriguing aspects. For example, Drosophila ananassae hosts two forms of W. pipientis genomes: One being the infectious bacterial genome and the other integrated into the host nuclear genome. Here, we characterize the infectious and integrated genomes of W. pipientis infecting D. ananassae (wAna), by genome sequencing 15 strains of D. ananassae that have either the infectious or integrated wAna genomes. Results indicate evolutionarily stable maternal transmission for the infectious wAna genome suggesting a relatively long-term coevolution with its host. In contrast, the integrated wAna genome showed pseudogene-like characteristics accumulating many variants that are predicted to have deleterious effects if present in an infectious bacterial genome. Phylogenomic analysis of sequence variation together with genotyping by polymerase chain reaction of large structural variations indicated several wAna variants among the eight infectious wAna genomes. In contrast, only a single wAna variant was found among the seven integrated wAna genomes examined in lines from Africa, south Asia, and south Pacific islands suggesting that the integration occurred once from a single infectious wAna genome and then spread geographically. Further analysis revealed that for all D. ananassae we examined with the integrated wAna genomes, the majority of the integrated wAna genomic regions is represented in at least two copies suggesting a double integration or single integration followed by an integrated genome duplication. The possible evolutionary mechanism underlying the widespread geographical presence of the duplicate integration of the wAna genome is an intriguing question remaining to be answered. PMID:26254486

  3. Genomic landscapes of Chinese hamster ovary cell lines as revealed by the Cricetulus griseus draft genome

    DEFF Research Database (Denmark)

    Lewis, Nathan E; Liu, Xin; Li, Yuxiang

    2013-01-01

    stymied by the lack of a unifying genomic resource for CHO cells. Here we report a 2.4-Gb draft genome sequence of a female Chinese hamster, Cricetulus griseus, harboring 24,044 genes. We also resequenced and analyzed the genomes of six CHO cell lines from the CHO-K1, DG44 and CHO-S lineages...

  4. GI-POP: a combinational annotation and genomic island prediction pipeline for ongoing microbial genome projects.

    Science.gov (United States)

    Lee, Chi-Ching; Chen, Yi-Ping Phoebe; Yao, Tzu-Jung; Ma, Cheng-Yu; Lo, Wei-Cheng; Lyu, Ping-Chiang; Tang, Chuan Yi

    2013-04-10

    Sequencing of microbial genomes is important because of microbial-carrying antibiotic and pathogenetic activities. However, even with the help of new assembling software, finishing a whole genome is a time-consuming task. In most bacteria, pathogenetic or antibiotic genes are carried in genomic islands. Therefore, a quick genomic island (GI) prediction method is useful for ongoing sequencing genomes. In this work, we built a Web server called GI-POP (http://gipop.life.nthu.edu.tw) which integrates a sequence assembling tool, a functional annotation pipeline, and a high-performance GI predicting module, in a support vector machine (SVM)-based method called genomic island genomic profile scanning (GI-GPS). The draft genomes of the ongoing genome projects in contigs or scaffolds can be submitted to our Web server, and it provides the functional annotation and highly probable GI-predicting results. GI-POP is a comprehensive annotation Web server designed for ongoing genome project analysis. Researchers can perform annotation and obtain pre-analytic information include possible GIs, coding/non-coding sequences and functional analysis from their draft genomes. This pre-analytic system can provide useful information for finishing a genome sequencing project. Copyright © 2012 Elsevier B.V. All rights reserved.

  5. Spaces of genomics : exploring the innovation journey of genomics in research on common disease

    NARCIS (Netherlands)

    Bitsch, L.

    2013-01-01

    Genomics was introduced with big promises and expectations of its future contribution to our society. Medical genomics was introduced as that which would lay the foundation for a revolution in our management of common diseases. Genomics would lead the way towards a future of personalised medicine.

  6. Tolerance of Whole-Genome Doubling Propagates Chromosomal Instability and Accelerates Cancer Genome Evolution

    DEFF Research Database (Denmark)

    Dewhurst, Sally M.; McGranahan, Nicholas; Burrell, Rebecca A.

    2014-01-01

    The contribution of whole-genome doubling to chromosomal instability (CIN) and tumor evolution is unclear. We use long-term culture of isogenic tetraploid cells from a stable diploid colon cancer progenitor to investigate how a genome-doubling event affects genome stability over time. Rare cells...

  7. A Trichosporonales genome tree based on 27 haploid and three evolutionarily conserved 'natural' hybrid genomes.

    Science.gov (United States)

    Takashima, Masako; Sriswasdi, Sira; Manabe, Ri-Ichiroh; Ohkuma, Moriya; Sugita, Takashi; Iwasaki, Wataru

    2018-01-01

    To construct a backbone tree consisting of basidiomycetous yeasts, draft genome sequences from 25 species of Trichosporonales (Tremellomycetes, Basidiomycota) were generated. In addition to the hybrid genomes of Trichosporon coremiiforme and Trichosporon ovoides that we described previously, we identified an interspecies hybrid genome in Cutaneotrichosporon mucoides (formerly Trichosporon mucoides). This hybrid genome had a gene retention rate of ~55%, and its closest haploid relative was Cutaneotrichosporon dermatis. After constructing the C. mucoides subgenomes, we generated a phylogenetic tree using genome data from the 27 haploid species and the subgenome data from the three hybrid genome species. It was a high-quality tree with 100% bootstrap support for all of the branches. The genome-based tree provided superior resolution compared with previous multi-gene analyses. Although our backbone tree does not include all Trichosporonales genera (e.g. Cryptotrichosporon), it will be valuable for future analyses of genome data. Interest in interspecies hybrid fungal genomes has recently increased because they may provide a basis for new technologies. The three Trichosporonales hybrid genomes described in this study are different from well-characterized hybrid genomes (e.g. those of Saccharomyces pastorianus and Saccharomyces bayanus) because these hybridization events probably occurred in the distant evolutionary past. Hence, they will be useful for studying genome stability following hybridization and speciation events. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.

  8. Rhipicephalus (Boophilus) microplus strain Deutsch, whole genome shotgun sequencing project first submission of genome sequence

    Science.gov (United States)

    The size and repetitive nature of the Rhipicephalus microplus genome makes obtaining a full genome sequence difficult. Cot filtration/selection techniques were used to reduce the repetitive fraction of the tick genome and enrich for the fraction of DNA with gene-containing regions. The Cot-selected ...

  9. Genomic analyses of the Chlamydia trachomatis core genome show an association between chromosomal genome, plasmid type and disease

    NARCIS (Netherlands)

    Versteeg, Bart; Bruisten, Sylvia M.; Pannekoek, Yvonne; Jolley, Keith A.; Maiden, Martin C. J.; van der Ende, Arie; Harrison, Odile B.

    2018-01-01

    Background: Chlamydia trachomatis (Ct) plasmid has been shown to encode genes essential for infection. We evaluated the population structure of Ct using whole-genome sequence data (WGS). In particular, the relationship between the Ct genome, plasmid and disease was investigated. Results: WGS data

  10. Genomic analysis of Fusarium verticillioides.

    Science.gov (United States)

    Brown, D W; Butchko, R A E; Proctor, R H

    2008-09-01

    Fusarium verticillioides (teleomorph Gibberella moniliformis) can be either an endophyte of maize, causing no visible disease, or a pathogen-causing disease of ears, stalks, roots and seedlings. At any stage, this fungus can synthesize fumonisins, a family of mycotoxins structurally similar to the sphingolipid sphinganine. Ingestion of fumonisin-contaminated maize has been associated with a number of animal diseases, including cancer in rodents, and exposure has been correlated with human oesophageal cancer in some regions of the world, and some evidence suggests that fumonisins are a risk factor for neural tube defects. A primary goal of the authors' laboratory is to eliminate fumonisin contamination of maize and maize products. Understanding how and why these toxins are made and the F. verticillioides-maize disease process will allow one to develop novel strategies to limit tissue destruction (rot) and fumonisin production. To meet this goal, genomic sequence data, expressed sequence tags (ESTs) and microarrays are being used to identify F. verticillioides genes involved in the biosynthesis of toxins and plant pathogenesis. This paper describes the current status of F. verticillioides genomic resources and three approaches being used to mine microarray data from a wild-type strain cultured in liquid fumonisin production medium for 12, 24, 48, 72, 96 and 120h. Taken together, these approaches demonstrate the power of microarray technology to provide information on different biological processes.

  11. DNABIT Compress - Genome compression algorithm.

    Science.gov (United States)

    Rajarajeswari, Pothuraju; Apparao, Allam

    2011-01-22

    Data compression is concerned with how information is organized in data. Efficient storage means removal of redundancy from the data being stored in the DNA molecule. Data compression algorithms remove redundancy and are used to understand biologically important molecules. We present a compression algorithm, "DNABIT Compress" for DNA sequences based on a novel algorithm of assigning binary bits for smaller segments of DNA bases to compress both repetitive and non repetitive DNA sequence. Our proposed algorithm achieves the best compression ratio for DNA sequences for larger genome. Significantly better compression results show that "DNABIT Compress" algorithm is the best among the remaining compression algorithms. While achieving the best compression ratios for DNA sequences (Genomes),our new DNABIT Compress algorithm significantly improves the running time of all previous DNA compression programs. Assigning binary bits (Unique BIT CODE) for (Exact Repeats, Reverse Repeats) fragments of DNA sequence is also a unique concept introduced in this algorithm for the first time in DNA compression. This proposed new algorithm could achieve the best compression ratio as much as 1.58 bits/bases where the existing best methods could not achieve a ratio less than 1.72 bits/bases.

  12. Functional Insights from Structural Genomics

    Energy Technology Data Exchange (ETDEWEB)

    Forouhar,F.; Kuzin, A.; Seetharaman, J.; Lee, I.; Zhou, W.; Abashidze, M.; Chen, Y.; Montelione, G.; Tong, L.; et al

    2007-01-01

    Structural genomics efforts have produced structural information, either directly or by modeling, for thousands of proteins over the past few years. While many of these proteins have known functions, a large percentage of them have not been characterized at the functional level. The structural information has provided valuable functional insights on some of these proteins, through careful structural analyses, serendipity, and structure-guided functional screening. Some of the success stories based on structures solved at the Northeast Structural Genomics Consortium (NESG) are reported here. These include a novel methyl salicylate esterase with important role in plant innate immunity, a novel RNA methyltransferase (H. influenzae yggJ (HI0303)), a novel spermidine/spermine N-acetyltransferase (B. subtilis PaiA), a novel methyltransferase or AdoMet binding protein (A. fulgidus AF{_}0241), an ATP:cob(I)alamin adenosyltransferase (B. subtilis YvqK), a novel carboxysome pore (E. coli EutN), a proline racemase homolog with a disrupted active site (B. melitensis BME11586), an FMN-dependent enzyme (S. pneumoniae SP{_}1951), and a 12-stranded {beta}-barrel with a novel fold (V. parahaemolyticus VPA1032).

  13. Observing copepods through a genomic lens

    Science.gov (United States)

    2011-01-01

    Background Copepods outnumber every other multicellular animal group. They are critical components of the world's freshwater and marine ecosystems, sensitive indicators of local and global climate change, key ecosystem service providers, parasites and predators of economically important aquatic animals and potential vectors of waterborne disease. Copepods sustain the world fisheries that nourish and support human populations. Although genomic tools have transformed many areas of biological and biomedical research, their power to elucidate aspects of the biology, behavior and ecology of copepods has only recently begun to be exploited. Discussion The extraordinary biological and ecological diversity of the subclass Copepoda provides both unique advantages for addressing key problems in aquatic systems and formidable challenges for developing a focused genomics strategy. This article provides an overview of genomic studies of copepods and discusses strategies for using genomics tools to address key questions at levels extending from individuals to ecosystems. Genomics can, for instance, help to decipher patterns of genome evolution such as those that occur during transitions from free living to symbiotic and parasitic lifestyles and can assist in the identification of genetic mechanisms and accompanying physiological changes associated with adaptation to new or physiologically challenging environments. The adaptive significance of the diversity in genome size and unique mechanisms of genome reorganization during development could similarly be explored. Genome-wide and EST studies of parasitic copepods of salmon and large EST studies of selected free-living copepods have demonstrated the potential utility of modern genomics approaches for the study of copepods and have generated resources such as EST libraries, shotgun genome sequences, BAC libraries, genome maps and inbred lines that will be invaluable in assisting further efforts to provide genomics tools for

  14. Observing copepods through a genomic lens

    Directory of Open Access Journals (Sweden)

    Johnson Stewart C

    2011-09-01

    Full Text Available Abstract Background Copepods outnumber every other multicellular animal group. They are critical components of the world's freshwater and marine ecosystems, sensitive indicators of local and global climate change, key ecosystem service providers, parasites and predators of economically important aquatic animals and potential vectors of waterborne disease. Copepods sustain the world fisheries that nourish and support human populations. Although genomic tools have transformed many areas of biological and biomedical research, their power to elucidate aspects of the biology, behavior and ecology of copepods has only recently begun to be exploited. Discussion The extraordinary biological and ecological diversity of the subclass Copepoda provides both unique advantages for addressing key problems in aquatic systems and formidable challenges for developing a focused genomics strategy. This article provides an overview of genomic studies of copepods and discusses strategies for using genomics tools to address key questions at levels extending from individuals to ecosystems. Genomics can, for instance, help to decipher patterns of genome evolution such as those that occur during transitions from free living to symbiotic and parasitic lifestyles and can assist in the identification of genetic mechanisms and accompanying physiological changes associated with adaptation to new or physiologically challenging environments. The adaptive significance of the diversity in genome size and unique mechanisms of genome reorganization during development could similarly be explored. Genome-wide and EST studies of parasitic copepods of salmon and large EST studies of selected free-living copepods have demonstrated the potential utility of modern genomics approaches for the study of copepods and have generated resources such as EST libraries, shotgun genome sequences, BAC libraries, genome maps and inbred lines that will be invaluable in assisting further efforts to

  15. Aligning the unalignable: bacteriophage whole genome alignments.

    Science.gov (United States)

    Bérard, Sèverine; Chateau, Annie; Pompidor, Nicolas; Guertin, Paul; Bergeron, Anne; Swenson, Krister M

    2016-01-13

    In recent years, many studies focused on the description and comparison of large sets of related bacteriophage genomes. Due to the peculiar mosaic structure of these genomes, few informative approaches for comparing whole genomes exist: dot plots diagrams give a mostly qualitative assessment of the similarity/dissimilarity between two or more genomes, and clustering techniques are used to classify genomes. Multiple alignments are conspicuously absent from this scene. Indeed, whole genome aligners interpret lack of similarity between sequences as an indication of rearrangements, insertions, or losses. This behavior makes them ill-prepared to align bacteriophage genomes, where even closely related strains can accomplish the same biological function with highly dissimilar sequences. In this paper, we propose a multiple alignment strategy that exploits functional collinearity shared by related strains of bacteriophages, and uses partial orders to capture mosaicism of sets of genomes. As classical alignments do, the computed alignments can be used to predict that genes have the same biological function, even in the absence of detectable similarity. The Alpha aligner implements these ideas in visual interactive displays, and is used to compute several examples of alignments of Staphylococcus aureus and Mycobacterium bacteriophages, involving up to 29 genomes. Using these datasets, we prove that Alpha alignments are at least as good as those computed by standard aligners. Comparison with the progressive Mauve aligner - which implements a partial order strategy, but whose alignments are linearized - shows a greatly improved interactive graphic display, while avoiding misalignments. Multiple alignments of whole bacteriophage genomes work, and will become an important conceptual and visual tool in comparative genomics of sets of related strains. A python implementation of Alpha, along with installation instructions for Ubuntu and OSX, is available on bitbucket (https://bitbucket.org/thekswenson/alpha).

  16. Dynamics of genome rearrangement in bacterial populations.

    Directory of Open Access Journals (Sweden)

    Aaron E Darling

    2008-07-01

    Full Text Available Genome structure variation has profound impacts on phenotype in organisms ranging from microbes to humans, yet little is known about how natural selection acts on genome arrangement. Pathogenic bacteria such as Yersinia pestis, which causes bubonic and pneumonic plague, often exhibit a high degree of genomic rearrangement. The recent availability of several Yersinia genomes offers an unprecedented opportunity to study the evolution of genome structure and arrangement. We introduce a set of statistical methods to study patterns of rearrangement in circular chromosomes and apply them to the Yersinia. We constructed a multiple alignment of eight Yersinia genomes using Mauve software to identify 78 conserved segments that are internally free from genome rearrangement. Based on the alignment, we applied Bayesian statistical methods to infer the phylogenetic inversion history of Yersinia. The sampling of genome arrangement reconstructions contains seven parsimonious tree topologies, each having different histories of 79 inversions. Topologies with a greater number of inversions also exist, but were sampled less frequently. The inversion phylogenies agree with results suggested by SNP patterns. We then analyzed reconstructed inversion histories to identify patterns of rearrangement. We confirm an over-representation of "symmetric inversions"-inversions with endpoints that are equally distant from the origin of chromosomal replication. Ancestral genome arrangements demonstrate moderate preference for replichore balance in Yersinia. We found that all inversions are shorter than expected under a neutral model, whereas inversions acting within a single replichore are much shorter than expected. We also found evidence for a canonical configuration of the origin and terminus of replication. Finally, breakpoint reuse analysis reveals that inversions with endpoints proximal to the origin of DNA replication are nearly three times more frequent. Our findings

  17. Simultaneous gene finding in multiple genomes.

    Science.gov (United States)

    König, Stefanie; Romoth, Lars W; Gerischer, Lizzy; Stanke, Mario

    2016-11-15

    As the tree of life is populated with sequenced genomes ever more densely, the new challenge is the accurate and consistent annotation of entire clades of genomes. We address this problem with a new approach to comparative gene finding that takes a multiple genome alignment of closely related species and simultaneously predicts the location and structure of protein-coding genes in all input genomes, thereby exploiting negative selection and sequence conservation. The model prefers potential gene structures in the different genomes that are in agreement with each other, or-if not-where the exon gains and losses are plausible given the species tree. We formulate the multi-species gene finding problem as a binary labeling problem on a graph. The resulting optimization problem is NP hard, but can be efficiently approximated using a subgradient-based dual decomposition approach. The proposed method was tested on whole-genome alignments of 12 vertebrate and 12 Drosophila species. The accuracy was evaluated for human, mouse and Drosophila melanogaster and compared to competing methods. Results suggest that our method is well-suited for annotation of (a large number of) genomes of closely related species within a clade, in particular, when RNA-Seq data are available for many of the genomes. The transfer of existing annotations from one genome to another via the genome alignment is more accurate than previous approaches that are based on protein-spliced alignments, when the genomes are at close to medium distances. The method is implemented in C ++ as part of Augustus and available open source at http://bioinf.uni-greifswald.de/augustus/ CONTACT: stefaniekoenig@ymail.com or mario.stanke@uni-greifswald.deSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  18. The whole genome sequences and experimentally phased haplotypes of over 100 personal genomes.

    Science.gov (United States)

    Mao, Qing; Ciotlos, Serban; Zhang, Rebecca Yu; Ball, Madeleine P; Chin, Robert; Carnevali, Paolo; Barua, Nina; Nguyen, Staci; Agarwal, Misha R; Clegg, Tom; Connelly, Abram; Vandewege, Ward; Zaranek, Alexander Wait; Estep, Preston W; Church, George M; Drmanac, Radoje; Peters, Brock A

    2016-10-11

    Since the completion of the Human Genome Project in 2003, it is estimated that more than 200,000 individual whole human genomes have been sequenced. A stunning accomplishment in such a short period of time. However, most of these were sequenced without experimental haplotype data and are therefore missing an important aspect of genome biology. In addition, much of the genomic data is not available to the public and lacks phenotypic information. As part of the Personal Genome Project, blood samples from 184 participants were collected and processed using Complete Genomics' Long Fragment Read technology. Here, we present the experimental whole genome haplotyping and sequencing of these samples to an average read coverage depth of 100X. This is approximately three-fold higher than the read coverage applied to most whole human genome assemblies and ensures the highest quality results. Currently, 114 genomes from this dataset are freely available in the GigaDB repository and are associated with rich phenotypic data; the remaining 70 should be added in the near future as they are approved through the PGP data release process. For reproducibility analyses, 20 genomes were sequenced at least twice using independent LFR barcoded libraries. Seven genomes were also sequenced using Complete Genomics' standard non-barcoded library process. In addition, we report 2.6 million high-quality, rare variants not previously identified in the Single Nucleotide Polymorphisms database or the 1000 Genomes Project Phase 3 data. These genomes represent a unique source of haplotype and phenotype data for the scientific community and should help to expand our understanding of human genome evolution and function.

  19. Genome-wide identification of direct HBx genomic targets

    KAUST Repository

    Guerrieri, Francesca

    2017-02-17

    Background The Hepatitis B Virus (HBV) HBx regulatory protein is required for HBV replication and involved in HBV-related carcinogenesis. HBx interacts with chromatin modifying enzymes and transcription factors to modulate histone post-translational modifications and to regulate viral cccDNA transcription and cellular gene expression. Aiming to identify genes and non-coding RNAs (ncRNAs) directly targeted by HBx, we performed a chromatin immunoprecipitation sequencing (ChIP-Seq) to analyse HBV recruitment on host cell chromatin in cells replicating HBV. Results ChIP-Seq high throughput sequencing of HBx-bound fragments was used to obtain a high-resolution, unbiased, mapping of HBx binding sites across the genome in HBV replicating cells. Protein-coding genes and ncRNAs involved in cell metabolism, chromatin dynamics and cancer were enriched among HBx targets together with genes/ncRNAs known to modulate HBV replication. The direct transcriptional activation of genes/miRNAs that potentiate endocytosis (Ras-related in brain (RAB) GTPase family) and autophagy (autophagy related (ATG) genes, beclin-1, miR-33a) and the transcriptional repression of microRNAs (miR-138, miR-224, miR-576, miR-596) that directly target the HBV pgRNA and would inhibit HBV replication, contribute to HBx-mediated increase of HBV replication. Conclusions Our ChIP-Seq analysis of HBx genome wide chromatin recruitment defined the repertoire of genes and ncRNAs directly targeted by HBx and led to the identification of new mechanisms by which HBx positively regulates cccDNA transcription and HBV replication.

  20. Dramatic improvement in genome assembly achieved using doubled-haploid genomes.

    Science.gov (United States)

    Zhang, Hong; Tan, Engkong; Suzuki, Yutaka; Hirose, Yusuke; Kinoshita, Shigeharu; Okano, Hideyuki; Kudoh, Jun; Shimizu, Atsushi; Saito, Kazuyoshi; Watabe, Shugo; Asakawa, Shuichi

    2014-10-27

    Improvement in de novo assembly of large genomes is still to be desired. Here, we improved draft genome sequence quality by employing doubled-haploid individuals. We sequenced wildtype and doubled-haploid Takifugu rubripes genomes, under the same conditions, using the Illumina platform and assembled contigs with SOAPdenovo2. We observed 5.4-fold and 2.6-fold improvement in the sizes of the N50 contig and scaffold of doubled-haploid individuals, respectively, compared to the wildtype, indicating that the use of a doubled-haploid genome aids in accurate genome analysis.