WorldWideScience

Sample records for human genome analysis

  1. Big Data Analysis of Human Genome Variations

    KAUST Repository

    Gojobori, Takashi

    2016-01-25

    Since the human genome draft sequence was in public for the first time in 2000, genomic analyses have been intensively extended to the population level. The following three international projects are good examples for large-scale studies of human genome variations: 1) HapMap Data (1,417 individuals) (http://hapmap.ncbi.nlm.nih.gov/downloads/genotypes/2010-08_phaseII+III/forward/), 2) HGDP (Human Genome Diversity Project) Data (940 individuals) (http://www.hagsc.org/hgdp/files.html), 3) 1000 genomes Data (2,504 individuals) http://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20130502/ If we can integrate all three data into a single volume of data, we should be able to conduct a more detailed analysis of human genome variations for a total number of 4,861 individuals (= 1,417+940+2,504 individuals). In fact, we successfully integrated these three data sets by use of information on the reference human genome sequence, and we conducted the big data analysis. In particular, we constructed a phylogenetic tree of about 5,000 human individuals at the genome level. As a result, we were able to identify clusters of ethnic groups, with detectable admixture, that were not possible by an analysis of each of the three data sets. Here, we report the outcome of this kind of big data analyses and discuss evolutionary significance of human genomic variations. Note that the present study was conducted in collaboration with Katsuhiko Mineta and Kosuke Goto at KAUST.

  2. Big Data Analysis of Human Genome Variations

    KAUST Repository

    Gojobori, Takashi

    2016-01-01

    Since the human genome draft sequence was in public for the first time in 2000, genomic analyses have been intensively extended to the population level. The following three international projects are good examples for large-scale studies of human

  3. Virtual Northern analysis of the human genome.

    Directory of Open Access Journals (Sweden)

    Evan H Hurowitz

    2007-05-01

    Full Text Available We applied the Virtual Northern technique to human brain mRNA to systematically measure human mRNA transcript lengths on a genome-wide scale.We used separation by gel electrophoresis followed by hybridization to cDNA microarrays to measure 8,774 mRNA transcript lengths representing at least 6,238 genes at high (>90% confidence. By comparing these transcript lengths to the Refseq and H-Invitational full-length cDNA databases, we found that nearly half of our measurements appeared to represent novel transcript variants. Comparison of length measurements determined by hybridization to different cDNAs derived from the same gene identified clones that potentially correspond to alternative transcript variants. We observed a close linear relationship between ORF and mRNA lengths in human mRNAs, identical in form to the relationship we had previously identified in yeast. Some functional classes of protein are encoded by mRNAs whose untranslated regions (UTRs tend to be longer or shorter than average; these functional classes were similar in both human and yeast.Human transcript diversity is extensive and largely unannotated. Our length dataset can be used as a new criterion for judging the completeness of cDNAs and annotating mRNA sequences. Similar relationships between the lengths of the UTRs in human and yeast mRNAs and the functions of the proteins they encode suggest that UTR sequences serve an important regulatory role among eukaryotes.

  4. Virtual Northern analysis of the human genome.

    Science.gov (United States)

    Hurowitz, Evan H; Drori, Iddo; Stodden, Victoria C; Donoho, David L; Brown, Patrick O

    2007-05-23

    We applied the Virtual Northern technique to human brain mRNA to systematically measure human mRNA transcript lengths on a genome-wide scale. We used separation by gel electrophoresis followed by hybridization to cDNA microarrays to measure 8,774 mRNA transcript lengths representing at least 6,238 genes at high (>90%) confidence. By comparing these transcript lengths to the Refseq and H-Invitational full-length cDNA databases, we found that nearly half of our measurements appeared to represent novel transcript variants. Comparison of length measurements determined by hybridization to different cDNAs derived from the same gene identified clones that potentially correspond to alternative transcript variants. We observed a close linear relationship between ORF and mRNA lengths in human mRNAs, identical in form to the relationship we had previously identified in yeast. Some functional classes of protein are encoded by mRNAs whose untranslated regions (UTRs) tend to be longer or shorter than average; these functional classes were similar in both human and yeast. Human transcript diversity is extensive and largely unannotated. Our length dataset can be used as a new criterion for judging the completeness of cDNAs and annotating mRNA sequences. Similar relationships between the lengths of the UTRs in human and yeast mRNAs and the functions of the proteins they encode suggest that UTR sequences serve an important regulatory role among eukaryotes.

  5. Human · mouse genome analysis and radiation biology. Proceedings

    International Nuclear Information System (INIS)

    Hori, Tada-aki

    1994-03-01

    This issue is the collection of the papers presented at the 25th NIRS symposium on Human, Mouse Genome Analysis and Radiation Biology. The 14 of the presented papers are indexed individually. (J.P.N.)

  6. Sequencing and analysis of an Irish human genome.

    LENUS (Irish Health Repository)

    Tong, Pin

    2010-01-01

    Recent studies generating complete human sequences from Asian, African and European subgroups have revealed population-specific variation and disease susceptibility loci. Here, choosing a DNA sample from a population of interest due to its relative geographical isolation and genetic impact on further populations, we extend the above studies through the generation of 11-fold coverage of the first Irish human genome sequence.

  7. Genome-wide linkage analysis for human longevity

    DEFF Research Database (Denmark)

    Beekman, Marian; Blanché, Hélène; Perola, Markus

    2013-01-01

    Clear evidence exists for heritability of human longevity, and much interest is focused on identifying genes associated with longer lives. To identify such longevity alleles, we performed the largest genome-wide linkage scan thus far reported. Linkage analyses included 2118 nonagenarian Caucasian...

  8. Genome-wide binding and transcriptome analysis of human farnesoid X receptor in primary human hepatocytes.

    Directory of Open Access Journals (Sweden)

    Le Zhan

    Full Text Available Farnesoid X receptor (FXR, NR1H4 is a ligand-activated transcription factor, belonging to the nuclear receptor superfamily. FXR is highly expressed in the liver and is essential in regulating bile acid homeostasis. FXR deficiency is implicated in numerous liver diseases and mice with modulation of FXR have been used as animal models to study liver physiology and pathology. We have reported genome-wide binding of FXR in mice by chromatin immunoprecipitation - deep sequencing (ChIP-seq, with results indicating that FXR may be involved in regulating diverse pathways in liver. However, limited information exists for the functions of human FXR and the suitability of using murine models to study human FXR functions.In the current study, we performed ChIP-seq in primary human hepatocytes (PHHs treated with a synthetic FXR agonist, GW4064 or DMSO control. In parallel, RNA deep sequencing (RNA-seq and RNA microarray were performed for GW4064 or control treated PHHs and wild type mouse livers, respectively.ChIP-seq showed similar profiles of genome-wide FXR binding in humans and mice in terms of motif analysis and pathway prediction. However, RNA-seq and microarray showed more different transcriptome profiles between PHHs and mouse livers upon GW4064 treatment.In summary, we have established genome-wide human FXR binding and transcriptome profiles. These results will aid in determining the human FXR functions, as well as judging to what level the mouse models could be used to study human FXR functions.

  9. Genome-Wide Prediction and Analysis of 3D-Domain Swapped Proteins in the Human Genome from Sequence Information.

    Science.gov (United States)

    Upadhyay, Atul Kumar; Sowdhamini, Ramanathan

    2016-01-01

    3D-domain swapping is one of the mechanisms of protein oligomerization and the proteins exhibiting this phenomenon have many biological functions. These proteins, which undergo domain swapping, have acquired much attention owing to their involvement in human diseases, such as conformational diseases, amyloidosis, serpinopathies, proteionopathies etc. Early realisation of proteins in the whole human genome that retain tendency to domain swap will enable many aspects of disease control management. Predictive models were developed by using machine learning approaches with an average accuracy of 78% (85.6% of sensitivity, 87.5% of specificity and an MCC value of 0.72) to predict putative domain swapping in protein sequences. These models were applied to many complete genomes with special emphasis on the human genome. Nearly 44% of the protein sequences in the human genome were predicted positive for domain swapping. Enrichment analysis was performed on the positively predicted sequences from human genome for their domain distribution, disease association and functional importance based on Gene Ontology (GO). Enrichment analysis was also performed to infer a better understanding of the functional importance of these sequences. Finally, we developed hinge region prediction, in the given putative domain swapped sequence, by using important physicochemical properties of amino acids.

  10. Comparative genomic analysis of human fungal pathogens causing paracoccidioidomycosis.

    Directory of Open Access Journals (Sweden)

    Christopher A Desjardins

    2011-10-01

    Full Text Available Paracoccidioides is a fungal pathogen and the cause of paracoccidioidomycosis, a health-threatening human systemic mycosis endemic to Latin America. Infection by Paracoccidioides, a dimorphic fungus in the order Onygenales, is coupled with a thermally regulated transition from a soil-dwelling filamentous form to a yeast-like pathogenic form. To better understand the genetic basis of growth and pathogenicity in Paracoccidioides, we sequenced the genomes of two strains of Paracoccidioides brasiliensis (Pb03 and Pb18 and one strain of Paracoccidioides lutzii (Pb01. These genomes range in size from 29.1 Mb to 32.9 Mb and encode 7,610 to 8,130 genes. To enable genetic studies, we mapped 94% of the P. brasiliensis Pb18 assembly onto five chromosomes. We characterized gene family content across Onygenales and related fungi, and within Paracoccidioides we found expansions of the fungal-specific kinase family FunK1. Additionally, the Onygenales have lost many genes involved in carbohydrate metabolism and fewer genes involved in protein metabolism, resulting in a higher ratio of proteases to carbohydrate active enzymes in the Onygenales than their relatives. To determine if gene content correlated with growth on different substrates, we screened the non-pathogenic onygenale Uncinocarpus reesii, which has orthologs for 91% of Paracoccidioides metabolic genes, for growth on 190 carbon sources. U. reesii showed growth on a limited range of carbohydrates, primarily basic plant sugars and cell wall components; this suggests that Onygenales, including dimorphic fungi, can degrade cellulosic plant material in the soil. In addition, U. reesii grew on gelatin and a wide range of dipeptides and amino acids, indicating a preference for proteinaceous growth substrates over carbohydrates, which may enable these fungi to also degrade animal biomass. These capabilities for degrading plant and animal substrates suggest a duality in lifestyle that could enable pathogenic

  11. Comparative analysis of genome maintenance genes in naked mole rat, mouse, and human.

    Science.gov (United States)

    MacRae, Sheila L; Zhang, Quanwei; Lemetre, Christophe; Seim, Inge; Calder, Robert B; Hoeijmakers, Jan; Suh, Yousin; Gladyshev, Vadim N; Seluanov, Andrei; Gorbunova, Vera; Vijg, Jan; Zhang, Zhengdong D

    2015-04-01

    Genome maintenance (GM) is an essential defense system against aging and cancer, as both are characterized by increased genome instability. Here, we compared the copy number variation and mutation rate of 518 GM-associated genes in the naked mole rat (NMR), mouse, and human genomes. GM genes appeared to be strongly conserved, with copy number variation in only four genes. Interestingly, we found NMR to have a higher copy number of CEBPG, a regulator of DNA repair, and TINF2, a protector of telomere integrity. NMR, as well as human, was also found to have a lower rate of germline nucleotide substitution than the mouse. Together, the data suggest that the long-lived NMR, as well as human, has more robust GM than mouse and identifies new targets for the analysis of the exceptional longevity of the NMR. © 2015 The Authors. Aging Cell published by the Anatomical Society and John Wiley & Sons Ltd.

  12. The Revolution in Viral Genomics as Exemplified by the Bioinformatic Analysis of Human Adenoviruses

    Directory of Open Access Journals (Sweden)

    Sarah Torres

    2010-06-01

    Full Text Available Over the past 30 years, genomic and bioinformatic analysis of human adenoviruses has been achieved using a variety of DNA sequencing methods; initially with the use of restriction enzymes and more currently with the use of the GS FLX pyrosequencing technology. Following the conception of DNA sequencing in the 1970s, analysis of adenoviruses has evolved from 100 base pair mRNA fragments to entire genomes. Comparative genomics of adenoviruses made its debut in 1984 when nucleotides and amino acids of coding sequences within the hexon genes of two human adenoviruses (HAdV, HAdV–C2 and HAdV–C5, were compared and analyzed. It was determined that there were three different zones (1-393, 394-1410, 1411-2910 within the hexon gene, of which HAdV–C2 and HAdV–C5 shared zones 1 and 3 with 95% and 89.5% nucleotide identity, respectively. In 1992, HAdV-C5 became the first adenovirus genome to be fully sequenced using the Sanger method. Over the next seven years, whole genome analysis and characterization was completed using bioinformatic tools such as blastn, tblastx, ClustalV and FASTA, in order to determine key proteins in species HAdV-A through HAdV-F. The bioinformatic revolution was initiated with the introduction of a novel species, HAdV-G, that was typed and named by the use of whole genome sequencing and phylogenetics as opposed to traditional serology. HAdV bioinformatics will continue to advance as the latest sequencing technology enables scientists to add to and expand the resource databases. As a result of these advancements, how novel HAdVs are typed has changed. Bioinformatic analysis has become the revolutionary tool that has significantly accelerated the in-depth study of HAdV microevolution through comparative genomics.

  13. In silico pattern-based analysis of the human cytomegalovirus genome.

    Science.gov (United States)

    Rigoutsos, Isidore; Novotny, Jiri; Huynh, Tien; Chin-Bow, Stephen T; Parida, Laxmi; Platt, Daniel; Coleman, David; Shenk, Thomas

    2003-04-01

    More than 200 open reading frames (ORFs) from the human cytomegalovirus genome have been reported as potentially coding for proteins. We have used two pattern-based in silico approaches to analyze this set of putative viral genes. With the help of an objective annotation method that is based on the Bio-Dictionary, a comprehensive collection of amino acid patterns that describes the currently known natural sequence space of proteins, we have reannotated all of the previously reported putative genes of the human cytomegalovirus. Also, with the help of MUSCA, a pattern-based multiple sequence alignment algorithm, we have reexamined the original human cytomegalovirus gene family definitions. Our analysis of the genome shows that many of the coded proteins comprise amino acid combinations that are unique to either the human cytomegalovirus or the larger group of herpesviruses. We have confirmed that a surprisingly large portion of the analyzed ORFs encode membrane proteins, and we have discovered a significant number of previously uncharacterized proteins that are predicted to be G-protein-coupled receptor homologues. The analysis also indicates that many of the encoded proteins undergo posttranslational modifications such as hydroxylation, phosphorylation, and glycosylation. ORFs encoding proteins with similar functional behavior appear in neighboring regions of the human cytomegalovirus genome. All of the results of the present study can be found and interactively explored online (http://cbcsrv.watson.ibm.com/virus/).

  14. The human genome project

    International Nuclear Information System (INIS)

    Worton, R.

    1996-01-01

    The Human Genome Project is a massive international research project, costing 3 to 5 billion dollars and expected to take 15 years, which will identify the all the genes in the human genome - i.e. the complete sequence of bases in human DNA. The prize will be the ability to identify genes causing or predisposing to disease, and in some cases the development of gene therapy, but this new knowledge will raise important ethical issues

  15. Human Genome Project

    Energy Technology Data Exchange (ETDEWEB)

    Block, S. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Cornwall, J. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Dally, W. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Dyson, F. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Fortson, N. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Joyce, G. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Kimble, H. J. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Lewis, N. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Max, C. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Prince, T. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Schwitters, R. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Weinberger, P. [The MITRE Corporation, McLean, VA (US). JASON Program Office; Woodin, W. H. [The MITRE Corporation, McLean, VA (US). JASON Program Office

    1998-01-04

    The study reviews Department of Energy supported aspects of the United States Human Genome Project, the joint National Institutes of Health/Department of Energy program to characterize all human genetic material, to discover the set of human genes, and to render them accessible for further biological study. The study concentrates on issues of technology, quality assurance/control, and informatics relevant to current effort on the genome project and needs beyond it. Recommendations are presented on areas of the genome program that are of particular interest to and supported by the Department of Energy.

  16. Human genome I

    International Nuclear Information System (INIS)

    Anon.

    1989-01-01

    An international conference, Human Genome I, was held Oct. 2-4, 1989 in San Diego, Calif. Selected speakers discussed: Current Status of the Genome Project; Technique Innovations; Interesting regions; Applications; and Organization - Different Views of Current and Future Science and Procedures. Posters, consisting of 119 presentations, were displayed during the sessions. 119 were indexed for inclusion to the Energy Data Base

  17. Complete genome sequence analysis of novel human bocavirus reveals genetic recombination between human bocavirus 2 and human bocavirus 4.

    Science.gov (United States)

    Khamrin, Pattara; Okitsu, Shoko; Ushijima, Hiroshi; Maneekarn, Niwat

    2013-07-01

    Epidemiological surveillance of human bocavirus (HBoV) was conducted on fecal specimens collected from hospitalized children with diarrhea in Chiang Mai, Thailand in 2011. By partial sequence analysis of VP1 gene, an unusual strain of HBoV (CMH-S011-11), was initially identified as HBoV4. The complete genome sequence of CMH-S011-11 was performed and analyzed further to clarify whether it was a recombinant strain or a new HBoV variant. Analysis of complete genome sequence revealed that the coding sequence starting from NS1, NP1 to VP1/VP2 was 4795 nucleotides long. Interestingly, the nucleotide sequence of NS1 gene of CMH-S011-11 was most closely related to the HBoV2 reference strains detected in Pakistan, which contradicted to the initial genotyping result of the partial VP1 region in the previous study. In addition, comparison of NP1 nucleotide sequence of CMH-S011-11 with those of other HBoV1-4 reference strains also revealed a high level of sequence identity with HBoV2. On the other hand, nucleotide sequence of VP1/VP2 gene of CMH-S011-11 was most closely related to those of HBoV4 reference strains detected in Nigeria. The overall full-length sequence analysis revealed that this CMH-S011-11 was grouped within HBoV4 species, but located in a separate branch from other HBoV4 prototype strains. Recombination analysis revealed that CMH-S011-11 was the result of recombination between HBoV2 and HBoV4 strains with the break point located near the start codon of VP2. Copyright © 2013 Elsevier B.V. All rights reserved.

  18. New families of human regulatory RNA structures identified by comparative analysis of vertebrate genomes

    DEFF Research Database (Denmark)

    Parker, Brian John; Moltke, Ida; Roth, Adam

    2011-01-01

    a comparative method, EvoFam, for genome-wide identification of families of regulatory RNA structures, based on primary sequence and secondary structure similarity. We apply EvoFam to a 41-way genomic vertebrate alignment. Genome-wide, we identify 220 human, high-confidence families outside protein...

  19. Comparative analysis of genome maintenance genes in naked mole rat, mouse, and human

    NARCIS (Netherlands)

    S.L. Macrae (Sheila L.); Q. Zhang (Quanwei); C. Lemetre (Christophe); I. Seim (Inge); R.B. Calder (Robert B.); J.H.J. Hoeijmakers (Jan); Y. Suh (Yousin); V.N. Gladyshev (Vadim N.); A. Seluanov (Andrei); V. Gorbunova (Vera); J. Vijg (Jan); Z.D. Zhang (Zhengdong D.)

    2015-01-01

    textabstractGenome maintenance (GM) is an essential defense system against aging and cancer, as both are characterized by increased genome instability. Here, we compared the copy number variation and mutation rate of 518 GM-associated genes in the naked mole rat (NMR), mouse, and human genomes. GM

  20. Human social genomics.

    Directory of Open Access Journals (Sweden)

    Steven W Cole

    2014-08-01

    Full Text Available A growing literature in human social genomics has begun to analyze how everyday life circumstances influence human gene expression. Social-environmental conditions such as urbanity, low socioeconomic status, social isolation, social threat, and low or unstable social status have been found to associate with differential expression of hundreds of gene transcripts in leukocytes and diseased tissues such as metastatic cancers. In leukocytes, diverse types of social adversity evoke a common conserved transcriptional response to adversity (CTRA characterized by increased expression of proinflammatory genes and decreased expression of genes involved in innate antiviral responses and antibody synthesis. Mechanistic analyses have mapped the neural "social signal transduction" pathways that stimulate CTRA gene expression in response to social threat and may contribute to social gradients in health. Research has also begun to analyze the functional genomics of optimal health and thriving. Two emerging opportunities now stand to revolutionize our understanding of the everyday life of the human genome: network genomics analyses examining how systems-level capabilities emerge from groups of individual socially sensitive genomes and near-real-time transcriptional biofeedback to empirically optimize individual well-being in the context of the unique genetic, geographic, historical, developmental, and social contexts that jointly shape the transcriptional realization of our innate human genomic potential for thriving.

  1. Chromosome microdissection and cloning in human genome and genetic disease analysis

    International Nuclear Information System (INIS)

    Kao, Faten; Yu, Jingwei

    1991-01-01

    A procedure has been described for microdissection and microcloning of human chromosomal DNA sequences in which universal amplification of the dissected fragments by Mbo I linker adaptor and polymerase chain reaction is used. A very large library comprising 700,000 recombinant plasmid microclones from 30 dissected chromosomes of human chromosome 21 was constructed. Colony hybridization showed that 42% of the clones contained repetitive sequences and 58% contained single or low-copy sequences. The insert sizes generated by complete Mbo I cleavage ranged from 50 to 1,100 base pairs with a mean of 416 base pairs. Southern blot analysis of microclones from the library confirmed their human origin and chromosome 21 specificity. Some of these clones have also been regionally mapped to specific sites of chromosome 21 by using a regional mapping panel of cell hybrids. This chromosome microtechnology can generate large numbers of microclones with unique sequences from defined chromosomal regions and can be used for processes such as (i) isolating corresponding yeast artificial chromosome clones with large inserts, (ii) screening various cDNA libraries for isolating expressed sequences, and (iii) constructing region-specific libraries of the entire human genome. The studies described here demonstrate the power of this technology for high-resolution genome analysis and explicate their use in an efficient search for disease-associated genes localized to specific chromosomal regions

  2. Whole genome analysis of Klebsiella pneumoniae T2-1-1 from human oral cavity

    Directory of Open Access Journals (Sweden)

    Kok-Gan Chan

    2016-03-01

    Full Text Available Klebsiella pneumoniae T2-1-1 was isolated from the human tongue debris and subjected to whole genome sequencing on HiSeq platform and annotated on RAST. The nucleotide sequence of this genome was deposited into DDBJ/EMBL/GenBank under the accession JAQL00000000. Keywords: Human tongue surface, Oral cavity, Oral bacteria, Virulence

  3. Sensitive and reliable detection of genomic imbalances in human neuroblastomas using comparative genomic hybridisation analysis

    NARCIS (Netherlands)

    van Gele, M.; van Roy, N.; Jauch, A.; Laureys, G.; Benoit, Y.; Schelfhout, V.; de Potter, C. R.; Brock, P.; Uyttebroeck, A.; Sciot, R.; Schuuring, E.; Versteeg, R.; Speleman, F.

    1997-01-01

    Deletions of the short arm of chromosome 1, extra copies of chromosome 17q and MYCN amplification are the most frequently encountered genetic changes in neuroblastomas. Standard techniques for detection of one or more of these genetic changes are karyotyping, FISH analysis and LOH analysis by

  4. Human Genome Program

    Energy Technology Data Exchange (ETDEWEB)

    1993-01-01

    The DOE Human Genome program has grown tremendously, as shown by the marked increase in the number of genome-funded projects since the last workshop held in 1991. The abstracts in this book describe the genome research of DOE-funded grantees and contractors and invited guests, and all projects are represented at the workshop by posters. The 3-day meeting includes plenary sessions on ethical, legal, and social issues pertaining to the availability of genetic data; sequencing techniques, informatics support; and chromosome and cDNA mapping and sequencing.

  5. Decoding the human genome

    CERN Multimedia

    CERN. Geneva. Audiovisual Unit; Antonerakis, S E

    2002-01-01

    Decoding the Human genome is a very up-to-date topic, raising several questions besides purely scientific, in view of the two competing teams (public and private), the ethics of using the results, and the fact that the project went apparently faster and easier than expected. The lecture series will address the following chapters: Scientific basis and challenges. Ethical and social aspects of genomics.

  6. Human Germline Genome Editing

    OpenAIRE

    Ormond, Kelly E.; Mortlock, Douglas P.; Scholes, Derek T.; Bombard, Yvonne; Brody, Lawrence C.; Faucett, W. Andrew; Garrison, Nanibaa’ A.; Hercher, Laura; Isasi, Rosario; Middleton, Anna; Musunuru, Kiran; Shriner, Daniel; Virani, Alice; Young, Caroline E.

    2017-01-01

    With CRISPR/Cas9 and other genome-editing technologies, successful somatic and germline genome editing are becoming feasible. To respond, an American Society of Human Genetics (ASHG) workgroup developed this position statement, which was approved by the ASHG Board in March 2017. The workgroup included representatives from the UK Association of Genetic Nurses and Counsellors, Canadian Association of Genetic Counsellors, International Genetic Epidemiology Society, and US National Society of Gen...

  7. The complete genome sequence and analysis of the human pathogen Campylobacter lari

    DEFF Research Database (Denmark)

    Miller, WG; Wang, G; Binnewies, Tim Terence

    2008-01-01

    Campylobacter lari is a member of the epsilon subdivision of the Proteobacteria and is part of the thermotolerant Campylobacter group, a clade that includes the human pathogen C. jejuni. Here we present the complete genome sequence of the human clinical isolate, C. lari RM2100. The genome of strain...... RM2100 is approximately 1.53 Mb and includes the 46 kb megaplasmid pCL2100. Also present within the strain RM2100 genome is a 36 kb putative prophage, termed CLIE1, which is similar to CJIE4, a putative prophage present within the C. jejuni RM1221 genome. Nearly all (90%) of the gene content...... in strain RM2100 is similar to genes present in the genomes of other characterized thermotolerant campylobacters. However, several genes involved in amino acid biosynthesis and energy metabolism, identified previously in other Campylobacter genomes, are absent from the C. lari RM2100 genome. Therefore, C...

  8. Phenotypic and Genomic Analysis of Hypervirulent Human-associated Bordetella bronchiseptica

    Directory of Open Access Journals (Sweden)

    Ahuja Umesh

    2012-08-01

    Full Text Available Abstract Background B. bronchiseptica infections are usually associated with wild or domesticated animals, but infrequently with humans. A recent phylogenetic analysis distinguished two distinct B. bronchiseptica subpopulations, designated complexes I and IV. Complex IV isolates appear to have a bias for infecting humans; however, little is known regarding their epidemiology, virulence properties, or comparative genomics. Results Here we report a characterization of the virulence of human-associated complex IV B. bronchiseptica strains. In in vitro cytotoxicity assays, complex IV strains showed increased cytotoxicity in comparison to a panel of complex I strains. Some complex IV isolates were remarkably cytotoxic, resulting in LDH release levels in A549 cells that were 10- to 20-fold greater than complex I strains. In vivo, a subset of complex IV strains was found to be hypervirulent, with an increased ability to cause lethal pulmonary infections in mice. Hypercytotoxicity in vitro and hypervirulence in vivo were both dependent on the activity of the bsc T3SS and the BteA effector. To clarify differences between lineages, representative complex IV isolates were sequenced and their genomes were compared to complex I isolates. Although our analysis showed there were no genomic sequences that can be considered unique to complex IV strains, there were several loci that were predominantly found in complex IV isolates. Conclusion Our observations reveal a T3SS-dependent hypervirulence phenotype in human-associated complex IV isolates, highlighting the need for further studies on the epidemiology and evolutionary dynamics of this B. bronchiseptica lineage.

  9. Genomic organization, transcript variants and comparative analysis of the human nucleoporin 155 (NUP155) gene

    DEFF Research Database (Denmark)

    Zhang, X.; Yang, J.; Yu, J.

    2002-01-01

    Nucleoporin 155 (Nup155) is a major component of the nuclear pore complex (NPC) involved in cellular nucleo-cytoplasmic transport. We have acquired the complete sequence and interpreted the genomic organization of the Nup155 orthologos from human (Homo sapiens) and pufferfish (Fugu rubripes), which...... complementary to RNAs of the Nup155 orthologs from Fugu and mouse. Comparative analysis of the Nup155 orthologs in many species, including H. sapiens, Mus musculus, Rattus norvegicus, F. rubripes, Arabidopsis thaliana, Drosophila melanogaster, and Saccharomyces cerevisiae, has revealed two paralogs in S...

  10. Annotating individual human genomes.

    Science.gov (United States)

    Torkamani, Ali; Scott-Van Zeeland, Ashley A; Topol, Eric J; Schork, Nicholas J

    2011-10-01

    Advances in DNA sequencing technologies have made it possible to rapidly, accurately and affordably sequence entire individual human genomes. As impressive as this ability seems, however, it will not likely amount to much if one cannot extract meaningful information from individual sequence data. Annotating variations within individual genomes and providing information about their biological or phenotypic impact will thus be crucially important in moving individual sequencing projects forward, especially in the context of the clinical use of sequence information. In this paper we consider the various ways in which one might annotate individual sequence variations and point out limitations in the available methods for doing so. It is arguable that, in the foreseeable future, DNA sequencing of individual genomes will become routine for clinical, research, forensic, and personal purposes. We therefore also consider directions and areas for further research in annotating genomic variants. Copyright © 2011 Elsevier Inc. All rights reserved.

  11. ANNOTATING INDIVIDUAL HUMAN GENOMES*

    Science.gov (United States)

    Torkamani, Ali; Scott-Van Zeeland, Ashley A.; Topol, Eric J.; Schork, Nicholas J.

    2014-01-01

    Advances in DNA sequencing technologies have made it possible to rapidly, accurately and affordably sequence entire individual human genomes. As impressive as this ability seems, however, it will not likely to amount to much if one cannot extract meaningful information from individual sequence data. Annotating variations within individual genomes and providing information about their biological or phenotypic impact will thus be crucially important in moving individual sequencing projects forward, especially in the context of the clinical use of sequence information. In this paper we consider the various ways in which one might annotate individual sequence variations and point out limitations in the available methods for doing so. It is arguable that, in the foreseeable future, DNA sequencing of individual genomes will become routine for clinical, research, forensic, and personal purposes. We therefore also consider directions and areas for further research in annotating genomic variants. PMID:21839162

  12. New families of human regulatory RNA structures identified by comparative analysis of vertebrate genomes.

    Science.gov (United States)

    Parker, Brian J; Moltke, Ida; Roth, Adam; Washietl, Stefan; Wen, Jiayu; Kellis, Manolis; Breaker, Ronald; Pedersen, Jakob Skou

    2011-11-01

    Regulatory RNA structures are often members of families with multiple paralogous instances across the genome. Family members share functional and structural properties, which allow them to be studied as a whole, facilitating both bioinformatic and experimental characterization. We have developed a comparative method, EvoFam, for genome-wide identification of families of regulatory RNA structures, based on primary sequence and secondary structure similarity. We apply EvoFam to a 41-way genomic vertebrate alignment. Genome-wide, we identify 220 human, high-confidence families outside protein-coding regions comprising 725 individual structures, including 48 families with known structural RNA elements. Known families identified include both noncoding RNAs, e.g., miRNAs and the recently identified MALAT1/MEN β lincRNA family; and cis-regulatory structures, e.g., iron-responsive elements. We also identify tens of new families supported by strong evolutionary evidence and other statistical evidence, such as GO term enrichments. For some of these, detailed analysis has led to the formulation of specific functional hypotheses. Examples include two hypothesized auto-regulatory feedback mechanisms: one involving six long hairpins in the 3'-UTR of MAT2A, a key metabolic gene that produces the primary human methyl donor S-adenosylmethionine; the other involving a tRNA-like structure in the intron of the tRNA maturation gene POP1. We experimentally validate the predicted MAT2A structures. Finally, we identify potential new regulatory networks, including large families of short hairpins enriched in immunity-related genes, e.g., TNF, FOS, and CTLA4, which include known transcript destabilizing elements. Our findings exemplify the diversity of post-transcriptional regulation and provide a resource for further characterization of new regulatory mechanisms and families of noncoding RNAs.

  13. Genome-wide analysis of the human Alu Yb-lineage

    Directory of Open Access Journals (Sweden)

    Carter Anthony B

    2004-03-01

    Full Text Available Abstract The Alu Yb-lineage is a 'young' primarily human-specific group of short interspersed element (SINE subfamilies that have integrated throughout the human genome. In this study, we have computationally screened the draft sequence of the human genome for Alu Yb-lineage subfamily members present on autosomal chromosomes. A total of 1,733 Yb Alu subfamily members have integrated into human autosomes. The average ages of Yb-lineage subfamilies, Yb7, Yb8 and Yb9, are estimated as 4.81, 2.39 and 2.32 million years, respectively. In order to determine the contribution of the Alu Yb-lineage to human genomic diversity, 1,202 loci were analysed using polymerase chain reaction (PCR-based assays, which amplify the genomic regions containing individual Yb-lineage subfamily members. Approximately 20 per cent of the Yb-lineage Alu elements are polymorphic for insertion presence/absence in the human genome. Fewer than 0.5 per cent of the Yb loci also demonstrate insertions at orthologous positions in non-human primate genomes. Genomic sequencing of these unusual loci demonstrates that each of the orthologous loci from non-human primate genomes contains older Y, Sg and Sx Alu family members that have been altered, through various mechanisms, into Yb8 sequences. These data suggest that Alu Yb-lineage subfamily members are largely restricted to the human genome. The high copy number, level of insertion polymorphism and estimated age indicate that members of the Alu Yb elements will be useful in a wide range of genetic analyses.

  14. Human Germline Genome Editing.

    Science.gov (United States)

    Ormond, Kelly E; Mortlock, Douglas P; Scholes, Derek T; Bombard, Yvonne; Brody, Lawrence C; Faucett, W Andrew; Garrison, Nanibaa' A; Hercher, Laura; Isasi, Rosario; Middleton, Anna; Musunuru, Kiran; Shriner, Daniel; Virani, Alice; Young, Caroline E

    2017-08-03

    With CRISPR/Cas9 and other genome-editing technologies, successful somatic and germline genome editing are becoming feasible. To respond, an American Society of Human Genetics (ASHG) workgroup developed this position statement, which was approved by the ASHG Board in March 2017. The workgroup included representatives from the UK Association of Genetic Nurses and Counsellors, Canadian Association of Genetic Counsellors, International Genetic Epidemiology Society, and US National Society of Genetic Counselors. These groups, as well as the American Society for Reproductive Medicine, Asia Pacific Society of Human Genetics, British Society for Genetic Medicine, Human Genetics Society of Australasia, Professional Society of Genetic Counselors in Asia, and Southern African Society for Human Genetics, endorsed the final statement. The statement includes the following positions. (1) At this time, given the nature and number of unanswered scientific, ethical, and policy questions, it is inappropriate to perform germline gene editing that culminates in human pregnancy. (2) Currently, there is no reason to prohibit in vitro germline genome editing on human embryos and gametes, with appropriate oversight and consent from donors, to facilitate research on the possible future clinical applications of gene editing. There should be no prohibition on making public funds available to support this research. (3) Future clinical application of human germline genome editing should not proceed unless, at a minimum, there is (a) a compelling medical rationale, (b) an evidence base that supports its clinical use, (c) an ethical justification, and (d) a transparent public process to solicit and incorporate stakeholder input. Copyright © 2017 American Society of Human Genetics. All rights reserved.

  15. Genome-wide RNA-seq analysis of human and mouse platelet transcriptomes

    Science.gov (United States)

    Rowley, Jesse W.; Oler, Andrew J.; Tolley, Neal D.; Hunter, Benjamin N.; Low, Elizabeth N.; Nix, David A.; Yost, Christian C.; Zimmerman, Guy A.

    2011-01-01

    Inbred mice are a useful tool for studying the in vivo functions of platelets. Nonetheless, the mRNA signature of mouse platelets is not known. Here, we use paired-end next-generation RNA sequencing (RNA-seq) to characterize the polyadenylated transcriptomes of human and mouse platelets. We report that RNA-seq provides unprecedented resolution of mRNAs that are expressed across the entire human and mouse genomes. Transcript expression and abundance are often conserved between the 2 species. Several mRNAs, however, are differentially expressed in human and mouse platelets. Moreover, previously described functional disparities between mouse and human platelets are reflected in differences at the transcript level, including protease activated receptor-1, protease activated receptor-3, platelet activating factor receptor, and factor V. This suggests that RNA-seq is a useful tool for predicting differences in platelet function between mice and humans. Our next-generation sequencing analysis provides new insights into the human and murine platelet transcriptomes. The sequencing dataset will be useful in the design of mouse models of hemostasis and a catalyst for discovery of new functions of platelets. Access to the dataset is found in the “Introduction.” PMID:21596849

  16. Analysis of The Cancer Genome Atlas sequencing data reveals novel properties of the human papillomavirus 16 genome in head and neck squamous cell carcinoma.

    Science.gov (United States)

    Nulton, Tara J; Olex, Amy L; Dozmorov, Mikhail; Morgan, Iain M; Windle, Brad

    2017-03-14

    Human papillomavirus (HPV) DNA is detected in up to 80% of oropharyngeal carcinomas (OPC) and this HPV positive disease has reached epidemic proportions. To increase our understanding of the disease, we investigated the status of the HPV16 genome in HPV-positive head and neck cancers (HNC). Raw RNA-Seq and Whole Genome Sequence data from The Cancer Genome Atlas HNC samples were analyzed to gain a full understanding of the HPV genome status for these tumors. Several remarkable and novel observations were made following this analysis. Firstly, there are three main HPV genome states in these tumors that are split relatively evenly: An episomal only state, an integrated state, and a state in which the viral genome exists as a hybrid episome with human DNA. Secondly, none of the tumors expressed high levels of E6; E6*I is the dominant variant expressed in all tumors. The most striking conclusion from this study is that around three quarters of HPV16 positive HNC contain episomal versions of the viral genome that are likely replicating in an E1-E2 dependent manner. The clinical and therapeutic implications of these observations are discussed.

  17. The genomic analysis of lactic acidosis and acidosis response in human cancers.

    Directory of Open Access Journals (Sweden)

    Julia Ling-Yu Chen

    2008-12-01

    Full Text Available The tumor microenvironment has a significant impact on tumor development. Two important determinants in this environment are hypoxia and lactic acidosis. Although lactic acidosis has long been recognized as an important factor in cancer, relatively little is known about how cells respond to lactic acidosis and how that response relates to cancer phenotypes. We develop genome-scale gene expression studies to dissect transcriptional responses of primary human mammary epithelial cells to lactic acidosis and hypoxia in vitro and to explore how they are linked to clinical tumor phenotypes in vivo. The resulting experimental signatures of responses to lactic acidosis and hypoxia are evaluated in a heterogeneous set of breast cancer datasets. A strong lactic acidosis response signature identifies a subgroup of low-risk breast cancer patients having distinct metabolic profiles suggestive of a preference for aerobic respiration. The association of lactic acidosis response with good survival outcomes may relate to the role of lactic acidosis in directing energy generation toward aerobic respiration and utilization of other energy sources via inhibition of glycolysis. This "inhibition of glycolysis" phenotype in tumors is likely caused by the repression of glycolysis gene expression and Akt inhibition. Our study presents a genomic evaluation of the prognostic information of a lactic acidosis response independent of the hypoxic response. Our results identify causal roles of lactic acidosis in metabolic reprogramming, and the direct functional consequence of lactic acidosis pathway activity on cellular responses and tumor development. The study also demonstrates the utility of genomic analysis that maps expression-based findings from in vitro experiments to human samples to assess links to in vivo clinical phenotypes.

  18. Investigation of inversion polymorphisms in the human genome using principal components analysis.

    Science.gov (United States)

    Ma, Jianzhong; Amos, Christopher I

    2012-01-01

    Despite the significant advances made over the last few years in mapping inversions with the advent of paired-end sequencing approaches, our understanding of the prevalence and spectrum of inversions in the human genome has lagged behind other types of structural variants, mainly due to the lack of a cost-efficient method applicable to large-scale samples. We propose a novel method based on principal components analysis (PCA) to characterize inversion polymorphisms using high-density SNP genotype data. Our method applies to non-recurrent inversions for which recombination between the inverted and non-inverted segments in inversion heterozygotes is suppressed due to the loss of unbalanced gametes. Inside such an inversion region, an effect similar to population substructure is thus created: two distinct "populations" of inversion homozygotes of different orientations and their 1:1 admixture, namely the inversion heterozygotes. This kind of substructure can be readily detected by performing PCA locally in the inversion regions. Using simulations, we demonstrated that the proposed method can be used to detect and genotype inversion polymorphisms using unphased genotype data. We applied our method to the phase III HapMap data and inferred the inversion genotypes of known inversion polymorphisms at 8p23.1 and 17q21.31. These inversion genotypes were validated by comparing with literature results and by checking Mendelian consistency using the family data whenever available. Based on the PCA-approach, we also performed a preliminary genome-wide scan for inversions using the HapMap data, which resulted in 2040 candidate inversions, 169 of which overlapped with previously reported inversions. Our method can be readily applied to the abundant SNP data, and is expected to play an important role in developing human genome maps of inversions and exploring associations between inversions and susceptibility of diseases.

  19. PCR-SSCP analysis and its application to human genome study

    International Nuclear Information System (INIS)

    Hayashi, Kenshi

    1994-01-01

    A large amount of DNA sequence data are now available owing to the development of the human genome project. These data are deposited in public databases, e.g. DDBJ, GebBank and EMBL, and freely accessible to scientific community. One of the major advantages of having these databases is that we can now detect sequence differences between individuals in a large scale. Using the sequence informations, we can design primer sequences, amplify various target regions of the sample DNA's by PCR and detect abnormal sequence changes from reference, or normal sequences. Detecting sequence changes, or mutations, are essential part of searching genes responsible for hereditary diseases and also DNA diagnosis of hereditary diseases or cancer. We can also measure mutation frequency of the human genome by knowing its variability. Our group has developed and been improving a method, PCR-SSCP analysis, as an extremely rapid and easy technique for detection of sequence differences between sample DNA's. Knowing the sensitivity (percentage detection of mutations) of this technique is important in evaluating usefulness of it for the purposes stated above. Considerable number of experiences on PCR-SSCP analysis of fragments shorter than 300 b.p. are accumulating. We summarize here the sensitivity of PCR-SSCP analysis for various sequence context of this size range examined in various electrophoretic conditions conducted in many laboratories. Data on mutation detection by this technique for longer fragments are limited. We also present oue effort for defining electrophoretic conditions of PCR-SSCP analysis when examining longer (350 to 600 b.p.) fragments. (author)

  20. Genome-wide comparative analysis reveals human-mouse regulatory landscape and evolution.

    Science.gov (United States)

    Denas, Olgert; Sandstrom, Richard; Cheng, Yong; Beal, Kathryn; Herrero, Javier; Hardison, Ross C; Taylor, James

    2015-02-14

    Because species-specific gene expression is driven by species-specific regulation, understanding the relationship between sequence and function of the regulatory regions in different species will help elucidate how differences among species arise. Despite active experimental and computational research, relationships among sequence, conservation, and function are still poorly understood. We compared transcription factor occupied segments (TFos) for 116 human and 35 mouse TFs in 546 human and 125 mouse cell types and tissues from the Human and the Mouse ENCODE projects. We based the map between human and mouse TFos on a one-to-one nucleotide cross-species mapper, bnMapper, that utilizes whole genome alignments (WGA). Our analysis shows that TFos are under evolutionary constraint, but a substantial portion (25.1% of mouse and 25.85% of human on average) of the TFos does not have a homologous sequence on the other species; this portion varies among cell types and TFs. Furthermore, 47.67% and 57.01% of the homologous TFos sequence shows binding activity on the other species for human and mouse respectively. However, 79.87% and 69.22% is repurposed such that it binds the same TF in different cells or different TFs in the same cells. Remarkably, within the set of repurposed TFos, the corresponding genome regions in the other species are preferred locations of novel TFos. These events suggest exaptation of some functional regulatory sequences into new function. Despite TFos repurposing, we did not find substantial changes in their predicted target genes, suggesting that CRMs buffer evolutionary events allowing little or no change in the TFos - target gene associations. Thus, the small portion of TFos with strictly conserved occupancy underestimates the degree of conservation of regulatory interactions. We mapped regulatory sequences from an extensive number of TFs and cell types between human and mouse using WGA. A comparative analysis of this correspondence unveiled the

  1. Whole-Genome Sequencing and Variant Analysis of Human Papillomavirus 16 Infections.

    Science.gov (United States)

    van der Weele, Pascal; Meijer, Chris J L M; King, Audrey J

    2017-10-01

    Human papillomavirus (HPV) is a strongly conserved DNA virus, high-risk types of which can cause cervical cancer in persistent infections. The most common type found in HPV-attributable cancer is HPV16, which can be subdivided into four lineages (A to D) with different carcinogenic properties. Studies have shown HPV16 sequence diversity in different geographical areas, but only limited information is available regarding HPV16 diversity within a population, especially at the whole-genome level. We analyzed HPV16 major variant diversity and conservation in persistent infections and performed a single nucleotide polymorphism (SNP) comparison between persistent and clearing infections. Materials were obtained in the Netherlands from a cohort study with longitudinal follow-up for up to 3 years. Our analysis shows a remarkably large variant diversity in the population. Whole-genome sequences were obtained for 57 persistent and 59 clearing HPV16 infections, resulting in 109 unique variants. Interestingly, persistent infections were completely conserved through time. One reinfection event was identified where the initial and follow-up samples clustered differently. Non-A1/A2 variants seemed to clear preferentially ( P = 0.02). Our analysis shows that population-wide HPV16 sequence diversity is very large. In persistent infections, the HPV16 sequence was fully conserved. Sequencing can identify HPV16 reinfections, although occurrence is rare. SNP comparison identified no strongly acting effect of the viral genome affecting HPV16 infection clearance or persistence in up to 3 years of follow-up. These findings suggest the progression of an early HPV16 infection could be host related. IMPORTANCE Human papillomavirus 16 (HPV16) is the predominant type found in cervical cancer. Progression of initial infection to cervical cancer has been linked to sequence properties; however, knowledge of variants circulating in European populations, especially with longitudinal follow-up, is

  2. Genomic localization, sequence analysis, and transcription of the putative human cytomegalovirus DNA polymerase gene

    International Nuclear Information System (INIS)

    Heilbronn, T.; Jahn, G.; Buerkle, A.; Freese, U.K.; Fleckenstein, B.; Zur Hausen, H.

    1987-01-01

    The human cytomegalovirus (HCMV)-induced DNA polymerase has been well characterized biochemically and functionally, but its genomic location has not yet been assigned. To identify the coding sequence, cross-hybridization with the herpes simplex virus type 1 (HSV-1) polymerase gene was used, as suggested by the close similarity of the herpes group virus-induced DNA polymerases to the HCMV DNA polymerase. A cosmid and plasmid library of the entire HCMV genome was screened with the BamHI Q fragment of HSF-1 at different stringency conditions. One PstI-HincII restriction fragment of 850 base pairs mapping within the EcoRI M fragment of HCMV cross-hybridized at T/sub m/ - 25/degrees/C. Sequence analysis revealed one open reading frame spanning the entire sequence. The amino acid sequence showed a highly conserved domain of 133 amino acids shared with the HSV and putative Esptein-Barr virus polymerase sequences. This domain maps within the C-terminal part of the HSV polymerase gene, which has been suggested to contain part of the catalytic center of the enzyme. Transcription analysis revealed one 5.4-kilobase early transcript in the sense orientation with respect to the open reading frame identified. This transcript appears to code for the 140-kilodalton HCMV polymerase protein

  3. Sequence Analysis and Characterization of Active Human Alu Subfamilies Based on the 1000 Genomes Pilot Project.

    Science.gov (United States)

    Konkel, Miriam K; Walker, Jerilyn A; Hotard, Ashley B; Ranck, Megan C; Fontenot, Catherine C; Storer, Jessica; Stewart, Chip; Marth, Gabor T; Batzer, Mark A

    2015-08-29

    The goal of the 1000 Genomes Consortium is to characterize human genome structural variation (SV), including forms of copy number variations such as deletions, duplications, and insertions. Mobile element insertions, particularly Alu elements, are major contributors to genomic SV among humans. During the pilot phase of the project we experimentally validated 645 (611 intergenic and 34 exon targeted) polymorphic "young" Alu insertion events, absent from the human reference genome. Here, we report high resolution sequencing of 343 (322 unique) recent Alu insertion events, along with their respective target site duplications, precise genomic breakpoint coordinates, subfamily assignment, percent divergence, and estimated A-rich tail lengths. All the sequenced Alu loci were derived from the AluY lineage with no evidence of retrotransposition activity involving older Alu families (e.g., AluJ and AluS). AluYa5 is currently the most active Alu subfamily in the human lineage, followed by AluYb8, and many others including three newly identified subfamilies we have termed AluYb7a3, AluYb8b1, and AluYa4a1. This report provides the structural details of 322 unique Alu variants from individual human genomes collectively adding about 100 kb of genomic variation. Many Alu subfamilies are currently active in human populations, including a surprising level of AluY retrotransposition. Human Alu subfamilies exhibit continuous evolution with potential drivers sprouting new Alu lineages. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  4. National Human Genome Research Institute

    Science.gov (United States)

    ... Care Genomic Medicine Working Group New Horizons and Research Patient Management Policy and Ethics Issues Quick Links for Patient Care Education All About the Human Genome Project Fact Sheets Genetic Education Resources for ...

  5. Comparative Genomics Analysis of Streptococcus Isolates from the Human Small Intestine Reveals their Adaptation to a Highly Dynamic Ecosystem

    Science.gov (United States)

    Van den Bogert, Bartholomeus; Boekhorst, Jos; Herrmann, Ruth; Smid, Eddy J.; Zoetendal, Erwin G.; Kleerebezem, Michiel

    2013-01-01

    The human small-intestinal microbiota is characterised by relatively large and dynamic Streptococcus populations. In this study, genome sequences of small-intestinal streptococci from S. mitis, S. bovis, and S. salivarius species-groups were determined and compared with those from 58 Streptococcus strains in public databases. The Streptococcus pangenome consists of 12,403 orthologous groups of which 574 are shared among all sequenced streptococci and are defined as the Streptococcus core genome. Genome mining of the small-intestinal streptococci focused on functions playing an important role in the interaction of these streptococci in the small-intestinal ecosystem, including natural competence and nutrient-transport and metabolism. Analysis of the small-intestinal Streptococcus genomes predicts a high capacity to synthesize amino acids and various vitamins as well as substantial divergence in their carbohydrate transport and metabolic capacities, which is in agreement with observed physiological differences between these Streptococcus strains. Gene-specific PCR-strategies enabled evaluation of conservation of Streptococcus populations in intestinal samples from different human individuals, revealing that the S. salivarius strains were frequently detected in the small-intestine microbiota, supporting the representative value of the genomes provided in this study. Finally, the Streptococcus genomes allow prediction of the effect of dietary substances on Streptococcus population dynamics in the human small-intestine. PMID:24386196

  6. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project

    DEFF Research Database (Denmark)

    Birney, Ewan; Stamatoyannopoulos, John A; Dutta, Anindya

    2007-01-01

    We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses...

  7. Large scale analysis of small repeats via mining of the human genome

    NARCIS (Netherlands)

    van den Berg, I.; Bosnacki, D.; Hilbers, P.A.J.

    2009-01-01

    Small repetitive sequences, called tandem repeats, are abundant throughout the human genome, both in coding and in non-coding regions. Their role is still mostly unknown, but at least 20 of those repetitive sequences have been related to neurodegenerative disorders. The mutational process that is

  8. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project.

    Science.gov (United States)

    Birney, Ewan; Stamatoyannopoulos, John A; Dutta, Anindya; Guigó, Roderic; Gingeras, Thomas R; Margulies, Elliott H; Weng, Zhiping; Snyder, Michael; Dermitzakis, Emmanouil T; Thurman, Robert E; Kuehn, Michael S; Taylor, Christopher M; Neph, Shane; Koch, Christoph M; Asthana, Saurabh; Malhotra, Ankit; Adzhubei, Ivan; Greenbaum, Jason A; Andrews, Robert M; Flicek, Paul; Boyle, Patrick J; Cao, Hua; Carter, Nigel P; Clelland, Gayle K; Davis, Sean; Day, Nathan; Dhami, Pawandeep; Dillon, Shane C; Dorschner, Michael O; Fiegler, Heike; Giresi, Paul G; Goldy, Jeff; Hawrylycz, Michael; Haydock, Andrew; Humbert, Richard; James, Keith D; Johnson, Brett E; Johnson, Ericka M; Frum, Tristan T; Rosenzweig, Elizabeth R; Karnani, Neerja; Lee, Kirsten; Lefebvre, Gregory C; Navas, Patrick A; Neri, Fidencio; Parker, Stephen C J; Sabo, Peter J; Sandstrom, Richard; Shafer, Anthony; Vetrie, David; Weaver, Molly; Wilcox, Sarah; Yu, Man; Collins, Francis S; Dekker, Job; Lieb, Jason D; Tullius, Thomas D; Crawford, Gregory E; Sunyaev, Shamil; Noble, William S; Dunham, Ian; Denoeud, France; Reymond, Alexandre; Kapranov, Philipp; Rozowsky, Joel; Zheng, Deyou; Castelo, Robert; Frankish, Adam; Harrow, Jennifer; Ghosh, Srinka; Sandelin, Albin; Hofacker, Ivo L; Baertsch, Robert; Keefe, Damian; Dike, Sujit; Cheng, Jill; Hirsch, Heather A; Sekinger, Edward A; Lagarde, Julien; Abril, Josep F; Shahab, Atif; Flamm, Christoph; Fried, Claudia; Hackermüller, Jörg; Hertel, Jana; Lindemeyer, Manja; Missal, Kristin; Tanzer, Andrea; Washietl, Stefan; Korbel, Jan; Emanuelsson, Olof; Pedersen, Jakob S; Holroyd, Nancy; Taylor, Ruth; Swarbreck, David; Matthews, Nicholas; Dickson, Mark C; Thomas, Daryl J; Weirauch, Matthew T; Gilbert, James; Drenkow, Jorg; Bell, Ian; Zhao, XiaoDong; Srinivasan, K G; Sung, Wing-Kin; Ooi, Hong Sain; Chiu, Kuo Ping; Foissac, Sylvain; Alioto, Tyler; Brent, Michael; Pachter, Lior; Tress, Michael L; Valencia, Alfonso; Choo, Siew Woh; Choo, Chiou Yu; Ucla, Catherine; Manzano, Caroline; Wyss, Carine; Cheung, Evelyn; Clark, Taane G; Brown, James B; Ganesh, Madhavan; Patel, Sandeep; Tammana, Hari; Chrast, Jacqueline; Henrichsen, Charlotte N; Kai, Chikatoshi; Kawai, Jun; Nagalakshmi, Ugrappa; Wu, Jiaqian; Lian, Zheng; Lian, Jin; Newburger, Peter; Zhang, Xueqing; Bickel, Peter; Mattick, John S; Carninci, Piero; Hayashizaki, Yoshihide; Weissman, Sherman; Hubbard, Tim; Myers, Richard M; Rogers, Jane; Stadler, Peter F; Lowe, Todd M; Wei, Chia-Lin; Ruan, Yijun; Struhl, Kevin; Gerstein, Mark; Antonarakis, Stylianos E; Fu, Yutao; Green, Eric D; Karaöz, Ulaş; Siepel, Adam; Taylor, James; Liefer, Laura A; Wetterstrand, Kris A; Good, Peter J; Feingold, Elise A; Guyer, Mark S; Cooper, Gregory M; Asimenos, George; Dewey, Colin N; Hou, Minmei; Nikolaev, Sergey; Montoya-Burgos, Juan I; Löytynoja, Ari; Whelan, Simon; Pardi, Fabio; Massingham, Tim; Huang, Haiyan; Zhang, Nancy R; Holmes, Ian; Mullikin, James C; Ureta-Vidal, Abel; Paten, Benedict; Seringhaus, Michael; Church, Deanna; Rosenbloom, Kate; Kent, W James; Stone, Eric A; Batzoglou, Serafim; Goldman, Nick; Hardison, Ross C; Haussler, David; Miller, Webb; Sidow, Arend; Trinklein, Nathan D; Zhang, Zhengdong D; Barrera, Leah; Stuart, Rhona; King, David C; Ameur, Adam; Enroth, Stefan; Bieda, Mark C; Kim, Jonghwan; Bhinge, Akshay A; Jiang, Nan; Liu, Jun; Yao, Fei; Vega, Vinsensius B; Lee, Charlie W H; Ng, Patrick; Shahab, Atif; Yang, Annie; Moqtaderi, Zarmik; Zhu, Zhou; Xu, Xiaoqin; Squazzo, Sharon; Oberley, Matthew J; Inman, David; Singer, Michael A; Richmond, Todd A; Munn, Kyle J; Rada-Iglesias, Alvaro; Wallerman, Ola; Komorowski, Jan; Fowler, Joanna C; Couttet, Phillippe; Bruce, Alexander W; Dovey, Oliver M; Ellis, Peter D; Langford, Cordelia F; Nix, David A; Euskirchen, Ghia; Hartman, Stephen; Urban, Alexander E; Kraus, Peter; Van Calcar, Sara; Heintzman, Nate; Kim, Tae Hoon; Wang, Kun; Qu, Chunxu; Hon, Gary; Luna, Rosa; Glass, Christopher K; Rosenfeld, M Geoff; Aldred, Shelley Force; Cooper, Sara J; Halees, Anason; Lin, Jane M; Shulha, Hennady P; Zhang, Xiaoling; Xu, Mousheng; Haidar, Jaafar N S; Yu, Yong; Ruan, Yijun; Iyer, Vishwanath R; Green, Roland D; Wadelius, Claes; Farnham, Peggy J; Ren, Bing; Harte, Rachel A; Hinrichs, Angie S; Trumbower, Heather; Clawson, Hiram; Hillman-Jackson, Jennifer; Zweig, Ann S; Smith, Kayla; Thakkapallayil, Archana; Barber, Galt; Kuhn, Robert M; Karolchik, Donna; Armengol, Lluis; Bird, Christine P; de Bakker, Paul I W; Kern, Andrew D; Lopez-Bigas, Nuria; Martin, Joel D; Stranger, Barbara E; Woodroffe, Abigail; Davydov, Eugene; Dimas, Antigone; Eyras, Eduardo; Hallgrímsdóttir, Ingileif B; Huppert, Julian; Zody, Michael C; Abecasis, Gonçalo R; Estivill, Xavier; Bouffard, Gerard G; Guan, Xiaobin; Hansen, Nancy F; Idol, Jacquelyn R; Maduro, Valerie V B; Maskeri, Baishali; McDowell, Jennifer C; Park, Morgan; Thomas, Pamela J; Young, Alice C; Blakesley, Robert W; Muzny, Donna M; Sodergren, Erica; Wheeler, David A; Worley, Kim C; Jiang, Huaiyang; Weinstock, George M; Gibbs, Richard A; Graves, Tina; Fulton, Robert; Mardis, Elaine R; Wilson, Richard K; Clamp, Michele; Cuff, James; Gnerre, Sante; Jaffe, David B; Chang, Jean L; Lindblad-Toh, Kerstin; Lander, Eric S; Koriabine, Maxim; Nefedov, Mikhail; Osoegawa, Kazutoyo; Yoshinaga, Yuko; Zhu, Baoli; de Jong, Pieter J

    2007-06-14

    We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses. Together, our results advance the collective knowledge about human genome function in several major areas. First, our studies provide convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts, including non-protein-coding transcripts, and those that extensively overlap one another. Second, systematic examination of transcriptional regulation has yielded new understanding about transcription start sites, including their relationship to specific regulatory sequences and features of chromatin accessibility and histone modification. Third, a more sophisticated view of chromatin structure has emerged, including its inter-relationship with DNA replication and transcriptional regulation. Finally, integration of these new sources of information, in particular with respect to mammalian evolution based on inter- and intra-species sequence comparisons, has yielded new mechanistic and evolutionary insights concerning the functional landscape of the human genome. Together, these studies are defining a path for pursuit of a more comprehensive characterization of human genome function.

  9. Whole genome analysis of selected human and animal rotaviruses identified in Uganda from 2012 to 2014 reveals complex genome reassortment events between human, bovine, caprine and porcine strains.

    Science.gov (United States)

    Bwogi, Josephine; Jere, Khuzwayo C; Karamagi, Charles; Byarugaba, Denis K; Namuwulya, Prossy; Baliraine, Frederick N; Desselberger, Ulrich; Iturriza-Gomara, Miren

    2017-01-01

    Rotaviruses of species A (RVA) are a common cause of diarrhoea in children and the young of various other mammals and birds worldwide. To investigate possible interspecies transmission of RVAs, whole genomes of 18 human and 6 domestic animal RVA strains identified in Uganda between 2012 and 2014 were sequenced using the Illumina HiSeq platform. The backbone of the human RVA strains had either a Wa- or a DS-1-like genetic constellation. One human strain was a Wa-like mono-reassortant containing a DS-1-like VP2 gene of possible animal origin. All eleven genes of one bovine RVA strain were closely related to those of human RVAs. One caprine strain had a mixed genotype backbone, suggesting that it emerged from multiple reassortment events involving different host species. The porcine RVA strains had mixed genotype backbones with possible multiple reassortant events with strains of human and bovine origin.Overall, whole genome characterisation of rotaviruses found in domestic animals in Uganda strongly suggested the presence of human-to animal RVA transmission, with concomitant circulation of multi-reassortant strains potentially derived from complex interspecies transmission events. However, whole genome data from the human RVA strains causing moderate and severe diarrhoea in under-fives in Uganda indicated that they were primarily transmitted from person-to-person.

  10. Human Genome Education Program

    Energy Technology Data Exchange (ETDEWEB)

    Richard Myers; Lane Conn

    2000-05-01

    The funds from the DOE Human Genome Program, for the project period 2/1/96 through 1/31/98, have provided major support for the curriculum development and field testing efforts for two high school level instructional units: Unit 1, ''Exploring Genetic Conditions: Genes, Culture and Choices''; and Unit 2, ''DNA Snapshots: Peaking at Your DNA''. In the original proposal, they requested DOE support for the partial salary and benefits of a Field Test Coordinator position to: (1) complete the field testing and revision of two high school curriculum units, and (2) initiate the education of teachers using these units. During the project period of this two-year DOE grant, a part-time Field-Test Coordinator was hired (Ms. Geraldine Horsma) and significant progress has been made in both of the original proposal objectives. Field testing for Unit 1 has occurred in over 12 schools (local and non-local sites with diverse student populations). Field testing for Unit 2 has occurred in over 15 schools (local and non-local sites) and will continue in 12-15 schools during the 96-97 school year. For both curricula, field-test sites and site teachers were selected for their interest in genetics education and in hands-on science education. Many of the site teachers had no previous experience with HGEP or the unit under development. Both of these first-year biology curriculum units, which contain genetics, biotechnology, societal, ethical and cultural issues related to HGP, are being implemented in many local and non-local schools (SF Bay Area, Southern California, Nebraska, Hawaii, and Texas) and in programs for teachers. These units will reach over 10,000 students in the SF Bay Area and continues to receive support from local corporate and private philanthropic organizations. Although HGEP unit development is nearing completion for both units, data is still being gathered and analyzed on unit effectiveness and student learning. The final field

  11. Genome-wide analysis identifies 12 loci influencing human reproductive behavior

    Science.gov (United States)

    Barban, Nicola; Jansen, Rick; de Vlaming, Ronald; Vaez, Ahmad; Mandemakers, Jornt J.; Tropf, Felix C.; Shen, Xia; Wilson, James F.; Chasman, Daniel I.; Nolte, Ilja M.; Tragante, Vinicius; van der Laan, Sander W.; Perry, John R. B.; Kong, Augustine; Ahluwalia, Tarunveer; Albrecht, Eva; Yerges-Armstrong, Laura; Atzmon, Gil; Auro, Kirsi; Ayers, Kristin; Bakshi, Andrew; Ben-Avraham, Danny; Berger, Klaus; Bergman, Aviv; Bertram, Lars; Bielak, Lawrence F.; Bjornsdottir, Gyda; Bonder, Marc Jan; Broer, Linda; Bui, Minh; Barbieri, Caterina; Cavadino, Alana; Chavarro, Jorge E; Turman, Constance; Concas, Maria Pina; Cordell, Heather J.; Davies, Gail; Eibich, Peter; Eriksson, Nicholas; Esko, Tõnu; Eriksson, Joel; Falahi, Fahimeh; Felix, Janine F.; Fontana, Mark Alan; Franke, Lude; Gandin, Ilaria; Gaskins, Audrey J.; Gieger, Christian; Gunderson, Erica P.; Guo, Xiuqing; Hayward, Caroline; He, Chunyan; Hofer, Edith; Huang, Hongyan; Joshi, Peter K.; Kanoni, Stavroula; Karlsson, Robert; Kiechl, Stefan; Kifley, Annette; Kluttig, Alexander; Kraft, Peter; Lagou, Vasiliki; Lecoeur, Cecile; Lahti, Jari; Li-Gao, Ruifang; Lind, Penelope A.; Liu, Tian; Makalic, Enes; Mamasoula, Crysovalanto; Matteson, Lindsay; Mbarek, Hamdi; McArdle, Patrick F.; McMahon, George; Meddens, S. Fleur W.; Mihailov, Evelin; Miller, Mike; Missmer, Stacey A.; Monnereau, Claire; van der Most, Peter J.; Myhre, Ronny; Nalls, Mike A.; Nutile, Teresa; Panagiota, Kalafati Ioanna; Porcu, Eleonora; Prokopenko, Inga; Rajan, Kumar B.; Rich-Edwards, Janet; Rietveld, Cornelius A.; Robino, Antonietta; Rose, Lynda M.; Rueedi, Rico; Ryan, Kathy; Saba, Yasaman; Schmidt, Daniel; Smith, Jennifer A.; Stolk, Lisette; Streeten, Elizabeth; Tonjes, Anke; Thorleifsson, Gudmar; Ulivi, Sheila; Wedenoja, Juho; Wellmann, Juergen; Willeit, Peter; Yao, Jie; Yengo, Loic; Zhao, Jing Hua; Zhao, Wei; Zhernakova, Daria V.; Amin, Najaf; Andrews, Howard; Balkau, Beverley; Barzilai, Nir; Bergmann, Sven; Biino, Ginevra; Bisgaard, Hans; Bønnelykke, Klaus; Boomsma, Dorret I.; Buring, Julie E.; Campbell, Harry; Cappellani, Stefania; Ciullo, Marina; Cox, Simon R.; Cucca, Francesco; Daniela, Toniolo; Davey-Smith, George; Deary, Ian J.; Dedoussis, George; Deloukas, Panos; van Duijn, Cornelia M.; de Geus, Eco JC.; Eriksson, Johan G.; Evans, Denis A.; Faul, Jessica D.; Felicita, Sala Cinzia; Froguel, Philippe; Gasparini, Paolo; Girotto, Giorgia; Grabe, Hans-Jörgen; Greiser, Karin Halina; Groenen, Patrick J.F.; de Haan, Hugoline G.; Haerting, Johannes; Harris, Tamara B.; Heath, Andrew C.; Heikkilä, Kauko; Hofman, Albert; Homuth, Georg; Holliday, Elizabeth G; Hopper, John; Hypponen, Elina; Jacobsson, Bo; Jaddoe, Vincent W. V.; Johannesson, Magnus; Jugessur, Astanand; Kähönen, Mika; Kajantie, Eero; Kardia, Sharon L.R.; Keavney, Bernard; Kolcic, Ivana; Koponen, Päivikki; Kovacs, Peter; Kronenberg, Florian; Kutalik, Zoltan; La Bianca, Martina; Lachance, Genevieve; Iacono, William; Lai, Sandra; Lehtimäki, Terho; Liewald, David C; Lindgren, Cecilia; Liu, Yongmei; Luben, Robert; Lucht, Michael; Luoto, Riitta; Magnus, Per; Magnusson, Patrik K.E.; Martin, Nicholas G.; McGue, Matt; McQuillan, Ruth; Medland, Sarah E.; Meisinger, Christa; Mellström, Dan; Metspalu, Andres; Michela, Traglia; Milani, Lili; Mitchell, Paul; Montgomery, Grant W.; Mook-Kanamori, Dennis; de Mutsert, Renée; Nohr, Ellen A; Ohlsson, Claes; Olsen, Jørn; Ong, Ken K.; Paternoster, Lavinia; Pattie, Alison; Penninx, Brenda WJH; Perola, Markus; Peyser, Patricia A.; Pirastu, Mario; Polasek, Ozren; Power, Chris; Kaprio, Jaakko; Raffel, Leslie J.; Räikkönen, Katri; Raitakari, Olli; Ridker, Paul M.; Ring, Susan M.; Roll, Kathryn; Rudan, Igor; Ruggiero, Daniela; Rujescu, Dan; Salomaa, Veikko; Schlessinger, David; Schmidt, Helena; Schmidt, Reinhold; Schupf, Nicole; Smit, Johannes; Sorice, Rossella; Spector, Tim D.; Starr, John M.; Stöckl, Doris; Strauch, Konstantin; Stumvoll, Michael; Swertz, Morris A.; Thorsteinsdottir, Unnur; Thurik, A. Roy; Timpson, Nicholas J.; Tönjes, Anke; Tung, Joyce Y.; Uitterlinden, André G.; Vaccargiu, Simona; Viikari, Jorma; Vitart, Veronique; Völzke, Henry; Vollenweider, Peter; Vuckovic, Dragana; Waage, Johannes; Wagner, Gert G.; Wang, Jie Jin; Wareham, Nicholas J.; Weir, David R.; Willemsen, Gonneke; Willeit, Johann; Wright, Alan F.; Zondervan, Krina T.; Stefansson, Kari; Krueger, Robert F.; Lee, James J.; Benjamin, Daniel J.; Cesarini, David; Koellinger, Philipp D.; den Hoed, Marcel; Snieder, Harold; Mills, Melinda C.

    2017-01-01

    The genetic architecture of human reproductive behavior – age at first birth (AFB) and number of children ever born (NEB) – has a strong relationship with fitness, human development, infertility and risk of neuropsychiatric disorders. However, very few genetic loci have been identified and the underlying mechanisms of AFB and NEB are poorly understood. We report the largest genome-wide association study to date of both sexes including 251,151 individuals for AFB and 343,072 for NEB. We identified 12 independent loci that are significantly associated with AFB and/or NEB in a SNP-based genome-wide association study, and four additional loci in a gene-based effort. These loci harbor genes that are likely to play a role – either directly or by affecting non-local gene expression – in human reproduction and infertility, thereby increasing our understanding of these complex traits. PMID:27798627

  12. Human Genome Sequencing in Health and Disease

    Science.gov (United States)

    Gonzaga-Jauregui, Claudia; Lupski, James R.; Gibbs, Richard A.

    2013-01-01

    Following the “finished,” euchromatic, haploid human reference genome sequence, the rapid development of novel, faster, and cheaper sequencing technologies is making possible the era of personalized human genomics. Personal diploid human genome sequences have been generated, and each has contributed to our better understanding of variation in the human genome. We have consequently begun to appreciate the vastness of individual genetic variation from single nucleotide to structural variants. Translation of genome-scale variation into medically useful information is, however, in its infancy. This review summarizes the initial steps undertaken in clinical implementation of personal genome information, and describes the application of whole-genome and exome sequencing to identify the cause of genetic diseases and to suggest adjuvant therapies. Better analysis tools and a deeper understanding of the biology of our genome are necessary in order to decipher, interpret, and optimize clinical utility of what the variation in the human genome can teach us. Personal genome sequencing may eventually become an instrument of common medical practice, providing information that assists in the formulation of a differential diagnosis. We outline herein some of the remaining challenges. PMID:22248320

  13. Analysis of human HPRT- deletion mutants by the microarray-CGH (comparative genomic hybridization)

    International Nuclear Information System (INIS)

    Kodaira, M.; Sasaki, K.; Tagawa, H.; Omine, H.; Kushiro, J.; Takahashi, N.; Katayama, H.

    2003-01-01

    We are trying to evaluate genetic effects of radiation on human using mutation frequency as an indicator. For the efficient detection of mutations, it is important to understand the mechanism and the characteristics of radiation-induced mutations. We have started the analysis of hypoxanthine-guanine phosphoribosyl transferase (HPRT) mutants induced by X-ray in order to clarify the deletion size and the mutation-distribution. We analyzed 39 human X-ray induced HPRT-deletion mutants by using the microarray-CGH. The array for this analysis contains 57 BAC clones covering as much as possible of the 4Mb of the 5' side and 10Mb of the 3' side of the HPRT gene based on the NCBI genome database. DNA from parent strain and each HPRT-mutant strain are labeled with Cy5 and Cy3 respectively, and were mixed and hybridized on the array. Fluorescent intensity ratio of the obtained spots was analyzed using software we developed to identify clones corresponding to the deletion region. The deletion in these strains ranged up to 3.5 Mb on the 5' side and 6 Mb on the 3' side of the HPRT gene. Deletions in 13 strains ended around BAC clones located at about 3 Mb on the 5' side. On the 3' side, deletions extended up to the specific clones located at 1.5 Mb in 11 strains. The mutations seem to be complex on the 3' end of deletion; some accompanied duplications with deletions and others could not be explained by one mutation event. We need to confirm these results, taking into account the experimental reproducibility and the accuracy of the published genetic map. The results of the research using the microarray-CGH help us to search the regions where deletions are easily induced and to identify the factors affecting the range of deletions

  14. Safety analysis of a Russian phage cocktail: From MetaGenomic analysis to oral application in healthy human subjects

    Energy Technology Data Exchange (ETDEWEB)

    McCallin, Shawna, E-mail: semccallin@yahoo.com [Nestlé Research Centre, Nestec Ltd, Vers-chez-les-Blanc, CH-1000 Lausanne 26 (Switzerland); Alam Sarker, Shafiqul, E-mail: sasarker@icddrb.org [International Centre for Diarrhoeal Diseases Research, Bangladesh (icddr,b), 68 Shaheed Tajuddin Ahmed Sharani, Mohakhali, Dhaka 1212 (Bangladesh); Barretto, Caroline, E-mail: Caroline.Barretto@rdls.nestle.com [Nestlé Research Centre, Nestec Ltd, Vers-chez-les-Blanc, CH-1000 Lausanne 26 (Switzerland); Sultana, Shamima, E-mail: shamima@icddrb.org [International Centre for Diarrhoeal Diseases Research, Bangladesh (icddr,b), 68 Shaheed Tajuddin Ahmed Sharani, Mohakhali, Dhaka 1212 (Bangladesh); Berger, Bernard, E-mail: bernard.berger@rdls.nestle.com [Nestlé Research Centre, Nestec Ltd, Vers-chez-les-Blanc, CH-1000 Lausanne 26 (Switzerland); Huq, Sayeda, E-mail: sayeeda@mail.icddrb.org [International Centre for Diarrhoeal Diseases Research, Bangladesh (icddr,b), 68 Shaheed Tajuddin Ahmed Sharani, Mohakhali, Dhaka 1212 (Bangladesh); Krause, Lutz, E-mail: ltz.krause@gmail.com [Nestlé Research Centre, Nestec Ltd, Vers-chez-les-Blanc, CH-1000 Lausanne 26 (Switzerland); Bibiloni, Rodrigo, E-mail: Rodrigo.Bibiloni@agresearch.co.nz [Nestlé Research Centre, Nestec Ltd, Vers-chez-les-Blanc, CH-1000 Lausanne 26 (Switzerland); Schmitt, Bertrand, E-mail: bertrand.schmitt@rdls.nestle.com [Nestlé Research Centre, Nestec Ltd, Vers-chez-les-Blanc, CH-1000 Lausanne 26 (Switzerland); Reuteler, Gloria, E-mail: gloria.reuteler@rdls.nestle.com [Nestlé Research Centre, Nestec Ltd, Vers-chez-les-Blanc, CH-1000 Lausanne 26 (Switzerland); Brüssow, Harald, E-mail: harald.bruessow@rdls.nestle.com [Nestlé Research Centre, Nestec Ltd, Vers-chez-les-Blanc, CH-1000 Lausanne 26 (Switzerland)

    2013-09-01

    Phage therapy has a long tradition in Eastern Europe, where preparations are comprised of complex phage cocktails whose compositions have not been described. We investigated the composition of a phage cocktail from the Russian pharmaceutical company Microgen targeting Escherichia coli/Proteus infections. Electron microscopy identified six phage types, with numerically T7-like phages dominating over T4-like phages. A metagenomic approach using taxonomical classification, reference mapping and de novo assembly identified 18 distinct phage types, including 7 genera of Podoviridae, 2 established and 2 proposed genera of Myoviridae, and 2 genera of Siphoviridae. De novo assembly yielded 7 contigs greater than 30 kb, including a 147-kb Myovirus genome and a 42-kb genome of a potentially new phage. Bioinformatic analysis did not reveal undesired genes and a small human volunteer trial did not associate adverse effects with oral phage exposure. - Highlights: • We analyzed the composition of a commercial Russian phage cocktail. • The cocktail consists of at least 10 different phage genera. • No undesired genes were detected. • No adverse effects were seen upon oral application in a small human clinical trial.

  15. Safety analysis of a Russian phage cocktail: From MetaGenomic analysis to oral application in healthy human subjects

    International Nuclear Information System (INIS)

    McCallin, Shawna; Alam Sarker, Shafiqul; Barretto, Caroline; Sultana, Shamima; Berger, Bernard; Huq, Sayeda; Krause, Lutz; Bibiloni, Rodrigo; Schmitt, Bertrand; Reuteler, Gloria; Brüssow, Harald

    2013-01-01

    Phage therapy has a long tradition in Eastern Europe, where preparations are comprised of complex phage cocktails whose compositions have not been described. We investigated the composition of a phage cocktail from the Russian pharmaceutical company Microgen targeting Escherichia coli/Proteus infections. Electron microscopy identified six phage types, with numerically T7-like phages dominating over T4-like phages. A metagenomic approach using taxonomical classification, reference mapping and de novo assembly identified 18 distinct phage types, including 7 genera of Podoviridae, 2 established and 2 proposed genera of Myoviridae, and 2 genera of Siphoviridae. De novo assembly yielded 7 contigs greater than 30 kb, including a 147-kb Myovirus genome and a 42-kb genome of a potentially new phage. Bioinformatic analysis did not reveal undesired genes and a small human volunteer trial did not associate adverse effects with oral phage exposure. - Highlights: • We analyzed the composition of a commercial Russian phage cocktail. • The cocktail consists of at least 10 different phage genera. • No undesired genes were detected. • No adverse effects were seen upon oral application in a small human clinical trial

  16. Quantitation of Murine Stroma and Selective Purification of the Human Tumor Component of Patient-Derived Xenografts for Genomic Analysis.

    Directory of Open Access Journals (Sweden)

    Valentina E Schneeberger

    Full Text Available Patient-derived xenograft (PDX mouse models are increasingly used for preclinical therapeutic testing of human cancer. A limitation in molecular and genetic characterization of PDX tumors is the presence of integral murine stroma. This is particularly problematic for genomic sequencing of PDX models. Rapid and dependable approaches for quantitating stromal content and purifying the malignant human component of these tumors are needed. We used a recently developed technique exploiting species-specific polymerase chain reaction (PCR amplicon length (ssPAL differences to define the fractional composition of murine and human DNA, which was proportional to the fractional composition of cells in a series of lung cancer PDX lines. We compared four methods of human cancer cell isolation: fluorescence-activated cell sorting (FACS, an immunomagnetic mouse cell depletion (MCD approach, and two distinct EpCAM-based immunomagnetic positive selection methods. We further analyzed DNA extracted from the resulting enriched human cancer cells by targeted sequencing using a clinically validated multi-gene panel. Stromal content varied widely among tumors of similar histology, but appeared stable over multiple serial tumor passages of an individual model. FACS and MCD were superior to either positive selection approach, especially in cases of high stromal content, and consistently allowed high quality human-specific genomic profiling. ssPAL is a dependable approach to quantitation of murine stromal content, and MCD is a simple, efficient, and high yield approach to human cancer cell isolation for genomic analysis of PDX tumors.

  17. Genome-to-genome analysis highlights the impact of the human innate and adaptive immune systems on the hepatitis C virus

    Science.gov (United States)

    Ip, Camilla; Magri, Andrea; Von Delft, Annette; Bonsall, David; Chaturvedi, Nimisha; Bartha, Istvan; Smith, David; Nicholson, George; McVean, Gilean; Trebes, Amy; Piazza, Paolo; Fellay, Jacques; Cooke, Graham; Foster, Graham R; Hudson, Emma; McLauchlan, John; Simmonds, Peter; Bowden, Rory; Klenerman, Paul; Barnes, Eleanor; Spencer, Chris C. A.

    2018-01-01

    Outcomes of hepatitis C virus (HCV) infection and treatment depend on viral and host genetic factors. We use human genome-wide genotyping arrays and new whole-genome HCV viral sequencing technologies to perform a systematic genome-to-genome study of 542 individuals chronically infected with HCV, predominately genotype 3. We show that both HLA alleles and interferon lambda innate immune system genes drive viral genome polymorphism, and that IFNL4 genotypes determine HCV viral load through a mechanism that is dependent on a specific polymorphism in the HCV polyprotein. We highlight the interplay between innate immune responses and the viral genome in HCV control. PMID:28394351

  18. In silico analysis of human metabolism: Reconstruction, contextualization and application of genome-scale models

    DEFF Research Database (Denmark)

    Geng, Jun; Nielsen, Jens

    2017-01-01

    The arising prevalence of metabolic diseases calls for a holistic approach for analysis of the underlying nature of abnormalities in cellular functions. Through mathematic representation and topological analysis of cellular metabolism, GEnome scale metabolic Models (GEMs) provide a promising fram...... that correctly describe interactions between cells or tissues, and we therefore discuss how GEMs can be integrated with blood circulation models. Finally, we end the review with proposing some possible future research directions....

  19. Analysis of cis-elements that facilitate extrachromosomal persistence of human papillomavirus genomes

    International Nuclear Information System (INIS)

    Pittayakhajonwut, Daraporn; Angeletti, Peter C.

    2008-01-01

    Human papillomaviruses (HPVs) are maintained latently in dividing epithelial cells as nuclear plasmids. Two virally encoded proteins, E1, a helicase, and E2, a transcription factor, are important players in replication and stable plasmid maintenance in host cells. Recent experiments in yeast have demonstrated that viral genomes retain replication and maintenance function independently of E1 and E2 [Angeletti, P.C., Kim, K., Fernandes, F.J., and Lambert, P.F. (2002). Stable replication of papillomavirus genomes in Saccharomyces cerevisiae. J. Virol. 76(7), 3350-8; Kim, K., Angeletti, P.C., Hassebroek, E.C., and Lambert, P.F. (2005). Identification of cis-acting elements that mediate the replication and maintenance of human papillomavirus type 16 genomes in Saccharomyces cerevisiae. J. Virol. 79(10), 5933-42]. Flow cytometry studies of EGFP-reporter vectors containing subgenomic HPV fragments with or without a human ARS (hARS), revealed that six fragments located in E6-E7, E1-E2, L1, and L2 regions showed a capacity for plasmid stabilization in the absence of E1 and E2 proteins. Interestingly, four fragments within E7, the 3' end of L2, and the 5' end of L1 exhibited stability in plasmids that lacked an hARS, indicating that they possess both replication and maintenance functions. Two fragments lying in E1-E2 and the 3' region of L1 were stable only in the presence of hARS, that they contained only maintenance function. Mutational analyses of HPV16-GFP reporter constructs provided evidence that genomes lacking E1 and E2 could replicate to an extent similar to wild type HPV16. Together these results support the concept that cellular factors influence HPV replication and maintenance, independently, and perhaps in conjunction with E1 and E2, suggesting a role in the persistent phase of the viral lifecycle

  20. Insights from Human/Mouse genome comparisons

    Energy Technology Data Exchange (ETDEWEB)

    Pennacchio, Len A.

    2003-03-30

    Large-scale public genomic sequencing efforts have provided a wealth of vertebrate sequence data poised to provide insights into mammalian biology. These include deep genomic sequence coverage of human, mouse, rat, zebrafish, and two pufferfish (Fugu rubripes and Tetraodon nigroviridis) (Aparicio et al. 2002; Lander et al. 2001; Venter et al. 2001; Waterston et al. 2002). In addition, a high-priority has been placed on determining the genomic sequence of chimpanzee, dog, cow, frog, and chicken (Boguski 2002). While only recently available, whole genome sequence data have provided the unique opportunity to globally compare complete genome contents. Furthermore, the shared evolutionary ancestry of vertebrate species has allowed the development of comparative genomic approaches to identify ancient conserved sequences with functionality. Accordingly, this review focuses on the initial comparison of available mammalian genomes and describes various insights derived from such analysis.

  1. Comparative genome analysis of Megasphaera sp. reveals niche specialization and its potential role in the human gut.

    Directory of Open Access Journals (Sweden)

    Sudarshan Anand Shetty

    Full Text Available With increasing number of novel bacteria being isolated from the human gut ecosystem, there is a greater need to study their role in the gut ecosystem and their effect on the host health. In the present study, we carried out in silico genome-wide analysis of two novel Megasphaera sp. isolates NM10 (DSM25563 and BL7 (DSM25562, isolated from feces of two healthy individuals and validated the key features by in vitro studies. The analysis revealed the general metabolic potential, adaptive features and the potential effects of these isolates on the host. The comparative genome analysis of the two human gut isolates NM10 and BL7 with ruminal isolate Megasphaera elsdenii (DSM20460 highlighted the differential adaptive features for their survival in human gut. The key findings include features like bile resistance, presence of various sensory and regulatory systems, stress response systems, membrane transporters and resistance to antibiotics. Comparison of the "glycobiome" based on the genomes of the ruminal isolate with the human gut isolates NM10 and BL revealed the presence of diverse and unique sets of Carbohydrate-Active enzymes (CAZymes amongst these isolates, with a higher collection of CAZymes in the human gut isolates. This could be attributed to the difference in host diet and thereby the environment, consequently suggesting host specific adaptation in these isolates. In silico analysis of metabolic potential predicted the ability of these isolates to produce important metabolites like short chain fatty acids (butyrate, acetate, formate, and caproate, vitamins and essential amino acids, which was further validated by in vitro experiments. The ability of these isolates to produce important metabolites advocates for a potential healthy influence on the host. Further in vivo studies including transcriptomic and proteomic analysis will be required for better understanding the role and impact of these Megasphaera sp. isolates NM10 and BL7 on the

  2. Genome engineering in human cells.

    Science.gov (United States)

    Song, Minjung; Kim, Young-Hoon; Kim, Jin-Soo; Kim, Hyongbum

    2014-01-01

    Genome editing in human cells is of great value in research, medicine, and biotechnology. Programmable nucleases including zinc-finger nucleases, transcription activator-like effector nucleases, and RNA-guided engineered nucleases recognize a specific target sequence and make a double-strand break at that site, which can result in gene disruption, gene insertion, gene correction, or chromosomal rearrangements. The target sequence complexities of these programmable nucleases are higher than 3.2 mega base pairs, the size of the haploid human genome. Here, we briefly introduce the structure of the human genome and the characteristics of each programmable nuclease, and review their applications in human cells including pluripotent stem cells. In addition, we discuss various delivery methods for nucleases, programmable nickases, and enrichment of gene-edited human cells, all of which facilitate efficient and precise genome editing in human cells.

  3. Full-Genome Analysis of Avian Influenza A(H5N1) Virus from a Human, North America, 2013

    Science.gov (United States)

    Pabbaraju, Kanti; Tellier, Raymond; Wong, Sallene; Li, Yan; Bastien, Nathalie; Tang, Julian W.; Drews, Steven J.; Jang, Yunho; Davis, C. Todd; Tipples, Graham A.

    2014-01-01

    Full-genome analysis was conducted on the first isolate of a highly pathogenic avian influenza A(H5N1) virus from a human in North America. The virus has a hemagglutinin gene of clade 2.3.2.1c and is a reassortant with an H9N2 subtype lineage polymerase basic 2 gene. No mutations conferring resistance to adamantanes or neuraminidase inhibitors were found. PMID:24755439

  4. Ubiquitous polygenicity of human complex traits: genome-wide analysis of 49 traits in Koreans.

    Directory of Open Access Journals (Sweden)

    Jian Yang

    Full Text Available Recent studies in population of European ancestry have shown that 30% ~ 50% of heritability for human complex traits such as height and body mass index, and common diseases such as schizophrenia and rheumatoid arthritis, can be captured by common SNPs and that genetic variation attributed to chromosomes are in proportion to their length. Using genome-wide estimation and partitioning approaches, we analysed 49 human quantitative traits, many of which are relevant to human diseases, in 7,170 unrelated Korean individuals genotyped on 326,262 SNPs. For 43 of the 49 traits, we estimated a nominally significant (P<0.05 proportion of variance explained by all SNPs on the Affymetrix 5.0 genotyping array ([Formula: see text]. On average across 47 of the 49 traits for which the estimate of h(G(2 is non-zero, common SNPs explain approximately one-third (range of 7.8% to 76.8% of narrow sense heritability. The estimate of h(G(2 is highly correlated with the proportion of SNPs with association P<0.031 (r(2 = 0.92. Longer genomic segments tend to explain more phenotypic variation, with a correlation of 0.78 between the estimate of variance explained by individual chromosomes and their physical length, and 1% of the genome explains approximately 1% of the genetic variance. Despite the fact that there are a few SNPs with large effects for some traits, these results suggest that polygenicity is ubiquitous for most human complex traits and that a substantial proportion of the "missing heritability" is captured by common SNPs.

  5. Genome-wide meta-analysis associates HLA-DQA1/DRB1 and LPA and lifestyle factors with human longevity

    NARCIS (Netherlands)

    Joshi, Peter K; Pirastu, Nicola; Kentistou, Katherine A; Fischer, Krista; Hofer, Edith; Schraut, Katharina E; Clark, David W; Nutile, Teresa; Barnes, Catriona L K; Timmers, Paul R H J; Shen, Xia; Gandin, Ilaria; McDaid, Aaron F; Hansen, Thomas Folkmann; Gordon, Scott D; Giulianini, Franco; Boutin, Thibaud S; Abdellaoui, Abdel; Zhao, Wei; Medina-Gomez, Carolina; Bartz, Traci M; Trompet, Stella; Lange, Leslie A; Raffield, Laura; van der Spek, Ashley; Galesloot, Tessel E; Proitsi, Petroula; Yanek, Lisa R; Bielak, Lawrence F; Payton, Antony; Murgia, Federico; Concas, Maria Pina; Biino, Ginevra; Tajuddin, Salman M; Seppälä, Ilkka; Amin, Najaf; Boerwinkle, Eric; Børglum, Anders D; Campbell, Archie; Demerath, Ellen W; Demuth, Ilja; Faul, Jessica D; Ford, Ian; Gialluisi, Alessandro; Gögele, Martin; Graff, MariaElisa; Hingorani, Aroon; Hottenga, Jouke-Jan; Hougaard, David M; Hurme, Mikko A; Ikram, M Arfan; Jylhä, Marja; Kuh, Diana; Ligthart, Lannie; Lill, Christina M; Lindenberger, Ulman; Lumley, Thomas; Mägi, Reedik; Marques-Vidal, Pedro; Medland, Sarah E; Milani, Lili; Nagy, Reka; Ollier, William E R; Peyser, Patricia A; Pramstaller, Peter P; Ridker, Paul M; Rivadeneira, Fernando; Ruggiero, Daniela; Saba, Yasaman; Schmidt, Reinhold; Schmidt, Helena; Slagboom, P Eline; Smith, Blair H; Smith, Jennifer A; Sotoodehnia, Nona; Steinhagen-Thiessen, Elisabeth; van Rooij, Frank J A; Verbeek, André L; Vermeulen, Sita H; Vollenweider, Peter; Wang, Yunpeng; Werge, Thomas; Whitfield, John B; Zonderman, Alan B; Lehtimäki, Terho; Evans, Michele K; Pirastu, Mario; Fuchsberger, Christian; Bertram, Lars; Pendleton, Neil; Kardia, Sharon L R; Ciullo, Marina; Becker, Diane M; Wong, Andrew; Psaty, Bruce M; van Duijn, Cornelia M; Wilson, James G; Jukema, J Wouter; Kiemeney, Lambertus; Uitterlinden, André G; Franceschini, Nora; North, Kari E; Weir, David R; Metspalu, Andres; Boomsma, Dorret I; Hayward, Caroline; Chasman, Daniel; Martin, Nicholas G; Sattar, Naveed; Campbell, Harry; Esko, Tōnu; Kutalik, Zoltán; Wilson, James F

    2017-01-01

    Genomic analysis of longevity offers the potential to illuminate the biology of human aging. Here, using genome-wide association meta-analysis of 606,059 parents' survival, we discover two regions associated with longevity (HLA-DQA1/DRB1 and LPA). We also validate previous suggestions that APOE,

  6. Genome-wide meta-analysis associates HLA-DQA1/DRB1 and LPA and lifestyle factors with human longevity

    NARCIS (Netherlands)

    P.K. Joshi (Peter); N. Pirastu (Nicola); Kentistou, K.A. (Katherine A.); K. Fischer (Krista); E. Hofer (Edith); Schraut, K.E. (Katharina E.); Clark, D.W. (David W.); Nutile, T. (Teresa); Barnes, C.L.K. (Catriona L. K.); Timmers, P.R.H.J. (Paul R. H. J.); Shen, X. (Xia); I. Gandin (Ilaria); McDaid, A.F. (Aaron F.); Hansen, T.F. (Thomas Folkmann); S.D. Gordon (Scott D.); F. Giulianini (Franco); T. Boutin (Thibaud); A. Abdellaoui (Abdel); W. Zhao (Wei); M.C. Medina-Gomez (Carolina); T.M. Bartz (Traci M.); S. Trompet (Stella); L.A. Lange (Leslie); Raffield, L. (Laura); A. van der Spek (Ashley); T.E. Galesloot (Tessel); Proitsi, P. (Petroula); L.R. Yanek (Lisa); L.F. Bielak (Lawrence F.); A. Payton (Antony); D. Murgia (Daniela); M.P. Concas (Maria Pina); G. Biino (Ginevra); Tajuddin, S.M. (Salman M.); I. Seppälä (Ilkka); Amin, N. (Najaf); Boerwinkle, E. (Eric); Børglum, A.D. (Anders D.); A. Campbell (Archie); E.W. Demerath (Ellen); I. Demuth (Ilja); J.D. Faul (Jessica D.); I. Ford (Ian); Gialluisi, A. (Alessandro); M. Gögele (Martin); M.J. Graff (Maud J.L.); A. Hingorani (Aroon); J.J. Hottenga (Jouke Jan); D.M. Hougaard (David); Hurme, M.A. (Mikko A.); M.K. Ikram (Kamran); Jylhä, M. (Marja); Kuh, D. (Diana); L. Ligthart (Lannie); C.M. Lill (Christina); U. Lindenberger (Ulman); T. Lumley (Thomas); R. Mägi (Reedik); P. Marques-Vidal (Pedro); S.E. Medland (Sarah Elizabeth); L. Milani (Lili); Nagy, R. (Reka); W.E.R. Ollier (William); P.A. Peyser (Patricia A.); P.P. Pramstaller (Peter Paul); P.M. Ridker (Paul); Rivadeneira, F. (Fernando); D. Ruggiero; Y. Saba (Yasaman); R. Schmidt (Reinhold); H. Schmidt (Helena); P.E. Slagboom (Eline); B.H. Smith; J.A. Smith (Jennifer A); N. Sotoodehnia (Nona); E. Steinhagen-Thiessen (Elisabeth); F.J.A. van Rooij (Frank); A.L.M. Verbeek; S.H.H.M. Vermeulen (Sita); P. Vollenweider (Peter); Wang, Y. (Yunpeng); T.M. Werge (Thomas); J.B. Whitfield (John B.); A.B. Zonderman; T. Lehtimäki (Terho); M. Evans (Michele); M. Pirastu (Mario); C. Fuchsberger (Christian); L. Bertram (Lars); N. Pendleton (Neil); Kardia, S.L.R. (Sharon L. R.); Ciullo, M. (Marina); D.M. Becker (Diane); Wong, A. (Andrew); B.M. Psaty (Bruce M.); C.M. van Duijn (Cornelia); J.F. Wilson (James); J.W. Jukema (Jan Wouter); L.A.L.M. Kiemeney (Bart); A.G. Uitterlinden (André); N. Franceschini (Nora); K.E. North (Kari); Weir, D.R. (David R.); Metspalu, A. (Andres); D.I. Boomsma (Dorret); C. Hayward (Caroline); D.I. Chasman (Daniel); Martin, N.G. (Nicholas G.); N. Sattar (Naveed); H. Campbell (Harry); T. Esko (Tõnu); Z. Kutalik (Zoltán); J.F. Wilson (James)

    2017-01-01

    textabstractGenomic analysis of longevity offers the potential to illuminate the biology of human aging. Here, using genome-wide association meta-analysis of 606,059 parents' survival, we discover two regions associated with longevity (HLA-DQA1/DRB1 and LPA). We also validate previous suggestions

  7. Genome-wide meta-analysis associates HLA-DQA1/DRB1 and LPA and lifestyle factors with human longevity

    DEFF Research Database (Denmark)

    Joshi, Peter K; Pirastu, Nicola; Kentistou, Katherine A

    2017-01-01

    Genomic analysis of longevity offers the potential to illuminate the biology of human aging. Here, using genome-wide association meta-analysis of 606,059 parents' survival, we discover two regions associated with longevity (HLA-DQA1/DRB1 and LPA). We also validate previous suggestions that APOE, ...

  8. Sequence analysis of the whole genomes of five African human G9 rotavirus strains.

    Science.gov (United States)

    Nyaga, Martin M; Jere, Khuzwayo C; Peenze, Ina; Mlera, Luwanika; van Dijk, Alberdina A; Seheri, Mapaseka L; Mphahlele, M Jeffrey

    2013-06-01

    The G9 rotaviruses are amongst the most common global rotavirus strains causing severe childhood diarrhoea. However, the whole genomes of only a few G9 rotaviruses have been fully sequenced and characterised of which only one G9P[6] and one G9P[8] are from Africa. We determined the consensus sequence of the whole genomes of five African human group A G9 rotavirus strains, four G9P[8] strains and one G9P[6] strain collected in Cameroon (central Africa), Kenya (eastern Africa), South Africa and Zimbabwe (southern Africa) in 1999, 2009 and 2010. Strain RVA/Human-wt/ZWE/MRC-DPRU1723/2009/G9P[8] from Zimbabwe, RVA/Human-wt/ZAF/MRC-DPRU4677/2010/G9P[8] from South Africa, RVA/Human-wt/CMR/1424/2009/G9P[8] from Cameroon and RVA/Human-wt/KEN/MRC-DPRU2427/2010/G9P[8] from Kenya were on a Wa-like genetic backbone and were genotyped as G9-P[8]-I1-R1-C1-M1-A1-N1-T1-E1-H1. Strain RVA/Human-wt/ZAF/MRC-DPRU9317/1999/G9P[6] from South Africa was genotyped as G9-P[6]-I2-R2-C2-M2-A2-N1-T2-E2-H2. Rotavirus A strain MRC-DPRU9317 is the second G9 strain to be reported on a DS-1-like genetic backbone, the other being RVA/Human-wt/ZAF/GR10924/1999/G9P[6]. MRC-DPRU9317 was found to be a reassortant between DS-1-like (I2, R2, C2, M2, A2, T2, E2 and H2) and Wa-like (N1) genome segments. All the genome segments of the five strains grouped strictly according to their genotype Wa- or DS-1-like clusters. Within their respective genotypes, the genome segments of the three G9 study strains from southern Africa clustered most closely with rotaviruses from the same geographical origin and with those with the same G and P types. The highest nucleotide identity of genome segments of the study strains from eastern and central Africa regions on a Wa-like backbone was not limited to rotaviruses with G9P[8] genotypes only, they were also closely related to G12P[6], G8P[8], G1P[8] and G11P[25] rotaviruses, indicating a close inter-genotype relationship between the G9 and other rotavirus genotypes

  9. Human genome. 1993 Program report

    Energy Technology Data Exchange (ETDEWEB)

    1994-03-01

    The purpose of this report is to update the Human Genome 1991-92 Program Report and provide new information on the DOE genome program to researchers, program managers, other government agencies, and the interested public. This FY 1993 supplement includes abstracts of 60 new or renewed projects and listings of 112 continuing and 28 completed projects. These two reports, taken together, present the most complete published view of the DOE Human Genome Program through FY 1993. Research is progressing rapidly toward 15-year goals of mapping and sequencing the DNA of each of the 24 different human chromosomes.

  10. Methylation-Sensitive Amplification Length Polymorphism (MS-AFLP) Microarrays for Epigenetic Analysis of Human Genomes.

    Science.gov (United States)

    Alonso, Sergio; Suzuki, Koichi; Yamamoto, Fumiichiro; Perucho, Manuel

    2018-01-01

    Somatic, and in a minor scale also germ line, epigenetic aberrations are fundamental to carcinogenesis, cancer progression, and tumor phenotype. DNA methylation is the most extensively studied and arguably the best understood epigenetic mechanisms that become altered in cancer. Both somatic loss of methylation (hypomethylation) and gain of methylation (hypermethylation) are found in the genome of malignant cells. In general, the cancer cell epigenome is globally hypomethylated, while some regions-typically gene-associated CpG islands-become hypermethylated. Given the profound impact that DNA methylation exerts on the transcriptional profile and genomic stability of cancer cells, its characterization is essential to fully understand the complexity of cancer biology, improve tumor classification, and ultimately advance cancer patient management and treatment. A plethora of methods have been devised to analyze and quantify DNA methylation alterations. Several of the early-developed methods relied on the use of methylation-sensitive restriction enzymes, whose activity depends on the methylation status of their recognition sequences. Among these techniques, methylation-sensitive amplification length polymorphism (MS-AFLP) was developed in the early 2000s, and successfully adapted from its original gel electrophoresis fingerprinting format to a microarray format that notably increased its throughput and allowed the quantification of the methylation changes. This array-based platform interrogates over 9500 independent loci putatively amplified by the MS-AFLP technique, corresponding to the NotI sites mapped throughout the human genome.

  11. Detailed analysis of inversions predicted between two human genomes: errors, real polymorphisms, and their origin and population distribution.

    Science.gov (United States)

    Vicente-Salvador, David; Puig, Marta; Gayà-Vidal, Magdalena; Pacheco, Sarai; Giner-Delgado, Carla; Noguera, Isaac; Izquierdo, David; Martínez-Fundichely, Alexander; Ruiz-Herrera, Aurora; Estivill, Xavier; Aguado, Cristina; Lucas-Lledó, José Ignacio; Cáceres, Mario

    2017-02-01

    The growing catalogue of structural variants in humans often overlooks inversions as one of the most difficult types of variation to study, even though they affect phenotypic traits in diverse organisms. Here, we have analysed in detail 90 inversions predicted from the comparison of two independently assembled human genomes: the reference genome (NCBI36/HG18) and HuRef. Surprisingly, we found that two thirds of these predictions (62) represent errors either in assembly comparison or in one of the assemblies, including 27 misassembled regions in HG18. Next, we validated 22 of the remaining 28 potential polymorphic inversions using different PCR techniques and characterized their breakpoints and ancestral state. In addition, we determined experimentally the derived allele frequency in Europeans for 17 inversions (DAF = 0.01-0.80), as well as the distribution in 14 worldwide populations for 12 of them based on the 1000 Genomes Project data. Among the validated inversions, nine have inverted repeats (IRs) at their breakpoints, and two show nucleotide variation patterns consistent with a recurrent origin. Conversely, inversions without IRs have a unique origin and almost all of them show deletions or insertions at the breakpoints in the derived allele mediated by microhomology sequences, which highlights the importance of mechanisms like FoSTeS/MMBIR in the generation of complex rearrangements in the human genome. Finally, we found several inversions located within genes and at least one candidate to be positively selected in Africa. Thus, our study emphasizes the importance of careful analysis and validation of large-scale genomic predictions to extract reliable biological conclusions. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  12. The characterization of twenty sequenced human genomes.

    Directory of Open Access Journals (Sweden)

    Kimberly Pelak

    2010-09-01

    Full Text Available We present the analysis of twenty human genomes to evaluate the prospects for identifying rare functional variants that contribute to a phenotype of interest. We sequenced at high coverage ten "case" genomes from individuals with severe hemophilia A and ten "control" genomes. We summarize the number of genetic variants emerging from a study of this magnitude, and provide a proof of concept for the identification of rare and highly-penetrant functional variants by confirming that the cause of hemophilia A is easily recognizable in this data set. We also show that the number of novel single nucleotide variants (SNVs discovered per genome seems to stabilize at about 144,000 new variants per genome, after the first 15 individuals have been sequenced. Finally, we find that, on average, each genome carries 165 homozygous protein-truncating or stop loss variants in genes representing a diverse set of pathways.

  13. Meta-analysis of human genome-microbiome association studies: the MiBioGen consortium initiative.

    Science.gov (United States)

    Wang, Jun; Kurilshikov, Alexander; Radjabzadeh, Djawad; Turpin, Williams; Croitoru, Kenneth; Bonder, Marc Jan; Jackson, Matthew A; Medina-Gomez, Carolina; Frost, Fabian; Homuth, Georg; Rühlemann, Malte; Hughes, David; Kim, Han-Na; Spector, Tim D; Bell, Jordana T; Steves, Claire J; Timpson, Nicolas; Franke, Andre; Wijmenga, Cisca; Meyer, Katie; Kacprowski, Tim; Franke, Lude; Paterson, Andrew D; Raes, Jeroen; Kraaij, Robert; Zhernakova, Alexandra

    2018-06-08

    In recent years, human microbiota, especially gut microbiota, have emerged as an important yet complex trait influencing human metabolism, immunology, and diseases. Many studies are investigating the forces underlying the observed variation, including the human genetic variants that shape human microbiota. Several preliminary genome-wide association studies (GWAS) have been completed, but more are necessary to achieve a fuller picture. Here, we announce the MiBioGen consortium initiative, which has assembled 18 population-level cohorts and some 19,000 participants. Its aim is to generate new knowledge for the rapidly developing field of microbiota research. Each cohort has surveyed the gut microbiome via 16S rRNA sequencing and genotyped their participants with full-genome SNP arrays. We have standardized the analytical pipelines for both the microbiota phenotypes and genotypes, and all the data have been processed using identical approaches. Our analysis of microbiome composition shows that we can reduce the potential artifacts introduced by technical differences in generating microbiota data. We are now in the process of benchmarking the association tests and performing meta-analyses of genome-wide associations. All pipeline and summary statistics results will be shared using public data repositories. We present the largest consortium to date devoted to microbiota-GWAS. We have adapted our analytical pipelines to suit multi-cohort analyses and expect to gain insight into host-microbiota cross-talk at the genome-wide level. And, as an open consortium, we invite more cohorts to join us (by contacting one of the corresponding authors) and to follow the analytical pipeline we have developed.

  14. Comprehensive analysis of genome-wide DNA methylation across human polycystic ovary syndrome ovary granulosa cell.

    Science.gov (United States)

    Xu, Jiawei; Bao, Xiao; Peng, Zhaofeng; Wang, Linlin; Du, Linqing; Niu, Wenbin; Sun, Yingpu

    2016-05-10

    Polycystic ovary syndrome (PCOS) affects approximately 7% of the reproductive-age women. A growing body of evidence indicated that epigenetic mechanisms contributed to the development of PCOS. The role of DNA modification in human PCOS ovary granulosa cell is still unknown in PCOS progression. Global DNA methylation and hydroxymethylation were detected between PCOS' and controls' granulosa cell. Genome-wide DNA methylation was profiled to investigate the putative function of DNA methylaiton. Selected genes expressions were analyzed between PCOS' and controls' granulosa cell. Our results showed that the granulosa cell global DNA methylation of PCOS patients was significant higher than the controls'. The global DNA hydroxymethylation showed low level and no statistical difference between PCOS and control. 6936 differentially methylated CpG sites were identified between control and PCOS-obesity. 12245 differential methylated CpG sites were detected between control and PCOS-nonobesity group. 5202 methylated CpG sites were significantly differential between PCOS-obesity and PCOS-nonobesity group. Our results showed that DNA methylation not hydroxymethylation altered genome-wide in PCOS granulosa cell. The different methylation genes were enriched in development protein, transcription factor activity, alternative splicing, sequence-specific DNA binding and embryonic morphogenesis. YWHAQ, NCF2, DHRS9 and SCNA were up-regulation in PCOS-obesity patients with no significance different between control and PCOS-nonobesity patients, which may be activated by lower DNA methylaiton. Global and genome-wide DNA methylation alteration may contribute to different genes expression and PCOS clinical pathology.

  15. A statistical framework to predict functional non-coding regions in the human genome through integrated analysis of annotation data.

    Science.gov (United States)

    Lu, Qiongshi; Hu, Yiming; Sun, Jiehuan; Cheng, Yuwei; Cheung, Kei-Hoi; Zhao, Hongyu

    2015-05-27

    Identifying functional regions in the human genome is a major goal in human genetics. Great efforts have been made to functionally annotate the human genome either through computational predictions, such as genomic conservation, or high-throughput experiments, such as the ENCODE project. These efforts have resulted in a rich collection of functional annotation data of diverse types that need to be jointly analyzed for integrated interpretation and annotation. Here we present GenoCanyon, a whole-genome annotation method that performs unsupervised statistical learning using 22 computational and experimental annotations thereby inferring the functional potential of each position in the human genome. With GenoCanyon, we are able to predict many of the known functional regions. The ability of predicting functional regions as well as its generalizable statistical framework makes GenoCanyon a unique and powerful tool for whole-genome annotation. The GenoCanyon web server is available at http://genocanyon.med.yale.edu.

  16. In-depth comparative analysis of malaria parasite genomes reveals protein-coding genes linked to human disease in Plasmodium falciparum genome.

    Science.gov (United States)

    Liu, Xuewu; Wang, Yuanyuan; Liang, Jiao; Wang, Luojun; Qin, Na; Zhao, Ya; Zhao, Gang

    2018-05-02

    Plasmodium falciparum is the most virulent malaria parasite capable of parasitizing human erythrocytes. The identification of genes related to this capability can enhance our understanding of the molecular mechanisms underlying human malaria and lead to the development of new therapeutic strategies for malaria control. With the availability of several malaria parasite genome sequences, performing computational analysis is now a practical strategy to identify genes contributing to this disease. Here, we developed and used a virtual genome method to assign 33,314 genes from three human malaria parasites, namely, P. falciparum, P. knowlesi and P. vivax, and three rodent malaria parasites, namely, P. berghei, P. chabaudi and P. yoelii, to 4605 clusters. Each cluster consisted of genes whose protein sequences were significantly similar and was considered as a virtual gene. Comparing the enriched values of all clusters in human malaria parasites with those in rodent malaria parasites revealed 115 P. falciparum genes putatively responsible for parasitizing human erythrocytes. These genes are mainly located in the chromosome internal regions and participate in many biological processes, including membrane protein trafficking and thiamine biosynthesis. Meanwhile, 289 P. berghei genes were included in the rodent parasite-enriched clusters. Most are located in subtelomeric regions and encode erythrocyte surface proteins. Comparing cluster values in P. falciparum with those in P. vivax and P. knowlesi revealed 493 candidate genes linked to virulence. Some of them encode proteins present on the erythrocyte surface and participate in cytoadhesion, virulence factor trafficking, or erythrocyte invasion, but many genes with unknown function were also identified. Cerebral malaria is characterized by accumulation of infected erythrocytes at trophozoite stage in brain microvascular. To discover cerebral malaria-related genes, fast Fourier transformation (FFT) was introduced to extract

  17. CMS: a web-based system for visualization and analysis of genome-wide methylation data of human cancers.

    Science.gov (United States)

    Gu, Fei; Doderer, Mark S; Huang, Yi-Wen; Roa, Juan C; Goodfellow, Paul J; Kizer, E Lynette; Huang, Tim H M; Chen, Yidong

    2013-01-01

    DNA methylation of promoter CpG islands is associated with gene suppression, and its unique genome-wide profiles have been linked to tumor progression. Coupled with high-throughput sequencing technologies, it can now efficiently determine genome-wide methylation profiles in cancer cells. Also, experimental and computational technologies make it possible to find the functional relationship between cancer-specific methylation patterns and their clinicopathological parameters. Cancer methylome system (CMS) is a web-based database application designed for the visualization, comparison and statistical analysis of human cancer-specific DNA methylation. Methylation intensities were obtained from MBDCap-sequencing, pre-processed and stored in the database. 191 patient samples (169 tumor and 22 normal specimen) and 41 breast cancer cell-lines are deposited in the database, comprising about 6.6 billion uniquely mapped sequence reads. This provides comprehensive and genome-wide epigenetic portraits of human breast cancer and endometrial cancer to date. Two views are proposed for users to better understand methylation structure at the genomic level or systemic methylation alteration at the gene level. In addition, a variety of annotation tracks are provided to cover genomic information. CMS includes important analytic functions for interpretation of methylation data, such as the detection of differentially methylated regions, statistical calculation of global methylation intensities, multiple gene sets of biologically significant categories, interactivity with UCSC via custom-track data. We also present examples of discoveries utilizing the framework. CMS provides visualization and analytic functions for cancer methylome datasets. A comprehensive collection of datasets, a variety of embedded analytic functions and extensive applications with biological and translational significance make this system powerful and unique in cancer methylation research. CMS is freely accessible

  18. Genomic hypomethylation in the human germline associates with selective structural mutability in the human genome.

    Directory of Open Access Journals (Sweden)

    Jian Li

    Full Text Available The hotspots of structural polymorphisms and structural mutability in the human genome remain to be explained mechanistically. We examine associations of structural mutability with germline DNA methylation and with non-allelic homologous recombination (NAHR mediated by low-copy repeats (LCRs. Combined evidence from four human sperm methylome maps, human genome evolution, structural polymorphisms in the human population, and previous genomic and disease studies consistently points to a strong association of germline hypomethylation and genomic instability. Specifically, methylation deserts, the ~1% fraction of the human genome with the lowest methylation in the germline, show a tenfold enrichment for structural rearrangements that occurred in the human genome since the branching of chimpanzee and are highly enriched for fast-evolving loci that regulate tissue-specific gene expression. Analysis of copy number variants (CNVs from 400 human samples identified using a custom-designed array comparative genomic hybridization (aCGH chip, combined with publicly available structural variation data, indicates that association of structural mutability with germline hypomethylation is comparable in magnitude to the association of structural mutability with LCR-mediated NAHR. Moreover, rare CNVs occurring in the genomes of individuals diagnosed with schizophrenia, bipolar disorder, and developmental delay and de novo CNVs occurring in those diagnosed with autism are significantly more concentrated within hypomethylated regions. These findings suggest a new connection between the epigenome, selective mutability, evolution, and human disease.

  19. Generation of a genomic tiling array of the human Major Histocompatibility Complex (MHC and its application for DNA methylation analysis

    Directory of Open Access Journals (Sweden)

    Ottaviani Diego

    2008-05-01

    Full Text Available Abstract Background The major histocompatibility complex (MHC is essential for human immunity and is highly associated with common diseases, including cancer. While the genetics of the MHC has been studied intensively for many decades, very little is known about the epigenetics of this most polymorphic and disease-associated region of the genome. Methods To facilitate comprehensive epigenetic analyses of this region, we have generated a genomic tiling array of 2 Kb resolution covering the entire 4 Mb MHC region. The array has been designed to be compatible with chromatin immunoprecipitation (ChIP, methylated DNA immunoprecipitation (MeDIP, array comparative genomic hybridization (aCGH and expression profiling, including of non-coding RNAs. The array comprises 7832 features, consisting of two replicates of both forward and reverse strands of MHC amplicons and appropriate controls. Results Using MeDIP, we demonstrate the application of the MHC array for DNA methylation profiling and the identification of tissue-specific differentially methylated regions (tDMRs. Based on the analysis of two tissues and two cell types, we identified 90 tDMRs within the MHC and describe their characterisation. Conclusion A tiling array covering the MHC region was developed and validated. Its successful application for DNA methylation profiling indicates that this array represents a useful tool for molecular analyses of the MHC in the context of medical genomics.

  20. Genome-to-genome analysis highlights the effect of the human innate and adaptive immune systems on the hepatitis C virus.

    Science.gov (United States)

    Ansari, M Azim; Pedergnana, Vincent; L C Ip, Camilla; Magri, Andrea; Von Delft, Annette; Bonsall, David; Chaturvedi, Nimisha; Bartha, Istvan; Smith, David; Nicholson, George; McVean, Gilean; Trebes, Amy; Piazza, Paolo; Fellay, Jacques; Cooke, Graham; Foster, Graham R; Hudson, Emma; McLauchlan, John; Simmonds, Peter; Bowden, Rory; Klenerman, Paul; Barnes, Eleanor; Spencer, Chris C A

    2017-05-01

    Outcomes of hepatitis C virus (HCV) infection and treatment depend on viral and host genetic factors. Here we use human genome-wide genotyping arrays and new whole-genome HCV viral sequencing technologies to perform a systematic genome-to-genome study of 542 individuals who were chronically infected with HCV, predominantly genotype 3. We show that both alleles of genes encoding human leukocyte antigen molecules and genes encoding components of the interferon lambda innate immune system drive viral polymorphism. Additionally, we show that IFNL4 genotypes determine HCV viral load through a mechanism dependent on a specific amino acid residue in the HCV NS5A protein. These findings highlight the interplay between the innate immune system and the viral genome in HCV control.

  1. Human Genome Research: Decoding DNA

    Science.gov (United States)

    dropdown arrow Site Map A-Z Index Menu Synopsis Human Genome Research: Decoding DNA Resources with of the DNA double helix during April 2003. James D. Watson, Francis Crick, and Maurice Wilkins were company Celera announced the completion of a "working draft" reference DNA sequence of the human

  2. Characterization and genome analysis of novel bacteriophages infecting the opportunistic human pathogens Klebsiella oxytoca and K. pneumoniae.

    Science.gov (United States)

    Park, Eun-Ah; Kim, You-Tae; Cho, Jae-Hyun; Ryu, Sangryeol; Lee, Ju-Hoon

    2017-04-01

    Klebsiella is a genus of well-known opportunistic human pathogens that are associated with diabetes mellitus and chronic pulmonary obstruction; however, this pathogen is often resistant to multiple drugs. To control this pathogen, two Klebsiella-infecting phages, K. oxytoca phage PKO111 and K. pneumoniae phage PKP126, were isolated from a sewage sample. Analysis of their host range revealed that they infect K. pneumoniae and K. oxytoca, suggesting host specificity for members of the genus Klebsiella. Stability tests confirmed that the phages are stable under various temperature (4 to 60 °C) and pH (3 to 11) conditions. A challenge assay showed that PKO111 and PKP126 inhibit growth of their host strains by 2 log and 4 log, respectively. Complete genome sequencing of the phages revealed that their genome sizes are quite different (168,758 bp for PKO111 and 50,934 bp for PKP126). Their genome annotation results showed that they have no human virulence-related genes, an important safety consideration. In addition, no lysogen-formation gene cluster was detected in either phage genome, suggesting that they are both virulent phages in their bacterial hosts. Based on these results, PKO111 and PKP126 may be good candidates for development of biocontrol agents against members of the genus Klebsiella for therapeutic purposes. A comparative analysis of tail-associated gene clusters of PKO111 and PKP126 revealed relatively low homology, suggesting that they might differ in the way they recognize and infect their specific hosts.

  3. Comprehensive Analysis of Genome Rearrangements in Eight Human Malignant Tumor Tissues.

    Directory of Open Access Journals (Sweden)

    Stefanie Marczok

    Full Text Available Carcinogenesis is a complex multifactorial, multistage process, but the precise mechanisms are not well understood. In this study, we performed a genome-wide analysis of the copy number variation (CNV, breakpoint region (BPR and fragile sites in 2,737 tumor samples from eight tumor entities and in 432 normal samples. CNV detection and BPR identification revealed that BPRs tended to accumulate in specific genomic regions in tumor samples whereas being dispersed genome-wide in the normal samples. Hotspots were observed, at which segments with similar alteration in copy number were overlapped along with BPRs adjacently clustered. Evaluation of BPR occurrence frequency showed that at least one was detected in about and more than 15% of samples for each tumor entity while BPRs were maximal in 12% of the normal samples. 127 of 2,716 tumor-relevant BPRs (termed 'common BPRs' exhibited also a noticeable occurrence frequency in the normal samples. Colocalization assessment identified 20,077 CNV-affecting genes and 169 of these being known tumor-related genes. The most noteworthy genes are KIAA0513 important for immunologic, synaptic and apoptotic signal pathways, intergenic non-coding RNA RP11-115C21.2 possibly acting as oncogene or tumor suppressor by changing the structure of chromatin, and ADAM32 likely importance in cancer cell proliferation and progression by ectodomain-shedding of diverse growth factors, and the well-known tumor suppressor gene p53. The BPR distributions indicate that CNV mutations are likely non-random in tumor genomes. The marked recurrence of BPRs at specific regions supports common progression mechanisms in tumors. The presence of hotspots together with common BPRs, despite its small group size, imply a relation between fragile sites and cancer-gene alteration. Our data further suggest that both protein-coding and non-coding genes possessing a range of biological functions might play a causative or functional role in tumor

  4. HGVA: the Human Genome Variation Archive.

    Science.gov (United States)

    Lopez, Javier; Coll, Jacobo; Haimel, Matthias; Kandasamy, Swaathi; Tarraga, Joaquin; Furio-Tari, Pedro; Bari, Wasim; Bleda, Marta; Rueda, Antonio; Gräf, Stefan; Rendon, Augusto; Dopazo, Joaquin; Medina, Ignacio

    2017-07-03

    High-profile genomic variation projects like the 1000 Genomes project or the Exome Aggregation Consortium, are generating a wealth of human genomic variation knowledge which can be used as an essential reference for identifying disease-causing genotypes. However, accessing these data, contrasting the various studies and integrating those data in downstream analyses remains cumbersome. The Human Genome Variation Archive (HGVA) tackles these challenges and facilitates access to genomic data for key reference projects in a clean, fast and integrated fashion. HGVA provides an efficient and intuitive web-interface for easy data mining, a comprehensive RESTful API and client libraries in Python, Java and JavaScript for fast programmatic access to its knowledge base. HGVA calculates population frequencies for these projects and enriches their data with variant annotation provided by CellBase, a rich and fast annotation solution. HGVA serves as a proof-of-concept of the genome analysis developments being carried out by the University of Cambridge together with UK's 100 000 genomes project and the National Institute for Health Research BioResource Rare-Diseases, in particular, deploying open-source for Computational Biology (OpenCB) software platform for storing and analyzing massive genomic datasets. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  5. Papillomavirus genomes in human cervical carcinoma: Analysis of their integration and transcriptional activity

    International Nuclear Information System (INIS)

    Matulic, M.; Soric, J.

    1994-01-01

    Eighty-four biopsies derived from cervical tissues were analyzed for the presence of human papillomavirus (HPV) DNA types 6, 16 and 18 using Southern blot hybridization. HPV 6 was found in none of the cervical biopsies, and HPV types 16 and 18 were found in 44% of them. The rate of HPV 16/18 positive samples increased proportionally to the severity of the lesion. In normal tissue there were no positive samples, in mild and moderate dysplasia HPV 16/18 was present in 20% and in severe dysplasia and invasive carcinomas in 37 and 50%, respectively. In biopsies from 13 cases with squamous cell carcinoma of the uterine cervix and CIN III lesions HPV 16 was integrated within the host genome. It was concluded that the virus could be integrated at variable, presumably randomly selected chromosomal loci and with different number of copies. Transcription of HPV 16 and 18 was detected in one cervical cancer in HeLa cells, respectively. These results imply that HPV types 16 and 18 play an etiological role in the carcinogenesis of human cervical epithelial cells. (author)

  6. Whole Genome Characterization, Phylogenetic and Genome Signature Analysis of Human Pandemic H1N1 Virus in Thailand, 2009–2012

    Science.gov (United States)

    Makkoch, Jarika; Suwannakarn, Kamol; Payungporn, Sunchai; Prachayangprecha, Slinporn; Cheiocharnsin, Thaweesak; Linsuwanon, Piyada; Theamboonlers, Apiradee; Poovorawan, Yong

    2012-01-01

    Background Three waves of human pandemic influenza occurred in Thailand in 2009–2012. The genome signature features and evolution of pH1N1 need to be characterized to elucidate the aspects responsible for the multiple waves of pandemic. Methodology/Findings Forty whole genome sequences and 584 partial sequences of pH1N1 circulating in Thailand, divided into 1st, 2nd and 3rd wave and post-pandemic were characterized and 77 genome signatures were analyzed. Phylogenetic trees of concatenated whole genome and HA gene sequences were constructed calculating substitution rate and dN/dS of each gene. Phylogenetic analysis showed a distinct pattern of pH1N1 circulation in Thailand, with the first two isolates from May, 2009 belonging to clade 5 while clades 5, 6 and 7 co-circulated during the first wave of pH1N1 pandemic in Thailand. Clade 8 predominated during the second wave and different proportions of the pH1N1 viruses circulating during the third wave and post pandemic period belonged to clades 8, 11.1 and 11.2. The mutation analysis of pH1N1 revealed many adaptive mutations which have become the signature of each clade and may be responsible for the multiple pandemic waves in Thailand, especially with regard to clades 11.1 and 11.2 as evidenced with V731I, G154D of PB1 gene, PA I330V, HA A214T S160G and S202T. The substitution rate of pH1N1 in Thailand ranged from 2.53×10−3±0.02 (M2 genes) to 5.27×10−3±0.03 per site per year (NA gene). Conclusions All results suggested that this virus is still adaptive, maybe to evade the host's immune response and tends to remain in the human host although the dN/dS were under purifying selection in all 8 genes. Due to the gradual evolution of pH1N1 in Thailand, continuous monitoring is essential for evaluation and surveillance to be prepared for and able to control future influenza activities. PMID:23251479

  7. De novo assembly of a haplotype-resolved human genome.

    Science.gov (United States)

    Cao, Hongzhi; Wu, Honglong; Luo, Ruibang; Huang, Shujia; Sun, Yuhui; Tong, Xin; Xie, Yinlong; Liu, Binghang; Yang, Hailong; Zheng, Hancheng; Li, Jian; Li, Bo; Wang, Yu; Yang, Fang; Sun, Peng; Liu, Siyang; Gao, Peng; Huang, Haodong; Sun, Jing; Chen, Dan; He, Guangzhu; Huang, Weihua; Huang, Zheng; Li, Yue; Tellier, Laurent C A M; Liu, Xiao; Feng, Qiang; Xu, Xun; Zhang, Xiuqing; Bolund, Lars; Krogh, Anders; Kristiansen, Karsten; Drmanac, Radoje; Drmanac, Snezana; Nielsen, Rasmus; Li, Songgang; Wang, Jian; Yang, Huanming; Li, Yingrui; Wong, Gane Ka-Shu; Wang, Jun

    2015-06-01

    The human genome is diploid, and knowledge of the variants on each chromosome is important for the interpretation of genomic information. Here we report the assembly of a haplotype-resolved diploid genome without using a reference genome. Our pipeline relies on fosmid pooling together with whole-genome shotgun strategies, based solely on next-generation sequencing and hierarchical assembly methods. We applied our sequencing method to the genome of an Asian individual and generated a 5.15-Gb assembled genome with a haplotype N50 of 484 kb. Our analysis identified previously undetected indels and 7.49 Mb of novel coding sequences that could not be aligned to the human reference genome, which include at least six predicted genes. This haplotype-resolved genome represents the most complete de novo human genome assembly to date. Application of our approach to identify individual haplotype differences should aid in translating genotypes to phenotypes for the development of personalized medicine.

  8. Analysing human genomes at different scales

    DEFF Research Database (Denmark)

    Liu, Siyang

    The thriving of the Next-Generation sequencing (NGS) technologies in the past decade has dramatically revolutionized the field of human genetics. We are experiencing a wave of several large-scale whole genome sequencing studies of humans in the world. Those studies vary greatly regarding cohort...... will be reflected by the analysis of real data. This thesis covers studies in two human genome sequencing projects that distinctly differ in terms of studied population, sample size and sequencing depth. In the first project, we sequenced 150 Danish individuals from 50 trio families to 78x coverage....... The sophisticated experimental design enables high-quality de novo assembly of the genomes and provides a good opportunity for mapping the structural variations in the human population. We developed the AsmVar approach to discover, genotype and characterize the structural variations from the assemblies. Our...

  9. Initial genomics of the human nucleolus.

    Directory of Open Access Journals (Sweden)

    Attila Németh

    2010-03-01

    Full Text Available We report for the first time the genomics of a nuclear compartment of the eukaryotic cell. 454 sequencing and microarray analysis revealed the pattern of nucleolus-associated chromatin domains (NADs in the linear human genome and identified different gene families and certain satellite repeats as the major building blocks of NADs, which constitute about 4% of the genome. Bioinformatic evaluation showed that NAD-localized genes take part in specific biological processes, like the response to other organisms, odor perception, and tissue development. 3D FISH and immunofluorescence experiments illustrated the spatial distribution of NAD-specific chromatin within interphase nuclei and its alteration upon transcriptional changes. Altogether, our findings describe the nature of DNA sequences associated with the human nucleolus and provide insights into the function of the nucleolus in genome organization and establishment of nuclear architecture.

  10. Initial Genomics of the Human Nucleolus

    Science.gov (United States)

    Németh, Attila; Conesa, Ana; Santoyo-Lopez, Javier; Medina, Ignacio; Montaner, David; Péterfia, Bálint; Solovei, Irina; Cremer, Thomas; Dopazo, Joaquin; Längst, Gernot

    2010-01-01

    We report for the first time the genomics of a nuclear compartment of the eukaryotic cell. 454 sequencing and microarray analysis revealed the pattern of nucleolus-associated chromatin domains (NADs) in the linear human genome and identified different gene families and certain satellite repeats as the major building blocks of NADs, which constitute about 4% of the genome. Bioinformatic evaluation showed that NAD–localized genes take part in specific biological processes, like the response to other organisms, odor perception, and tissue development. 3D FISH and immunofluorescence experiments illustrated the spatial distribution of NAD–specific chromatin within interphase nuclei and its alteration upon transcriptional changes. Altogether, our findings describe the nature of DNA sequences associated with the human nucleolus and provide insights into the function of the nucleolus in genome organization and establishment of nuclear architecture. PMID:20361057

  11. Complete genome sequence analysis of Nocardia brasiliensis HUJEG-1 reveals a saprobic lifestyle and the genes needed for human pathogenesis.

    Science.gov (United States)

    Vera-Cabrera, Lucio; Ortiz-Lopez, Rocio; Elizondo-Gonzalez, Ramiro; Ocampo-Candiani, Jorge

    2013-01-01

    Nocardia brasiliensis is an important etiologic agent of mycetoma. These bacteria live as a saprobe in soil or organic material and enter the tissue via minor trauma. Mycetoma is characterized by tumefaction and the production of fistula and abscesses, with no spontaneous cure. By using mass sequencing, we determined the complete genomic nucleotide sequence of the bacteria. According to our data, the genome is a circular chromosome 9,436,348-bp long with 68% G+C content that encodes 8,414 proteins. We observed orthologs for virulence factors, a higher number of genes involved in lipid biosynthesis and catabolism, and gene clusters for the synthesis of bioactive compounds, such as antibiotics, terpenes, and polyketides. An in silico analysis of the sequence supports the conclusion that the bacteria acquired diverse genes by horizontal transfer from other soil bacteria, even from eukaryotic organisms. The genome composition reflects the evolution of bacteria via the acquisition of a large amount of DNA, which allows it to survive in new ecological niches, including humans.

  12. Complete genome sequence analysis of Nocardia brasiliensis HUJEG-1 reveals a saprobic lifestyle and the genes needed for human pathogenesis.

    Directory of Open Access Journals (Sweden)

    Lucio Vera-Cabrera

    Full Text Available Nocardia brasiliensis is an important etiologic agent of mycetoma. These bacteria live as a saprobe in soil or organic material and enter the tissue via minor trauma. Mycetoma is characterized by tumefaction and the production of fistula and abscesses, with no spontaneous cure. By using mass sequencing, we determined the complete genomic nucleotide sequence of the bacteria. According to our data, the genome is a circular chromosome 9,436,348-bp long with 68% G+C content that encodes 8,414 proteins. We observed orthologs for virulence factors, a higher number of genes involved in lipid biosynthesis and catabolism, and gene clusters for the synthesis of bioactive compounds, such as antibiotics, terpenes, and polyketides. An in silico analysis of the sequence supports the conclusion that the bacteria acquired diverse genes by horizontal transfer from other soil bacteria, even from eukaryotic organisms. The genome composition reflects the evolution of bacteria via the acquisition of a large amount of DNA, which allows it to survive in new ecological niches, including humans.

  13. Genome Editing in Human Pluripotent Stem Cells.

    Science.gov (United States)

    Carlson-Stevermer, Jared; Saha, Krishanu

    2017-01-01

    Genome editing in human pluripotent stem cells (hPSCs) enables the generation of reporter lines and knockout cell lines. Zinc finger nucleases, transcription activator-like effector nucleases (TALENs), and CRISPR/Cas9 technology have recently increased the efficiency of proper gene editing by creating double strand breaks (DSB) at defined sequences in the human genome. These systems typically use plasmids to transiently transcribe nucleases within the cell. Here, we describe the process for preparing hPSCs for transient expression of nucleases via electroporation and subsequent analysis to create genetically modified stem cell lines.

  14. Comparative genomic analysis of pathogenic and probiotic Enterococcus faecalis isolates, and their transcriptional responses to growth in human urine.

    Directory of Open Access Journals (Sweden)

    Heidi C Vebø

    Full Text Available Urinary tract infection (UTI is the most common infection caused by enterococci, and Enterococcus faecalis accounts for the majority of enterococcal infections. Although a number of virulence related traits have been established, no comprehensive genomic or transcriptomic studies have been conducted to investigate how to distinguish pathogenic from non-pathogenic E. faecalis in their ability to cause UTI. In order to identify potential genetic traits or gene regulatory features that distinguish pathogenic from non-pathogenic E. faecalis with respect to UTI, we have performed comparative genomic analysis, and investigated growth capacity and transcriptome profiling in human urine in vitro. Six strains of different origins were cultivated and all grew readily in human urine. The three strains chosen for transcriptional analysis showed an overall similar response with respect to energy and nitrogen metabolism, stress mechanism, cell envelope modifications, and trace metal acquisition. Our results suggest that citrate and aspartate are significant for growth of E. faecalis in human urine, and manganese appear to be a limiting factor. The majority of virulence factors were either not differentially regulated or down-regulated. Notably, a significant up-regulation of genes involved in biofilm formation was observed. Strains from different origins have similar capacity to grow in human urine. The overall similar transcriptional responses between the two pathogenic and the probiotic strain suggest that the pathogenic potential of a certain E. faecalis strain may to a great extent be determined by presence of fitness and virulence factors, rather than the level of expression of such traits.

  15. Human genomics projects and precision medicine.

    Science.gov (United States)

    Carrasco-Ramiro, F; Peiró-Pastor, R; Aguado, B

    2017-09-01

    The completion of the Human Genome Project (HGP) in 2001 opened the floodgates to a deeper understanding of medicine. There are dozens of HGP-like projects which involve from a few tens to several million genomes currently in progress, which vary from having specialized goals or a more general approach. However, data generation, storage, management and analysis in public and private cloud computing platforms have raised concerns about privacy and security. The knowledge gained from further research has changed the field of genomics and is now slowly permeating into clinical medicine. The new precision (personalized) medicine, where genome sequencing and data analysis are essential components, allows tailored diagnosis and treatment according to the information from the patient's own genome and specific environmental factors. P4 (predictive, preventive, personalized and participatory) medicine is introducing new concepts, challenges and opportunities. This review summarizes current sequencing technologies, concentrates on ongoing human genomics projects, and provides some examples in which precision medicine has already demonstrated clinical impact in diagnosis and/or treatment.

  16. Alternative splicing of human elastin mRNA indicated by sequence analysis of cloned genomic and complementary DNA

    International Nuclear Information System (INIS)

    Indik, Z.; Yeh, H.; Ornstein-goldstein, N.; Sheppard, P.; Anderson, N.; Rosenbloom, J.C.; Peltonen, L.; Rosenbloom, J.

    1987-01-01

    Poly(A) + RNA, isolated from a single 7-mo fetal human aorta, was used to synthesize cDNA by the RNase H method, and the cDNA was inserted into λgt10. Recombinant phage containing elastin sequences were identified by hybridization with cloned, exon-containing fragments of the human elastin gene. Three clones containing inserts of 3.3, 2.7, and 2.3 kilobases were selected for further analysis. Three overlapping clones containing 17.8 kilobases of the human elastin gene were also isolated from genomic libraries. Complete sequence analysis of the six clones demonstrated that: (i) the cDNA encompassed the entire translated portion of the mRNA encoding 786 amino acids, including several unusual hydrophilic amino acid sequences not previously identified in porcine tropoelastin, (ii) exons encoding either hydrophobic or crosslinking domains in the protein alternated in the gene, and (iii) a great abundance of Alu repetitive sequences occurred throughout the introns. The data also indicated substantial alternative splicing of the mRNA. These results suggest the potential for significant variation in the precise molecular structure of the elastic fiber in the human population

  17. Whole-Genome Expression Analysis of Human Mesenchymal Stromal Cells Exposed to Ultrasmooth Tantalum vs. Titanium Oxide Surfaces

    DEFF Research Database (Denmark)

    Stiehler, C.; Bunger, C.; Overall, R. W.

    2013-01-01

    to titanium (Ti) surface. The aim of this study was to extend the previous investigation of biocompatibility by monitoring temporal gene expression of MSCs on topographically comparable smooth Ta and Ti surfaces using whole-genome gene expression analysis. Total RNA samples from telomerase-immortalized human...... MSCs cultivated on plain sputter-coated surfaces of Ta or Ti for 1, 2, 4, and 8 days were hybridized to n = 16 U133 Plus 2.0 arrays (Affymetrix(A (R))). Functional annotation, cluster and pathway analyses were performed. The vast majority of genes were differentially regulated after 4 days...... of cultivation and genes upregulated by MSCs exposed to Ta and Ti were predominantly related to the processes of differentiation and transcription, respectively. Functional annotation analysis of the 1,000 temporally most significantly regulated genes suggests earlier cellular differentiation on Ta compared...

  18. A combined analysis of genome-wide expression profiling of bipolar disorder in human prefrontal cortex.

    Science.gov (United States)

    Wang, Jinglu; Qu, Susu; Wang, Weixiao; Guo, Liyuan; Zhang, Kunlin; Chang, Suhua; Wang, Jing

    2016-11-01

    Numbers of gene expression profiling studies of bipolar disorder have been published. Besides different array chips and tissues, variety of the data processes in different cohorts aggravated the inconsistency of results of these genome-wide gene expression profiling studies. By searching the gene expression databases, we obtained six data sets for prefrontal cortex (PFC) of bipolar disorder with raw data and combinable platforms. We used standardized pre-processing and quality control procedures to analyze each data set separately and then combined them into a large gene expression matrix with 101 bipolar disorder subjects and 106 controls. A standard linear mixed-effects model was used to calculate the differentially expressed genes (DEGs). Multiple levels of sensitivity analyses and cross validation with genetic data were conducted. Functional and network analyses were carried out on basis of the DEGs. In the result, we identified 198 unique differentially expressed genes in the PFC of bipolar disorder and control. Among them, 115 DEGs were robust to at least three leave-one-out tests or different pre-processing methods; 51 DEGs were validated with genetic association signals. Pathway enrichment analysis showed these DEGs were related with regulation of neurological system, cell death and apoptosis, and several basic binding processes. Protein-protein interaction network further identified one key hub gene. We have contributed the most comprehensive integrated analysis of bipolar disorder expression profiling studies in PFC to date. The DEGs, especially those with multiple validations, may denote a common signature of bipolar disorder and contribute to the pathogenesis of disease. Copyright © 2016 Elsevier Ltd. All rights reserved.

  19. Genomics and the human genome project: implications for psychiatry

    OpenAIRE

    Kelsoe, J R

    2004-01-01

    In the past decade the Human Genome Project has made extraordinary strides in understanding of fundamental human genetics. The complete human genetic sequence has been determined, and the chromosomal location of almost all human genes identified. Presently, a large international consortium, the HapMap Project, is working to identify a large portion of genetic variation in different human populations and the structure and relationship of these variants to each other. The Human Genome Project h...

  20. Chromosome region-specific libraries for human genome analysis. Final progress report, 1 March 1991--28 February 1994

    Energy Technology Data Exchange (ETDEWEB)

    Kao, F.T.

    1994-04-01

    The objectives of this grant proposal include (1) development of a chromosome microdissection and PCR-mediated microcloning technology, (2) application of this microtechnology to the construction of region-specific libraries for human genome analysis. During this grant period, the authors have successfully developed this microtechnology and have applied it to the construction of microdissection libraries for the following chromosome regions: a whole chromosome 21 (21E), 2 region-specific libraries for the long arm of chromosome 2, 2q35-q37 (2Q1) and 2q33-q35 (2Q2), and 4 region-specific libraries for the entire short arm of chromosome 2, 2p23-p25 (2P1), 2p21-p23 (2P2), 2p14-p16 (wP3) and 2p11-p13 (2P4). In addition, 20--40 unique sequence microclones have been isolated and characterized for genomic studies. These region-specific libraries and the single-copy microclones from the library have been used as valuable resources for (1) isolating microsatellite probes in linkage analysis to further refine the disease locus; (2) isolating corresponding clones with large inserts, e.g. YAC, BAC, P1, cosmid and phage, to facilitate construction of contigs for high resolution physical mapping; and (3) isolating region-specific cDNA clones for use as candidate genes. These libraries are being deposited in the American Type Culture Collection (ATCC) for general distribution.

  1. Identifying cis-mediators for trans-eQTLs across many human tissues using genomic mediation analysis.

    Science.gov (United States)

    Yang, Fan; Wang, Jiebiao; Pierce, Brandon L; Chen, Lin S

    2017-11-01

    The impact of inherited genetic variation on gene expression in humans is well-established. The majority of known expression quantitative trait loci (eQTLs) impact expression of local genes ( cis -eQTLs). More research is needed to identify effects of genetic variation on distant genes ( trans -eQTLs) and understand their biological mechanisms. One common trans -eQTLs mechanism is "mediation" by a local ( cis ) transcript. Thus, mediation analysis can be applied to genome-wide SNP and expression data in order to identify transcripts that are " cis -mediators" of trans -eQTLs, including those " cis -hubs" involved in regulation of many trans -genes. Identifying such mediators helps us understand regulatory networks and suggests biological mechanisms underlying trans -eQTLs, both of which are relevant for understanding susceptibility to complex diseases. The multitissue expression data from the Genotype-Tissue Expression (GTEx) program provides a unique opportunity to study cis -mediation across human tissue types. However, the presence of complex hidden confounding effects in biological systems can make mediation analyses challenging and prone to confounding bias, particularly when conducted among diverse samples. To address this problem, we propose a new method: Genomic Mediation analysis with Adaptive Confounding adjustment (GMAC). It enables the search of a very large pool of variables, and adaptively selects potential confounding variables for each mediation test. Analyses of simulated data and GTEx data demonstrate that the adaptive selection of confounders by GMAC improves the power and precision of mediation analysis. Application of GMAC to GTEx data provides new insights into the observed patterns of cis -hubs and trans -eQTL regulation across tissue types. © 2017 Yang et al.; Published by Cold Spring Harbor Laboratory Press.

  2. HGVA: the Human Genome Variation Archive

    OpenAIRE

    Lopez, Javier; Coll, Jacobo; Haimel, Matthias; Kandasamy, Swaathi; Tarraga, Joaquin; Furio-Tari, Pedro; Bari, Wasim; Bleda, Marta; Rueda, Antonio; Gr?f, Stefan; Rendon, Augusto; Dopazo, Joaquin; Medina, Ignacio

    2017-01-01

    Abstract High-profile genomic variation projects like the 1000 Genomes project or the Exome Aggregation Consortium, are generating a wealth of human genomic variation knowledge which can be used as an essential reference for identifying disease-causing genotypes. However, accessing these data, contrasting the various studies and integrating those data in downstream analyses remains cumbersome. The Human Genome Variation Archive (HGVA) tackles these challenges and facilitates access to genomic...

  3. Genome-wide meta-analysis associates HLA-DQA1/DRB1 and LPA and lifestyle factors with human longevity

    DEFF Research Database (Denmark)

    Joshi, Peter K; Pirastu, Nicola; Kentistou, Katherine A

    2017-01-01

    Genomic analysis of longevity offers the potential to illuminate the biology of human aging. Here, using genome-wide association meta-analysis of 606,059 parents' survival, we discover two regions associated with longevity (HLA-DQA1/DRB1 and LPA). We also validate previous suggestions that APOE...... that an increase of one body mass index unit reduces lifespan by 7 months while 1 year of education adds 11 months to expected lifespan.Variability in human longevity is genetically influenced. Using genetic data of parental lifespan, the authors identify associations at HLA-DQA/DRB1 and LPA and find that genetic...

  4. Genomic analysis of Andamanese provides insights into ancient human migration into Asia and adaptation

    NARCIS (Netherlands)

    Mondal, M.; Casals, F.; Xu, T.; Dall'Olio, G.M.; Pybus, M.; Netea, M.G.; Comas, D.; Laayouni, H.; Li, Q.; Majumder, P.P.; Bertranpetit, J.

    2016-01-01

    To shed light on the peopling of South Asia and the origins of the morphological adaptations found there, we analyzed whole-genome sequences from 10 Andamanese individuals and compared them with sequences for 60 individuals from mainland Indian populations with different ethnic histories and with

  5. "Orphan" retrogenes in the human genome.

    Science.gov (United States)

    Ciomborowska, Joanna; Rosikiewicz, Wojciech; Szklarczyk, Damian; Makałowski, Wojciech; Makałowska, Izabela

    2013-02-01

    Gene duplicates generated via retroposition were long thought to be pseudogenized and consequently decayed. However, a significant number of these genes escaped their evolutionary destiny and evolved into functional genes. Despite multiple studies, the number of functional retrogenes in human and other genomes remains unclear. We performed a comparative analysis of human, chicken, and worm genomes to identify "orphan" retrogenes, that is, retrogenes that have replaced their progenitors. We located 25 such candidates in the human genome. All of these genes were previously known, and the majority has been intensively studied. Despite this, they have never been recognized as retrogenes. Analysis revealed that the phenomenon of replacing parental genes with their retrocopies has been taking place over the entire span of animal evolution. This process was often species specific and contributed to interspecies differences. Surprisingly, these retrogenes, which should evolve in a more relaxed mode, are subject to a very strong purifying selection, which is, on average, two and a half times stronger than other human genes. Also, for retrogenes, they do not show a typical overall tendency for a testis-specific expression. Notably, seven of them are associated with human diseases. Recognizing them as "orphan" retrocopies, which have different regulatory machinery than their parents, is important for any disease studies in model organisms, especially when discoveries made in one species are transferred to humans.

  6. Recurrent DNA inversion rearrangements in the human genome

    DEFF Research Database (Denmark)

    Flores, Margarita; Morales, Lucía; Gonzaga-Jauregui, Claudia

    2007-01-01

    Several lines of evidence suggest that reiterated sequences in the human genome are targets for nonallelic homologous recombination (NAHR), which facilitates genomic rearrangements. We have used a PCR-based approach to identify breakpoint regions of rearranged structures in the human genome...... to human genomic variation is discussed........ In particular, we have identified intrachromosomal identical repeats that are located in reverse orientation, which may lead to chromosomal inversions. A bioinformatic workflow pathway to select appropriate regions for analysis was developed. Three such regions overlapping with known human genes, located...

  7. Genome-wide analysis of PDX1 target genes in human pancreatic progenitors

    Directory of Open Access Journals (Sweden)

    Xianming Wang

    2018-03-01

    Full Text Available Objective: Homozygous loss-of-function mutations in the gene coding for the homeobox transcription factor (TF PDX1 leads to pancreatic agenesis, whereas heterozygous mutations can cause Maturity-Onset Diabetes of the Young 4 (MODY4. Although the function of Pdx1 is well studied in pre-clinical models during insulin-producing β-cell development and homeostasis, it remains elusive how this TF controls human pancreas development by regulating a downstream transcriptional program. Also, comparative studies of PDX1 binding patterns in pancreatic progenitors and adult β-cells have not been conducted so far. Furthermore, many studies reported the association between single nucleotide polymorphisms (SNPs and T2DM, and it has been shown that islet enhancers are enriched in T2DM-associated SNPs. Whether regions, harboring T2DM-associated SNPs are PDX1 bound and active at the pancreatic progenitor stage has not been reported so far. Methods: In this study, we have generated a novel induced pluripotent stem cell (iPSC line that efficiently differentiates into human pancreatic progenitors (PPs. Furthermore, PDX1 and H3K27ac chromatin immunoprecipitation sequencing (ChIP-seq was used to identify PDX1 transcriptional targets and active enhancer and promoter regions. To address potential differences in the function of PDX1 during development and adulthood, we compared PDX1 binding profiles from PPs and adult islets. Moreover, combining ChIP-seq and GWAS meta-analysis data we identified T2DM-associated SNPs in PDX1 binding sites and active chromatin regions. Results: ChIP-seq for PDX1 revealed a total of 8088 PDX1-bound regions that map to 5664 genes in iPSC-derived PPs. The PDX1 target regions include important pancreatic TFs, such as PDX1 itself, RFX6, HNF1B, and MEIS1, which were activated during the differentiation process as revealed by the active chromatin mark H3K27ac and mRNA expression profiling, suggesting that auto-regulatory feedback regulation

  8. The Human Genome Diversity Project

    Energy Technology Data Exchange (ETDEWEB)

    Cavalli-Sforza, L. [Stanford Univ., CA (United States)

    1994-12-31

    The Human Genome Diversity Project (HGD Project) is an international anthropology project that seeks to study the genetic richness of the entire human species. This kind of genetic information can add a unique thread to the tapestry knowledge of humanity. Culture, environment, history, and other factors are often more important, but humanity`s genetic heritage, when analyzed with recent technology, brings another type of evidence for understanding species` past and present. The Project will deepen the understanding of this genetic richness and show both humanity`s diversity and its deep and underlying unity. The HGD Project is still largely in its planning stages, seeking the best ways to reach its goals. The continuing discussions of the Project, throughout the world, should improve the plans for the Project and their implementation. The Project is as global as humanity itself; its implementation will require the kinds of partnerships among different nations and cultures that make the involvement of UNESCO and other international organizations particularly appropriate. The author will briefly discuss the Project`s history, describe the Project, set out the core principles of the Project, and demonstrate how the Project will help combat the scourge of racism.

  9. Sex genes for genomic analysis in human brain: internal controls for comparison of probe level data extraction.

    Directory of Open Access Journals (Sweden)

    Ellis Steven P

    2003-09-01

    Full Text Available Abstract Background Genomic studies of complex tissues pose unique analytical challenges for assessment of data quality, performance of statistical methods used for data extraction, and detection of differentially expressed genes. Ideally, to assess the accuracy of gene expression analysis methods, one needs a set of genes which are known to be differentially expressed in the samples and which can be used as a "gold standard". We introduce the idea of using sex-chromosome genes as an alternative to spiked-in control genes or simulations for assessment of microarray data and analysis methods. Results Expression of sex-chromosome genes were used as true internal biological controls to compare alternate probe-level data extraction algorithms (Microarray Suite 5.0 [MAS5.0], Model Based Expression Index [MBEI] and Robust Multi-array Average [RMA], to assess microarray data quality and to establish some statistical guidelines for analyzing large-scale gene expression. These approaches were implemented on a large new dataset of human brain samples. RMA-generated gene expression values were markedly less variable and more reliable than MAS5.0 and MBEI-derived values. A statistical technique controlling the false discovery rate was applied to adjust for multiple testing, as an alternative to the Bonferroni method, and showed no evidence of false negative results. Fourteen probesets, representing nine Y- and two X-chromosome linked genes, displayed significant sex differences in brain prefrontal cortex gene expression. Conclusion In this study, we have demonstrated the use of sex genes as true biological internal controls for genomic analysis of complex tissues, and suggested analytical guidelines for testing alternate oligonucleotide microarray data extraction protocols and for adjusting multiple statistical analysis of differentially expressed genes. Our results also provided evidence for sex differences in gene expression in the brain prefrontal cortex

  10. Genome-wide analysis identifies 12 loci influencing human reproductive behavior

    DEFF Research Database (Denmark)

    Barban, Nicola; Jansen, Rick; de Vlaming, Ronald

    2016-01-01

    The genetic architecture of human reproductive behavior-age at first birth (AFB) and number of children ever born (NEB)-has a strong relationship with fitness, human development, infertility and risk of neuropsychiatric disorders. However, very few genetic loci have been identified, and the under......The genetic architecture of human reproductive behavior-age at first birth (AFB) and number of children ever born (NEB)-has a strong relationship with fitness, human development, infertility and risk of neuropsychiatric disorders. However, very few genetic loci have been identified...

  11. All about the Human Genome Project (HGP)

    Science.gov (United States)

    ... Care Genomic Medicine Working Group New Horizons and Research Patient Management Policy and Ethics Issues Quick Links for Patient Care Education All About the Human Genome Project Fact Sheets Genetic Education Resources for ...

  12. From human monocytes to genome-wide binding sites--a protocol for small amounts of blood: monocyte isolation/ChIP-protocol/library amplification/genome wide computational data analysis.

    Directory of Open Access Journals (Sweden)

    Sebastian Weiterer

    Full Text Available Chromatin immunoprecipitation in combination with a genome-wide analysis via high-throughput sequencing is the state of the art method to gain genome-wide representation of histone modification or transcription factor binding profiles. However, chromatin immunoprecipitation analysis in the context of human experimental samples is limited, especially in the case of blood cells. The typically extremely low yields of precipitated DNA are usually not compatible with library amplification for next generation sequencing. We developed a highly reproducible protocol to present a guideline from the first step of isolating monocytes from a blood sample to analyse the distribution of histone modifications in a genome-wide manner.The protocol describes the whole work flow from isolating monocytes from human blood samples followed by a high-sensitivity and small-scale chromatin immunoprecipitation assay with guidance for generating libraries compatible with next generation sequencing from small amounts of immunoprecipitated DNA.

  13. Genome-wide analysis identifies 12 loci influencing human reproductive behavior

    NARCIS (Netherlands)

    Barban, Nicola; Jansen, Rick; de Vlaming, Ronald; Vaez, Ahmad; Mandemakers, Jornt J; Tropf, Felix C; Shen, Xia; Wilson, James F; Chasman, Daniel I; Nolte, Ilja M; Tragante, Vinicius; van der Laan, Sander W; Perry, John R B; Kong, Augustine; Ahluwalia, Tarunveer S; Albrecht, Eva; Yerges-Armstrong, Laura; Atzmon, Gil; Auro, Kirsi; Ayers, Kristin; Bakshi, Andrew; Ben-Avraham, Danny; Berger, Klaus; Bergman, Aviv; Bertram, Lars; Bielak, Lawrence F; Bjornsdottir, Gyda; Bonder, Marc Jan; Broer, Linda; Bui, Minh; Barbieri, Caterina; Cavadino, Alana; Chavarro, Jorge E; Turman, Constance; Concas, Maria Pina; Cordell, Heather J; Davies, Gail; Eibich, Peter; Eriksson, Nicholas; Esko, Tõnu; Eriksson, Joel; Falahi, Fahimeh; Felix, Janine F; Fontana, Mark Alan; Franke, Lude; Gandin, Ilaria; Gaskins, Audrey J; Gieger, Christian; Gunderson, Erica P; Guo, Xiuqing; Hayward, Caroline; He, Chunyan; Hofer, Edith; Huang, Hongyan; Joshi, Peter K; Kanoni, Stavroula; Karlsson, Robert; Kiechl, Stefan; Kifley, Annette; Kluttig, Alexander; Kraft, Peter; Lagou, Vasiliki; Lecoeur, Cecile; Lahti, Jari; Li-Gao, Ruifang; Lind, Penelope A; Liu, Tian; Makalic, Enes; Mamasoula, Crysovalanto; Matteson, Lindsay; Mbarek, Hamdi; McArdle, Patrick F; McMahon, George; Meddens, S Fleur W; Mihailov, Evelin; Miller, Mike; Missmer, Stacey A; Monnereau, Claire; van der Most, Peter J; Myhre, Ronny; Nalls, Mike A; Nutile, Teresa; Kalafati, Ioanna Panagiota; Porcu, Eleonora; Prokopenko, Inga; Rajan, Kumar B; Rich-Edwards, Janet; Rietveld, Cornelius A; Robino, Antonietta; Rose, Lynda M; Rueedi, Rico; Ryan, Kathleen A; Saba, Yasaman; Schmidt, Daniel; Smith, Jennifer A; Stolk, Lisette; Streeten, Elizabeth; Tönjes, Anke; Thorleifsson, Gudmar; Ulivi, Sheila; Wedenoja, Juho; Wellmann, Juergen; Willeit, Peter; Yao, Jie; Yengo, Loic; Zhao, Jing Hua; Zhao, Wei; Zhernakova, Daria V; Amin, Najaf; Andrews, Howard; Balkau, Beverley; Barzilai, Nir; Bergmann, Sven; Biino, Ginevra; Bisgaard, Hans; Bønnelykke, Klaus; Boomsma, Dorret I; Buring, Julie E; Campbell, Harry; Cappellani, Stefania; Ciullo, Marina; Cox, Simon R; Cucca, Francesco; Toniolo, Daniela; Davey-Smith, George; Deary, Ian J; Dedoussis, George; Deloukas, Panos; van Duijn, Cornelia M; de Geus, Eco J C; Eriksson, Johan G; Evans, Denis A; Faul, Jessica D; Sala, Cinzia Felicita; Froguel, Philippe; Gasparini, Paolo; Girotto, Giorgia; Grabe, Hans-Jörgen; Greiser, Karin Halina; Groenen, Patrick J F; de Haan, Hugoline G; Haerting, Johannes; Harris, Tamara B; Heath, Andrew C; Heikkilä, Kauko; Hofman, Albert; Homuth, Georg; Holliday, Elizabeth G; Hopper, John; Hyppönen, Elina; Jacobsson, Bo; Jaddoe, Vincent W V; Johannesson, Magnus; Jugessur, Astanand; Kähönen, Mika; Kajantie, Eero; Kardia, Sharon L R; Keavney, Bernard; Kolcic, Ivana; Koponen, Päivikki; Kovacs, Peter; Kronenberg, Florian; Kutalik, Zoltan; La Bianca, Martina; Lachance, Genevieve; Iacono, William G; Lai, Sandra; Lehtimäki, Terho; Liewald, David C; Lindgren, Cecilia M; Liu, Yongmei; Luben, Robert; Lucht, Michael; Luoto, Riitta; Magnus, Per; Magnusson, Patrik K E; Martin, Nicholas G; McGue, Matt; McQuillan, Ruth; Medland, Sarah E; Meisinger, Christa; Mellström, Dan; Metspalu, Andres; Traglia, Michela; Milani, Lili; Mitchell, Paul; Montgomery, Grant W; Mook-Kanamori, Dennis; de Mutsert, Renée; Nohr, Ellen A; Ohlsson, Claes; Olsen, Jørn; Ong, Ken K; Paternoster, Lavinia; Pattie, Alison; Penninx, Brenda W J H; Perola, Markus; Peyser, Patricia A; Pirastu, Mario; Polasek, Ozren; Power, Chris; Kaprio, Jaakko; Raffel, Leslie J; Räikkönen, Katri; Raitakari, Olli; Ridker, Paul M; Ring, Susan M; Roll, Kathryn; Rudan, Igor; Ruggiero, Daniela; Rujescu, Dan; Salomaa, Veikko; Schlessinger, David; Schmidt, Helena; Schmidt, Reinhold; Schupf, Nicole; Smit, Johannes; Sorice, Rossella; Spector, Tim D; Starr, John M; Stöckl, Doris; Strauch, Konstantin; Stumvoll, Michael; Swertz, Morris A; Thorsteinsdottir, Unnur; Thurik, A Roy; Timpson, Nicholas J; Tung, Joyce Y; Uitterlinden, André G; Vaccargiu, Simona; Viikari, Jorma; Vitart, Veronique; Völzke, Henry; Vollenweider, Peter; Vuckovic, Dragana; Waage, Johannes; Wagner, Gert G; Wang, Jie Jin; Wareham, Nicholas J; Weir, David R; Willemsen, Gonneke; Willeit, Johann; Wright, Alan F; Zondervan, Krina T; Stefansson, Kari; Krueger, Robert F; Lee, James J; Benjamin, Daniel J; Cesarini, David; Koellinger, Philipp D; den Hoed, Marcel; Snieder, Harold; Mills, Melinda C

    2016-01-01

    The genetic architecture of human reproductive behavior age at first birth (AFB) and number of children ever born (NEB) has a strong relationship with fitness, human development, infertility and risk of neuropsychiatric disorders. However, very few genetic loci have been identified, and the

  14. Genome-wide analysis identifies 12 loci influencing human reproductive behavior

    NARCIS (Netherlands)

    Barban, Nicola; Jansen, Rick; De Vlaming, Ronald; Vaez, Ahmad; Mandemakers, Jornt J.; Tropf, Felix C.; Shen, Xia; Wilson, James F.; Chasman, Daniel I.; Nolte, Ilja M.; Tragante, Vinicius; Van Der Laan, Sander W.; Perry, John R B; Kong, Augustine; Ahluwalia, Tarunveer S.; Albrecht, Eva; Yerges-Armstrong, Laura; Atzmon, Gil; Auro, Kirsi; Ayers, Kristin; Bakshi, Andrew; Ben-Avraham, Danny; Berger, Klaus; Bergman, Aviv; Bertram, Lars; Bielak, Lawrence F.; Bjornsdottir, Gyda; Bonder, Marc Jan; Broer, Linda; Bui, Minh; Barbieri, Caterina; Cavadino, Alana; Chavarro, Jorge E.; Turman, Constance; Concas, Maria Pina; Cordell, Heather J.; Davies, Gail; Eibich, Peter; Eriksson, Nicholas; Esko, Tõnu; Eriksson, Joel; Falahi, Fahimeh; Felix, Janine F.; Fontana, Mark Alan; Franke, Lude; Gandin, Ilaria; Gaskins, Audrey J.; Gieger, Christian; Gunderson, Erica P.; Guo, Xiuqing; Hayward, Caroline; He, Chunyan; Hofer, Edith; Huang, Hongyan; Joshi, Peter K.; Kanoni, Stavroula; Karlsson, Robert; Kiechl, Stefan; Kifley, Annette; Kluttig, Alexander; Kraft, Peter; Lagou, Vasiliki; Lecoeur, Cecile; Lahti, Jari; Li-Gao, Ruifang; Lind, Penelope A.; Liu, Tian; Makalic, Enes; Mamasoula, Crysovalanto; Matteson, Lindsay; Mbarek, Hamdi; McArdle, Patrick F.; McMahon, George; Meddens, S. Fleur W; Mihailov, Evelin; Miller, Mike; Missmer, Stacey A.; Monnereau, Claire; Van Der Most, Peter J.; Myhre, Ronny; Nalls, Mike A.; Nutile, Teresa; Kalafati, Ioanna Panagiota; Porcu, Eleonora; Prokopenko, Inga; Rajan, Kumar B.; Rich-Edwards, Janet; Rietveld, Cornelius A.; Robino, Antonietta; Rose, Lynda M.; Rueedi, Rico; Ryan, Kathleen A.; Saba, Yasaman; Schmidt, Daniel; Smith, Jennifer A.; Stolk, Lisette; Streeten, Elizabeth; Tönjes, Anke; Thorleifsson, Gudmar; Ulivi, Sheila; Wedenoja, Juho; Wellmann, Juergen; Willeit, Peter; Yao, Jie; Yengo, Loic; Zhao, Jing Hua; Zhao, Wei; Zhernakova, Daria V.; Amin, Najaf; Andrews, Howard; Balkau, Beverley; Barzilai, Nir; Bergmann, Sven; Biino, Ginevra; Bisgaard, Hans; Bønnelykke, Klaus; Boomsma, Dorret I.; Buring, Julie E.; Campbell, Harry; Cappellani, Stefania; Ciullo, Marina; Cox, Simon R.; Cucca, Francesco; Toniolo, Daniela; Davey-Smith, George; Deary, Ian J.; Dedoussis, George; Deloukas, Panos; Van Duijn, Cornelia M.; De Geus, Eco J C; Eriksson, Johan G.; Evans, Denis A.; Faul, Jessica D.; Sala, Cinzia Felicita; Froguel, Philippe; Gasparini, Paolo; Girotto, Giorgia; Grabe, Hans Jörgen; Greiser, Karin Halina; Groenen, Patrick J F; De Haan, Hugoline G.; Haerting, Johannes; Harris, Tamara B.; Heath, Andrew C.; Heikkilä, Kauko; Hofman, Albert; Homuth, Georg; Holliday, Elizabeth G.; Hopper, John; Hyppönen, Elina; Jacobsson, Bo; Jaddoe, Vincent W V; Johannesson, Magnus; Jugessur, Astanand; Kähönen, Mika; Kajantie, Eero; Kardia, Sharon L R; Keavney, Bernard; Kolcic, Ivana; Koponen, Päivikki; Kovacs, Peter; Kronenberg, Florian; Kutalik, Zoltan; La Bianca, Martina; Lachance, Genevieve; Iacono, William G.; Lai, Sandra; Lehtimäki, Terho; Liewald, David C.; Lindgren, Cecilia M.; Liu, Yongmei; Luben, Robert; Lucht, Michael; Luoto, Riitta; Magnus, Per; Magnusson, Patrikke; Martin, Nicholas G.; McGue, Matt; McQuillan, Ruth; Medland, Sarah E.; Meisinger, Christa; Mellström, Dan; Metspalu, Andres; Traglia, Michela; Milani, Lili; Mitchell, Paul; Montgomery, Grant W.; Mook-Kanamori, Dennis; De Mutsert, Renée; Nohr, Ellen A.; Ohlsson, Claes; Olsen, Jørn; Ong, Ken K.; Paternoster, Lavinia; Pattie, Alison; Penninx, Brenda W J H; Perola, Markus; Peyser, Patricia A.; Pirastu, Mario; Polasek, Ozren; Power, Chris; Kaprio, Jaakko; Raffel, Leslie J.; Räikkönen, Katri; Raitakari, Olli; Ridker, Paul M.; Ring, Susan M.; Roll, Kathryn; Rudan, Igor; Ruggiero, Daniela; Rujescu, Dan; Salomaa, Veikko; Schlessinger, David; Schmidt, Helena; Schmidt, Reinhold; Schupf, Nicole; Smit, Johannes; Sorice, Rossella; Spector, Tim D.; Starr, John M.; Stöckl, Doris; Strauch, Konstantin; Stumvoll, Michael; Swertz, Morris A.; Thorsteinsdottir, Unnur; Roy Thurik, A.; Timpson, Nicholas J.; Tung, Joyce Y.; Uitterlinden, André G.; Vaccargiu, Simona; Viikari, Jorma; Vitart, Veronique; Völzke, Henry; Vollenweider, Peter; Vuckovic, Dragana; Waage, Johannes; Wagner, Gert G.; Wang, Jie Jin; Wareham, Nicholas J.; Weir, David R.; Willemsen, Gonneke; Willeit, Johann; Wright, Alan F.; Zondervan, Krina T.; Stefansson, Kari; Krueger, Robert F.; Lee, James J.; Benjamin, Daniel J.; Cesarini, David; Koellinger, Philipp D.; Den Hoed, Marcel; Snieder, Harold; Mills, Melinda C.

    2016-01-01

    The genetic architecture of human reproductive behavior - age at first birth (AFB) and number of children ever born (NEB) - has a strong relationship with fitness, human development, infertility and risk of neuropsychiatric disorders. However, very few genetic loci have been identified, and the

  15. Genome-wide analysis identifies 12 loci influencing human reproductive behavior

    NARCIS (Netherlands)

    Barban, Nicola; Jansen, Rick; Vlaming, de Ronald; Vaez, Ahmad; Mandemakers, Jornt J.; Tropf, Felix C.; Shen, Xia; Wilson, James F.; Chasman, Daniel I.; Nolte, Ilja M.; Tragante, Vinicius; Laan, van der Sander W.; Perry, John R.B.; Kong, Augustine; Ahluwalia, Tarunveer S.; Albrecht, Eva; Yerges-Armstrong, Laura; Atzmon, Gil; Auro, Kirsi; Ayers, Kristin; Bakshi, Andrew; Ben-Avraham, Danny; Berger, Klaus; Bergman, Aviv; Bertram, Lars; Bielak, Lawrence F.; Bjornsdottir, Gyda; Bonder, Marc Jan; Broer, Linda; Bui, Minh; Barbieri, Caterina; Cavadino, Alana; Chavarro, Jorge E.; Turman, Constance; Concas, Maria Pina; Cordell, Heather J.; Davies, Gail; Eibich, Peter; Eriksson, Nicholas; Esko, Tõnu; Eriksson, Joel; Falahi, Fahimeh; Felix, Janine F.; Fontana, Mark Alan; Franke, Lude; Gandin, Ilaria; Gaskins, Audrey J.; Gieger, Christian; Gunderson, Erica P.; Guo, Xiuqing; Hayward, Caroline; He, Chunyan; Hofer, Edith; Huang, Hongyan; Joshi, Peter K.; Kanoni, Stavroula; Karlsson, Robert; Kiechl, Stefan; Kifley, Annette; Kluttig, Alexander; Kraft, Peter; Lagou, Vasiliki; Lecoeur, Cecile; Lahti, Jari; Li-Gao, Ruifang; Lind, Penelope A.; Liu, Tian; Makalic, Enes; Mamasoula, Crysovalanto; Matteson, Lindsay; Mbarek, Hamdi; McArdle, Patrick F.; McMahon, George; Meddens, S.F.W.; Mihailov, Evelin; Miller, Mike; Missmer, Stacey A.; Monnereau, Claire; Most, van der Peter J.; Myhre, Ronny; Nalls, Mike A.; Nutile, Teresa; Kalafati, Ioanna Panagiota; Porcu, Eleonora; Prokopenko, Inga; Rajan, Kumar B.; Rich-Edwards, Janet; Rietveld, Cornelius A.; Robino, Antonietta; Rose, Lynda M.; Rueedi, Rico; Ryan, Kathleen A.; Saba, Yasaman; Schmidt, Daniel; Smith, Jennifer A.; Stolk, Lisette; Streeten, Elizabeth; Tönjes, Anke; Thorleifsson, Gudmar; Ulivi, Sheila; Wedenoja, Juho; Wellmann, Juergen; Willeit, Peter; Yao, Jie; Yengo, Loic; Zhao, Jing Hua; Zhao, Wei; Zhernakova, Daria V.; Amin, Najaf; Andrews, Howard; Balkau, Beverley; Barzilai, Nir; Bergmann, Sven; Biino, Ginevra; Bisgaard, Hans; Bønnelykke, Klaus; Boomsma, Dorret I.; Buring, Julie E.; Campbell, Harry; Cappellani, Stefania; Ciullo, Marina; Cox, Simon R.; Cucca, Francesco; Toniolo, Daniela; Davey-Smith, George; Deary, Ian J.; Dedoussis, George; Deloukas, Panos; Duijn, van Cornelia M.; Geus, de Eco J.C.; Eriksson, Johan G.; Evans, Denis A.; Faul, Jessica D.; Sala, Cinzia Felicita; Froguel, Philippe; Gasparini, Paolo; Girotto, Giorgia; Grabe, Hans-Jörgen; Greiser, Karin Halina; Groenen, Patrick J.F.; Haan, de Hugoline G.; Haerting, Johannes; Harris, Tamara B.; Heath, Andrew C.; Heikkilä, Kauko; Hofman, Albert; Homuth, Georg; Holliday, Elizabeth G.; Hopper, John; Hyppönen, Elina; Jacobsson, Bo; Jaddoe, Vincent W.V.; Johannesson, Magnus; Jugessur, Astanand; Kähönen, Mika; Kajantie, Eero; Kardia, Sharon L.R.; Keavney, Bernard; Kolcic, Ivana; Koponen, Päivikki; Kovacs, Peter; Kronenberg, Florian; Kutalik, Zoltan; Bianca, la Martina; Lachance, Genevieve; Iacono, William G.; Lai, Sandra; Lehtimäki, Terho; Liewald, David C.; Lindgren, Cecilia M.; Liu, Yongmei; Luben, Robert; Lucht, Michael; Luoto, Riitta; Magnus, Per; Magnusson, Patrik K.E.; Martin, Nicholas G.; McGue, Matt; McQuillan, Ruth; Medland, Sarah E.; Meisinger, Christa; Mellström, Dan; Metspalu, Andres; Traglia, Michela; Milani, Lili; Mitchell, Paul; Montgomery, Grant W.; Mook-Kanamori, Dennis; Mutsert, de Renée; Nohr, Ellen A.; Ohlsson, Claes; Olsen, Jørn; Ong, Ken K.; Paternoster, Lavinia; Pattie, Alison; Penninx, Brenda W.J.H.; Perola, Markus; Peyser, Patricia A.; Pirastu, Mario; Polasek, Ozren; Power, Chris; Kaprio, Jaakko; Raffel, Leslie J.; Räikkönen, Katri; Raitakari, Olli; Ridker, Paul M.; Ring, Susan M.; Roll, Kathryn; Rudan, Igor; Ruggiero, Daniela; Rujescu, Dan; Salomaa, Veikko; Schlessinger, David; Schmidt, Helena; Schmidt, Reinhold; Schupf, Nicole; Smit, Johannes; Sorice, Rossella; Spector, Tim D.; Starr, John M.; Stöckl, Doris; Strauch, Konstantin; Stumvoll, Michael; Swertz, Morris A.; Thorsteinsdottir, Unnur; Thurik, A.R.; Timpson, Nicholas J.; Tung, Joyce Y.; Uitterlinden, André G.; Vaccargiu, Simona; Viikari, Jorma; Vitart, Veronique; Völzke, Henry; Vollenweider, Peter; Vuckovic, Dragana; Waage, Johannes; Wagner, Gert G.; Wang, Jie Jin; Wareham, Nicholas J.; Weir, David R.; Willemsen, Gonneke; Willeit, Johann; Wright, Alan F.; Zondervan, Krina T.; Stefansson, Kari; Krueger, Robert F.; Lee, James J.; Benjamin, Daniel J.; Cesarini, David; Koellinger, Philipp D.; Hoed, den Marcel; Snieder, Harold; Mills, Melinda C.

    2016-01-01

    The genetic architecture of human reproductive behavior—age at first birth (AFB) and number of children ever born (NEB)—has a strong relationship with fitness, human development, infertility and risk of neuropsychiatric disorders. However, very few genetic loci have been identified, and the

  16. Three genome-wide association studies and a linkage analysis identify HERC2 as a human iris color gene

    NARCIS (Netherlands)

    Kayser, Manfred; Liu, Fan; Janssens, A. Cecile J. W.; Rivadeneira, Fernando; Lao, Oscar; van Duijn, Kate; Vermeulen, Mark; Arp, Pascal; Jhamai, Mila M.; van Ijcken, Wilfred F. J.; den Dunnen, Johan T.; Heath, Simon; Zelenika, Diana; Despriet, Dominiek D. G.; Klaver, Caroline C. W.; Vingerling, Johannes R.; de Jong, Paulus T. V. M.; Hofman, Albert; Aulchenko, Yurii S.; Uitterlinden, Andre G.; Oostra, Ben A.; van Duijn, Cornelia M.

    2008-01-01

    Human iris color was one of the first traits for which Mendelian segregation was established. To date, the genetics of iris color is still not fully understood and is of interest, particularly in view of forensic applications. In three independent genome-wide association (GWA) studies of a total of

  17. Comparative Genomics Analysis of Streptococcus Isolates from the Human Small Intestine Reveals their Adaptation to a Highly Dynamic Ecosystem

    NARCIS (Netherlands)

    Bogert, van den B.; Boekhorst, te J.; Herrmann, R.; Smid, E.J.; Zoetendal, E.G.; Kleerebezem, M.

    2013-01-01

    The human small-intestinal microbiota is characterised by relatively large and dynamic Streptococcus populations. In this study, genome sequences of small-intestinal streptococci from S. mitis, S. bovis, and S. salivarius species-groups were determined and compared with those from 58 Streptococcus

  18. Genome-wide analysis reveals loci encoding anti-macrophage factors in the human pathogen Burkholderia pseudomallei K96243.

    Directory of Open Access Journals (Sweden)

    Andrea J Dowling

    2010-12-01

    Full Text Available Burkholderia pseudomallei is an important human pathogen whose infection biology is still poorly understood. The bacterium is endemic to tropical regions, including South East Asia and Northern Australia, where it causes melioidosis, a serious disease associated with both high mortality and antibiotic resistance. B. pseudomallei is a Gram-negative facultative intracellular pathogen that is able to replicate in macrophages. However despite the critical nature of its interaction with macrophages, few anti-macrophage factors have been characterized to date. Here we perform a genome-wide gain of function screen of B. pseudomallei strain K96243 to identify loci encoding factors with anti-macrophage activity. We identify a total of 113 such loci scattered across both chromosomes, with positive gene clusters encoding transporters and secretion systems, enzymes/toxins, secondary metabolite, biofilm, adhesion and signal response related factors. Further phenotypic analysis of four of these regions shows that the encoded factors cause striking cellular phenotypes relevant to infection biology, including apoptosis, formation of actin 'tails' and multi-nucleation within treated macrophages. The detailed analysis of the remaining host of loci will facilitate genetic dissection of the interaction of this important pathogen with host macrophages and thus further elucidate this critical part of its infection cycle.

  19. Human Contamination in Public Genome Assemblies.

    Science.gov (United States)

    Kryukov, Kirill; Imanishi, Tadashi

    2016-01-01

    Contamination in genome assembly can lead to wrong or confusing results when using such genome as reference in sequence comparison. Although bacterial contamination is well known, the problem of human-originated contamination received little attention. In this study we surveyed 45,735 available genome assemblies for evidence of human contamination. We used lineage specificity to distinguish between contamination and conservation. We found that 154 genome assemblies contain fragments that with high confidence originate as contamination from human DNA. Majority of contaminating human sequences were present in the reference human genome assembly for over a decade. We recommend that existing contaminated genomes should be revised to remove contaminated sequence, and that new assemblies should be thoroughly checked for presence of human DNA before submitting them to public databases.

  20. The bonobo genome compared with the chimpanzee and human genomes

    Science.gov (United States)

    Prüfer, Kay; Munch, Kasper; Hellmann, Ines; Akagi, Keiko; Miller, Jason R.; Walenz, Brian; Koren, Sergey; Sutton, Granger; Kodira, Chinnappa; Winer, Roger; Knight, James R.; Mullikin, James C.; Meader, Stephen J.; Ponting, Chris P.; Lunter, Gerton; Higashino, Saneyuki; Hobolth, Asger; Dutheil, Julien; Karakoç, Emre; Alkan, Can; Sajjadian, Saba; Catacchio, Claudia Rita; Ventura, Mario; Marques-Bonet, Tomas; Eichler, Evan E.; André, Claudine; Atencia, Rebeca; Mugisha, Lawrence; Junhold, Jörg; Patterson, Nick; Siebauer, Michael; Good, Jeffrey M.; Fischer, Anne; Ptak, Susan E.; Lachmann, Michael; Symer, David E.; Mailund, Thomas; Schierup, Mikkel H.; Andrés, Aida M.; Kelso, Janet; Pääbo, Svante

    2012-01-01

    Two African apes are the closest living relatives of humans: the chimpanzee (Pan troglodytes) and the bonobo (Pan paniscus). Although they are similar in many respects, bonobos and chimpanzees differ strikingly in key social and sexual behaviours1–4, and for some of these traits they show more similarity with humans than with each other. Here we report the sequencing and assembly of the bonobo genome to study its evolutionary relationship with the chimpanzee and human genomes. We find that more than three per cent of the human genome is more closely related to either the bonobo or the chimpanzee genome than these are to each other. These regions allow various aspects of the ancestry of the two ape species to be reconstructed. In addition, many of the regions that overlap genes may eventually help us understand the genetic basis of phenotypes that humans share with one of the two apes to the exclusion of the other. PMID:22722832

  1. An overview of the human genome project

    Energy Technology Data Exchange (ETDEWEB)

    Batzer, M.A.

    1994-01-01

    The human genome project is one of the most ambitious scientific projects to date, with the ultimate goal being a nucleotide sequence for all four billion bases of human DNA. In the process of determining the nucleotide sequence for each base, the location, function, and regulatory regions from the estimated 100,000 human genes will be identified. The genome project itself relies upon maps of the human genetic code derived from several different levels of resolution. Genetic linkage analysis provides a low resolution genome map. The information for genetic linkage maps is derived from the analysis of chromosome specific markers such as Sequence Tagged Sites (STSs), Variable Number of Tandem Repeats (VNTRs) or other polymorphic (highly informative) loci in a number of different-families. Using this information the location of an unknown disease gene can be limited to a region comprised of one million base pairs of DNA or less. After this point, one must construct or have access to a physical map of the region of interest. Physical mapping involves the construction of an ordered overlapping (contiguous) set of recombinant DNA clones. These clones may be derived from a number of different vectors including cosmids, Bacterial Artificial Chromosomes (BACs), P1 derived Artificial Chromosomes (PACs), somatic cell hybrids, or Yeast Artificial Chromosomes (YACs). The ultimate goal for physical mapping is to establish a completely overlapping (contiguous) set of clones for the entire genome. After a gene or region of interest has been localized using physical mapping the nucleotide sequence is determined. The overlap between genetic mapping, physical mapping and DNA sequencing has proven to be a powerful tool for the isolation of disease genes through positional cloning.

  2. ChIP on SNP-chip for genome-wide analysis of human histone H4 hyperacetylation

    Directory of Open Access Journals (Sweden)

    Porter Christopher J

    2007-09-01

    Full Text Available Abstract Background SNP microarrays are designed to genotype Single Nucleotide Polymorphisms (SNPs. These microarrays report hybridization of DNA fragments and therefore can be used for the purpose of detecting genomic fragments. Results Here, we demonstrate that a SNP microarray can be effectively used in this way to perform chromatin immunoprecipitation (ChIP on chip as an alternative to tiling microarrays. We illustrate this novel application by mapping whole genome histone H4 hyperacetylation in human myoblasts and myotubes. We detect clusters of hyperacetylated histone H4, often spanning across up to 300 kilobases of genomic sequence. Using complementary genome-wide analyses of gene expression by DNA microarray we demonstrate that these clusters of hyperacetylated histone H4 tend to be associated with expressed genes. Conclusion The use of a SNP array for a ChIP-on-chip application (ChIP on SNP-chip will be of great value to laboratories whose interest is the determination of general rules regarding the relationship of specific chromatin modifications to transcriptional status throughout the genome and to examine the asymmetric modification of chromatin at heterozygous loci.

  3. Disturbance of gene expression in primary human hepatocytes by hepatotoxic pyrrolizidine alkaloids: A whole genome transcriptome analysis.

    Science.gov (United States)

    Luckert, Claudia; Hessel, Stefanie; Lenze, Dido; Lampen, Alfonso

    2015-10-01

    1,2-unsaturated pyrrolizidine alkaloids (PA) are plant metabolites predominantly occurring in the plant families Asteraceae and Boraginaceae. Acute and chronic PA poisoning causes severe hepatotoxicity. So far, the molecular mechanisms of PA toxicity are not well understood. To analyze its mode of action, primary human hepatocytes were exposed to a non-cytotoxic dose of 100 μM of four structurally different PA: echimidine, heliotrine, senecionine, senkirkine. Changes in mRNA expression were analyzed by a whole genome microarray. Employing cut-off values with a |fold change| of 2 and a q-value of 0.01, data analysis revealed numerous changes in gene expression. In total, 4556, 1806, 3406 and 8623 genes were regulated by echimidine, heliotrine, senecione and senkirkine, respectively. 1304 genes were identified as commonly regulated. PA affected pathways related to cell cycle regulation, cell death and cancer development. The transcription factors TP53, MYC, NFκB and NUPR1 were predicted to be activated upon PA treatment. Furthermore, gene expression data showed a considerable interference with lipid metabolism and bile acid flow. The associated transcription factors FXR, LXR, SREBF1/2, and PPARα/γ/δ were predicted to be inhibited. In conclusion, though structurally different, all four PA significantly regulated a great number of genes in common. This proposes similar molecular mechanisms, although the extent seems to differ between the analyzed PA as reflected by the potential hepatotoxicity and individual PA structure. Copyright © 2015 Elsevier Ltd. All rights reserved.

  4. Genome-wide analysis of pain-, nerve- and neurotrophin -related gene expression in the degenerating human annulus

    Science.gov (United States)

    2012-01-01

    Background In spite of its high clinical relevance, the relationship between disc degeneration and low back pain is still not well understood. Recent studies have shown that genome-wide gene expression studies utilizing ontology searches provide an efficient and valuable methodology for identification of clinically relevant genes. Here we use this approach in analysis of pain-, nerve-, and neurotrophin-related gene expression patterns in specimens of human disc tissue. Control, non-herniated clinical, and herniated clinical specimens of human annulus tissue were studied following Institutional Review Board approval. Results Analyses were performed on more generated (Thompson grade IV and V) discs vs. less degenerated discs (grades I-III), on surgically operated discs vs. control discs, and on herniated vs. control discs. Analyses of more degenerated vs. less degenerated discs identified significant upregulation of well-recognized pain-related genes (bradykinin receptor B1, calcitonin gene-related peptide and catechol-0-methyltransferase). Nerve growth factor was significantly upregulated in surgical vs. control and in herniated vs. control discs. All three analyses also found significant changes in numerous proinflammatory cytokine- and chemokine-related genes. Nerve, neurotrophin and pain-ontology searches identified many matrix, signaling and functional genes which have known importance in the disc. Immunohistochemistry was utilized to confirm the presence of calcitonin gene-related peptide, catechol-0-methyltransferase and bradykinin receptor B1 at the protein level in the human annulus. Conclusions Findings point to the utility of microarray analyses in identification of pain-, neurotrophin and nerve-related genes in the disc, and point to the importance of future work exploring functional interactions between nerve and disc cells in vitro and in vivo. Nerve, pain and neurotrophin ontology searches identified numerous changes in proinflammatory cytokines and

  5. The missing indels: an estimate of indel variation in a human genome and analysis of factors that impede detection

    Science.gov (United States)

    Jiang, Yue; Turinsky, Andrei L.; Brudno, Michael

    2015-01-01

    With the development of High-Throughput Sequencing (HTS) thousands of human genomes have now been sequenced. Whenever different studies analyze the same genome they usually agree on the amount of single-nucleotide polymorphisms, but differ dramatically on the number of insertion and deletion variants (indels). Furthermore, there is evidence that indels are often severely under-reported. In this manuscript we derive the total number of indel variants in a human genome by combining data from different sequencing technologies, while assessing the indel detection accuracy. Our estimate of approximately 1 million indels in a Yoruban genome is much higher than the results reported in several recent HTS studies. We identify two key sources of difficulties in indel detection: the insufficient coverage, read length or alignment quality; and the presence of repeats, including short interspersed elements and homopolymers/dimers. We quantify the effect of these factors on indel detection. The quality of sequencing data plays a major role in improving indel detection by HTS methods. However, many indels exist in long homopolymers and repeats, where their detection is severely impeded. The true number of indel events is likely even higher than our current estimates, and new techniques and technologies will be required to detect them. PMID:26130710

  6. Genome-wide prediction and analysis of human tissue-selective genes using microarray expression data

    Directory of Open Access Journals (Sweden)

    Teng Shaolei

    2013-01-01

    Full Text Available Abstract Background Understanding how genes are expressed specifically in particular tissues is a fundamental question in developmental biology. Many tissue-specific genes are involved in the pathogenesis of complex human diseases. However, experimental identification of tissue-specific genes is time consuming and difficult. The accurate predictions of tissue-specific gene targets could provide useful information for biomarker development and drug target identification. Results In this study, we have developed a machine learning approach for predicting the human tissue-specific genes using microarray expression data. The lists of known tissue-specific genes for different tissues were collected from UniProt database, and the expression data retrieved from the previously compiled dataset according to the lists were used for input vector encoding. Random Forests (RFs and Support Vector Machines (SVMs were used to construct accurate classifiers. The RF classifiers were found to outperform SVM models for tissue-specific gene prediction. The results suggest that the candidate genes for brain or liver specific expression can provide valuable information for further experimental studies. Our approach was also applied for identifying tissue-selective gene targets for different types of tissues. Conclusions A machine learning approach has been developed for accurately identifying the candidate genes for tissue specific/selective expression. The approach provides an efficient way to select some interesting genes for developing new biomedical markers and improve our knowledge of tissue-specific expression.

  7. Genome content analysis yields new insights into the relationship between the human malaria parasite Plasmodium falciparum and its anopheline vectors.

    Science.gov (United States)

    Oppenheim, Sara J; Rosenfeld, Jeffrey A; DeSalle, Rob

    2017-02-27

    The persistent and growing gap between the availability of sequenced genomes and the ability to assign functions to sequenced genes led us to explore ways to maximize the information content of automated annotation for studies of anopheline mosquitos. Specifically, we use genome content analysis of a large number of previously sequenced anopheline mosquitos to follow the loss and gain of protein families over the evolutionary history of this group. The importance of this endeavor lies in the potential for comparative genomic studies between Anopheles and closely related non-vector species to reveal ancestral genome content dynamics involved in vector competence. In addition, comparisons within Anopheles could identify genome content changes responsible for variation in the vectorial capacity of this family of important parasite vectors. The competence and capacity of P. falciparum vectors do not appear to be phylogenetically constrained within the Anophelinae. Instead, using ancestral reconstruction methods, we suggest that a previously unexamined component of vector biology, anopheline nucleotide metabolism, may contribute to the unique status of anophelines as P. falciparum vectors. While the fitness effects of nucleotide co-option by P. falciparum parasites on their anopheline hosts are not yet known, our results suggest that anopheline genome content may be responding to selection pressure from P. falciparum. Whether this response is defensive, in an attempt to redress improper nucleotide balance resulting from P. falciparum infection, or perhaps symbiotic, resulting from an as-yet-unknown mutualism between anophelines and P. falciparum, is an open question that deserves further study. Clearly, there is a wealth of functional information to be gained from detailed manual genome annotation, yet the rapid increase in the number of available sequences means that most researchers will not have the time or resources to manually annotate all the sequence data they

  8. Whole genome sequencing and evolutionary analysis of human respiratory syncytial virus A and B from Milwaukee, WI 1998-2010.

    Directory of Open Access Journals (Sweden)

    Cecilia Rebuffo-Scheer

    Full Text Available BACKGROUND: Respiratory Syncytial Virus (RSV is the leading cause of lower respiratory-tract infections in infants and young children worldwide. Despite this, only six complete genome sequences of original strains have been previously published, the most recent of which dates back 35 and 26 years for RSV group A and group B respectively. METHODOLOGY/PRINCIPAL FINDINGS: We present a semi-automated sequencing method allowing for the sequencing of four RSV whole genomes simultaneously. We were able to sequence the complete coding sequences of 13 RSV A and 4 RSV B strains from Milwaukee collected from 1998-2010. Another 12 RSV A and 5 RSV B strains sequenced in this study cover the majority of the genome. All RSV A and RSV B sequences were analyzed by neighbor-joining, maximum parsimony and Bayesian phylogeny methods. Genetic diversity was high among RSV A viruses in Milwaukee including the circulation of multiple genotypes (GA1, GA2, GA5, GA7 with GA2 persisting throughout the 13 years of the study. However, RSV B genomes showed little variation with all belonging to the BA genotype. For RSV A, the same evolutionary patterns and clades were seen consistently across the whole genome including all intergenic, coding, and non-coding regions sequences. CONCLUSIONS/SIGNIFICANCE: The sequencing strategy presented in this work allows for RSV A and B genomes to be sequenced simultaneously in two working days and with a low cost. We have significantly increased the amount of genomic data that is available for both RSV A and B, providing the basic molecular characteristics of RSV strains circulating in Milwaukee over the last 13 years. This information can be used for comparative analysis with strains circulating in other communities around the world which should also help with the development of new strategies for control of RSV, specifically vaccine development and improvement of RSV diagnostics.

  9. Analysis of the genomic response of human prostate cancer cells to histone deacetylase inhibitors.

    Science.gov (United States)

    Kortenhorst, Madeleine S Q; Wissing, Michel D; Rodríguez, Ronald; Kachhap, Sushant K; Jans, Judith J M; Van der Groep, Petra; Verheul, Henk M W; Gupta, Anuj; Aiyetan, Paul O; van der Wall, Elsken; Carducci, Michael A; Van Diest, Paul J; Marchionni, Luigi

    2013-09-01

    Histone deacetylases (HDACs) have emerged as important targets for cancer treatment. HDAC-inhibitors (HDACis) are well tolerated in patients and have been approved for the treatment of patients with cutaneous T-cell lymphoma (CTCL). To improve the clinical benefit of HDACis in solid tumors, combination strategies with HDACis could be employed. In this study, we applied Analysis of Functional Annotation (AFA) to provide a comprehensive list of genes and pathways affected upon HDACi-treatment in prostate cancer cells. This approach provides an unbiased and objective approach to high throughput data mining. By performing AFA on gene expression data from prostate cancer cell lines DU-145 (an HDACi-sensitive cell line) and PC3 (a relatively HDACi-resistant cell line) treated with HDACis valproic acid or vorinostat, we identified biological processes that are affected by HDACis and are therefore potential treatment targets for combination therapy. Our analysis revealed that HDAC-inhibition resulted among others in upregulation of major histocompatibility complex (MHC) genes and deregulation of the mitotic spindle checkpoint by downregulation of genes involved in mitosis. These findings were confirmed by AFA on publicly available data sets from HDACi-treated prostate cancer cells. In total, we analyzed 375 microarrays with HDACi treated and non-treated (control) prostate cancer cells. All results from this extensive analysis are provided as an online research source (available at the journal's website and at http://luigimarchionni.org/HDACIs.html). By publishing this data, we aim to enhance our understanding of the cellular changes after HDAC-inhibition, and to identify novel potential combination strategies with HDACis for the treatment of prostate cancer patients.

  10. Whole genome transcription profiling of Anaplasma phagocytophilum in human and tick host cells by tiling array analysis

    Directory of Open Access Journals (Sweden)

    Chavez Adela

    2008-07-01

    Full Text Available Abstract Background Anaplasma phagocytophilum (Ap is an obligate intracellular bacterium and the agent of human granulocytic anaplasmosis, an emerging tick-borne disease. Ap alternately infects ticks and mammals and a variety of cell types within each. Understanding the biology behind such versatile cellular parasitism may be derived through the use of tiling microarrays to establish high resolution, genome-wide transcription profiles of the organism as it infects cell lines representative of its life cycle (tick; ISE6 and pathogenesis (human; HL-60 and HMEC-1. Results Detailed, host cell specific transcriptional behavior was revealed. There was extensive differential Ap gene transcription between the tick (ISE6 and the human (HL-60 and HMEC-1 cell lines, with far fewer differentially transcribed genes between the human cell lines, and all disproportionately represented by membrane or surface proteins. There were Ap genes exclusively transcribed in each cell line, apparent human- and tick-specific operons and paralogs, and anti-sense transcripts that suggest novel expression regulation processes. Seven virB2 paralogs (of the bacterial type IV secretion system showed human or tick cell dependent transcription. Previously unrecognized genes and coding sequences were identified, as were the expressed p44/msp2 (major surface proteins paralogs (of 114 total, through elevated signal produced to the unique hypervariable region of each – 2/114 in HL-60, 3/114 in HMEC-1, and none in ISE6. Conclusion Using these methods, whole genome transcription profiles can likely be generated for Ap, as well as other obligate intracellular organisms, in any host cells and for all stages of the cell infection process. Visual representation of comprehensive transcription data alongside an annotated map of the genome renders complex transcription into discernable patterns.

  11. Comparative Genome Analysis of Enterobacter cloacae

    Science.gov (United States)

    Liu, Wing-Yee; Wong, Chi-Fat; Chung, Karl Ming-Kar; Jiang, Jing-Wei; Leung, Frederick Chi-Ching

    2013-01-01

    The Enterobacter cloacae species includes an extremely diverse group of bacteria that are associated with plants, soil and humans. Publication of the complete genome sequence of the plant growth-promoting endophytic E. cloacae subsp. cloacae ENHKU01 provided an opportunity to perform the first comparative genome analysis between strains of this dynamic species. Examination of the pan-genome of E. cloacae showed that the conserved core genome retains the general physiological and survival genes of the species, while genomic factors in plasmids and variable regions determine the virulence of the human pathogenic E. cloacae strain; additionally, the diversity of fimbriae contributes to variation in colonization and host determination of different E. cloacae strains. Comparative genome analysis further illustrated that E. cloacae strains possess multiple mechanisms for antagonistic action against other microorganisms, which involve the production of siderophores and various antimicrobial compounds, such as bacteriocins, chitinases and antibiotic resistance proteins. The presence of Type VI secretion systems is expected to provide further fitness advantages for E. cloacae in microbial competition, thus allowing it to survive in different environments. Competition assays were performed to support our observations in genomic analysis, where E. cloacae subsp. cloacae ENHKU01 demonstrated antagonistic activities against a wide range of plant pathogenic fungal and bacterial species. PMID:24069314

  12. Building the sequence map of the human pan-genome

    DEFF Research Database (Denmark)

    Li, Ruiqiang; Li, Yingrui; Zheng, Hancheng

    2010-01-01

    analysis of predicted genes indicated that the novel sequences contain potentially functional coding regions. We estimate that a complete human pan-genome would contain approximately 19-40 Mb of novel sequence not present in the extant reference genome. The extensive amount of novel sequence contributing...

  13. Tempo and mode of genomic mutations unveil human evolutionary history.

    Science.gov (United States)

    Hara, Yuichiro

    2015-01-01

    Mutations that have occurred in human genomes provide insight into various aspects of evolutionary history such as speciation events and degrees of natural selection. Comparing genome sequences between human and great apes or among humans is a feasible approach for inferring human evolutionary history. Recent advances in high-throughput or so-called 'next-generation' DNA sequencing technologies have enabled the sequencing of thousands of individual human genomes, as well as a variety of reference genomes of hominids, many of which are publicly available. These sequence data can help to unveil the detailed demographic history of the lineage leading to humans as well as the explosion of modern human population size in the last several thousand years. In addition, high-throughput sequencing illustrates the tempo and mode of de novo mutations, which are producing human genetic variation at this moment. Pedigree-based human genome sequencing has shown that mutation rates vary significantly across the human genome. These studies have also provided an improved timescale of human evolution, because the mutation rate estimated from pedigree analysis is half that estimated from traditional analyses based on molecular phylogeny. Because of the dramatic reduction in sequencing cost, sequencing on-demand samples designed for specific studies is now also becoming popular. To produce data of sufficient quality to meet the requirements of the study, it is necessary to set an explicit sequencing plan that includes the choice of sample collection methods, sequencing platforms, and number of sequence reads.

  14. Construction of a genomic library of the human cytomegalovirus genome and analysis of late transcription of its inverted internal repeat region

    International Nuclear Information System (INIS)

    Silva, K.F.S.T.

    1989-01-01

    The investigations described in this dissertation were designed to determine the transcriptionally active DNA sequences of IIR region and to identify the viral mRNA transcribed from the transcriptionally most active DNA sequences of that region during late phase of HCMV Towne infection. Preliminary transcriptional studies which included the hybridization of a southern blot of XbaI digested entire HCMV genome to 32 P-labelled late phase infected cell A + RNA, indicated that late viral transcripts homologous to XbaI Q fragment of IIR region were very highly abundant while XbaI Q fragment showed a very low transcriptional activity. To facilitate further analysis of late transcription of IIR region, the entire DNA sequences of IIR region were molecularly cloned as U, S, and H BamHI fragments in pACYC-184 plasmid vector. In addition, to be used in future studies on other regions of the genome, except for y and c' smaller fragments the entire 240 kb HCMV genome was cloned as BamHI fragments in the same vector. Furthermore, the U, S, and H BamHI fragments were mapped with six other restriction enzymes in order to use that mapping data in subsequent transcriptional analysis of the IIR region. Further localization of transcriptionally active DNA sequences within IIR region was achieved by hybridization of southern blots of restricted U, S, and H BamHI fragments with 3' 32 P-labelled infected cell late A + RNA. The 1.5 kb EcooRI subfragments of S BamHI fragment and the adjoining 0.72 kb XhoI subfragment of H BamHI fragment revealed the highest level of transcription, although the remainder of the S fragment was also transcribed at a substantial level. The U fragment and the remainder of the H fragment was transcribed at a very low level

  15. Explaining human uniqueness: genome interactions with environment, behaviour and culture.

    Science.gov (United States)

    Varki, Ajit; Geschwind, Daniel H; Eichler, Evan E

    2008-10-01

    What makes us human? Specialists in each discipline respond through the lens of their own expertise. In fact, 'anthropogeny' (explaining the origin of humans) requires a transdisciplinary approach that eschews such barriers. Here we take a genomic and genetic perspective towards molecular variation, explore systems analysis of gene expression and discuss an organ-systems approach. Rejecting any 'genes versus environment' dichotomy, we then consider genome interactions with environment, behaviour and culture, finally speculating that aspects of human uniqueness arose because of a primate evolutionary trend towards increasing and irreversible dependence on learned behaviours and culture - perhaps relaxing allowable thresholds for large-scale genomic diversity.

  16. Pan-Genome Analysis of Human Gastric Pathogen H. pylori: Comparative Genomics and Pathogenomics Approaches to Identify Regions Associated with Pathogenicity and Prediction of Potential Core Therapeutic Targets

    DEFF Research Database (Denmark)

    Ali, Amjad; Naz, Anam; Soares, Siomar C.

    2015-01-01

    -genome approach; the predicted conserved gene families (1,193) constitute similar to 77% of the average H. pylori genome and 45% of the global gene repertoire of the species. Reverse vaccinology strategies have been adopted to identify and narrow down the potential core-immunogenic candidates. Total of 28 nonhost....... Pan-genome analyses of the global representative H. pylori isolates consisting of 39 complete genomes are presented in this paper. Phylogenetic analyses have revealed close relationships among geographically diverse strains of H. pylori. The conservation among these genomes was further analyzed by pan...

  17. Implementing genomics and pharmacogenomics in the clinic: The National Human Genome Research Institute's genomic medicine portfolio.

    Science.gov (United States)

    Manolio, Teri A

    2016-10-01

    Increasing knowledge about the influence of genetic variation on human health and growing availability of reliable, cost-effective genetic testing have spurred the implementation of genomic medicine in the clinic. As defined by the National Human Genome Research Institute (NHGRI), genomic medicine uses an individual's genetic information in his or her clinical care, and has begun to be applied effectively in areas such as cancer genomics, pharmacogenomics, and rare and undiagnosed diseases. In 2011 NHGRI published its strategic vision for the future of genomic research, including an ambitious research agenda to facilitate and promote the implementation of genomic medicine. To realize this agenda, NHGRI is consulting and facilitating collaborations with the external research community through a series of "Genomic Medicine Meetings," under the guidance and leadership of the National Advisory Council on Human Genome Research. These meetings have identified and begun to address significant obstacles to implementation, such as lack of evidence of efficacy, limited availability of genomics expertise and testing, lack of standards, and difficulties in integrating genomic results into electronic medical records. The six research and dissemination initiatives comprising NHGRI's genomic research portfolio are designed to speed the evaluation and incorporation, where appropriate, of genomic technologies and findings into routine clinical care. Actual adoption of successful approaches in clinical care will depend upon the willingness, interest, and energy of professional societies, practitioners, patients, and payers to promote their responsible use and share their experiences in doing so. Published by Elsevier Ireland Ltd.

  18. Origins of the Human Genome Project

    Energy Technology Data Exchange (ETDEWEB)

    Cook-Deegan, Robert

    1993-07-01

    The human genome project was borne of technology, grew into a science bureaucracy in the US and throughout the world, and is now being transformed into a hybrid academic and commercial enterprise. The next phase of the project promises to veer more sharply toward commercial application, harnessing both the technical prowess of molecular biology and the rapidly growing body of knowledge about DNA structure to the pursuit of practical benefits. Faith that the systematic analysis of DNA structure will prove to be a powerful research tool underlies the rationale behind the genome project. The notion that most genetic information is embedded in the sequence of CNA base pairs comprising chromosomes is a central tenet. A rough analogy is to liken an organism's genetic code to computer code. The coal of the genome project, in this parlance, is to identify and catalog 75,000 or more files (genes) in the software that directs construction of a self-modifying and self-replicating system -- a living organism.

  19. Origins of the Human Genome Project

    Science.gov (United States)

    Cook-Deegan, Robert (Affiliation: Institute of Medicine, National Academy of Sciences)

    1993-07-01

    The human genome project was borne of technology, grew into a science bureaucracy in the United States and throughout the world, and is now being transformed into a hybrid academic and commercial enterprise. The next phase of the project promises to veer more sharply toward commercial application, harnessing both the technical prowess of molecular biology and the rapidly growing body of knowledge about DNA structure to the pursuit of practical benefits. Faith that the systematic analysis of DNA structure will prove to be a powerful research tool underlies the rationale behind the genome project. The notion that most genetic information is embedded in the sequence of CNA base pairs comprising chromosomes is a central tenet. A rough analogy is to liken an organism's genetic code to computer code. The coal of the genome project, in this parlance, is to identify and catalog 75,000 or more files (genes) in the software that directs construction of a self-modifying and self-replicating system -- a living organism.

  20. Transposable element activity, genome regulation and human health.

    Science.gov (United States)

    Wang, Lu; Jordan, I King

    2018-03-02

    A convergence of novel genome analysis technologies is enabling population genomic studies of human transposable elements (TEs). Population surveys of human genome sequences have uncovered thousands of individual TE insertions that segregate as common genetic variants, i.e. TE polymorphisms. These recent TE insertions provide an important source of naturally occurring human genetic variation. Investigators are beginning to leverage population genomic data sets to execute genome-scale association studies for assessing the phenotypic impact of human TE polymorphisms. For example, the expression quantitative trait loci (eQTL) analytical paradigm has recently been used to uncover hundreds of associations between human TE insertion variants and gene expression levels. These include population-specific gene regulatory effects as well as coordinated changes to gene regulatory networks. In addition, analyses of linkage disequilibrium patterns with previously characterized genome-wide association study (GWAS) trait variants have uncovered TE insertion polymorphisms that are likely causal variants for a variety of common complex diseases. Gene regulatory mechanisms that underlie specific disease phenotypes have been proposed for a number of these trait associated TE polymorphisms. These new population genomic approaches hold great promise for understanding how ongoing TE activity contributes to functionally relevant genetic variation within and between human populations. Copyright © 2018 Elsevier Ltd. All rights reserved.

  1. Comparative genomics of emerging human ehrlichiosis agents.

    Directory of Open Access Journals (Sweden)

    Julie C Dunning Hotopp

    2006-02-01

    Full Text Available Anaplasma (formerly Ehrlichia phagocytophilum, Ehrlichia chaffeensis, and Neorickettsia (formerly Ehrlichia sennetsu are intracellular vector-borne pathogens that cause human ehrlichiosis, an emerging infectious disease. We present the complete genome sequences of these organisms along with comparisons to other organisms in the Rickettsiales order. Ehrlichia spp. and Anaplasma spp. display a unique large expansion of immunodominant outer membrane proteins facilitating antigenic variation. All Rickettsiales have a diminished ability to synthesize amino acids compared to their closest free-living relatives. Unlike members of the Rickettsiaceae family, these pathogenic Anaplasmataceae are capable of making all major vitamins, cofactors, and nucleotides, which could confer a beneficial role in the invertebrate vector or the vertebrate host. Further analysis identified proteins potentially involved in vacuole confinement of the Anaplasmataceae, a life cycle involving a hematophagous vector, vertebrate pathogenesis, human pathogenesis, and lack of transovarial transmission. These discoveries provide significant insights into the biology of these obligate intracellular pathogens.

  2. Radiation-induced instability of human genome

    International Nuclear Information System (INIS)

    Ryabchenko, N.N.; Demina, Eh.A.

    2014-01-01

    A brief review is dedicated to the phenomenon of radiation-induced genomic instability where the increased level of genomic changes in the offspring of irradiated cells is characteristic. Particular attention is paid to the problems of genomic instability induced by the low-dose radiation, role of the bystander effect in formation of radiation-induced instability, and its relationship with individual radiosensitivity. We believe that in accordance with the paradigm of modern radiobiology the increased human individual radiosensitivity can be formed due to the genome instability onset and is a significant risk factor for radiation-induced cancer

  3. Unexplored therapeutic opportunities in the human genome

    DEFF Research Database (Denmark)

    Oprea, Tudor I; Bologa, Cristian G; Brunak, Søren

    2018-01-01

    A large proportion of biomedical research and the development of therapeutics is focused on a small fraction of the human genome. In a strategic effort to map the knowledge gaps around proteins encoded by the human genome and to promote the exploration of currently understudied, but potentially d...... as well as key drug target classes, including G protein-coupled receptors, protein kinases and ion channels, which illustrate the nature of the unexplored opportunities for biomedical research and therapeutic development....

  4. Body maps on the human genome.

    Science.gov (United States)

    Cherniak, Christopher; Rodriguez-Esteban, Raul

    2013-12-20

    Chromosomes have territories, or preferred locales, in the cell nucleus. When these sites are taken into account, some large-scale structure of the human genome emerges. The synoptic picture is that genes highly expressed in particular topologically compact tissues are not randomly distributed on the genome. Rather, such tissue-specific genes tend to map somatotopically onto the complete chromosome set. They seem to form a "genome homunculus": a multi-dimensional, genome-wide body representation extending across chromosome territories of the entire spermcell nucleus. The antero-posterior axis of the body significantly corresponds to the head-tail axis of the nucleus, and the dorso-ventral body axis to the central-peripheral nucleus axis. This large-scale genomic structure includes thousands of genes. One rationale for a homuncular genome structure would be to minimize connection costs in genetic networks. Somatotopic maps in cerebral cortex have been reported for over a century.

  5. Genomic characterisation of Leptospira inadai serogroup Lyme isolated from captured rat in Brazil and comparative analysis with human reference strain

    Science.gov (United States)

    Moreno, Luisa Z; Miraglia, Fabiana; Loureiro, Ana P; Kremer, Frederico S; Eslabao, Marcus R; Dellagostin, Odir A; Lilenbaum, Walter; Vasconcellos, Silvio A; Heinemann, Marcos B; Moreno, Andrea M

    2018-01-01

    Leptospira inadai is classified as a species of the Leptospira intermediate group that has been poorly studied due to its apparent insignificance to human and animal health. Nevertheless, over the last two decades the species has been described in human cases in India and in carrier animals in Ecuador. Here, we present the first identification and genomic characterisation of L. inadai serogroup Lyme isolated from captured rodent in Brazil. Even though the M34/99 strain was not pathogenic for hamsters, it was able to establish renal colonisation. The M34/99 strain presented high similarity with L. inadai serogroup Lyme human reference indicating that animal strain could also infect humans, although it does not represent high risk of severe disease. An extrachromosomal sequence was also identified in M34/99 strain and presented high identity with previously described L. inadai phage LinZ_10, suggesting that phage-like extrachromosomal sequence may be another feature of this understudied species. PMID:29538491

  6. Genomic characterisation of Leptospira inadai serogroup Lyme isolated from captured rat in Brazil and comparative analysis with human reference strain.

    Science.gov (United States)

    Moreno, Luisa Z; Miraglia, Fabiana; Loureiro, Ana P; Kremer, Frederico S; Eslabao, Marcus R; Dellagostin, Odir A; Lilenbaum, Walter; Vasconcellos, Silvio A; Heinemann, Marcos B; Moreno, Andrea M

    2018-03-12

    Leptospira inadai is classified as a species of the Leptospira intermediate group that has been poorly studied due to its apparent insignificance to human and animal health. Nevertheless, over the last two decades the species has been described in human cases in India and in carrier animals in Ecuador. Here, we present the first identification and genomic characterisation of L. inadai serogroup Lyme isolated from captured rodent in Brazil. Even though the M34/99 strain was not pathogenic for hamsters, it was able to establish renal colonisation. The M34/99 strain presented high similarity with L. inadai serogroup Lyme human reference indicating that animal strain could also infect humans, although it does not represent high risk of severe disease. An extrachromosomal sequence was also identified in M34/99 strain and presented high identity with previously described L. inadai phage LinZ_10, suggesting that phage-like extrachromosomal sequence may be another feature of this understudied species.

  7. Genomic characterisation of Leptospira inadai serogroup Lyme isolated from captured rat in Brazil and comparative analysis with human reference strain

    Directory of Open Access Journals (Sweden)

    Luisa Z Moreno

    2018-03-01

    Full Text Available Leptospira inadai is classified as a species of the Leptospira intermediate group that has been poorly studied due to its apparent insignificance to human and animal health. Nevertheless, over the last two decades the species has been described in human cases in India and in carrier animals in Ecuador. Here, we present the first identification and genomic characterisation of L. inadai serogroup Lyme isolated from captured rodent in Brazil. Even though the M34/99 strain was not pathogenic for hamsters, it was able to establish renal colonisation. The M34/99 strain presented high similarity with L. inadai serogroup Lyme human reference indicating that animal strain could also infect humans, although it does not represent high risk of severe disease. An extrachromosomal sequence was also identified in M34/99 strain and presented high identity with previously described L. inadai phage LinZ_10, suggesting that phage-like extrachromosomal sequence may be another feature of this understudied species.

  8. Origins of the Human Genome Project.

    Science.gov (United States)

    Watson, J D; Cook-Deegan, R M

    1991-01-01

    The Human Genome Project has become a reality. Building on a debate that dates back to 1985, several genome projects are now in full stride around the world, and more are likely to form in the next several years. Italy began its genome program in 1987, and the United Kingdom and U.S.S.R. in 1988. The European communities mounted several genome projects on yeast, bacteria, Drosophila, and Arabidospis thaliana (a rapidly growing plant with a small genome) in 1988, and in 1990 commenced a new 2-year program on the human genome. In the United States, we have completed the first year of operation of the National Center for Human Genome Research at the National Institutes of Health (NIH), now the largest single funding source for genome research in the world. There have been dedicated budgets focused on genome-scale research at NIH, the U.S. Department of Energy, and the Howard Hughes Medical Institute for several years, and results are beginning to accumulate. There were three annual meetings on genome mapping and sequencing at Cold Spring Harbor, New York, in the spring of 1988, 1989, and 1990; the talks have shifted from a discussion about how to approach problems to presenting results from experiments already performed. We have finally begun to work rather than merely talk. The purpose of genome projects is to assemble data on the structure of DNA in human chromosomes and those of other organisms. A second goal is to develop new technologies to perform mapping and sequencing. There have been impressive technical advances in the past 5 years since the debate about the human genome project began. We are on the verge of beginning pilot projects to test several approaches to sequencing long stretches of DNA, using both automation and manual methods. Ordered sets of yeast artificial chromosome and cosmid clones have been assembled to span more than 2 million base pairs of several human chromosomes, and a region of 10 million base pairs has been assembled for

  9. Comparative Genome Analysis and Genome Evolution

    NARCIS (Netherlands)

    Snel, Berend

    2002-01-01

    This thesis described a collection of bioinformatic analyses on complete genome sequence data. We have studied the evolution of gene content and find that vertical inheritance dominates over horizontal gene trasnfer, even to the extent that we can use the gene content to make genome phylogenies.

  10. Comparative genomic analysis reveals a novel mitochondrial isoform of human rTS protein and unusual phylogenetic distribution of the rTS gene

    Science.gov (United States)

    Liang, Ping; Nair, Jayakumar R; Song, Lei; McGuire, John J; Dolnick, Bruce J

    2005-01-01

    Background The rTS gene (ENOSF1), first identified in Homo sapiens as a gene complementary to the thymidylate synthase (TYMS) mRNA, is known to encode two protein isoforms, rTSα and rTSβ. The rTSβ isoform appears to be an enzyme responsible for the synthesis of signaling molecules involved in the down-regulation of thymidylate synthase, but the exact cellular functions of rTS genes are largely unknown. Results Through comparative genomic sequence analysis, we predicted the existence of a novel protein isoform, rTS, which has a 27 residue longer N-terminus by virtue of utilizing an alternative start codon located upstream of the start codon in rTSβ. We observed that a similar extended N-terminus could be predicted in all rTS genes for which genomic sequences are available and the extended regions are conserved from bacteria to human. Therefore, we reasoned that the protein with the extended N-terminus might represent an ancestral form of the rTS protein. Sequence analysis strongly predicts a mitochondrial signal sequence in the extended N-terminal of human rTSγ, which is absent in rTSβ. We confirmed the existence of rTS in human mitochondria experimentally by demonstrating the presence of both rTSγ and rTSβ proteins in mitochondria isolated by subcellular fractionation. In addition, our comprehensive analysis of rTS orthologous sequences reveals an unusual phylogenetic distribution of this gene, which suggests the occurrence of one or more horizontal gene transfer events. Conclusion The presence of two rTS isoforms in mitochondria suggests that the rTS signaling pathway may be active within mitochondria. Our report also presents an example of identifying novel protein isoforms and for improving gene annotation through comparative genomic analysis. PMID:16162288

  11. Comparative genomic analysis reveals a novel mitochondrial isoform of human rTS protein and unusual phylogenetic distribution of the rTS gene

    Directory of Open Access Journals (Sweden)

    McGuire John J

    2005-09-01

    Full Text Available Abstract Background The rTS gene (ENOSF1, first identified in Homo sapiens as a gene complementary to the thymidylate synthase (TYMS mRNA, is known to encode two protein isoforms, rTSα and rTSβ. The rTSβ isoform appears to be an enzyme responsible for the synthesis of signaling molecules involved in the down-regulation of thymidylate synthase, but the exact cellular functions of rTS genes are largely unknown. Results Through comparative genomic sequence analysis, we predicted the existence of a novel protein isoform, rTS, which has a 27 residue longer N-terminus by virtue of utilizing an alternative start codon located upstream of the start codon in rTSβ. We observed that a similar extended N-terminus could be predicted in all rTS genes for which genomic sequences are available and the extended regions are conserved from bacteria to human. Therefore, we reasoned that the protein with the extended N-terminus might represent an ancestral form of the rTS protein. Sequence analysis strongly predicts a mitochondrial signal sequence in the extended N-terminal of human rTSγ, which is absent in rTSβ. We confirmed the existence of rTS in human mitochondria experimentally by demonstrating the presence of both rTSγ and rTSβ proteins in mitochondria isolated by subcellular fractionation. In addition, our comprehensive analysis of rTS orthologous sequences reveals an unusual phylogenetic distribution of this gene, which suggests the occurrence of one or more horizontal gene transfer events. Conclusion The presence of two rTS isoforms in mitochondria suggests that the rTS signaling pathway may be active within mitochondria. Our report also presents an example of identifying novel protein isoforms and for improving gene annotation through comparative genomic analysis.

  12. A decade of human genome project conclusion: Scientific diffusion about our genome knowledge.

    Science.gov (United States)

    Moraes, Fernanda; Góes, Andréa

    2016-05-06

    The Human Genome Project (HGP) was initiated in 1990 and completed in 2003. It aimed to sequence the whole human genome. Although it represented an advance in understanding the human genome and its complexity, many questions remained unanswered. Other projects were launched in order to unravel the mysteries of our genome, including the ENCyclopedia of DNA Elements (ENCODE). This review aims to analyze the evolution of scientific knowledge related to both the HGP and ENCODE projects. Data were retrieved from scientific articles published in 1990-2014, a period comprising the development and the 10 years following the HGP completion. The fact that only 20,000 genes are protein and RNA-coding is one of the most striking HGP results. A new concept about the organization of genome arose. The ENCODE project was initiated in 2003 and targeted to map the functional elements of the human genome. This project revealed that the human genome is pervasively transcribed. Therefore, it was determined that a large part of the non-protein coding regions are functional. Finally, a more sophisticated view of chromatin structure emerged. The mechanistic functioning of the genome has been redrafted, revealing a much more complex picture. Besides, a gene-centric conception of the organism has to be reviewed. A number of criticisms have emerged against the ENCODE project approaches, raising the question of whether non-conserved but biochemically active regions are truly functional. Thus, HGP and ENCODE projects accomplished a great map of the human genome, but the data generated still requires further in depth analysis. © 2016 by The International Union of Biochemistry and Molecular Biology, 44:215-223, 2016. © 2016 The International Union of Biochemistry and Molecular Biology.

  13. Human-specific HERV-K insertion causes genomic variations in the human genome.

    Directory of Open Access Journals (Sweden)

    Wonseok Shin

    Full Text Available Human endogenous retroviruses (HERV sequences account for about 8% of the human genome. Through comparative genomics and literature mining, we identified a total of 29 human-specific HERV-K insertions. We characterized them focusing on their structure and flanking sequence. The results showed that four of the human-specific HERV-K insertions deleted human genomic sequences via non-classical insertion mechanisms. Interestingly, two of the human-specific HERV-K insertion loci contained two HERV-K internals and three LTR elements, a pattern which could be explained by LTR-LTR ectopic recombination or template switching. In addition, we conducted a polymorphic test and observed that twelve out of the 29 elements are polymorphic in the human population. In conclusion, human-specific HERV-K elements have inserted into human genome since the divergence of human and chimpanzee, causing human genomic changes. Thus, we believe that human-specific HERV-K activity has contributed to the genomic divergence between humans and chimpanzees, as well as within the human population.

  14. Beyond the human genome: Microbes, methaphors and what it means to be human in an interconnected post-genomic world

    NARCIS (Netherlands)

    Nerlich, B.; Hellsten, I.R.

    2009-01-01

    Four years after the completion of the Human Genome Project, the US National Institutes for Health launched the Human Microbiome Project on 19 December 2007. Using metaphor analysis, this article investigates reporting in English-language newspapers on advances in microbiomics from 2003 onwards,

  15. Justice and the Human Genome Project

    Energy Technology Data Exchange (ETDEWEB)

    Murphy, T.F.; Lappe, M. (eds.)

    1992-01-01

    Most of the essays gathered in this volume were first presented at a conference, Justice and the Human Genome, in Chicago in early November, 1991. The goal of the, conference was to consider questions of justice as they are and will be raised by the Human Genome Project. To achieve its goal of identifying and elucidating the challenges of justice inherent in genomic research and its social applications the conference drew together in one forum members from academia, medicine, and industry with interests divergent as rate-setting for insurance, the care of newborns, and the history of ethics. The essays in this volume address a number of theoretical and practical concerns relative to the meaning of genomic research.

  16. Justice and the Human Genome Project

    Energy Technology Data Exchange (ETDEWEB)

    Murphy, T.F.; Lappe, M. [eds.

    1992-12-31

    Most of the essays gathered in this volume were first presented at a conference, Justice and the Human Genome, in Chicago in early November, 1991. The goal of the, conference was to consider questions of justice as they are and will be raised by the Human Genome Project. To achieve its goal of identifying and elucidating the challenges of justice inherent in genomic research and its social applications the conference drew together in one forum members from academia, medicine, and industry with interests divergent as rate-setting for insurance, the care of newborns, and the history of ethics. The essays in this volume address a number of theoretical and practical concerns relative to the meaning of genomic research.

  17. Inversion variants in human and primate genomes.

    Science.gov (United States)

    Catacchio, Claudia Rita; Maggiolini, Flavia Angela Maria; D'Addabbo, Pietro; Bitonto, Miriana; Capozzi, Oronzo; Signorile, Martina Lepore; Miroballo, Mattia; Archidiacono, Nicoletta; Eichler, Evan E; Ventura, Mario; Antonacci, Francesca

    2018-05-18

    For many years, inversions have been proposed to be a direct driving force in speciation since they suppress recombination when heterozygous. Inversions are the most common large-scale differences among humans and great apes. Nevertheless, they represent large events easily distinguishable by classical cytogenetics, whose resolution, however, is limited. Here, we performed a genome-wide comparison between human, great ape, and macaque genomes using the net alignments for the most recent releases of genome assemblies. We identified a total of 156 putative inversions, between 103 kb and 91 Mb, corresponding to 136 human loci. Combining literature, sequence, and experimental analyses, we analyzed 109 of these loci and found 67 regions inverted in one or multiple primates, including 28 newly identified inversions. These events overlap with 81 human genes at their breakpoints, and seven correspond to sites of recurrent rearrangements associated with human disease. This work doubles the number of validated primate inversions larger than 100 kb, beyond what was previously documented. We identified 74 sites of errors, where the sequence has been assembled in the wrong orientation, in the reference genomes analyzed. Our data serve two purposes: First, we generated a map of evolutionary inversions in these genomes representing a resource for interrogating differences among these species at a functional level; second, we provide a list of misassembled regions in these primate genomes, involving over 300 Mb of DNA and 1978 human genes. Accurately annotating these regions in the genome references has immediate applications for evolutionary and biomedical studies on primates. © 2018 Catacchio et al.; Published by Cold Spring Harbor Laboratory Press.

  18. Genomic analysis of influenza A virus from captive wild boars in Brazil reveals a human-like H1N2 influenza virus.

    Science.gov (United States)

    Biondo, Natalha; Schaefer, Rejane; Gava, Danielle; Cantão, Mauricio E; Silveira, Simone; Mores, Marcos A Z; Ciacci-Zanella, Janice R; Barcellos, David E S N

    2014-01-10

    Influenza is a viral disease that affects human and several animal species. In Brazil, H1N1, H3N2 and 2009 pandemic H1N1 A(H1N1)pdm09 influenza A viruses (IAV) circulate in domestic swine herds. Wild boars are also susceptible to IAV infection but in Brazil until this moment there are no reports of IAV infection in wild boars or in captive wild boars populations. Herein the occurrence of IAV in captive wild boars with the presence of lung consolidation lesions during slaughter was investigated. Lung samples were screened by RT-PCR for IAV detection. IAV positive samples were further analyzed by quantitative real-time PCR (qRRT-PCR), virus isolation, genomic sequencing, histopathology and immunohistochemistry (IHC). Eleven out of 60 lungs (18.3%) were positive for IAV by RT-PCR and seven out of the eleven were also positive for A(H1N1)pdm09 by qRRT-PCR. Chronic diffuse bronchopneumonia was observed in all samples and IHC analysis was negative for influenza A antigen. Full genes segments of H1N2 IAV were sequenced using Illumina's genome analyzer platform (MiSeq). The genomic analysis revealed that the HA and NA genes clustered with IAVs of the human lineage and the six internal genes were derived from the H1N1pdm09 IAV. This is the first report of a reassortant human-like H1N2 influenza virus infection in captive wild boars in Brazil and indicates the need to monitor IAV evolution in Suidae populations. Copyright © 2013 Elsevier B.V. All rights reserved.

  19. Human genome project: revolutionizing biology through leveraging technology

    Science.gov (United States)

    Dahl, Carol A.; Strausberg, Robert L.

    1996-04-01

    The Human Genome Project (HGP) is an international project to develop genetic, physical, and sequence-based maps of the human genome. Since the inception of the HGP it has been clear that substantially improved technology would be required to meet the scientific goals, particularly in order to acquire the complete sequence of the human genome, and that these technologies coupled with the information forthcoming from the project would have a dramatic effect on the way biomedical research is performed in the future. In this paper, we discuss the state-of-the-art for genomic DNA sequencing, technological challenges that remain, and the potential technological paths that could yield substantially improved genomic sequencing technology. The impact of the technology developed from the HGP is broad-reaching and a discussion of other research and medical applications that are leveraging HGP-derived DNA analysis technologies is included. The multidisciplinary approach to the development of new technologies that has been successful for the HGP provides a paradigm for facilitating new genomic approaches toward understanding the biological role of functional elements and systems within the cell, including those encoded within genomic DNA and their molecular products.

  20. Genomic features of human limb specific enhancers.

    Science.gov (United States)

    Ali, Shahid; Amina, Bibi; Anwar, Saneela; Minhas, Rashid; Parveen, Nazia; Nawaz, Uzma; Azam, Syed Sikandar; Abbasi, Amir Ali

    2016-10-01

    To elucidate important cellular and molecular interactions that regulate patterning and skeletal development, vertebrate limbs served as a model organ. A growing body of evidence from detailed studies on a subset of limb regulators like the HOXD cluster or SHH, reveals the importance of enhancers in limb related developmental and disease processes. Exploiting the recent genome-wide availability of functionally confirmed enhancer dataset, this study establishes regulatory interactions for dozens of human limb developmental genes. From these data, it appears that the long-range regulatory interactions are fairly common during limb development. This observation highlights the significance of chromosomal breaks/translocations in human limb deformities. Transcriptional factor (TF) analysis predicts that the differentiation of early nascent limb-bud into future territories entail distinct TF interaction networks. Conclusively, an important motivation for annotating the human limb specific regulatory networks is to pave way for the systematic exploration of their role in disease and evolution. Copyright © 2016. Published by Elsevier Inc.

  1. Mathematical Analysis of Genomic Evolution

    Directory of Open Access Journals (Sweden)

    Cedric Green

    2011-01-01

    Full Text Available Changes in nucleotide sequences, or mutations, accumulate from generation to generation in the genomes of all living organisms. The mutations can be advantageous, deleterious, or neutral. The goal of this project is to determine the amount of advantageous mutations it takes to get human (Homo sapiens DNA from the DNA of genetically distinct organisms. We do this by collecting the genomic data of such organisms, and estimating the amount of mutations it takes to transform yeast (Saccharomyces cerevisiae DNA to the DNA of a human. We calculate the typical number of mutations occurring annually through the organism's average life span and the average mutation rate. This allows us to determine the total number of mutations as well as the probability of advantageous mutations. Not surprisingly, this probability proves to be fairly small. A more precise estimate can be determined by accounting for the differences in the chromosomal structure and phenomena like horizontal gene transfer.

  2. Human Genome Editing and Ethical Considerations.

    Science.gov (United States)

    Krishan, Kewal; Kanchan, Tanuj; Singh, Bahadur

    2016-04-01

    Editing human germline genes may act as boon in some genetic and other disorders. Recent editing of the genome of the human embryo with the CRISPR/Cas9 editing tool generated a debate amongst top scientists of the world for the ethical considerations regarding its effect on the future generations. It needs to be seen as to what transformation human gene editing brings to humankind in the times to come.

  3. Development and application of Human Genome Epidemiology

    Science.gov (United States)

    Xu, Jingwen

    2017-12-01

    Epidemiology is a science that studies distribution of diseases and health in population and its influencing factors, it also studies how to prevent and cure disease and promote health strategies and measures. Epidemiology has developed rapidly in recent years and it is an intercross subject with various other disciplines to form a series of branch disciplines such as Genetic epidemiology, molecular epidemiology, drug epidemiology and tumor epidemiology. With the implementation and completion of Human Genome Project (HGP), Human Genome Epidemiology (HuGE) has emerged at this historic moment. In this review, the development of Human Genome Epidemiology, research content, the construction and structure of relevant network, research standards, as well as the existing results and problems are briefly outlined.

  4. A periodic pattern of SNPs in the human genome

    DEFF Research Database (Denmark)

    Madsen, Bo Eskerod; Villesen, Palle; Wiuf, Carsten

    2007-01-01

    By surveying a filtered, high-quality set of SNPs in the human genome, we have found that SNPs positioned 1, 2, 4, 6, or 8 bp apart are more frequent than SNPs positioned 3, 5, 7, or 9 bp apart. The observed pattern is not restricted to genomic regions that are known to cause sequencing...... periodic DNA. Our results suggest that not all SNPs in the human genome are created by independent single nucleotide mutations, and that care should be taken in analysis of SNPs from periodic DNA. The latter may have important consequences for SNP and association studies....... or alignment errors, for example, transposable elements (SINE, LINE, and LTR), tandem repeats, and large duplicated regions. However, we found that the pattern is almost entirely confined to what we define as "periodic DNA." Periodic DNA is a genomic region with a high degree of periodicity in nucleotide usage...

  5. Genomic divergences among cattle, dog and human estimated from large-scale alignments of genomic sequences

    Directory of Open Access Journals (Sweden)

    Shade Larry L

    2006-06-01

    Full Text Available Abstract Background Approximately 11 Mb of finished high quality genomic sequences were sampled from cattle, dog and human to estimate genomic divergences and their regional variation among these lineages. Results Optimal three-way multi-species global sequence alignments for 84 cattle clones or loci (each >50 kb of genomic sequence were constructed using the human and dog genome assemblies as references. Genomic divergences and substitution rates were examined for each clone and for various sequence classes under different functional constraints. Analysis of these alignments revealed that the overall genomic divergences are relatively constant (0.32–0.37 change/site for pairwise comparisons among cattle, dog and human; however substitution rates vary across genomic regions and among different sequence classes. A neutral mutation rate (2.0–2.2 × 10(-9 change/site/year was derived from ancestral repetitive sequences, whereas the substitution rate in coding sequences (1.1 × 10(-9 change/site/year was approximately half of the overall rate (1.9–2.0 × 10(-9 change/site/year. Relative rate tests also indicated that cattle have a significantly faster rate of substitution as compared to dog and that this difference is about 6%. Conclusion This analysis provides a large-scale and unbiased assessment of genomic divergences and regional variation of substitution rates among cattle, dog and human. It is expected that these data will serve as a baseline for future mammalian molecular evolution studies.

  6. A Genome-Wide mQTL Analysis in Human Adipose Tissue Identifies Genetic Variants Associated with DNA Methylation, Gene Expression and Metabolic Traits

    DEFF Research Database (Denmark)

    Volkov, Petr; Olsson, Anders H; Gillberg, Linn

    2016-01-01

    Little is known about the extent to which interactions between genetics and epigenetics may affect the risk of complex metabolic diseases and/or their intermediary phenotypes. We performed a genome-wide DNA methylation quantitative trait locus (mQTL) analysis in human adipose tissue of 119 men, w...... and epigenetic variation in both cis and trans positions influencing gene expression in adipose tissue and in vivo (dys)metabolic traits associated with the development of obesity and diabetes.......Little is known about the extent to which interactions between genetics and epigenetics may affect the risk of complex metabolic diseases and/or their intermediary phenotypes. We performed a genome-wide DNA methylation quantitative trait locus (mQTL) analysis in human adipose tissue of 119 men......, where 592,794 single nucleotide polymorphisms (SNPs) were related to DNA methylation of 477,891 CpG sites, covering 99% of RefSeq genes. SNPs in significant mQTLs were further related to gene expression in adipose tissue and obesity related traits. We found 101,911 SNP-CpG pairs (mQTLs) in cis and 5...

  7. Population genetic inference from personal genome data: impact of ancestry and admixture on human genomic variation.

    Science.gov (United States)

    Kidd, Jeffrey M; Gravel, Simon; Byrnes, Jake; Moreno-Estrada, Andres; Musharoff, Shaila; Bryc, Katarzyna; Degenhardt, Jeremiah D; Brisbin, Abra; Sheth, Vrunda; Chen, Rong; McLaughlin, Stephen F; Peckham, Heather E; Omberg, Larsson; Bormann Chung, Christina A; Stanley, Sarah; Pearlstein, Kevin; Levandowsky, Elizabeth; Acevedo-Acevedo, Suehelay; Auton, Adam; Keinan, Alon; Acuña-Alonzo, Victor; Barquera-Lozano, Rodrigo; Canizales-Quinteros, Samuel; Eng, Celeste; Burchard, Esteban G; Russell, Archie; Reynolds, Andy; Clark, Andrew G; Reese, Martin G; Lincoln, Stephen E; Butte, Atul J; De La Vega, Francisco M; Bustamante, Carlos D

    2012-10-05

    Full sequencing of individual human genomes has greatly expanded our understanding of human genetic variation and population history. Here, we present a systematic analysis of 50 human genomes from 11 diverse global populations sequenced at high coverage. Our sample includes 12 individuals who have admixed ancestry and who have varying degrees of recent (within the last 500 years) African, Native American, and European ancestry. We found over 21 million single-nucleotide variants that contribute to a 1.75-fold range in nucleotide heterozygosity across diverse human genomes. This heterozygosity ranged from a high of one heterozygous site per kilobase in west African genomes to a low of 0.57 heterozygous sites per kilobase in segments inferred to have diploid Native American ancestry from the genomes of Mexican and Puerto Rican individuals. We show evidence of all three continental ancestries in the genomes of Mexican, Puerto Rican, and African American populations, and the genome-wide statistics are highly consistent across individuals from a population once ancestry proportions have been accounted for. Using a generalized linear model, we identified subtle variations across populations in the proportion of neutral versus deleterious variation and found that genome-wide statistics vary in admixed populations even once ancestry proportions have been factored in. We further infer that multiple periods of gene flow shaped the diversity of admixed populations in the Americas-70% of the European ancestry in today's African Americans dates back to European gene flow happening only 7-8 generations ago. Copyright © 2012 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  8. Gene expansion shapes genome architecture in the human pathogen Lichtheimia corymbifera: an evolutionary genomics analysis in the ancient terrestrial mucorales (Mucoromycotina.

    Directory of Open Access Journals (Sweden)

    Volker U Schwartze

    2014-08-01

    Full Text Available Lichtheimia species are the second most important cause of mucormycosis in Europe. To provide broader insights into the molecular basis of the pathogenicity-associated traits of the basal Mucorales, we report the full genome sequence of L. corymbifera and compared it to the genome of Rhizopus oryzae, the most common cause of mucormycosis worldwide. The genome assembly encompasses 33.6 MB and 12,379 protein-coding genes. This study reveals four major differences of the L. corymbifera genome to R. oryzae: (i the presence of an highly elevated number of gene duplications which are unlike R. oryzae not due to whole genome duplication (WGD, (ii despite the relatively high incidence of introns, alternative splicing (AS is not frequently observed for the generation of paralogs and in response to stress, (iii the content of repetitive elements is strikingly low (<5%, (iv L. corymbifera is typically haploid. Novel virulence factors were identified which may be involved in the regulation of the adaptation to iron-limitation, e.g. LCor01340.1 encoding a putative siderophore transporter and LCor00410.1 involved in the siderophore metabolism. Genes encoding the transcription factors LCor08192.1 and LCor01236.1, which are similar to GATA type regulators and to calcineurin regulated CRZ1, respectively, indicating an involvement of the calcineurin pathway in the adaption to iron limitation. Genes encoding MADS-box transcription factors are elevated up to 11 copies compared to the 1-4 copies usually found in other fungi. More findings are: (i lower content of tRNAs, but unique codons in L. corymbifera, (ii Over 25% of the proteins are apparently specific for L. corymbifera. (iii L. corymbifera contains only 2/3 of the proteases (known to be essential virulence factors in comparison to R. oryzae. On the other hand, the number of secreted proteases, however, is roughly twice as high as in R. oryzae.

  9. Gene expansion shapes genome architecture in the human pathogen Lichtheimia corymbifera: an evolutionary genomics analysis in the ancient terrestrial mucorales (Mucoromycotina).

    Science.gov (United States)

    Schwartze, Volker U; Winter, Sascha; Shelest, Ekaterina; Marcet-Houben, Marina; Horn, Fabian; Wehner, Stefanie; Linde, Jörg; Valiante, Vito; Sammeth, Michael; Riege, Konstantin; Nowrousian, Minou; Kaerger, Kerstin; Jacobsen, Ilse D; Marz, Manja; Brakhage, Axel A; Gabaldón, Toni; Böcker, Sebastian; Voigt, Kerstin

    2014-08-01

    Lichtheimia species are the second most important cause of mucormycosis in Europe. To provide broader insights into the molecular basis of the pathogenicity-associated traits of the basal Mucorales, we report the full genome sequence of L. corymbifera and compared it to the genome of Rhizopus oryzae, the most common cause of mucormycosis worldwide. The genome assembly encompasses 33.6 MB and 12,379 protein-coding genes. This study reveals four major differences of the L. corymbifera genome to R. oryzae: (i) the presence of an highly elevated number of gene duplications which are unlike R. oryzae not due to whole genome duplication (WGD), (ii) despite the relatively high incidence of introns, alternative splicing (AS) is not frequently observed for the generation of paralogs and in response to stress, (iii) the content of repetitive elements is strikingly low (<5%), (iv) L. corymbifera is typically haploid. Novel virulence factors were identified which may be involved in the regulation of the adaptation to iron-limitation, e.g. LCor01340.1 encoding a putative siderophore transporter and LCor00410.1 involved in the siderophore metabolism. Genes encoding the transcription factors LCor08192.1 and LCor01236.1, which are similar to GATA type regulators and to calcineurin regulated CRZ1, respectively, indicating an involvement of the calcineurin pathway in the adaption to iron limitation. Genes encoding MADS-box transcription factors are elevated up to 11 copies compared to the 1-4 copies usually found in other fungi. More findings are: (i) lower content of tRNAs, but unique codons in L. corymbifera, (ii) Over 25% of the proteins are apparently specific for L. corymbifera. (iii) L. corymbifera contains only 2/3 of the proteases (known to be essential virulence factors) in comparison to R. oryzae. On the other hand, the number of secreted proteases, however, is roughly twice as high as in R. oryzae.

  10. IMG: the integrated microbial genomes database and comparative analysis system

    Science.gov (United States)

    Markowitz, Victor M.; Chen, I-Min A.; Palaniappan, Krishna; Chu, Ken; Szeto, Ernest; Grechkin, Yuri; Ratner, Anna; Jacob, Biju; Huang, Jinghua; Williams, Peter; Huntemann, Marcel; Anderson, Iain; Mavromatis, Konstantinos; Ivanova, Natalia N.; Kyrpides, Nikos C.

    2012-01-01

    The Integrated Microbial Genomes (IMG) system serves as a community resource for comparative analysis of publicly available genomes in a comprehensive integrated context. IMG integrates publicly available draft and complete genomes from all three domains of life with a large number of plasmids and viruses. IMG provides tools and viewers for analyzing and reviewing the annotations of genes and genomes in a comparative context. IMG's data content and analytical capabilities have been continuously extended through regular updates since its first release in March 2005. IMG is available at http://img.jgi.doe.gov. Companion IMG systems provide support for expert review of genome annotations (IMG/ER: http://img.jgi.doe.gov/er), teaching courses and training in microbial genome analysis (IMG/EDU: http://img.jgi.doe.gov/edu) and analysis of genomes related to the Human Microbiome Project (IMG/HMP: http://www.hmpdacc-resources.org/img_hmp). PMID:22194640

  11. The integrated microbial genome resource of analysis.

    Science.gov (United States)

    Checcucci, Alice; Mengoni, Alessio

    2015-01-01

    Integrated Microbial Genomes and Metagenomes (IMG) is a biocomputational system that allows to provide information and support for annotation and comparative analysis of microbial genomes and metagenomes. IMG has been developed by the US Department of Energy (DOE)-Joint Genome Institute (JGI). IMG platform contains both draft and complete genomes, sequenced by Joint Genome Institute and other public and available genomes. Genomes of strains belonging to Archaea, Bacteria, and Eukarya domains are present as well as those of viruses and plasmids. Here, we provide some essential features of IMG system and case study for pangenome analysis.

  12. Implications of the Human Genome Project

    Energy Technology Data Exchange (ETDEWEB)

    Kitcher, P.

    1998-11-01

    The Human Genome Project (HGP), launched in 1991, aims to map and sequence the human genome by 2006. During the fifteen-year life of the project, it is projected that $3 billion in federal funds will be allocated to it. The ultimate aims of spending this money are to analyze the structure of human DNA, to identify all human genes, to recognize the functions of those genes, and to prepare for the biology and medicine of the twenty-first century. The following summary examines some of the implications of the program, concentrating on its scientific import and on the ethical and social problems that it raises. Its aim is to expose principles that might be used in applying the information which the HGP will generate. There is no attempt here to translate the principles into detailed proposals for legislation. Arguments and discussion can be found in the full report, but, like this summary, that report does not contain any legislative proposals.

  13. The Human Genome Project and Biology Education.

    Science.gov (United States)

    McInerney, Joseph D.

    1996-01-01

    Highlights the importance of the Human Genome Project in educating the public about genetics. Discusses four challenges that science educators must address: teaching for conceptual understanding, the nature of science, the personal and social impact of science and technology, and the principles of technology. Contains 45 references. (JRH)

  14. Attitudes towards the Human Genome Project.

    Science.gov (United States)

    Shahroudi, Julie; Shaw, Geraldine

    Attitudes concerning the Human Genome Project were reported by faculty (N=40) and students (N=66) from a liberal arts college. Positive attitudes toward the project involved privacy, insurance and health, economic purposes, reproductive purposes, genetic counseling, religion and overall opinions. Negative attitudes were expressed regarding…

  15. Unexplored therapeutic opportunities in the human genome.

    Science.gov (United States)

    Oprea, Tudor I; Bologa, Cristian G; Brunak, Søren; Campbell, Allen; Gan, Gregory N; Gaulton, Anna; Gomez, Shawn M; Guha, Rajarshi; Hersey, Anne; Holmes, Jayme; Jadhav, Ajit; Jensen, Lars Juhl; Johnson, Gary L; Karlson, Anneli; Leach, Andrew R; Ma'ayan, Avi; Malovannaya, Anna; Mani, Subramani; Mathias, Stephen L; McManus, Michael T; Meehan, Terrence F; von Mering, Christian; Muthas, Daniel; Nguyen, Dac-Trung; Overington, John P; Papadatos, George; Qin, Jun; Reich, Christian; Roth, Bryan L; Schürer, Stephan C; Simeonov, Anton; Sklar, Larry A; Southall, Noel; Tomita, Susumu; Tudose, Ilinca; Ursu, Oleg; Vidovic, Dušica; Waller, Anna; Westergaard, David; Yang, Jeremy J; Zahoránszky-Köhalmi, Gergely

    2018-05-01

    A large proportion of biomedical research and the development of therapeutics is focused on a small fraction of the human genome. In a strategic effort to map the knowledge gaps around proteins encoded by the human genome and to promote the exploration of currently understudied, but potentially druggable, proteins, the US National Institutes of Health launched the Illuminating the Druggable Genome (IDG) initiative in 2014. In this article, we discuss how the systematic collection and processing of a wide array of genomic, proteomic, chemical and disease-related resource data by the IDG Knowledge Management Center have enabled the development of evidence-based criteria for tracking the target development level (TDL) of human proteins, which indicates a substantial knowledge deficit for approximately one out of three proteins in the human proteome. We then present spotlights on the TDL categories as well as key drug target classes, including G protein-coupled receptors, protein kinases and ion channels, which illustrate the nature of the unexplored opportunities for biomedical research and therapeutic development.

  16. Viral symbiosis and the holobiontic nature of the human genome.

    Science.gov (United States)

    Ryan, Francis Patrick

    2016-01-01

    The human genome is a holobiontic union of the mammalian nuclear genome, the mitochondrial genome and large numbers of endogenized retroviral genomes. This article defines and explores this symbiogenetic pattern of evolution, looking at the implications for human genetics, epigenetics, embryogenesis, physiology and the pathogenesis of inborn errors of metabolism and many other diseases. © 2016 APMIS. Published by John Wiley & Sons Ltd.

  17. Genome-wide ChIP-seq analysis of human TOP2B occupancy in MCF7 breast cancer epithelial cells

    Directory of Open Access Journals (Sweden)

    Catriona M. Manville

    2015-11-01

    Full Text Available We report the whole genome ChIP seq for human TOP2B from MCF7 cells. Using three different peak calling methods, regions of binding were identified in the presence or absence of the nuclear hormone estradiol, as TOP2B has been reported to play a role in ligand-induced transcription. TOP2B peaks were found across the whole genome, 50% of the peaks fell either within a gene or within 5 kb of a transcription start site. TOP2B peaks coincident with gene promoters were less frequently associated with epigenetic features marking active promoters in estradiol treated than in untreated cells. Significantly enriched transcription factor motifs within the DNA sequences underlying the peaks were identified. These included SP1, KLF4, TFAP2A, MYF, REST, CTCF, ESR1 and ESR2. Gene ontology analysis of genes associated with TOP2B peaks found neuronal development terms including axonogenesis and axon guidance were significantly enriched. In the absence of functional TOP2B there are errors in axon guidance in the zebrafish eye. Specific heparin sulphate structures are involved in retinal axon targeting. The glycosaminoglycan biosynthesis–heparin sulphate/heparin pathway is significantly enriched in the TOP2B gene ontology analysis, suggesting changes in this pathway in the absence of TOP2B may cause the axon guidance faults.

  18. Genomic uracil and human disease

    DEFF Research Database (Denmark)

    Hagen, Lars; Pena Diaz, Javier; Kavli, Bodil

    2006-01-01

    Uracil is present in small amounts in DNA due to spontaneous deamination of cytosine and incorporation of dUMP during replication. While deamination generates mutagenic U:G mismatches, incorporated dUMP results in U:A pairs that are not directly mutagenic, but may be cytotoxic. In most cells, mut...... retroviral infections. Ung(-/-) mice have a similar phenotype and develop B-cell lymphomas late in life. However, there is no evidence indicating that UNG deficiency causes lymphomas in humans....

  19. Research for genetic instability of human genome

    Energy Technology Data Exchange (ETDEWEB)

    Hori, T.; Takahashi, E.; Tsuji, H.; Yamauchi, M. (National Inst. of Radiological Sciences, Chiba (Japan)); Murata, M.

    1992-01-01

    In the present review paper, the potential relevance of chromosomal fragile sites to carcinogenesis and mutagenesis is discussed based on our own and other's studies. Recent evidence indicate that fragile sites may act as predisposition factors involved in chromosomal instability of the human genome and that the sites may be preferential targets for various DNA damaging agents including ionizing radiation. It is also demonstrated that some critical genomic rearrangements at the fragile sites may contribute towards oncogenesis and that individuals carrying heritable form of fragile site may be at the risk. Although clinical significance of autosomal fragile sites has been a matter of discussion, a fragile site of the X chromosome is known to be associated with an X-linked genetic diseases, called fragile X syndrome. Molecular events leading to the fragile X syndrome have recently been elucidated. The fragile X genotype can be characterized by an increased amount of p(CCG)n repeat DNA sequence in the FMR-1 gene and the repeated sequences are shown to be unstable in both meiosis and mitosis. These repeats might exhibit higher mutation rate than is generally seen in the human genome. Further studies on the fragile sites in molecular biology and radiation biology will yield relevant data to the molecular mechanisms of genetic instability of the human genome as well as to better assessment of genetic effect of ionizing radiation. (author).

  20. Helminth Genomics: The Implications for Human Health

    Science.gov (United States)

    Brindley, Paul J.; Mitreva, Makedonka; Ghedin, Elodie; Lustigman, Sara

    2009-01-01

    More than two billion people (one-third of humanity) are infected with parasitic roundworms or flatworms, collectively known as helminth parasites. These infections cause diseases that are responsible for enormous levels of morbidity and mortality, delays in the physical development of children, loss of productivity among the workforce, and maintenance of poverty. Genomes of the major helminth species that affect humans, and many others of agricultural and veterinary significance, are now the subject of intensive genome sequencing and annotation. Draft genome sequences of the filarial worm Brugia malayi and two of the human schistosomes, Schistosoma japonicum and S. mansoni, are now available, among others. These genome data will provide the basis for a comprehensive understanding of the molecular mechanisms involved in helminth nutrition and metabolism, host-dependent development and maturation, immune evasion, and evolution. They are likely also to predict new potential vaccine candidates and drug targets. In this review, we present an overview of these efforts and emphasize the potential impact and importance of these new findings. PMID:19855829

  1. Research for genetic instability of human genome

    International Nuclear Information System (INIS)

    Hori, T.; Takahashi, E.; Tsuji, H.; Yamauchi, M.; Murata, M.

    1992-01-01

    In the present review paper, the potential relevance of chromosomal fragile sites to carcinogenesis and mutagenesis is discussed based on our own and other's studies. Recent evidence indicate that fragile sites may act as predisposition factors involved in chromosomal instability of the human genome and that the sites may be preferential targets for various DNA damaging agents including ionizing radiation. It is also demonstrated that some critical genomic rearrangements at the fragile sites may contribute towards oncogenesis and that individuals carrying heritable form of fragile site may be at the risk. Although clinical significance of autosomal fragile sites has been a matter of discussion, a fragile site of the X chromosome is known to be associated with an X-linked genetic diseases, called fragile X syndrome. Molecular events leading to the fragile X syndrome have recently been elucidated. The fragile X genotype can be characterized by an increased amount of p(CCG)n repeat DNA sequence in the FMR-1 gene and the repeated sequences are shown to be unstable in both meiosis and mitosis. These repeats might exhibit higher mutation rate than is generally seen in the human genome. Further studies on the fragile sites in molecular biology and radiation biology will yield relevant data to the molecular mechanisms of genetic instability of the human genome as well as to better assessment of genetic effect of ionizing radiation. (author)

  2. Comparative genomics analysis of Streptococcus agalactiae reveals that isolates from cultured tilapia in China are closely related to the human strain A909.

    Science.gov (United States)

    Liu, Guangjin; Zhang, Wei; Lu, Chengping

    2013-11-11

    Streptococcus agalactiae, also referred to as Group B Streptococcus (GBS), is a frequent resident of the rectovaginal tract in humans, and a major cause of neonatal infection. In addition, S. agalactiae is a known fish pathogen, which compromises food safety and represents a zoonotic hazard. The complete genome sequence of the piscine S. agalactiae isolate GD201008-001 was compared with 14 other piscine, human and bovine strains to explore their virulence determinants, evolutionary relationships and the genetic basis of host tropism in S. agalactiae. The pan-genome of S. agalactiae is open and its size increases with the addition of newly sequenced genomes. The core genes shared by all isolates account for 50 ~ 70% of any single genome. The Chinese piscine isolates GD201008-001 and ZQ0910 are phylogenetically distinct from the Latin American piscine isolates SA20-06 and STIR-CD-17, but are closely related to the human strain A909, in the context of the clustered regularly interspaced short palindromic repeats (CRISPRs), prophage, virulence-associated genes and phylogenetic relationships. We identified a unique 10 kb gene locus in Chinese piscine strains. Isolates from cultured tilapia in China have a close genomic relationship with the human strain A909. Our findings provide insight into the pathogenesis and host-associated genome content of piscine S. agalactiae isolated in China.

  3. About human genome Acerca del genoma humano

    Directory of Open Access Journals (Sweden)

    Mojica Tobias

    2000-12-01

    Full Text Available The sequence ofthe human genome, an undertaking ofadvanced countries, is nearly complete. In fact The Human Genome Project has around 85% ofthe genome sequenced 4 times on the average, with an accuracy of roughly 1 in 1000 nucleotides. Celera Genomics, on the other hand, has 99% of the sequence of one person, with an accuracy of slightly less than 1 in 100. The Human Genome project trives to produce a physical map for public consumption following a step by step strategy, in which the researcher sequences short DNA fragments belonging to Iarger fragments of known relative
    position. Celera Genomics wants to have very rapidly a physical map which can be quickly used to develop genetic tests and drugs, which can be later sold. We feel that the sequence ofthe human genome is something, which will widen the gap between advanced and backward countries.En este artículo se revisan los eventos, alrededor del secuenciamiento del genoma humano, que han llevado a tanta excitación en los medios noticiosos y académicos en meses recientes. Se explican las estrategias que han llevado a que tengamos dos borradores diferentes pero complementarios, la estrategia llevada a cabo con el dinero
    de los contribuyentes que consiste en establecer el orden de fragmentos grandes de DNA antes de ser secuenciados y la estrategia llevada a cabo con dineros aportados por la industria privada, con la intención de explotar gananciosamente el conocimiento derivado del genoma humano. El genoma humano a mediados del año 2000 es
    un borrador incompleto que cubre aliededor del 85% de la secuencia con una precisión de un error en 1000 y el 99% de la secuencia con una precisión menor de 1 en 100 nucleótidos, También se discuten algunas de las posibles avenidas

  4. Continued colonization of the human genome by mitochondrial DNA.

    Directory of Open Access Journals (Sweden)

    Miria Ricchetti

    2004-09-01

    Full Text Available Integration of mitochondrial DNA fragments into nuclear chromosomes (giving rise to nuclear DNA sequences of mitochondrial origin, or NUMTs is an ongoing process that shapes nuclear genomes. In yeast this process depends on double-strand-break repair. Since NUMTs lack amplification and specific integration mechanisms, they represent the prototype of exogenous insertions in the nucleus. From sequence analysis of the genome of Homo sapiens, followed by sampling humans from different ethnic backgrounds, and chimpanzees, we have identified 27 NUMTs that are specific to humans and must have colonized human chromosomes in the last 4-6 million years. Thus, we measured the fixation rate of NUMTs in the human genome. Six such NUMTs show insertion polymorphism and provide a useful set of DNA markers for human population genetics. We also found that during recent human evolution, Chromosomes 18 and Y have been more susceptible to colonization by NUMTs. Surprisingly, 23 out of 27 human-specific NUMTs are inserted in known or predicted genes, mainly in introns. Some individuals carry a NUMT insertion in a tumor-suppressor gene and in a putative angiogenesis inhibitor. Therefore in humans, but not in yeast, NUMT integrations preferentially target coding or regulatory sequences. This is indeed the case for novel insertions associated with human diseases and those driven by environmental insults. We thus propose a mutagenic phenomenon that may be responsible for a variety of genetic diseases in humans and suggest that genetic or environmental factors that increase the frequency of chromosome breaks provide the impetus for the continued colonization of the human genome by mitochondrial DNA.

  5. Computational and serologic analysis of novel and known viruses in species human adenovirus D in which serology and genomics do not correlate.

    Directory of Open Access Journals (Sweden)

    Elizabeth B Liu

    Full Text Available In November of 2007 a human adenovirus (HAdV was isolated from a bronchoalveolar lavage (BAL sample recovered from a biopsy of an AIDS patient who presented with fever, cough, tachycardia, and expiratory wheezes. To better understand the isolated virus, the genome was sequenced and analyzed using bioinformatic and phylogenomic analysis. The results suggest that this novel virus, which is provisionally named HAdV-D59, may have been created from multiple recombination events. Specifically, the penton, hexon, and fiber genes have high nucleotide identity to HAdV-D19C, HAdV-D25, and HAdV-D56, respectively. Serological results demonstrated that HAdV-D59 has a neutralization profile that is similar yet not identical to that of HAdV-D25. Furthermore, we observed a two-fold difference between the ability of HAdV-D15 and HAdV-D25 to be neutralized by reciprocal antiserum indicating that the two hexon proteins may be more similar in epitopic conformation than previously assumed. In contrast, hexon loops 1 and 2 of HAdV-D15 and HAdV-D25 share 79.13 and 92.56 percent nucleotide identity, respectively. These data suggest that serology and genomics do not always correlate.

  6. Whole genome sequencing analysis of Salmonella enterica serovar Weltevreden isolated from human stool and contaminated food samples collected from the Southern coastal area of China.

    Science.gov (United States)

    Li, Baisheng; Yang, Xingfen; Tan, Hailing; Ke, Bixia; He, Dongmei; Wang, Haiyan; Chen, Qiuxia; Ke, Changwen; Zhang, Yonghui

    2018-02-02

    Salmonella enterica serovar Weltevreden is the most common non-typhoid Salmonella found in South and Southeast Asia. It causes zoonoses worldwide through the consumption of contaminated foods and seafood, and is considered as an important food-borne pathogen in China, especially in the Southern coastal area. We compared the whole genomes of 44 S. Weltevreden strains isolated from human stool and contaminated food samples from Southern Coastal China, in order to investigate their phylogenetic relationships and establish their genetic relatedness to known international strains. ResFinder analysis of the draft genomes of isolated strains detected antimicrobial resistance (AMR) genes in only eight isolates, equivalent to minimum inhibitory concentration assay, and only a few isolates showed resistance to tetracycline, ciprofloxacin or ampicillin. In silico MLST analysis revealed that 43 out of 44 S. Weltevreden strains belonged to sequence type 365 (CC205), the most common sequence type of the serovars. Phylogenetic analysis of the 44 domestic and 26 international isolates suggested that the population of S. Weltevreden could be segregated into six phylogenetic clusters. Cluster I included two strains from food and strains of the "Island Cluster", indicating potential inter-transmission between different countries and regions through foods. The predominant S. Weltevreden isolates obtained from the samples from Southern coastal China were found to be phylogenetically related to strains from Southern East Asia, and formed clusters II-VI. The study has demonstrated that WGS-based analysis may be used to improve our understanding of the epidemiology of this bacterium as part of a food-borne disease surveillance program. The methods used are also more widely applicable to other geographical regions and areas and could therefore be useful for improving our understanding of the international spread of S. Weltevreden on a global scale. Copyright © 2017. Published by Elsevier

  7. Insights into Modern Human Prehistory Using Ancient Genomes.

    Science.gov (United States)

    Yang, Melinda A; Fu, Qiaomei

    2018-03-01

    The genetic relationship of past modern humans to today's populations and each other was largely unknown until recently, when advances in ancient DNA sequencing allowed for unprecedented analysis of the genomes of these early people. These ancient genomes reveal new insights into human prehistory not always observed studying present-day populations, including greater details on the genetic diversity, population structure, and gene flow that characterized past human populations, particularly in early Eurasia, as well as increased insight on the relationship between archaic and modern humans. Here, we review genetic studies on ∼45000- to 7500-year-old individuals associated with mainly preagricultural cultures found in Eurasia, the Americas, and Africa. Copyright © 2017 Elsevier Ltd. All rights reserved.

  8. Genome-wide meta-analysis points to CTC1 and ZNF676 as genes regulating telomere homeostasis in humans

    DEFF Research Database (Denmark)

    Mangino, Massimo; Hwang, Shih-Jen; Spector, Timothy D

    2012-01-01

    (-8)). We also identified two novel genomic regions associated with LTL variation that map near a conserved telomere maintenance complex component 1 (CTC1; rs3027234, P = 3.6 × 10(-8)) on chromosome17p13.1 and zinc finger protein 676 (ZNF676; rs412658, P = 3.3 × 10(-8)) on 19p12. The minor allele of rs...

  9. Annotating the human genome with Disease Ontology

    Science.gov (United States)

    Osborne, John D; Flatow, Jared; Holko, Michelle; Lin, Simon M; Kibbe, Warren A; Zhu, Lihua (Julie); Danila, Maria I; Feng, Gang; Chisholm, Rex L

    2009-01-01

    Background The human genome has been extensively annotated with Gene Ontology for biological functions, but minimally computationally annotated for diseases. Results We used the Unified Medical Language System (UMLS) MetaMap Transfer tool (MMTx) to discover gene-disease relationships from the GeneRIF database. We utilized a comprehensive subset of UMLS, which is disease-focused and structured as a directed acyclic graph (the Disease Ontology), to filter and interpret results from MMTx. The results were validated against the Homayouni gene collection using recall and precision measurements. We compared our results with the widely used Online Mendelian Inheritance in Man (OMIM) annotations. Conclusion The validation data set suggests a 91% recall rate and 97% precision rate of disease annotation using GeneRIF, in contrast with a 22% recall and 98% precision using OMIM. Our thesaurus-based approach allows for comparisons to be made between disease containing databases and allows for increased accuracy in disease identification through synonym matching. The much higher recall rate of our approach demonstrates that annotating human genome with Disease Ontology and GeneRIF for diseases dramatically increases the coverage of the disease annotation of human genome. PMID:19594883

  10. Analysis of complete genome sequences of G9P[19] rotavirus strains from human and piglet with diarrhea provides evidence for whole-genome interspecies transmission of nonreassorted porcine rotavirus.

    Science.gov (United States)

    Yodmeeklin, Arpaporn; Khamrin, Pattara; Chuchaona, Watchaporn; Kumthip, Kattareeya; Kongkaew, Aphisek; Vachirachewin, Ratchaya; Okitsu, Shoko; Ushijima, Hiroshi; Maneekarn, Niwat

    2017-01-01

    Whole genomes of G9P[19] human (RVA/Human-wt/THA/CMH-S070-13/2013/G9P[19]) and porcine (RVA/Pig-wt/THA/CMP-015-12/2012/G9P[19]) rotaviruses concurrently detected in the same geographical area in northern Thailand were sequenced and analyzed for their genetic relationships using bioinformatic tools. The complete genome sequence of human rotavirus RVA/Human-wt/THA/CMH-S070-13/2013/G9P[19] was most closely related to those of porcine rotavirus RVA/Pig-wt/THA/CMP-015-12/2012/G9P[19] and to those of porcine-like human and porcine rotaviruses reference strains than to those of human rotavirus reference strains. The genotype constellation of G9P[19] detected in human and piglet were identical and displayed as the G9-P[19]-I5-R1-C1-M1-A8-N1-T1-E1-H1 genotypes with the nucleotide sequence identities of VP7, VP4, VP6, VP1, VP2, VP3, NSP1, NSP2, NSP3, NSP4, and NSP5 at 99.0%, 99.5%, 93.2%, 97.7%, 97.7%, 85.6%, 89.5%, 93.2%, 92.9%, 94.0%, and 98.1%, respectively. The findings indicate that human rotavirus strain RVA/Human-wt/THA/CMH-S070-13/2013/G9P[19] containing the genome segments of porcine genetic backbone is most likely a human rotavirus of porcine origin. Our data provide an evidence of interspecies transmission and whole-genome transmission of nonreassorted G9P[19] porcine RVA to human occurring in nature in northern Thailand. Copyright © 2016. Published by Elsevier B.V.

  11. Genomic analysis of Fusarium verticillioides.

    Science.gov (United States)

    Brown, D W; Butchko, R A E; Proctor, R H

    2008-09-01

    Fusarium verticillioides (teleomorph Gibberella moniliformis) can be either an endophyte of maize, causing no visible disease, or a pathogen-causing disease of ears, stalks, roots and seedlings. At any stage, this fungus can synthesize fumonisins, a family of mycotoxins structurally similar to the sphingolipid sphinganine. Ingestion of fumonisin-contaminated maize has been associated with a number of animal diseases, including cancer in rodents, and exposure has been correlated with human oesophageal cancer in some regions of the world, and some evidence suggests that fumonisins are a risk factor for neural tube defects. A primary goal of the authors' laboratory is to eliminate fumonisin contamination of maize and maize products. Understanding how and why these toxins are made and the F. verticillioides-maize disease process will allow one to develop novel strategies to limit tissue destruction (rot) and fumonisin production. To meet this goal, genomic sequence data, expressed sequence tags (ESTs) and microarrays are being used to identify F. verticillioides genes involved in the biosynthesis of toxins and plant pathogenesis. This paper describes the current status of F. verticillioides genomic resources and three approaches being used to mine microarray data from a wild-type strain cultured in liquid fumonisin production medium for 12, 24, 48, 72, 96 and 120h. Taken together, these approaches demonstrate the power of microarray technology to provide information on different biological processes.

  12. Identification of endogenous retroviral reading frames in the human genome

    Directory of Open Access Journals (Sweden)

    Wiuf Carsten

    2004-10-01

    Full Text Available Abstract Background Human endogenous retroviruses (HERVs comprise a large class of repetitive retroelements. Most HERVs are ancient and invaded our genome at least 25 million years ago, except for the evolutionary young HERV-K group. The far majority of the encoded genes are degenerate due to mutational decay and only a few non-HERV-K loci are known to retain intact reading frames. Additional intact HERV genes may exist, since retroviral reading frames have not been systematically annotated on a genome-wide scale. Results By clustering of hits from multiple BLAST searches using known retroviral sequences we have mapped 1.1% of the human genome as retrovirus related. The coding potential of all identified HERV regions were analyzed by annotating viral open reading frames (vORFs and we report 7836 loci as verified by protein homology criteria. Among 59 intact or almost-intact viral polyproteins scattered around the human genome we have found 29 envelope genes including two novel gammaretroviral types. One encodes a protein similar to a recently discovered zebrafish retrovirus (ZFERV while another shows partial, C-terminal, homology to Syncytin (HERV-W/FRD. Conclusions This compilation of HERV sequences and their coding potential provide a useful tool for pursuing functional analysis such as RNA expression profiling and effects of viral proteins, which may, in turn, reveal a role for HERVs in human health and disease. All data are publicly available through a database at http://www.retrosearch.dk.

  13. Evolutionary forces shaping genomic islands of population differentiation in humans

    Directory of Open Access Journals (Sweden)

    Hofer Tamara

    2012-03-01

    Full Text Available Abstract Background Levels of differentiation among populations depend both on demographic and selective factors: genetic drift and local adaptation increase population differentiation, which is eroded by gene flow and balancing selection. We describe here the genomic distribution and the properties of genomic regions with unusually high and low levels of population differentiation in humans to assess the influence of selective and neutral processes on human genetic structure. Methods Individual SNPs of the Human Genome Diversity Panel (HGDP showing significantly high or low levels of population differentiation were detected under a hierarchical-island model (HIM. A Hidden Markov Model allowed us to detect genomic regions or islands of high or low population differentiation. Results Under the HIM, only 1.5% of all SNPs are significant at the 1% level, but their genomic spatial distribution is significantly non-random. We find evidence that local adaptation shaped high-differentiation islands, as they are enriched for non-synonymous SNPs and overlap with previously identified candidate regions for positive selection. Moreover there is a negative relationship between the size of islands and recombination rate, which is stronger for islands overlapping with genes. Gene ontology analysis supports the role of diet as a major selective pressure in those highly differentiated islands. Low-differentiation islands are also enriched for non-synonymous SNPs, and contain an overly high proportion of genes belonging to the 'Oncogenesis' biological process. Conclusions Even though selection seems to be acting in shaping islands of high population differentiation, neutral demographic processes might have promoted the appearance of some genomic islands since i as much as 20% of islands are in non-genic regions ii these non-genic islands are on average two times shorter than genic islands, suggesting a more rapid erosion by recombination, and iii most loci are

  14. Analysis of nuclear and organellar genomes of Plasmodium knowlesi in humans reveals ancient population structure and recent recombination among host-specific subpopulations

    KAUST Repository

    Diez Benavente, Ernest

    2017-09-18

    The macaque parasite Plasmodium knowlesi is a significant concern in Malaysia where cases of human infection are increasing. Parasites infecting humans originate from genetically distinct subpopulations associated with the long-tailed (Macaca fascicularis (Mf)) or pig-tailed macaques (Macaca nemestrina (Mn)). We used a new high-quality reference genome to re-evaluate previously described subpopulations among human and macaque isolates from Malaysian-Borneo and Peninsular-Malaysia. Nuclear genomes were dimorphic, as expected, but new evidence of chromosomal-segment exchanges between subpopulations was found. A large segment on chromosome 8 originating from the Mn subpopulation and containing genes encoding proteins expressed in mosquito-borne parasite stages, was found in Mf genotypes. By contrast, non-recombining organelle genomes partitioned into 3 deeply branched lineages, unlinked with nuclear genomic dimorphism. Subpopulations which diverged in isolation have re-connected, possibly due to deforestation and disruption of wild macaque habitats. The resulting genomic mosaics reveal traits selected by host-vector-parasite interactions in a setting of ecological transition.

  15. Analysis of nuclear and organellar genomes of Plasmodium knowlesi in humans reveals ancient population structure and recent recombination among host-specific subpopulations

    KAUST Repository

    Diez Benavente, Ernest; Florez de Sessions, Paola; Moon, Robert W.; Holder, Anthony A.; Blackman, Michael J.; Roper, Cally; Drakeley, Christopher J.; Pain, Arnab; Sutherland, Colin J.; Hibberd, Martin L.; Campino, Susana; Clark, Taane G.

    2017-01-01

    The macaque parasite Plasmodium knowlesi is a significant concern in Malaysia where cases of human infection are increasing. Parasites infecting humans originate from genetically distinct subpopulations associated with the long-tailed (Macaca fascicularis (Mf)) or pig-tailed macaques (Macaca nemestrina (Mn)). We used a new high-quality reference genome to re-evaluate previously described subpopulations among human and macaque isolates from Malaysian-Borneo and Peninsular-Malaysia. Nuclear genomes were dimorphic, as expected, but new evidence of chromosomal-segment exchanges between subpopulations was found. A large segment on chromosome 8 originating from the Mn subpopulation and containing genes encoding proteins expressed in mosquito-borne parasite stages, was found in Mf genotypes. By contrast, non-recombining organelle genomes partitioned into 3 deeply branched lineages, unlinked with nuclear genomic dimorphism. Subpopulations which diverged in isolation have re-connected, possibly due to deforestation and disruption of wild macaque habitats. The resulting genomic mosaics reveal traits selected by host-vector-parasite interactions in a setting of ecological transition.

  16. Human genome-microbiome interaction: metagenomics frontiers for the aetiopathology of autoimmune diseases.

    Science.gov (United States)

    Gundogdu, Aycan; Nalbantoglu, Ufuk

    2017-04-01

    A short while ago, the human genome and microbiome were analysed simultaneously for the first time as a multi-omic approach. The analyses of heterogeneous population cohorts showed that microbiome components were associated with human genome variations. In-depth analysis of these results reveals that the majority of those relationships are between immune pathways and autoimmune disease-associated microbiome components. Thus, it can be hypothesized that autoimmunity may be associated with homeostatic disequilibrium of the human-microbiome interactome. Further analysis of human genome-human microbiome relationships in disease contexts with tailored systems biology approaches may yield insights into disease pathogenesis and prognosis.

  17. Genomic variation landscape of the human gut microbiome

    DEFF Research Database (Denmark)

    Schloissnig, Siegfried; Arumugam, Manimozhiyan; Sunagawa, Shinichi

    2013-01-01

    Whereas large-scale efforts have rapidly advanced the understanding and practical impact of human genomic variation, the practical impact of variation is largely unexplored in the human microbiome. We therefore developed a framework for metagenomic variation analysis and applied it to 252 faecal...... polymorphism rates of 0.11 was more variable between gut microbial species than across human hosts. Subjects sampled at varying time intervals exhibited individuality and temporal stability of SNP variation patterns, despite considerable composition changes of their gut microbiota. This indicates...

  18. Comparative Genomic Analysis of Soybean Flowering Genes

    Science.gov (United States)

    Jung, Chol-Hee; Wong, Chui E.; Singh, Mohan B.; Bhalla, Prem L.

    2012-01-01

    Flowering is an important agronomic trait that determines crop yield. Soybean is a major oilseed legume crop used for human and animal feed. Legumes have unique vegetative and floral complexities. Our understanding of the molecular basis of flower initiation and development in legumes is limited. Here, we address this by using a computational approach to examine flowering regulatory genes in the soybean genome in comparison to the most studied model plant, Arabidopsis. For this comparison, a genome-wide analysis of orthologue groups was performed, followed by an in silico gene expression analysis of the identified soybean flowering genes. Phylogenetic analyses of the gene families highlighted the evolutionary relationships among these candidates. Our study identified key flowering genes in soybean and indicates that the vernalisation and the ambient-temperature pathways seem to be the most variant in soybean. A comparison of the orthologue groups containing flowering genes indicated that, on average, each Arabidopsis flowering gene has 2-3 orthologous copies in soybean. Our analysis highlighted that the CDF3, VRN1, SVP, AP3 and PIF3 genes are paralogue-rich genes in soybean. Furthermore, the genome mapping of the soybean flowering genes showed that these genes are scattered randomly across the genome. A paralogue comparison indicated that the soybean genes comprising the largest orthologue group are clustered in a 1.4 Mb region on chromosome 16 of soybean. Furthermore, a comparison with the undomesticated soybean (Glycine soja) revealed that there are hundreds of SNPs that are associated with putative soybean flowering genes and that there are structural variants that may affect the genes of the light-signalling and ambient-temperature pathways in soybean. Our study provides a framework for the soybean flowering pathway and insights into the relationship and evolution of flowering genes between a short-day soybean and the long-day plant, Arabidopsis. PMID:22679494

  19. Report on the Human Genome Initiative

    Energy Technology Data Exchange (ETDEWEB)

    Tinoco, I.; Cahill, G.; Cantor, C.; Caskey, T.; Dulbecco, R.; Engelhardt, D. L.; Hood, L.; Lerman, L. S.; Mendelsohn, M. L.; Sinsheimer, R. L.; Smith, T.; Soll, D.; Stormo, G.; White, R. L.

    1987-04-01

    The report urges DOE and the Nation to commit to a large. multi-year. multidisciplinary. technological undertaking to order and sequence the human genome. This effort will first require significant innovation in general capability to manipulate DNA. major new analytical methods for ordering and sequencing. theoretical developments in computer science and mathematical biology, and great expansions in our ability to store and manipulate the information and to interface it with other large and diverse genetic databases. The actual ordering and sequencing involves the coordinated processing of some 3 billion bases from a reference human genome. Science is poised on the rudimentary edge of being able to read and understand human genes. A concerted. broadly based. scientific effort to provide new methods of sufficient power and scale should transform this activity from an inefficient one-gene-at-a-time. single laboratory effort into a coordinated. worldwide. comprehensive reading of "the book of man". The effort will be extraordinary in scope and magnitude. but so will be the benefit to biological understanding. new technology and the diagnosis and treatment of human disease.

  20. Whole genome analysis of a livestock-associated methicillin-resistant Staphylococcus aureus ST398 isolate from a case of human endocarditis

    Directory of Open Access Journals (Sweden)

    van Strijp Jos AG

    2010-06-01

    Full Text Available Abstract Background Recently, a new livestock-associated methicillin-resistant Staphylococcus aureus (MRSA Sequence Type 398 (ST398 isolate has emerged worldwide. Although there have been reports of invasive disease in humans, MRSA ST398 colonization is much more common in livestock and demonstrates especially high prevalence rates in pigs and calves. The aim of this study was to compare the genome sequence of an ST398 MRSA isolate with other S. aureus genomes in order to identify genetic traits that may explain the success of this particular lineage. Therefore, we determined the whole genome sequence of S0385, an MRSA ST398 isolate from a human case of endocarditis. Results The entire genome sequence of S0385 demonstrated considerable accessory genome content differences relative to other S. aureus genomes. Several mobile genetic elements that confer antibiotic resistance were identified, including a novel composite of an type V (5C2&5 Staphylococcal Chromosome Cassette mec (SCCmec with distinct joining (J regions. The presence of multiple integrative conjugative elements combined with the absence of a type I restriction and modification system on one of the two νSa islands, could enhance horizontal gene transfer in this strain. The ST398 MRSA isolate carries a unique pathogenicity island which encodes homologues of two excreted virulence factors; staphylococcal complement inhibitor (SCIN and von Willebrand factor-binding protein (vWbp. However, several virulence factors such as enterotoxins and phage encoded toxins, including Panton-Valentine leukocidin (PVL, were not identified in this isolate. Conclusions Until now MRSA ST398 isolates did not cause frequent invasive disease in humans, which may be due to the absence of several common virulence factors. However, the proposed enhanced ability of these isolates to acquire mobile elements may lead to the rapid acquisition of determinants which contribute to virulence in human infections.

  1. GenPlay Multi-Genome, a tool to compare and analyze multiple human genomes in a graphical interface.

    Science.gov (United States)

    Lajugie, Julien; Fourel, Nicolas; Bouhassira, Eric E

    2015-01-01

    Parallel visualization of multiple individual human genomes is a complex endeavor that is rapidly gaining importance with the increasing number of personal, phased and cancer genomes that are being generated. It requires the display of variants such as SNPs, indels and structural variants that are unique to specific genomes and the introduction of multiple overlapping gaps in the reference sequence. Here, we describe GenPlay Multi-Genome, an application specifically written to visualize and analyze multiple human genomes in parallel. GenPlay Multi-Genome is ideally suited for the comparison of allele-specific expression and functional genomic data obtained from multiple phased genomes in a graphical interface with access to multiple-track operation. It also allows the analysis of data that have been aligned to custom genomes rather than to a standard reference and can be used as a variant calling format file browser and as a tool to compare different genome assembly, such as hg19 and hg38. GenPlay is available under the GNU public license (GPL-3) from http://genplay.einstein.yu.edu. The source code is available at https://github.com/JulienLajugie/GenPlay. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  2. The Human Genome Project (HGP): dividends and challenges: a ...

    African Journals Online (AJOL)

    The Human Genome Project (HGP): dividends and challenges: a review. ... Genomic studies have given profound insights into the genetic organization of ... with it will be an essential part of modern medicine and biology for years to come.

  3. Analysis of cis and trans Requirements for DNA Replication at the Right-End Hairpin of the Human Bocavirus 1 Genome.

    Science.gov (United States)

    Shen, Weiran; Deng, Xuefeng; Zou, Wei; Engelhardt, John F; Yan, Ziying; Qiu, Jianming

    2016-09-01

    Parvoviruses are single-stranded DNA viruses that use the palindromic structures at the ends of the viral genome for their replication. The mechanism of parvovirus replication has been studied mostly in the dependoparvovirus adeno-associated virus 2 (AAV2) and the protoparvovirus minute virus of mice (MVM). Here, we used human bocavirus 1 (HBoV1) to understand the replication mechanism of bocaparvovirus. HBoV1 is pathogenic to humans, causing acute respiratory tract infections, especially in young children under 2 years old. By using the duplex replicative form of the HBoV1 genome in human embryonic kidney 293 (HEK293) cells, we identified the HBoV1 minimal replication origin at the right-end hairpin (OriR). Mutagenesis analyses confirmed the putative NS1 binding and nicking sites within the OriR. Of note, unlike the large nonstructural protein (Rep78/68 or NS1) of other parvoviruses, HBoV1 NS1 did not specifically bind OriR in vitro, indicating that other viral and cellular components or the oligomerization of NS1 is required for NS1 binding to the OriR. In vivo studies demonstrated that residues responsible for NS1 binding and nicking are within the origin-binding domain. Further analysis identified that the small nonstructural protein NP1 is required for HBoV1 DNA replication at OriR. NP1 and other viral nonstructural proteins (NS1 to NS4) colocalized within the viral DNA replication centers in both OriR-transfected cells and virus-infected cells, highlighting a direct involvement of NP1 in viral DNA replication at OriR. Overall, our study revealed the characteristics of HBoV1 DNA replication at OriR, suggesting novel characteristics of autonomous parvovirus DNA replication. Human bocavirus 1 (HBoV1) causes acute respiratory tract infections in young children. The duplex HBoV1 genome replicates in HEK293 cells and produces progeny virions that are infectious in well-differentiated airway epithelial cells. A recombinant AAV2 vector pseudotyped with an HBoV1

  4. Whole genome sequencing and bioinformatics analysis of two Egyptian genomes.

    Science.gov (United States)

    ElHefnawi, Mahmoud; Jeon, Sungwon; Bhak, Youngjune; ElFiky, Asmaa; Horaiz, Ahmed; Jun, JeHoon; Kim, Hyunho; Bhak, Jong

    2018-05-15

    We report two Egyptian male genomes (EGP1 and EGP2) sequenced at ~ 30× sequencing depths. EGP1 had 4.7 million variants, where 198,877 were novel variants while EGP2 had 209,109 novel variants out of 4.8 million variants. The mitochondrial haplogroup of the two individuals were identified to be H7b1 and L2a1c, respectively. We also identified the Y haplogroup of EGP1 (R1b) and EGP2 (J1a2a1a2 > P58 > FGC11). EGP1 had a mutation in the NADH gene of the mitochondrial genome ND4 (m.11778 G > A) that causes Leber's hereditary optic neuropathy. Some SNPs shared by the two genomes were associated with an increased level of cholesterol and triglycerides, probably related with Egyptians obesity. Comparison of these genomes with African and Western-Asian genomes can provide insights on Egyptian ancestry and genetic history. This resource can be used to further understand genomic diversity and functional classification of variants as well as human migration and evolution across Africa and Western-Asia. Copyright © 2017. Published by Elsevier B.V.

  5. Meta-analysis of genome-wide association studies identifies 8 novel loci involved in shape variation of human head hair.

    Science.gov (United States)

    Liu, Fan; Chen, Yan; Zhu, Gu; Hysi, Pirro G; Wu, Sijie; Adhikari, Kaustubh; Breslin, Krystal; Pospiech, Ewelina; Hamer, Merel A; Peng, Fuduan; Muralidharan, Charanya; Acuna-Alonzo, Victor; Canizales-Quinteros, Samuel; Bedoya, Gabriel; Gallo, Carla; Poletti, Giovanni; Rothhammer, Francisco; Bortolini, Maria Catira; Gonzalez-Jose, Rolando; Zeng, Changqing; Xu, Shuhua; Jin, Li; Uitterlinden, André G; Ikram, M Arfan; van Duijn, Cornelia M; Nijsten, Tamar; Walsh, Susan; Branicki, Wojciech; Wang, Sijia; Ruiz-Linares, Andrés; Spector, Timothy D; Martin, Nicholas G; Medland, Sarah E; Kayser, Manfred

    2018-02-01

    Shape variation of human head hair shows striking variation within and between human populations, while its genetic basis is far from being understood. We performed a series of genome-wide association studies (GWASs) and replication studies in a total of 28 964 subjects from 9 cohorts from multiple geographic origins. A meta-analysis of three European GWASs identified 8 novel loci (1p36.23 ERRFI1/SLC45A1, 1p36.22 PEX14, 1p36.13 PADI3, 2p13.3 TGFA, 11p14.1 LGR4, 12q13.13 HOXC13, 17q21.2 KRTAP, and 20q13.33 PTK6), and confirmed 4 previously known ones (1q21.3 TCHH/TCHHL1/LCE3E, 2q35 WNT10A, 4q21.21 FRAS1, and 10p14 LINC00708/GATA3), all showing genome-wide significant association with hair shape (P 5e-8). All except one (1p36.22 PEX14) were replicated with nominal significance in at least one of the 6 additional cohorts of European, Native American and East Asian origins. Three additional previously known genes (EDAR, OFCC1, and PRSS53) were confirmed at the nominal significance level. A multivariable regression model revealed that 14 SNPs from different genes significantly and independently contribute to hair shape variation, reaching a cross-validated AUC value of 0.66 (95% CI: 0.62-0.70) and an AUC value of 0.64 in an independent validation cohort, providing an improved accuracy compared with a previous model. Prediction outcomes of 2504 individuals from a multiethnic sample were largely consistent with general knowledge on the global distribution of hair shape variation. Our study thus delivers target genes and DNA variants for future functional studies to further evaluate the molecular basis of hair shape in humans. © The Author(s) 2017. Published by Oxford University Press.

  6. Meta-analysis of genome-wide association studies identifies 8 novel loci involved in shape variation of human head hair

    Science.gov (United States)

    Liu, Fan; Chen, Yan; Zhu, Gu; Hysi, Pirro G; Wu, Sijie; Adhikari, Kaustubh; Breslin, Krystal; Pośpiech, Ewelina; Hamer, Merel A; Peng, Fuduan; Muralidharan, Charanya; Acuna-Alonzo, Victor; Canizales-Quinteros, Samuel; Bedoya, Gabriel; Gallo, Carla; Poletti, Giovanni; Rothhammer, Francisco; Bortolini, Maria Catira; Gonzalez-Jose, Rolando; Zeng, Changqing; Xu, Shuhua; Jin, Li; Uitterlinden, André G; Ikram, M Arfan; van Duijn, Cornelia M; Nijsten, Tamar; Walsh, Susan; Branicki, Wojciech; Wang, Sijia; Ruiz-Linares, Andrés; Spector, Timothy D; Martin, Nicholas G; Medland, Sarah E; Kayser, Manfred

    2018-01-01

    Abstract Shape variation of human head hair shows striking variation within and between human populations, while its genetic basis is far from being understood. We performed a series of genome-wide association studies (GWASs) and replication studies in a total of 28 964 subjects from 9 cohorts from multiple geographic origins. A meta-analysis of three European GWASs identified 8 novel loci (1p36.23 ERRFI1/SLC45A1, 1p36.22 PEX14, 1p36.13 PADI3, 2p13.3 TGFA, 11p14.1 LGR4, 12q13.13 HOXC13, 17q21.2 KRTAP, and 20q13.33 PTK6), and confirmed 4 previously known ones (1q21.3 TCHH/TCHHL1/LCE3E, 2q35 WNT10A, 4q21.21 FRAS1, and 10p14 LINC00708/GATA3), all showing genome-wide significant association with hair shape (P < 5e-8). All except one (1p36.22 PEX14) were replicated with nominal significance in at least one of the 6 additional cohorts of European, Native American and East Asian origins. Three additional previously known genes (EDAR, OFCC1, and PRSS53) were confirmed at the nominal significance level. A multivariable regression model revealed that 14 SNPs from different genes significantly and independently contribute to hair shape variation, reaching a cross-validated AUC value of 0.66 (95% CI: 0.62–0.70) and an AUC value of 0.64 in an independent validation cohort, providing an improved accuracy compared with a previous model. Prediction outcomes of 2504 individuals from a multiethnic sample were largely consistent with general knowledge on the global distribution of hair shape variation. Our study thus delivers target genes and DNA variants for future functional studies to further evaluate the molecular basis of hair shape in humans. PMID:29220522

  7. Genomic sequences of murine gamma B- and gamma C-crystallin-encoding genes: promoter analysis and complete evolutionary pattern of mouse, rat and human gamma-crystallins.

    Science.gov (United States)

    Graw, J; Liebstein, A; Pietrowski, D; Schmitt-John, T; Werner, T

    1993-12-22

    The murine genes, gamma B-cry and gamma C-cry, encoding the gamma B- and gamma C-crystallins, were isolated from a genomic DNA library. The complete nucleotide (nt) sequences of both genes were determined from 661 and 711 bp, respectively, upstream from the first exon to the corresponding polyadenylation sites, comprising more than 2650 and 2890 bp, respectively. The new sequences were compared to the partial cDNA sequences available for the murine gamma B-cry and gamma C-cry, as well as to the corresponding genomic sequences from rat and man, at both the nt and predicted amino acid (aa) sequence levels. In the gamma B-cry promoter region, a canonical CCAAT-box, a TATA-box, putative NF-I and C/EBP sites were detected. An R-repeat is inserted 366 bp upstream from the transcription start point. In contrast, the gamma C-cry promoter does not contain a CCAAT-box, but some other putative binding sites for transcription factors (AP-2, UBP-1, LBP-1) were located by computer analysis. The promoter regions of all six gamma-cry from mouse, rat and human, except human psi gamma F-cry, were analyzed for common sequence elements. A complex sequence element of about 70-80 bp was found in the proximal promoter, which contains a gamma-cry-specific and almost invariant sequence (crygpel) of 14 nt, and ends with the also invariant TATA-box. Within the complex sequence element, a minimum of three further features specific for the gamma A-, gamma B- and gamma D/E/F-cry genes can be defined, at least two of which were recently shown to be functional. In addition to these four sequence elements, a subtype-specific structure of inverted repeats with different-sized spacers can be deduced from the multiple sequence alignment. A phylogenetic analysis based on the promoter region, as well as the complete exon 3 of all gamma-cry from mouse, rat and man, suggests separation of only five gamma-cry subtypes (gamma A-, gamma B-, gamma C-, gamma D- and gamma E/F-cry) prior to species separation.

  8. GENOME ANALYSIS OF BURKHOLDERIA CEPACIA AC1100

    Science.gov (United States)

    Burkholderia cepacia is an important organism in bioremediation of environmental pollutants and it is also of increasing interest as a human pathogen. The genomic organization of B. cepacia is being studied in order to better understand its unusual adaptive capacity and genome pl...

  9. Genomic landscape of human diversity across Madagascar

    Science.gov (United States)

    Pierron, Denis; Heiske, Margit; Razafindrazaka, Harilanto; Rakoto, Ignace; Rabetokotany, Nelly; Ravololomanga, Bodo; Rakotozafy, Lucien M.-A.; Rakotomalala, Mireille Mialy; Razafiarivony, Michel; Rasoarifetra, Bako; Raharijesy, Miakabola Andriamampianina; Razafindralambo, Lolona; Ramilisonina; Fanony, Fulgence; Lejamble, Sendra; Thomas, Olivier; Mohamed Abdallah, Ahmed; Rocher, Christophe; Arachiche, Amal; Tonaso, Laure; Pereda-loth, Veronica; Schiavinato, Stéphanie; Brucato, Nicolas; Ricaut, Francois-Xavier; Kusuma, Pradiptajati; Sudoyo, Herawati; Ni, Shengyu; Boland, Anne; Deleuze, Jean-Francois; Beaujard, Philippe; Grange, Philippe; Adelaar, Sander; Stoneking, Mark; Rakotoarisoa, Jean-Aimé; Radimilahy, Chantal; Letellier, Thierry

    2017-01-01

    Although situated ∼400 km from the east coast of Africa, Madagascar exhibits cultural, linguistic, and genetic traits from both Southeast Asia and Eastern Africa. The settlement history remains contentious; we therefore used a grid-based approach to sample at high resolution the genomic diversity (including maternal lineages, paternal lineages, and genome-wide data) across 257 villages and 2,704 Malagasy individuals. We find a common Bantu and Austronesian descent for all Malagasy individuals with a limited paternal contribution from Europe and the Middle East. Admixture and demographic growth happened recently, suggesting a rapid settlement of Madagascar during the last millennium. However, the distribution of African and Asian ancestry across the island reveals that the admixture was sex biased and happened heterogeneously across Madagascar, suggesting independent colonization of Madagascar from Africa and Asia rather than settlement by an already admixed population. In addition, there are geographic influences on the present genomic diversity, independent of the admixture, showing that a few centuries is sufficient to produce detectable genetic structure in human populations. PMID:28716916

  10. Genome-wide associations of gene expression variation in humans.

    Directory of Open Access Journals (Sweden)

    Barbara E Stranger

    2005-12-01

    Full Text Available The exploration of quantitative variation in human populations has become one of the major priorities for medical genetics. The successful identification of variants that contribute to complex traits is highly dependent on reliable assays and genetic maps. We have performed a genome-wide quantitative trait analysis of 630 genes in 60 unrelated Utah residents with ancestry from Northern and Western Europe using the publicly available phase I data of the International HapMap project. The genes are located in regions of the human genome with elevated functional annotation and disease interest including the ENCODE regions spanning 1% of the genome, Chromosome 21 and Chromosome 20q12-13.2. We apply three different methods of multiple test correction, including Bonferroni, false discovery rate, and permutations. For the 374 expressed genes, we find many regions with statistically significant association of single nucleotide polymorphisms (SNPs with expression variation in lymphoblastoid cell lines after correcting for multiple tests. Based on our analyses, the signal proximal (cis- to the genes of interest is more abundant and more stable than distal and trans across statistical methodologies. Our results suggest that regulatory polymorphism is widespread in the human genome and show that the 5-kb (phase I HapMap has sufficient density to enable linkage disequilibrium mapping in humans. Such studies will significantly enhance our ability to annotate the non-coding part of the genome and interpret functional variation. In addition, we demonstrate that the HapMap cell lines themselves may serve as a useful resource for quantitative measurements at the cellular level.

  11. Genome-Wide Associations of Gene Expression Variation in Humans.

    Directory of Open Access Journals (Sweden)

    2005-12-01

    Full Text Available The exploration of quantitative variation in human populations has become one of the major priorities for medical genetics. The successful identification of variants that contribute to complex traits is highly dependent on reliable assays and genetic maps. We have performed a genome-wide quantitative trait analysis of 630 genes in 60 unrelated Utah residents with ancestry from Northern and Western Europe using the publicly available phase I data of the International HapMap project. The genes are located in regions of the human genome with elevated functional annotation and disease interest including the ENCODE regions spanning 1% of the genome, Chromosome 21 and Chromosome 20q12-13.2. We apply three different methods of multiple test correction, including Bonferroni, false discovery rate, and permutations. For the 374 expressed genes, we find many regions with statistically significant association of single nucleotide polymorphisms (SNPs with expression variation in lymphoblastoid cell lines after correcting for multiple tests. Based on our analyses, the signal proximal (cis- to the genes of interest is more abundant and more stable than distal and trans across statistical methodologies. Our results suggest that regulatory polymorphism is widespread in the human genome and show that the 5-kb (phase I HapMap has sufficient density to enable linkage disequilibrium mapping in humans. Such studies will significantly enhance our ability to annotate the non-coding part of the genome and interpret functional variation. In addition, we demonstrate that the HapMap cell lines themselves may serve as a useful resource for quantitative measurements at the cellular level.

  12. The Human Genome Initiative of the Department of Energy

    Science.gov (United States)

    1988-01-01

    The structural characterization of genes and elucidation of their encoded functions have become a cornerstone of modern health research, biology and biotechnology. A genome program is an organized effort to locate and identify the functions of all the genes of an organism. Beginning with the DOE-sponsored, 1986 human genome workshop at Santa Fe, the value of broadly organized efforts supporting total genome characterization became a subject of intensive study. There is now national recognition that benefits will rapidly accrue from an effective scientific infrastructure for total genome research. In the US genome research is now receiving dedicated funds. Several other nations are implementing genome programs. Supportive infrastructure is being improved through both national and international cooperation. The Human Genome Initiative of the Department of Energy (DOE) is a focused program of Resource and Technology Development, with objectives of speeding and bringing economies to the national human genome effort. This report relates the origins and progress of the Initiative.

  13. Genome-wide survey in African Americans demonstrates potential epistasis of fitness in the human genome.

    Science.gov (United States)

    Wang, Heming; Choi, Yoonha; Tayo, Bamidele; Wang, Xuefeng; Morris, Nathan; Zhang, Xiang; Broeckel, Uli; Hanis, Craig; Kardia, Sharon; Redline, Susan; Cooper, Richard S; Tang, Hua; Zhu, Xiaofeng

    2017-02-01

    The role played by epistasis between alleles at unlinked loci in shaping population fitness has been debated for many years and the existing evidence has been mainly accumulated from model organisms. In model organisms, fitness epistasis can be systematically inferred by detecting nonindependence of genotypic values between loci in a population and confirmed through examining the number of offspring produced in two-locus genotype groups. No systematic study has been conducted to detect epistasis of fitness in humans owing to experimental constraints. In this study, we developed a novel method to detect fitness epistasis by testing the correlation between local ancestries on different chromosomes in an admixed population. We inferred local ancestry across the genome in 16,252 unrelated African Americans and systematically examined the pairwise correlations between the genomic regions on different chromosomes. Our analysis revealed a pair of genomic regions on chromosomes 4 and 6 that show significant local ancestry correlation (P-value = 4.01 × 10 -8 ) that can be potentially attributed to fitness epistasis. However, we also observed substantial local ancestry correlation that cannot be explained by systemic ancestry inference bias. To our knowledge, this study is the first to systematically examine evidence of fitness epistasis across the human genome. © 2016 WILEY PERIODICALS, INC.

  14. The PCNA pseudogenes in the human genome

    Directory of Open Access Journals (Sweden)

    Stoimenov Ivaylo

    2012-02-01

    Full Text Available Abstract Background The proliferating cell nuclear antigen (PCNA is a key protein in the eukaryotic DNA replication and cell proliferation. Following the cloning and characterisation of the human PCNA gene, the question of the existence of pseudogenes in the human genome was raised. Findings In this short communication we summarise the existing information about the PCNA pseudogenes and critically assess their status. Conclusions We propose the existence of at least four valid PCNA pseudogenes, PCNAP1, PCNAP2, LOC392454 and LOC390102. We would like to recommend assignment of a name for LOC392454 as "proliferating cell nuclear antigen pseudogene 3" (alias PCNAP3 and a name for LOC390102 as "proliferating cell nuclear antigen pseudogene 4" (alias PCNAP4. We prompt for more critical evaluation of the existence of a PCNA pseudogene, designated as PCNAP.

  15. A Genome-Wide mQTL Analysis in Human Adipose Tissue Identifies Genetic Variants Associated with DNA Methylation, Gene Expression and Metabolic Traits.

    Directory of Open Access Journals (Sweden)

    Petr Volkov

    Full Text Available Little is known about the extent to which interactions between genetics and epigenetics may affect the risk of complex metabolic diseases and/or their intermediary phenotypes. We performed a genome-wide DNA methylation quantitative trait locus (mQTL analysis in human adipose tissue of 119 men, where 592,794 single nucleotide polymorphisms (SNPs were related to DNA methylation of 477,891 CpG sites, covering 99% of RefSeq genes. SNPs in significant mQTLs were further related to gene expression in adipose tissue and obesity related traits. We found 101,911 SNP-CpG pairs (mQTLs in cis and 5,342 SNP-CpG pairs in trans showing significant associations between genotype and DNA methylation in adipose tissue after correction for multiple testing, where cis is defined as distance less than 500 kb between a SNP and CpG site. These mQTLs include reported obesity, lipid and type 2 diabetes loci, e.g. ADCY3/POMC, APOA5, CETP, FADS2, GCKR, SORT1 and LEPR. Significant mQTLs were overrepresented in intergenic regions meanwhile underrepresented in promoter regions and CpG islands. We further identified 635 SNPs in significant cis-mQTLs associated with expression of 86 genes in adipose tissue including CHRNA5, G6PC2, GPX7, RPL27A, THNSL2 and ZFP57. SNPs in significant mQTLs were also associated with body mass index (BMI, lipid traits and glucose and insulin levels in our study cohort and public available consortia data. Importantly, the Causal Inference Test (CIT demonstrates how genetic variants mediate their effects on metabolic traits (e.g. BMI, cholesterol, high-density lipoprotein (HDL, hemoglobin A1c (HbA1c and homeostatic model assessment of insulin resistance (HOMA-IR via altered DNA methylation in human adipose tissue. This study identifies genome-wide interactions between genetic and epigenetic variation in both cis and trans positions influencing gene expression in adipose tissue and in vivo (dysmetabolic traits associated with the development of

  16. Survey sequencing and comparative analysis of the elephant shark (Callorhinchus milii genome.

    Directory of Open Access Journals (Sweden)

    Byrappa Venkatesh

    2007-04-01

    Full Text Available Owing to their phylogenetic position, cartilaginous fishes (sharks, rays, skates, and chimaeras provide a critical reference for our understanding of vertebrate genome evolution. The relatively small genome of the elephant shark, Callorhinchus milii, a chimaera, makes it an attractive model cartilaginous fish genome for whole-genome sequencing and comparative analysis. Here, the authors describe survey sequencing (1.4x coverage and comparative analysis of the elephant shark genome, one of the first cartilaginous fish genomes to be sequenced to this depth. Repetitive sequences, represented mainly by a novel family of short interspersed element-like and long interspersed element-like sequences, account for about 28% of the elephant shark genome. Fragments of approximately 15,000 elephant shark genes reveal specific examples of genes that have been lost differentially during the evolution of tetrapod and teleost fish lineages. Interestingly, the degree of conserved synteny and conserved sequences between the human and elephant shark genomes are higher than that between human and teleost fish genomes. Elephant shark contains putative four Hox clusters indicating that, unlike teleost fish genomes, the elephant shark genome has not experienced an additional whole-genome duplication. These findings underscore the importance of the elephant shark as a critical reference vertebrate genome for comparative analysis of the human and other vertebrate genomes. This study also demonstrates that a survey-sequencing approach can be applied productively for comparative analysis of distantly related vertebrate genomes.

  17. A set of BAC clones spanning the human genome.

    NARCIS (Netherlands)

    Krzywinski, M.; Bosdet, I.; Smailus, D.; Chiu, R.; Mathewson, C.; Wye, N.; Barber, S.; Brown-John, M.; Chan, S.; Chand, S.; Cloutier, A.; Girn, N.; Lee, D.; Masson, A.; Mayo, M.; Olson, T.; Pandoh, P.; Prabhu, A.L.; Schoenmakers, E.F.P.M.; Tsai, M.Y.; Albertson, D.; Lam, W.W.; Choy, C.O.; Osoegawa, K.; Zhao, S.; Jong, P.J. de; Schein, J.; Jones, S.; Marra, M.A.

    2004-01-01

    Using the human bacterial artificial chromosome (BAC) fingerprint-based physical map, genome sequence assembly and BAC end sequences, we have generated a fingerprint-validated set of 32 855 BAC clones spanning the human genome. The clone set provides coverage for at least 98% of the human

  18. The Human Genome Project: how do we protect Australians?

    Science.gov (United States)

    Stott Despoja, N

    It is the moon landing of the nineties: the ambitious Human Genome Project--identifying the up to 100,000 genes that make up human DNA and the sequences of the three billion base-pairs that comprise the human genome. However, unlike the moon landing, the effects of the genome project will have a fundamental impact on the way we see ourselves and each other.

  19. Implementing genomics and pharmacogenomics in the clinic: The National Human Genome Research Institute’s genomic medicine portfolio

    Science.gov (United States)

    Manolio, Teri A.

    2016-01-01

    Increasing knowledge about the influence of genetic variation on human health and growing availability of reliable, cost-effective genetic testing have spurred the implementation of genomic medicine in the clinic. As defined by the National Human Genome Research Institute (NHGRI), genomic medicine uses an individual’s genetic information in his or her clinical care, and has begun to be applied effectively in areas such as cancer genomics, pharmacogenomics, and rare and undiagnosed diseases. In 2011 NHGRI published its strategic vision for the future of genomic research, including an ambitious research agenda to facilitate and promote the implementation of genomic medicine. To realize this agenda, NHGRI is consulting and facilitating collaborations with the external research community through a series of “Genomic Medicine Meetings,” under the guidance and leadership of the National Advisory Council on Human Genome Research. These meetings have identified and begun to address significant obstacles to implementation, such as lack of evidence of efficacy, limited availability of genomics expertise and testing, lack of standards, and diffficulties in integrating genomic results into electronic medical records. The six research and dissemination initiatives comprising NHGRI’s genomic research portfolio are designed to speed the evaluation and incorporation, where appropriate, of genomic technologies and findings into routine clinical care. Actual adoption of successful approaches in clinical care will depend upon the willingness, interest, and energy of professional societies, practitioners, patients, and payers to promote their responsible use and share their experiences in doing so. PMID:27612677

  20. Structural analysis of a hepatitis B virus genome integrated into chromosome 17p of a human hepatocellular carcinoma

    International Nuclear Information System (INIS)

    Zhou, Y.Z.; Slagle, B.L.; Donehower, L.A.; van Tuinen, P.; Ledbetter, D.H.; Butel, J.S.

    1988-01-01

    Hepatitis B virus (HBV) is clearly a factor in the development of hepatocellular carcinoma, but its mechanism of action remains obscure. One possibility is that the HBV integration event alters the expression of a nearby growth-regulatory cellular gene. A 9-kilobase (kb) DNA fragment containing an HBV insert plus flanking cellular sequences was cloned from a hepatoma specimen from Shanghai, People's Republic of China. Restriction mapping of the insert revealed a large inverted repeat structure consisting of both viral sequences (encompassing all of the core and pre-S regions and portions of the X and S genes) and at least 3 kb of unique cellular sequences. The virus-cell junction mapped 11 nucleotides from the DRI region, in a position within the HBV X gene and included in the cohesive overlap region. A probe generated from 1.0 kb of the flanking cellular DNA mapped the viral insert to chromosome 17 in the region designated 17p11.2-17p12, which is near the human proto-oncogene p53. Sequence data from a portion of the flanking cellular DNA revealed a stretch of approximately 70 base pairs that showed highly significant homology with a conserved region of a number of functional mammalian DNA, including the human autonomously replicating sequence 1 (ASRI)

  1. Human Rhinovirus B and C Genomes from Rural Coastal Kenya

    NARCIS (Netherlands)

    Agoti, Charles N.; Kiyuka, Patience K.; Kamau, Everlyn; Munywoki, Patrick K.; Bett, Anne; van der Hoek, Lia; Kellam, Paul; Nokes, D. James; Cotten, Matthew

    2016-01-01

    Primer-independent agnostic deep sequencing was used to generate three human rhinovirus (HRV) B genomes and one HRV C genome from samples collected in a household respiratory survey in rural coastal Kenya. The study provides the first rhinovirus genomes from Kenya and will help improve the

  2. Human genetics and genomics a decade after the release of the draft sequence of the human genome

    Science.gov (United States)

    2011-01-01

    Substantial progress has been made in human genetics and genomics research over the past ten years since the publication of the draft sequence of the human genome in 2001. Findings emanating directly from the Human Genome Project, together with those from follow-on studies, have had an enormous impact on our understanding of the architecture and function of the human genome. Major developments have been made in cataloguing genetic variation, the International HapMap Project, and with respect to advances in genotyping technologies. These developments are vital for the emergence of genome-wide association studies in the investigation of complex diseases and traits. In parallel, the advent of high-throughput sequencing technologies has ushered in the 'personal genome sequencing' era for both normal and cancer genomes, and made possible large-scale genome sequencing studies such as the 1000 Genomes Project and the International Cancer Genome Consortium. The high-throughput sequencing and sequence-capture technologies are also providing new opportunities to study Mendelian disorders through exome sequencing and whole-genome sequencing. This paper reviews these major developments in human genetics and genomics over the past decade. PMID:22155605

  3. Whole-genome analysis of human papillomavirus genotypes 52 and 58 isolated from Japanese women with cervical intraepithelial neoplasia and invasive cervical cancer.

    Science.gov (United States)

    Tenjimbayashi, Yuri; Onuki, Mamiko; Hirose, Yusuke; Mori, Seiichiro; Ishii, Yoshiyuki; Takeuchi, Takamasa; Tasaka, Nobutaka; Satoh, Toyomi; Morisada, Tohru; Iwata, Takashi; Miyamoto, Shingo; Matsumoto, Koji; Sekizawa, Akihiko; Kukimoto, Iwao

    2017-01-01

    Human papillomavirus genotypes 52 and 58 (HPV52/58) are frequently detected in patients with cervical intraepithelial neoplasia (CIN) and invasive cervical cancer (ICC) in East Asian countries including Japan. As with other HPV genotypes, HPV52/58 consist of multiple lineages of genetic variants harboring less than 10% differences between complete genome sequences of the same HPV genotype. However, site variations of nucleotide and amino acid sequences across the viral whole-genome have not been fully examined for HPV52/58. The aim of this study was to investigate genetic variations of HPV52/58 prevalent among Japanese women by analyzing the viral whole-genome sequences. The entire genomic region of HPV52/58 was amplified by long-range PCR with total cellular DNA extracted from cervical exfoliated cells isolated from Japanese patients with CIN or ICC. The amplified DNA was subjected to next generation sequencing to determine the complete viral genome sequences. Phylogenetic analyses were performed with the whole-genome sequences to assign variant lineages/sublineages to the HPV52/58 isolates. The variability in amino acid sequences of viral proteins was assessed by calculating the Shannon entropy scores at individual amino acid positions of HPV proteins. Among 52 isolates of HPV52 (CIN1, n  = 20; CIN2/3, n  = 21; ICC, n  = 11), 50 isolates belonged to lineage B (sublineage B2) and two isolates belonged to lineage A (sublineage A1). Among 48 isolates of HPV58 (CIN1, n  = 21; CIN2/3, n  = 19; ICC, n  = 8), 47 isolates belonged to lineage A (sublineages A1/A2/A3) and one isolate belonged to lineage C. Single nucleotide polymorphisms specific for individual variant lineages were determined throughout the viral genome based on multiple sequence alignments of the Japanese HPV52/58 isolates and reference HPV52/58 genomes. Entropy analyses revealed that the E1 protein was relatively variable among the HPV52 isolates, whereas the E7, E4, and L2 proteins showed

  4. Genomic stability of adipogenic human adenovirus 36.

    Science.gov (United States)

    Nam, J-H; Na, H-N; Atkinson, R L; Dhurandhar, N V

    2014-02-01

    Human adenovirus Ad36 increases adiposity in several animal models, including rodents and non-human primates. Importantly, Ad36 is associated with human obesity, which has prompted research to understand its epidemiology and to develop a vaccine to prevent a subgroup of obesity. For this purpose, understanding the genomic stability of Ad36 in vivo and in vitro infections is critical. Here, we examined whether in vitro cell passaging over a 14-year period introduced any genetic variation in Ad36. We sequenced the whole genome of Ad36-which was plaque purified in 1998 from the original strain obtained from American Type Culture Collection, and passaged approximately 12 times over the past 14 years (Ad36-2012). This DNA sequence was compared with a previously published sequence of Ad36 likely obtained from the same source (Ad36-1988). Compared with Ad36-1988, only two nucleotides were altered in Ad36-2012: a T insertion at nucleotide 1862, which may induce early termination of the E1B viral protein, and a T➝C transition at nucleotide 26 136. Virus with the T insertion (designated Ad36-2012-T6) was mixed with wild-type virus lacking the T insertion (designated Ad36-2012-T5) in the viral stock. The transition at nucleotide 26 136 does not change the encoded amino acid (aspartic acid) in the pVIII viral protein. The rate of genetic variation in Ad36 is ∼2.37 × 10(-6) mutations/nucleotide/passage. Of particular importance, there were no mutations in the E4orf1 gene, the critical gene for producing obesity. This very-low-variation rate should reduce concerns about genetic variability when developing Ad36 vaccines or developing assays for detecting Ad36 infection in populations.

  5. GWAMA: software for genome-wide association meta-analysis

    Directory of Open Access Journals (Sweden)

    Mägi Reedik

    2010-05-01

    Full Text Available Abstract Background Despite the recent success of genome-wide association studies in identifying novel loci contributing effects to complex human traits, such as type 2 diabetes and obesity, much of the genetic component of variation in these phenotypes remains unexplained. One way to improving power to detect further novel loci is through meta-analysis of studies from the same population, increasing the sample size over any individual study. Although statistical software analysis packages incorporate routines for meta-analysis, they are ill equipped to meet the challenges of the scale and complexity of data generated in genome-wide association studies. Results We have developed flexible, open-source software for the meta-analysis of genome-wide association studies. The software incorporates a variety of error trapping facilities, and provides a range of meta-analysis summary statistics. The software is distributed with scripts that allow simple formatting of files containing the results of each association study and generate graphical summaries of genome-wide meta-analysis results. Conclusions The GWAMA (Genome-Wide Association Meta-Analysis software has been developed to perform meta-analysis of summary statistics generated from genome-wide association studies of dichotomous phenotypes or quantitative traits. Software with source files, documentation and example data files are freely available online at http://www.well.ox.ac.uk/GWAMA.

  6. A human genome-wide library of local phylogeny predictions for whole-genome inference problems

    Directory of Open Access Journals (Sweden)

    Schwartz Russell

    2008-08-01

    Full Text Available Abstract Background Many common inference problems in computational genetics depend on inferring aspects of the evolutionary history of a data set given a set of observed modern sequences. Detailed predictions of the full phylogenies are therefore of value in improving our ability to make further inferences about population history and sources of genetic variation. Making phylogenetic predictions on the scale needed for whole-genome analysis is, however, extremely computationally demanding. Results In order to facilitate phylogeny-based predictions on a genomic scale, we develop a library of maximum parsimony phylogenies within local regions spanning all autosomal human chromosomes based on Haplotype Map variation data. We demonstrate the utility of this library for population genetic inferences by examining a tree statistic we call 'imperfection,' which measures the reuse of variant sites within a phylogeny. This statistic is significantly predictive of recombination rate, shows additional regional and population-specific conservation, and allows us to identify outlier genes likely to have experienced unusual amounts of variation in recent human history. Conclusion Recent theoretical advances in algorithms for phylogenetic tree reconstruction have made it possible to perform large-scale inferences of local maximum parsimony phylogenies from single nucleotide polymorphism (SNP data. As results from the imperfection statistic demonstrate, phylogeny predictions encode substantial information useful for detecting genomic features and population history. This data set should serve as a platform for many kinds of inferences one may wish to make about human population history and genetic variation.

  7. Genome-wide association meta-analysis of 78,308 individuals identifies new loci and genes influencing human intelligence.

    Science.gov (United States)

    Sniekers, Suzanne; Stringer, Sven; Watanabe, Kyoko; Jansen, Philip R; Coleman, Jonathan R I; Krapohl, Eva; Taskesen, Erdogan; Hammerschlag, Anke R; Okbay, Aysu; Zabaneh, Delilah; Amin, Najaf; Breen, Gerome; Cesarini, David; Chabris, Christopher F; Iacono, William G; Ikram, M Arfan; Johannesson, Magnus; Koellinger, Philipp; Lee, James J; Magnusson, Patrik K E; McGue, Matt; Miller, Mike B; Ollier, William E R; Payton, Antony; Pendleton, Neil; Plomin, Robert; Rietveld, Cornelius A; Tiemeier, Henning; van Duijn, Cornelia M; Posthuma, Danielle

    2017-07-01

    Intelligence is associated with important economic and health-related life outcomes. Despite intelligence having substantial heritability (0.54) and a confirmed polygenic nature, initial genetic studies were mostly underpowered. Here we report a meta-analysis for intelligence of 78,308 individuals. We identify 336 associated SNPs (METAL P intelligence in childhood (0.45) and adulthood (0.80), we show substantial genetic correlation (r g = 0.89, LD score regression P = 5.4 × 10 -29 ). These findings provide new insight into the genetic architecture of intelligence.

  8. The human genome: Some assembly required. Final report

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1994-12-31

    The Human Genome Project promises to be one of the most rewarding endeavors in modern biology. The cost and the ethical and social implications, however, have made this project the source of considerable debate both in the scientific community and in the public at large. The 1994 Graduate Student Symposium addresses the scientific merits of the project, the technical issues involved in accomplishing the task, as well as the medical and social issues which stem from the wealth of knowledge which the Human Genome Project will help create. To this end, speakers were brought together who represent the diverse areas of expertise characteristic of this multidisciplinary project. The keynote speaker addresses the project`s motivations and goals in the larger context of biological and medical sciences. The first two sessions address relevant technical issues, data collection with a focus on high-throughput sequencing methods and data analysis with an emphasis on identification of coding sequences. The third session explores recent advances in the understanding of genetic diseases and possible routes to treatment. Finally, the last session addresses some of the ethical, social and legal issues which will undoubtedly arise from having a detailed knowledge of the human genome.

  9. 75 FR 8374 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2010-02-24

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... Officer, Scientific Review Branch, National Human Genome Research Institute, National Institutes of Health...

  10. 77 FR 5035 - National Human Genome Research Institute; Notice of Closed Meetings

    Science.gov (United States)

    2012-02-01

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... clearly unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research... Officer, Scientific Review Branch, National Human Genome Research Institute, National Institutes of Health...

  11. 78 FR 64222 - National Human Genome Research Institute; Notice of Closed Meetings

    Science.gov (United States)

    2013-10-28

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... clearly unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research... Review, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, 301...

  12. 77 FR 20646 - National Human Genome Research Institute; Notice of Closed Meetings

    Science.gov (United States)

    2012-04-05

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... clearly unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research.... Agenda: To review and evaluate grant applications. Place: National Human Genome Research Institute, 5635...

  13. 77 FR 58402 - National Human Genome Research Institute; Notice of Closed Meetings

    Science.gov (United States)

    2012-09-20

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... clearly unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research...: To review and evaluate grant applications. Place: National Human Genome Research Institute, 5635...

  14. 76 FR 65204 - National Human Genome Research Institute; Notice of Closed Meetings

    Science.gov (United States)

    2011-10-20

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... constitute a clearly unwarranted invasion of personal privacy. Name of Committee: National Human Genome... Review Officer, Scientific Review Branch, National Human Genome Research Institute, 5635 Fishers Lane...

  15. 77 FR 12604 - National Human Genome Research Institute; Notice of Closed Meetings

    Science.gov (United States)

    2012-03-01

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... clearly unwarranted invasion of personal privacy. >Name of Committee: National Human Genome Research... review and evaluate contract proposals. Place: National Human Genome Reseach Institute, 5635 Fishers Lane...

  16. 78 FR 55752 - National Human Genome Research Institute; Notice of Closed Meetings

    Science.gov (United States)

    2013-09-11

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... clearly unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research.... Pozzatti, Ph.D., Scientific Review Officer, Scientific Review Branch, National Human Genome Research...

  17. 78 FR 56905 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2013-09-16

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... clearly unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research....m. Agenda: To review and evaluate grant applications. Place: National Human Genome Research...

  18. 76 FR 28056 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2011-05-13

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Counselors, National Human Genome Research Institute. The meeting will be closed to the public as indicated... National Human Genome Research Institute, including consideration of personnel qualifications and...

  19. 76 FR 17930 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2011-03-31

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... Review Officer, Scientific Review Branch, National Human Genome Research Institute, 5635 Fishers Lane...

  20. 77 FR 59933 - National Human Genome Research Institute; Notice of Closed Meetings

    Science.gov (United States)

    2012-10-01

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... clearly unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research....D., Scientific Review Officer, Scientific Review Branch, National Human Genome Research Institute...

  1. 78 FR 107 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2013-01-02

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... evaluate grant applications. Place: National Human Genome Research Institute, 3rd Floor Conference Room....D., Scientific Review Officer, Scientific Review Branch, National Human Genome Research Institute...

  2. 76 FR 58023 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2011-09-19

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Initial..., Scientific Review Officer, Office of Scientific Review, National Human Genome Research Institute, National...

  3. 77 FR 28888 - National Human Genome Research Institute Notice of Closed Meeting

    Science.gov (United States)

    2012-05-16

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Initial...: To review and evaluate grant applications. Place: National Human Genome Research Institute, 3635...

  4. 78 FR 70063 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2013-11-22

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Counselors, National Human Genome Research Institute. The meeting will be closed to the public as indicated... NATIONAL HUMAN GENOME RESEARCH INSTITUTE, including consideration of personnel qualifications and...

  5. 78 FR 9707 - National Human Genome Research Institute; Notice of Closed Meetings

    Science.gov (United States)

    2013-02-11

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... clearly unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research... Officer, Scientific Review Branch, National Human Genome Research Institute, 5635 Fishers Lane, Suite 4076...

  6. 77 FR 71604 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2012-12-03

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special..., Scientific Review Branch, National Human Genome Research Institute, National Institutes of Health, 5635...

  7. 76 FR 5390 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2011-01-31

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Place: National Human Genome Research Institute Special Emphasis... Officer, Scientific Review Branch, National Human Genome Research Institute, 5635 Fishers Lane, Suite 4076...

  8. 75 FR 13558 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2010-03-22

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Counselors, National Human Genome Research Institute. The meeting will be closed to the public as indicated... National Human Genome Research Institute, including consideration of personnel qualifications and...

  9. 76 FR 29772 - National Human Genome Research Institute; Notice of Closed Meetings

    Science.gov (United States)

    2011-05-23

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... clearly unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research... of Scientific Review, National Human Genome Research Institute, National Institutes of Health...

  10. Functional assessment of human enhancer activities using whole-genome STARR-sequencing.

    Science.gov (United States)

    Liu, Yuwen; Yu, Shan; Dhiman, Vineet K; Brunetti, Tonya; Eckart, Heather; White, Kevin P

    2017-11-20

    Genome-wide quantification of enhancer activity in the human genome has proven to be a challenging problem. Recent efforts have led to the development of powerful tools for enhancer quantification. However, because of genome size and complexity, these tools have yet to be applied to the whole human genome.  In the current study, we use a human prostate cancer cell line, LNCaP as a model to perform whole human genome STARR-seq (WHG-STARR-seq) to reliably obtain an assessment of enhancer activity. This approach builds upon previously developed STARR-seq in the fly genome and CapSTARR-seq techniques in targeted human genomic regions. With an improved library preparation strategy, our approach greatly increases the library complexity per unit of starting material, which makes it feasible and cost-effective to explore the landscape of regulatory activity in the much larger human genome. In addition to our ability to identify active, accessible enhancers located in open chromatin regions, we can also detect sequences with the potential for enhancer activity that are located in inaccessible, closed chromatin regions. When treated with the histone deacetylase inhibitor, Trichostatin A, genes nearby this latter class of enhancers are up-regulated, demonstrating the potential for endogenous functionality of these regulatory elements. WHG-STARR-seq provides an improved approach to current pipelines for analysis of high complexity genomes to gain a better understanding of the intricacies of transcriptional regulation.

  11. Forces shaping the fastest evolving regions in the human genome

    DEFF Research Database (Denmark)

    Pollard, Katherine S; Salama, Sofie R; King, Bryan

    2006-01-01

    Comparative genomics allow us to search the human genome for segments that were extensively changed in the last approximately 5 million years since divergence from our common ancestor with chimpanzee, but are highly conserved in other species and thus are likely to be functional. We found 202...... genomic elements that are highly conserved in vertebrates but show evidence of significantly accelerated substitution rates in human. These are mostly in non-coding DNA, often near genes associated with transcription and DNA binding. Resequencing confirmed that the five most accelerated elements...... contributed to accelerated evolution of the fastest evolving elements in the human genome....

  12. Genome Architecture and Its Roles in Human Copy Number Variation

    Directory of Open Access Journals (Sweden)

    Lu Chen

    2014-12-01

    Full Text Available Besides single-nucleotide variants in the human genome, large-scale genomic variants, such as copy number variations (CNVs, are being increasingly discovered as a genetic source of human diversity and the pathogenic factors of diseases. Recent experimental findings have shed light on the links between different genome architectures and CNV mutagenesis. In this review, we summarize various genomic features and discuss their contributions to CNV formation. Genomic repeats, including both low-copy and high-copy repeats, play important roles in CNV instability, which was initially known as DNA recombination events. Furthermore, it has been found that human genomic repeats can also induce DNA replication errors and consequently result in CNV mutations. Some recent studies showed that DNA replication timing, which reflects the high-order information of genomic organization, is involved in human CNV mutations. Our review highlights that genome architecture, from DNA sequence to high-order genomic organization, is an important molecular factor in CNV mutagenesis and human genomic instability.

  13. The Human Genome Project: An Imperative for International Collaboration.

    Science.gov (United States)

    Allende, J. E.

    1989-01-01

    Discussed is the Human Genome Project which aims to decipher the totality of the human genetic information. The historical background, the objectives, international cooperation, ethical discussion, and the role of UNESCO are included. (KR)

  14. Complete Genome Sequence of the Human Gut Symbiont Roseburia hominis

    DEFF Research Database (Denmark)

    Travis, Anthony J.; Kelly, Denise; Flint, Harry J

    2015-01-01

    We report here the complete genome sequence of the human gut symbiont Roseburia hominis A2-183(T) (= DSM 16839(T) = NCIMB 14029(T)), isolated from human feces. The genome is represented by a 3,592,125-bp chromosome with 3,405 coding sequences. A number of potential functions contributing to host...

  15. Child Development and Structural Variation in the Human Genome

    Science.gov (United States)

    Zhang, Ying; Haraksingh, Rajini; Grubert, Fabian; Abyzov, Alexej; Gerstein, Mark; Weissman, Sherman; Urban, Alexander E.

    2013-01-01

    Structural variation of the human genome sequence is the insertion, deletion, or rearrangement of stretches of DNA sequence sized from around 1,000 to millions of base pairs. Over the past few years, structural variation has been shown to be far more common in human genomes than previously thought. Very little is currently known about the effects…

  16. Widespread of horizontal gene transfer in the human genome.

    Science.gov (United States)

    Huang, Wenze; Tsai, Lillian; Li, Yulong; Hua, Nan; Sun, Chen; Wei, Chaochun

    2017-04-04

    A fundamental concept in biology is that heritable material is passed from parents to offspring, a process called vertical gene transfer. An alternative mechanism of gene acquisition is through horizontal gene transfer (HGT), which involves movement of genetic materials between different species. Horizontal gene transfer has been found prevalent in prokaryotes but very rare in eukaryote. In this paper, we investigate horizontal gene transfer in the human genome. From the pair-wise alignments between human genome and 53 vertebrate genomes, 1,467 human genome regions (2.6 M bases) from all chromosomes were found to be more conserved with non-mammals than with most mammals. These human genome regions involve 642 known genes, which are enriched with ion binding. Compared to known horizontal gene transfer regions in the human genome, there were few overlapping regions, which indicated horizontal gene transfer is more common than we expected in the human genome. Horizontal gene transfer impacts hundreds of human genes and this study provided insight into potential mechanisms of HGT in the human genome.

  17. What does it mean to be genomically literate?: National Human Genome Research Institute Meeting Report.

    Science.gov (United States)

    Hurle, Belen; Citrin, Toby; Jenkins, Jean F; Kaphingst, Kimberly A; Lamb, Neil; Roseman, Jo Ellen; Bonham, Vence L

    2013-08-01

    Genomic discoveries will increasingly advance the science of medicine. Limited genomic literacy may adversely impact the public's understanding and use of the power of genetics and genomics in health care and public health. In November 2011, a meeting was held by the National Human Genome Research Institute to examine the challenge of achieving genomic literacy for the general public, from kindergarten to grade 12 to adult education. The role of the media in disseminating scientific messages and in perpetuating or reducing misconceptions was also discussed. Workshop participants agreed that genomic literacy will be achieved only through active engagement between genomics experts and the varied constituencies that comprise the public. This report summarizes the background, content, and outcomes from this meeting, including recommendations for a research agenda to inform decisions about how to advance genomic literacy in our society.

  18. Chromatin structure and evolution in the human genome

    Directory of Open Access Journals (Sweden)

    Dunlop Malcolm G

    2007-05-01

    Full Text Available Abstract Background Evolutionary rates are not constant across the human genome but genes in close proximity have been shown to experience similar levels of divergence and selection. The higher-order organisation of chromosomes has often been invoked to explain such phenomena but previously there has been insufficient data on chromosome structure to investigate this rigorously. Using the results of a recent genome-wide analysis of open and closed human chromatin structures we have investigated the global association between divergence, selection and chromatin structure for the first time. Results In this study we have shown that, paradoxically, synonymous site divergence (dS at non-CpG sites is highest in regions of open chromatin, primarily as a result of an increased number of transitions, while the rates of other traditional measures of mutation (intergenic, intronic and ancient repeat divergence as well as SNP density are highest in closed regions of the genome. Analysis of human-chimpanzee divergence across intron-exon boundaries indicates that although genes in relatively open chromatin generally display little selection at their synonymous sites, those in closed regions show markedly lower divergence at their fourfold degenerate sites than in neighbouring introns and intergenic regions. Exclusion of known Exonic Splice Enhancer hexamers has little affect on the divergence observed at fourfold degenerate sites across chromatin categories; however, we show that closed chromatin is enriched with certain classes of ncRNA genes whose RNA secondary structure may be particularly important. Conclusion We conclude that, overall, non-CpG mutation rates are lowest in open regions of the genome and that regions of the genome with a closed chromatin structure have the highest background mutation rate. This might reflect lower rates of DNA damage or enhanced DNA repair processes in regions of open chromatin. Our results also indicate that dS is a poor

  19. Mycobacterial species as case-study of comparative genome analysis.

    Science.gov (United States)

    Zakham, F; Belayachi, L; Ussery, D; Akrim, M; Benjouad, A; El Aouad, R; Ennaji, M M

    2011-02-08

    The genus Mycobacterium represents more than 120 species including important pathogens of human and cause major public health problems and illnesses. Further, with more than 100 genome sequences from this genus, comparative genome analysis can provide new insights for better understanding the evolutionary events of these species and improving drugs, vaccines, and diagnostics tools for controlling Mycobacterial diseases. In this present study we aim to outline a comparative genome analysis of fourteen Mycobacterial genomes: M. avium subsp. paratuberculosis K—10, M. bovis AF2122/97, M. bovis BCG str. Pasteur 1173P2, M. leprae Br4923, M. marinum M, M. sp. KMS, M. sp. MCS, M. tuberculosis CDC1551, M. tuberculosis F11, M. tuberculosis H37Ra, M. tuberculosis H37Rv, M. tuberculosis KZN 1435 , M. ulcerans Agy99,and M. vanbaalenii PYR—1, For this purpose a comparison has been done based on their length of genomes, GC content, number of genes in different data bases (Genbank, Refseq, and Prodigal). The BLAST matrix of these genomes has been figured to give a lot of information about the similarity between species in a simple scheme. As a result of multiple genome analysis, the pan and core genome have been defined for twelve Mycobacterial species. We have also introduced the genome atlas of the reference strain M. tuberculosis H37Rv which can give a good overview of this genome. And for examining the phylogenetic relationships among these bacteria, a phylogenic tree has been constructed from 16S rRNA gene for tuberculosis and non tuberculosis Mycobacteria to understand the evolutionary events of these species.

  20. Human genome-microbiome interaction: metagenomics frontiers for the aetiopathology of autoimmune diseases

    Science.gov (United States)

    Nalbantoglu, Ufuk

    2017-01-01

    A short while ago, the human genome and microbiome were analysed simultaneously for the first time as a multi-omic approach. The analyses of heterogeneous population cohorts showed that microbiome components were associated with human genome variations. In-depth analysis of these results reveals that the majority of those relationships are between immune pathways and autoimmune disease-associated microbiome components. Thus, it can be hypothesized that autoimmunity may be associated with homeostatic disequilibrium of the human-microbiome interactome. Further analysis of human genome–human microbiome relationships in disease contexts with tailored systems biology approaches may yield insights into disease pathogenesis and prognosis. PMID:28785422

  1. Genome Sequencing and Analysis Conference IV

    Energy Technology Data Exchange (ETDEWEB)

    1993-12-31

    J. Craig Venter and C. Thomas Caskey co-chaired Genome Sequencing and Analysis Conference IV held at Hilton Head, South Carolina from September 26--30, 1992. Venter opened the conference by noting that approximately 400 researchers from 16 nations were present four times as many participants as at Genome Sequencing Conference I in 1989. Venter also introduced the Data Fair, a new component of the conference allowing exchange and on-site computer analysis of unpublished sequence data.

  2. A framework for annotating human genome in disease context.

    Science.gov (United States)

    Xu, Wei; Wang, Huisong; Cheng, Wenqing; Fu, Dong; Xia, Tian; Kibbe, Warren A; Lin, Simon M

    2012-01-01

    Identification of gene-disease association is crucial to understanding disease mechanism. A rapid increase in biomedical literatures, led by advances of genome-scale technologies, poses challenge for manually-curated-based annotation databases to characterize gene-disease associations effectively and timely. We propose an automatic method-The Disease Ontology Annotation Framework (DOAF) to provide a comprehensive annotation of the human genome using the computable Disease Ontology (DO), the NCBO Annotator service and NCBI Gene Reference Into Function (GeneRIF). DOAF can keep the resulting knowledgebase current by periodically executing automatic pipeline to re-annotate the human genome using the latest DO and GeneRIF releases at any frequency such as daily or monthly. Further, DOAF provides a computable and programmable environment which enables large-scale and integrative analysis by working with external analytic software or online service platforms. A user-friendly web interface (doa.nubic.northwestern.edu) is implemented to allow users to efficiently query, download, and view disease annotations and the underlying evidences.

  3. Detection and analysis of ancient segmental duplications in mammalian genomes.

    Science.gov (United States)

    Pu, Lianrong; Lin, Yu; Pevzner, Pavel A

    2018-05-07

    Although segmental duplications (SDs) represent hotbeds for genomic rearrangements and emergence of new genes, there are still no easy-to-use tools for identifying SDs. Moreover, while most previous studies focused on recently emerged SDs, detection of ancient SDs remains an open problem. We developed an SDquest algorithm for SD finding and applied it to analyzing SDs in human, gorilla, and mouse genomes. Our results demonstrate that previous studies missed many SDs in these genomes and show that SDs account for at least 6.05% of the human genome (version hg19), a 17% increase as compared to the previous estimate. Moreover, SDquest classified 6.42% of the latest GRCh38 version of the human genome as SDs, a large increase as compared to previous studies. We thus propose to re-evaluate evolution of SDs based on their accurate representation across multiple genomes. Toward this goal, we analyzed the complex mosaic structure of SDs and decomposed mosaic SDs into elementary SDs, a prerequisite for follow-up evolutionary analysis. We also introduced the concept of the breakpoint graph of mosaic SDs that revealed SD hotspots and suggested that some SDs may have originated from circular extrachromosomal DNA (ecDNA), not unlike ecDNA that contributes to accelerated evolution in cancer. © 2018 Pu et al.; Published by Cold Spring Harbor Laboratory Press.

  4. Data mining and the human genome

    Energy Technology Data Exchange (ETDEWEB)

    Abarbanel, Henry [The MITRE Corporation, McLean, VA (US). JASON Program Office; Callan, Curtis [The MITRE Corporation, McLean, VA (US). JASON Program Office; Dally, William [The MITRE Corporation, McLean, VA (US). JASON Program Office; Dyson, Freeman [The MITRE Corporation, McLean, VA (US). JASON Program Office; Hwa, Terence [The MITRE Corporation, McLean, VA (US). JASON Program Office; Koonin, Steven [The MITRE Corporation, McLean, VA (US). JASON Program Office; Levine, Herbert [The MITRE Corporation, McLean, VA (US). JASON Program Office; Rothaus, Oscar [The MITRE Corporation, McLean, VA (US). JASON Program Office; Schwitters, Roy [The MITRE Corporation, McLean, VA (US). JASON Program Office; Stubbs, Christopher [The MITRE Corporation, McLean, VA (US). JASON Program Office; Weinberger, Peter [The MITRE Corporation, McLean, VA (US). JASON Program Office

    2000-01-07

    As genomics research moves from an era of data acquisition to one of both acquisition and interpretation, new methods are required for organizing and prioritizing the data. These methods would allow an initial level of data analysis to be carried out before committing resources to a particular genetic locus. This JASON study sought to delineate the main problems that must be faced in bioinformatics and to identify information technologies that can help to overcome those problems. While the current influx of data greatly exceeds what biologists have experienced in the past, other scientific disciplines and the commercial sector have been handling much larger datasets for many years. Powerful datamining techniques have been developed in other fields that, with appropriate modification, could be applied to the biological sciences.

  5. Fenton reaction induced cancer in wild type rats recapitulates genomic alterations observed in human cancer.

    Directory of Open Access Journals (Sweden)

    Shinya Akatsuka

    Full Text Available Iron overload has been associated with carcinogenesis in humans. Intraperitoneal administration of ferric nitrilotriacetate initiates a Fenton reaction in renal proximal tubules of rodents that ultimately leads to a high incidence of renal cell carcinoma (RCC after repeated treatments. We performed high-resolution microarray comparative genomic hybridization to identify characteristics in the genomic profiles of this oxidative stress-induced rat RCCs. The results revealed extensive large-scale genomic alterations with a preference for deletions. Deletions and amplifications were numerous and sometimes fragmented, demonstrating that a Fenton reaction is a cause of such genomic alterations in vivo. Frequency plotting indicated that two of the most commonly altered loci corresponded to a Cdkn2a/2b deletion and a Met amplification. Tumor sizes were proportionally associated with Met expression and/or amplification, and clustering analysis confirmed our results. Furthermore, we developed a procedure to compare whole genomic patterns of the copy number alterations among different species based on chromosomal syntenic relationship. Patterns of the rat RCCs showed the strongest similarity to the human RCCs among five types of human cancers, followed by human malignant mesothelioma, an iron overload-associated cancer. Therefore, an iron-dependent Fenton chemical reaction causes large-scale genomic alterations during carcinogenesis, which may result in distinct genomic profiles. Based on the characteristics of extensive genome alterations in human cancer, our results suggest that this chemical reaction may play a major role during human carcinogenesis.

  6. Microbial genome analysis: the COG approach.

    Science.gov (United States)

    Galperin, Michael Y; Kristensen, David M; Makarova, Kira S; Wolf, Yuri I; Koonin, Eugene V

    2017-09-14

    For the past 20 years, the Clusters of Orthologous Genes (COG) database had been a popular tool for microbial genome annotation and comparative genomics. Initially created for the purpose of evolutionary classification of protein families, the COG have been used, apart from straightforward functional annotation of sequenced genomes, for such tasks as (i) unification of genome annotation in groups of related organisms; (ii) identification of missing and/or undetected genes in complete microbial genomes; (iii) analysis of genomic neighborhoods, in many cases allowing prediction of novel functional systems; (iv) analysis of metabolic pathways and prediction of alternative forms of enzymes; (v) comparison of organisms by COG functional categories; and (vi) prioritization of targets for structural and functional characterization. Here we review the principles of the COG approach and discuss its key advantages and drawbacks in microbial genome analysis. Published by Oxford University Press 2017. This work is written by US Government employees and is in the public domain in the US.

  7. Recurrence time statistics: versatile tools for genomic DNA sequence analysis.

    Science.gov (United States)

    Cao, Yinhe; Tung, Wen-Wen; Gao, J B

    2004-01-01

    With the completion of the human and a few model organisms' genomes, and the genomes of many other organisms waiting to be sequenced, it has become increasingly important to develop faster computational tools which are capable of easily identifying the structures and extracting features from DNA sequences. One of the more important structures in a DNA sequence is repeat-related. Often they have to be masked before protein coding regions along a DNA sequence are to be identified or redundant expressed sequence tags (ESTs) are to be sequenced. Here we report a novel recurrence time based method for sequence analysis. The method can conveniently study all kinds of periodicity and exhaustively find all repeat-related features from a genomic DNA sequence. An efficient codon index is also derived from the recurrence time statistics, which has the salient features of being largely species-independent and working well on very short sequences. Efficient codon indices are key elements of successful gene finding algorithms, and are particularly useful for determining whether a suspected EST belongs to a coding or non-coding region. We illustrate the power of the method by studying the genomes of E. coli, the yeast S. cervisivae, the nematode worm C. elegans, and the human, Homo sapiens. Computationally, our method is very efficient. It allows us to carry out analysis of genomes on the whole genomic scale by a PC.

  8. The polydeoxyadenylate tract of Alu repetitive elements is polymorphic in the human genome

    International Nuclear Information System (INIS)

    Economou, E.P.; Bergen, A.W.; Warren, A.C.; Antonarakis, S.E.

    1990-01-01

    To identify DNA polymorphisms that are abundant in the human genome and are detectable by polymerase chain reaction amplification of genomic DNA, the authors hypothesize that the polydeoxyadenylate tract of the Alu family of repetitive elements is polymorphic among human chromosomes. Analysis of the 3' ends of three specific Alu sequences showed two occurrences, one in the adenosine deaminase gene and other in the β-globin pseudogene, were polymorphic. This novel class of polymorphism, termed AluVpA [Alu variable poly(A)] may represent one of the most useful and informative group of DNA markers in the human genome

  9. MIPS: analysis and annotation of proteins from whole genomes.

    Science.gov (United States)

    Mewes, H W; Amid, C; Arnold, R; Frishman, D; Güldener, U; Mannhaupt, G; Münsterkötter, M; Pagel, P; Strack, N; Stümpflen, V; Warfsmann, J; Ruepp, A

    2004-01-01

    The Munich Information Center for Protein Sequences (MIPS-GSF), Neuherberg, Germany, provides protein sequence-related information based on whole-genome analysis. The main focus of the work is directed toward the systematic organization of sequence-related attributes as gathered by a variety of algorithms, primary information from experimental data together with information compiled from the scientific literature. MIPS maintains automatically generated and manually annotated genome-specific databases, develops systematic classification schemes for the functional annotation of protein sequences and provides tools for the comprehensive analysis of protein sequences. This report updates the information on the yeast genome (CYGD), the Neurospora crassa genome (MNCDB), the database of complete cDNAs (German Human Genome Project, NGFN), the database of mammalian protein-protein interactions (MPPI), the database of FASTA homologies (SIMAP), and the interface for the fast retrieval of protein-associated information (QUIPOS). The Arabidopsis thaliana database, the rice database, the plant EST databases (MATDB, MOsDB, SPUTNIK), as well as the databases for the comprehensive set of genomes (PEDANT genomes) are described elsewhere in the 2003 and 2004 NAR database issues, respectively. All databases described, and the detailed descriptions of our projects can be accessed through the MIPS web server (http://mips.gsf.de).

  10. The Past, Present, and Future of Human Centromere Genomics

    Directory of Open Access Journals (Sweden)

    Megan E. Aldrup-MacDonald

    2014-01-01

    Full Text Available The centromere is the chromosomal locus essential for chromosome inheritance and genome stability. Human centromeres are located at repetitive alpha satellite DNA arrays that compose approximately 5% of the genome. Contiguous alpha satellite DNA sequence is absent from the assembled reference genome, limiting current understanding of centromere organization and function. Here, we review the progress in centromere genomics spanning the discovery of the sequence to its molecular characterization and the work done during the Human Genome Project era to elucidate alpha satellite structure and sequence variation. We discuss exciting recent advances in alpha satellite sequence assembly that have provided important insight into the abundance and complex organization of this sequence on human chromosomes. In light of these new findings, we offer perspectives for future studies of human centromere assembly and function.

  11. Differential DNA Methylation Analysis without a Reference Genome

    Directory of Open Access Journals (Sweden)

    Johanna Klughammer

    2015-12-01

    Full Text Available Genome-wide DNA methylation mapping uncovers epigenetic changes associated with animal development, environmental adaptation, and species evolution. To address the lack of high-throughput methods for DNA methylation analysis in non-model organisms, we developed an integrated approach for studying DNA methylation differences independent of a reference genome. Experimentally, our method relies on an optimized 96-well protocol for reduced representation bisulfite sequencing (RRBS, which we have validated in nine species (human, mouse, rat, cow, dog, chicken, carp, sea bass, and zebrafish. Bioinformatically, we developed the RefFreeDMA software to deduce ad hoc genomes directly from RRBS reads and to pinpoint differentially methylated regions between samples or groups of individuals (http://RefFreeDMA.computational-epigenetics.org. The identified regions are interpreted using motif enrichment analysis and/or cross-mapping to annotated genomes. We validated our method by reference-free analysis of cell-type-specific DNA methylation in the blood of human, cow, and carp. In summary, we present a cost-effective method for epigenome analysis in ecology and evolution, which enables epigenome-wide association studies in natural populations and species without a reference genome.

  12. Exploratory analysis of genomic segmentations with Segtools

    Directory of Open Access Journals (Sweden)

    Buske Orion J

    2011-10-01

    Full Text Available Abstract Background As genome-wide experiments and annotations become more prevalent, researchers increasingly require tools to help interpret data at this scale. Many functional genomics experiments involve partitioning the genome into labeled segments, such that segments sharing the same label exhibit one or more biochemical or functional traits. For example, a collection of ChlP-seq experiments yields a compendium of peaks, each labeled with one or more associated DNA-binding proteins. Similarly, manually or automatically generated annotations of functional genomic elements, including cis-regulatory modules and protein-coding or RNA genes, can also be summarized as genomic segmentations. Results We present a software toolkit called Segtools that simplifies and automates the exploration of genomic segmentations. The software operates as a series of interacting tools, each of which provides one mode of summarization. These various tools can be pipelined and summarized in a single HTML page. We describe the Segtools toolkit and demonstrate its use in interpreting a collection of human histone modification data sets and Plasmodium falciparum local chromatin structure data sets. Conclusions Segtools provides a convenient, powerful means of interpreting a genomic segmentation.

  13. Rapid detection of structural variation in a human genome using nanochannel-based genome mapping technology

    DEFF Research Database (Denmark)

    Cao, Hongzhi; Hastie, Alex R.; Cao, Dandan

    2014-01-01

    mutations; however, none of the current detection methods are comprehensive, and currently available methodologies are incapable of providing sufficient resolution and unambiguous information across complex regions in the human genome. To address these challenges, we applied a high-throughput, cost......-effective genome mapping technology to comprehensively discover genome-wide SVs and characterize complex regions of the YH genome using long single molecules (>150 kb) in a global fashion. RESULTS: Utilizing nanochannel-based genome mapping technology, we obtained 708 insertions/deletions and 17 inversions larger...... fosmid data. Of the remaining 270 SVs, 260 are insertions and 213 overlap known SVs in the Database of Genomic Variants. Overall, 609 out of 666 (90%) variants were supported by experimental orthogonal methods or historical evidence in public databases. At the same time, genome mapping also provides...

  14. The complete genome sequence and comparative genome analysis of the high pathogenicity Yersinia enterocolitica strain 8081.

    Directory of Open Access Journals (Sweden)

    Nicholas R Thomson

    2006-12-01

    Full Text Available The human enteropathogen, Yersinia enterocolitica, is a significant link in the range of Yersinia pathologies extending from mild gastroenteritis to bubonic plague. Comparison at the genomic level is a key step in our understanding of the genetic basis for this pathogenicity spectrum. Here we report the genome of Y. enterocolitica strain 8081 (serotype 0:8; biotype 1B and extensive microarray data relating to the genetic diversity of the Y. enterocolitica species. Our analysis reveals that the genome of Y. enterocolitica strain 8081 is a patchwork of horizontally acquired genetic loci, including a plasticity zone of 199 kb containing an extraordinarily high density of virulence genes. Microarray analysis has provided insights into species-specific Y. enterocolitica gene functions and the intraspecies differences between the high, low, and nonpathogenic Y. enterocolitica biotypes. Through comparative genome sequence analysis we provide new information on the evolution of the Yersinia. We identify numerous loci that represent ancestral clusters of genes potentially important in enteric survival and pathogenesis, which have been lost or are in the process of being lost, in the other sequenced Yersinia lineages. Our analysis also highlights large metabolic operons in Y. enterocolitica that are absent in the related enteropathogen, Yersinia pseudotuberculosis, indicating major differences in niche and nutrients used within the mammalian gut. These include clusters directing, the production of hydrogenases, tetrathionate respiration, cobalamin synthesis, and propanediol utilisation. Along with ancestral gene clusters, the genome of Y. enterocolitica has revealed species-specific and enteropathogen-specific loci. This has provided important insights into the pathology of this bacterium and, more broadly, into the evolution of the genus. Moreover, wider investigations looking at the patterns of gene loss and gain in the Yersinia have highlighted common

  15. A Secure Alignment Algorithm for Mapping Short Reads to Human Genome.

    Science.gov (United States)

    Zhao, Yongan; Wang, Xiaofeng; Tang, Haixu

    2018-05-09

    The elastic and inexpensive computing resources such as clouds have been recognized as a useful solution to analyzing massive human genomic data (e.g., acquired by using next-generation sequencers) in biomedical researches. However, outsourcing human genome computation to public or commercial clouds was hindered due to privacy concerns: even a small number of human genome sequences contain sufficient information for identifying the donor of the genomic data. This issue cannot be directly addressed by existing security and cryptographic techniques (such as homomorphic encryption), because they are too heavyweight to carry out practical genome computation tasks on massive data. In this article, we present a secure algorithm to accomplish the read mapping, one of the most basic tasks in human genomic data analysis based on a hybrid cloud computing model. Comparing with the existing approaches, our algorithm delegates most computation to the public cloud, while only performing encryption and decryption on the private cloud, and thus makes the maximum use of the computing resource of the public cloud. Furthermore, our algorithm reports similar results as the nonsecure read mapping algorithms, including the alignment between reads and the reference genome, which can be directly used in the downstream analysis such as the inference of genomic variations. We implemented the algorithm in C++ and Python on a hybrid cloud system, in which the public cloud uses an Apache Spark system.

  16. Chromosomal locations of members of a family of novel endogenous human retroviral genomes

    International Nuclear Information System (INIS)

    Horn, T.M.; Huebner, K.; Croce, C.; Callahan, R.

    1986-01-01

    Human cellular DNA contains two distinguishable families of retroviral related sequences. One family shares extensive nucleotide sequence homology with infectious mammalian type C retroviral genomes. The other family contains major regions of homology with the pol genes of infectious type A and B and avian type C and D retroviral genomes. Analysis of the human recombinant clone HLM-2 has shown that the pol gene in the latter family is located within an endogenous proviral genome. The authors show that the proviral genome in HLM-2 and the related recombinant clone HLM-25 are located, respectively, on human chromosomes 1 and 5. Other related proviral genomes are located on chromosomes 7, 8, 11, 14, and 17

  17. Complete genome sequence of a commensal bacterium, Hafnia alvei CBA7124, isolated from human feces.

    Science.gov (United States)

    Song, Hye Seon; Kim, Joon Yong; Kim, Yeon Bee; Jeong, Myeong Seon; Kang, Jisu; Rhee, Jin-Kyu; Kwon, Joseph; Kim, Ju Suk; Choi, Jong-Soon; Choi, Hak-Jong; Nam, Young-Do; Roh, Seong Woon

    2017-01-01

    Members of the genus Hafnia have been isolated from the feces of mammals, birds, reptiles, and fish, as well as from soil, water, sewage, and foods. Hafnia alvei is an opportunistic pathogen that has been implicated in intestinal and extraintestinal infections in humans. However, its pathogenicity is still unclear. In this study, we isolated H. alvei from human feces and performed sequencing as well as comparative genomic analysis to better understand its pathogenicity. The genome of H. alvei CBA7124 comprised a single circular chromosome with 4,585,298 bp and a GC content of 48.8%. The genome contained 25 rRNA genes (9 5S rRNA genes, 8 16S rRNA genes, and 8 23S rRNA genes), 88 tRNA genes, and 4043 protein-coding genes. Using comparative genomic analysis, the genome of this strain was found to have 72 strain-specific singletons. The genome also contained genes for antibiotic and antimicrobial resistance, as well as toxin-antitoxin systems. We revealed the complete genome sequence of the opportunistic gut pathogen, H. alvei CBA7124. We also performed comparative genomic analysis of the sequences in the genome of H. alvei CBA7124, and found that it contained strain-specific singletons, antibiotic resistance genes, and toxin-antitoxin systems. These results could improve our understanding of the pathogenicity and the mechanism behind the antibiotic resistance of H. alvei strains.

  18. Recent and ongoing selection in the human genome

    DEFF Research Database (Denmark)

    Nielsen, Rasmus; Hellmann, Ines; Hubisz, Melissa

    2007-01-01

    The recent availability of genome-scale genotyping data has led to the identification of regions of the human genome that seem to have been targeted by selection. These findings have increased our understanding of the evolutionary forces that affect the human genome, have augmented our knowledge...... of gene function and promise to increase our understanding of the genetic basis of disease. However, inferences of selection are challenged by several confounding factors, especially the complex demographic history of human populations, and concordance between studies is variable. Although such studies...

  19. Localizing recent adaptive evolution in the human genome

    DEFF Research Database (Denmark)

    Williamson, Scott H; Hubisz, Melissa J; Clark, Andrew G

    2007-01-01

    , clusters of olfactory receptors, genes involved in nervous system development and function, immune system genes, and heat shock genes. We also observe consistent evidence of selective sweeps in centromeric regions. In general, we find that recent adaptation is strikingly pervasive in the human genome......-nucleotide polymorphism ascertainment, while also providing fine-scale estimates of the position of the selected site, we analyzed a genomic dataset of 1.2 million human single-nucleotide polymorphisms genotyped in African-American, European-American, and Chinese samples. We identify 101 regions of the human genome...

  20. A genomic atlas of human adrenal and gonad development

    Science.gov (United States)

    del Valle, Ignacio; Buonocore, Federica; Duncan, Andrew J.; Lin, Lin; Barenco, Martino; Parnaik, Rahul; Shah, Sonia; Hubank, Mike; Gerrelli, Dianne; Achermann, John C.

    2017-01-01

    Background: In humans, the adrenal glands and gonads undergo distinct biological events between 6-10 weeks post conception (wpc), such as testis determination, the onset of steroidogenesis and primordial germ cell development. However, relatively little is currently known about the genetic mechanisms underlying these processes. We therefore aimed to generate a detailed genomic atlas of adrenal and gonad development across these critical stages of human embryonic and fetal development. Methods: RNA was extracted from 53 tissue samples between 6-10 wpc (adrenal, testis, ovary and control). Affymetrix array analysis was performed and differential gene expression was analysed using Bioconductor. A mathematical model was constructed to investigate time-series changes across the dataset. Pathway analysis was performed using ClueGo and cellular localisation of novel factors confirmed using immunohistochemistry. Results: Using this approach, we have identified novel components of adrenal development (e.g. ASB4, NPR3) and confirmed the role of SRY as the main human testis-determining gene. By mathematical modelling time-series data we have found new genes up-regulated with SOX9 in the testis (e.g. CITED1), which may represent components of the testis development pathway. We have shown that testicular steroidogenesis has a distinct onset at around 8 wpc and identified potential novel components in adrenal and testicular steroidogenesis (e.g. MGARP, FOXO4, MAP3K15, GRAMD1B, RMND2), as well as testis biomarkers (e.g. SCUBE1). We have also shown that the developing human ovary expresses distinct subsets of genes (e.g. OR10G9, OR4D5), but enrichment for established biological pathways is limited. Conclusion: This genomic atlas is revealing important novel aspects of human development and new candidate genes for adrenal and reproductive disorders. PMID:28459107

  1. Comparative analysis of prophages in Streptococcus mutans genomes

    Science.gov (United States)

    Fu, Tiwei; Fan, Xiangyu; Long, Quanxin; Deng, Wanyan; Song, Jinlin

    2017-01-01

    Prophages have been considered genetic units that have an intimate association with novel phenotypic properties of bacterial hosts, such as pathogenicity and genomic variation. Little is known about the genetic information of prophages in the genome of Streptococcus mutans, a major pathogen of human dental caries. In this study, we identified 35 prophage-like elements in S. mutans genomes and performed a comparative genomic analysis. Comparative genomic and phylogenetic analyses of prophage sequences revealed that the prophages could be classified into three main large clusters: Cluster A, Cluster B, and Cluster C. The S. mutans prophages in each cluster were compared. The genomic sequences of phismuN66-1, phismuNLML9-1, and phismu24-1 all shared similarities with the previously reported S. mutans phages M102, M102AD, and ϕAPCM01. The genomes were organized into seven major gene clusters according to the putative functions of the predicted open reading frames: packaging and structural modules, integrase, host lysis modules, DNA replication/recombination modules, transcriptional regulatory modules, other protein modules, and hypothetical protein modules. Moreover, an integrase gene was only identified in phismuNLML9-1 prophages. PMID:29158986

  2. Comparative genome analysis of trypanotolerance QTL | Nganga ...

    African Journals Online (AJOL)

    Homologous sequences were used in the definition of synteny relationships and subsequent identification of the shared disease response genes. The homologous genes within the human genome were then identified and aligned to the bovine radiation hybrid map in order to identify the mouse/bovine homologous regions.

  3. A Distance Measure for Genome Phylogenetic Analysis

    Science.gov (United States)

    Cao, Minh Duc; Allison, Lloyd; Dix, Trevor

    Phylogenetic analyses of species based on single genes or parts of the genomes are often inconsistent because of factors such as variable rates of evolution and horizontal gene transfer. The availability of more and more sequenced genomes allows phylogeny construction from complete genomes that is less sensitive to such inconsistency. For such long sequences, construction methods like maximum parsimony and maximum likelihood are often not possible due to their intensive computational requirement. Another class of tree construction methods, namely distance-based methods, require a measure of distances between any two genomes. Some measures such as evolutionary edit distance of gene order and gene content are computational expensive or do not perform well when the gene content of the organisms are similar. This study presents an information theoretic measure of genetic distances between genomes based on the biological compression algorithm expert model. We demonstrate that our distance measure can be applied to reconstruct the consensus phylogenetic tree of a number of Plasmodium parasites from their genomes, the statistical bias of which would mislead conventional analysis methods. Our approach is also used to successfully construct a plausible evolutionary tree for the γ-Proteobacteria group whose genomes are known to contain many horizontally transferred genes.

  4. Genome sequences and comparative genomics of two Lactobacillus ruminis strains from the bovine and human intestinal tracts

    LENUS (Irish Health Repository)

    2011-08-30

    Abstract Background The genus Lactobacillus is characterized by an extraordinary degree of phenotypic and genotypic diversity, which recent genomic analyses have further highlighted. However, the choice of species for sequencing has been non-random and unequal in distribution, with only a single representative genome from the L. salivarius clade available to date. Furthermore, there is no data to facilitate a functional genomic analysis of motility in the lactobacilli, a trait that is restricted to the L. salivarius clade. Results The 2.06 Mb genome of the bovine isolate Lactobacillus ruminis ATCC 27782 comprises a single circular chromosome, and has a G+C content of 44.4%. In silico analysis identified 1901 coding sequences, including genes for a pediocin-like bacteriocin, a single large exopolysaccharide-related cluster, two sortase enzymes, two CRISPR loci and numerous IS elements and pseudogenes. A cluster of genes related to a putative pilin was identified, and shown to be transcribed in vitro. A high quality draft assembly of the genome of a second L. ruminis strain, ATCC 25644 isolated from humans, suggested a slightly larger genome of 2.138 Mb, that exhibited a high degree of synteny with the ATCC 27782 genome. In contrast, comparative analysis of L. ruminis and L. salivarius identified a lack of long-range synteny between these closely related species. Comparison of the L. salivarius clade core proteins with those of nine other Lactobacillus species distributed across 4 major phylogenetic groups identified the set of shared proteins, and proteins unique to each group. Conclusions The genome of L. ruminis provides a comparative tool for directing functional analyses of other members of the L. salivarius clade, and it increases understanding of the divergence of this distinct Lactobacillus lineage from other commensal lactobacilli. The genome sequence provides a definitive resource to facilitate investigation of the genetics, biochemistry and host

  5. Genome Editing: A New Approach to Human Therapeutics.

    Science.gov (United States)

    Porteus, Matthew

    2016-01-01

    The ability to manipulate the genome with precise spatial and nucleotide resolution (genome editing) has been a powerful research tool. In the past decade, the tools and expertise for using genome editing in human somatic cells and pluripotent cells have increased to such an extent that the approach is now being developed widely as a strategy to treat human disease. The fundamental process depends on creating a site-specific DNA double-strand break (DSB) in the genome and then allowing the cell's endogenous DSB repair machinery to fix the break such that precise nucleotide changes are made to the DNA sequence. With the development and discovery of several different nuclease platforms and increasing knowledge of the parameters affecting different genome editing outcomes, genome editing frequencies now reach therapeutic relevance for a wide variety of diseases. Moreover, there is a series of complementary approaches to assessing the safety and toxicity of any genome editing process, irrespective of the underlying nuclease used. Finally, the development of genome editing has raised the issue of whether it should be used to engineer the human germline. Although such an approach could clearly prevent the birth of people with devastating and destructive genetic diseases, questions remain about whether human society is morally responsible enough to use this tool.

  6. Integrated analysis of whole genome and transcriptome sequencing reveals diverse transcriptomic aberrations driven by somatic genomic changes in liver cancers.

    Directory of Open Access Journals (Sweden)

    Yuichi Shiraishi

    Full Text Available Recent studies applying high-throughput sequencing technologies have identified several recurrently mutated genes and pathways in multiple cancer genomes. However, transcriptional consequences from these genomic alterations in cancer genome remain unclear. In this study, we performed integrated and comparative analyses of whole genomes and transcriptomes of 22 hepatitis B virus (HBV-related hepatocellular carcinomas (HCCs and their matched controls. Comparison of whole genome sequence (WGS and RNA-Seq revealed much evidence that various types of genomic mutations triggered diverse transcriptional changes. Not only splice-site mutations, but also silent mutations in coding regions, deep intronic mutations and structural changes caused splicing aberrations. HBV integrations generated diverse patterns of virus-human fusion transcripts depending on affected gene, such as TERT, CDK15, FN1 and MLL4. Structural variations could drive over-expression of genes such as WNT ligands, with/without creating gene fusions. Furthermore, by taking account of genomic mutations causing transcriptional aberrations, we could improve the sensitivity of deleterious mutation detection in known cancer driver genes (TP53, AXIN1, ARID2, RPS6KA3, and identified recurrent disruptions in putative cancer driver genes such as HNF4A, CPS1, TSC1 and THRAP3 in HCCs. These findings indicate genomic alterations in cancer genome have diverse transcriptomic effects, and integrated analysis of WGS and RNA-Seq can facilitate the interpretation of a large number of genomic alterations detected in cancer genome.

  7. The human genome as public: Justifications and implications.

    Science.gov (United States)

    Bayefsky, Michelle J

    2017-03-01

    Since the human genome was decoded, great emphasis has been placed on the unique, personal nature of the genome, along with the benefits that personalized medicine can bring to individuals and the importance of safeguarding genetic privacy. As a result, an equally important aspect of the human genome - its common nature - has been underappreciated and underrepresented in the ethics literature and policy dialogue surrounding genetics and genomics. This article will argue that, just as the personal nature of the genome has been used to reinforce individual rights and justify important privacy protections, so too the common nature of the genome can be employed to support protections of the genome at a population level and policies designed to promote the public's wellbeing. In order for public health officials to have the authority to develop genetics policies for the sake of the public good, the genome must have not only a common, but also a public, dimension. This article contends that DNA carries a public dimension through the use of two conceptual frameworks: the common heritage (CH) framework and the common resource (CR) framework. Both frameworks establish a public interest in the human genome, but the CH framework can be used to justify policies aimed at preserving and protecting the genome, while the CR framework can be employed to justify policies for utilizing the genome for the public benefit. A variety of possible policy implications are discussed, with special attention paid to the use of large-scale genomics databases for public health research. © Published 2016. This article is a U.S. Government work and is in the public domain in the USA.

  8. Predicting Tissue-Specific Enhancers in the Human Genome

    Energy Technology Data Exchange (ETDEWEB)

    Pennacchio, Len A.; Loots, Gabriela G.; Nobrega, Marcelo A.; Ovcharenko, Ivan

    2006-07-01

    Determining how transcriptional regulatory signals areencoded in vertebrate genomes is essential for understanding the originsof multi-cellular complexity; yet the genetic code of vertebrate generegulation remains poorly understood. In an attempt to elucidate thiscode, we synergistically combined genome-wide gene expression profiling,vertebrate genome comparisons, and transcription factor binding siteanalysis to define sequence signatures characteristic of candidatetissue-specific enhancers in the human genome. We applied this strategyto microarray-based gene expression profiles from 79 human tissues andidentified 7,187 candidate enhancers that defined their flanking geneexpression, the majority of which were located outside of knownpromoters. We cross-validated this method for its ability to de novopredict tissue-specific gene expression and confirmed its reliability in57 of the 79 available human tissues, with an average precision inenhancer recognition ranging from 32 percent to 63 percent, and asensitivity of 47 percent. We used the sequence signatures identified bythis approach to assign tissue-specific predictions to ~;328,000human-mouse conserved noncoding elements in the human genome. Byoverlapping these genome-wide predictions with a large in vivo dataset ofenhancers validated in transgenic mice, we confirmed our results with a28 percent sensitivity and 50 percent precision. These results indicatethe power of combining complementary genomic datasets as an initialcomputational foray into the global view of tissue-specific generegulation in vertebrates.

  9. National human genome projects: an update and an agenda.

    Science.gov (United States)

    An, Joon Yong

    2017-01-01

    Population genetic and human genetic studies are being accelerated with genome technology and data sharing. Accordingly, in the past 10 years, several countries have initiated genetic research using genome technology and identified the genetic architecture of the ethnic groups living in the corresponding country or suggested the genetic foundation of a social phenomenon. Genetic research has been conducted from epidemiological studies that previously described the health or disease conditions in defined population. This perspective summarizes national genome projects conducted in the past 10 years and introduces case studies to utilize genomic data in genetic research.

  10. Segmenting the human genome based on states of neutral genetic divergence.

    Science.gov (United States)

    Kuruppumullage Don, Prabhani; Ananda, Guruprasad; Chiaromonte, Francesca; Makova, Kateryna D

    2013-09-03

    Many studies have demonstrated that divergence levels generated by different mutation types vary and covary across the human genome. To improve our still-incomplete understanding of the mechanistic basis of this phenomenon, we analyze several mutation types simultaneously, anchoring their variation to specific regions of the genome. Using hidden Markov models on insertion, deletion, nucleotide substitution, and microsatellite divergence estimates inferred from human-orangutan alignments of neutrally evolving genomic sequences, we segment the human genome into regions corresponding to different divergence states--each uniquely characterized by specific combinations of divergence levels. We then parsed the mutagenic contributions of various biochemical processes associating divergence states with a broad range of genomic landscape features. We find that high divergence states inhabit guanine- and cytosine (GC)-rich, highly recombining subtelomeric regions; low divergence states cover inner parts of autosomes; chromosome X forms its own state with lowest divergence; and a state of elevated microsatellite mutability is interspersed across the genome. These general trends are mirrored in human diversity data from the 1000 Genomes Project, and departures from them highlight the evolutionary history of primate chromosomes. We also find that genes and noncoding functional marks [annotations from the Encyclopedia of DNA Elements (ENCODE)] are concentrated in high divergence states. Our results provide a powerful tool for biomedical data analysis: segmentations can be used to screen personal genome variants--including those associated with cancer and other diseases--and to improve computational predictions of noncoding functional elements.

  11. Human genome and genetic sequencing research and informed consent

    International Nuclear Information System (INIS)

    Iwakawa, Mayumi

    2003-01-01

    On March 29, 2001, the Ethical Guidelines for Human Genome and Genetic Sequencing Research were established. They have intended to serve as ethical guidelines for all human genome and genetic sequencing research practice, for the purpose of upholding respect for human dignity and rights and enforcing use of proper methods in the pursuit of human genome and genetic sequencing research, with the understanding and cooperation of the public. The RadGenomics Project has prepared a research protocol and informed consent document that follow these ethical guidelines. We have endeavored to protect the privacy of individual information, and have established a procedure for examination of research practices by an ethics committee. Here we report our procedure in order to offer this concept to the patients. (authors)

  12. Genetical genomic determinants of alcohol consumption in rats and humans

    Directory of Open Access Journals (Sweden)

    Mangion Jonathan

    2009-10-01

    Full Text Available Abstract Background We have used a genetical genomic approach, in conjunction with phenotypic analysis of alcohol consumption, to identify candidate genes that predispose to varying levels of alcohol intake by HXB/BXH recombinant inbred rat strains. In addition, in two populations of humans, we assessed genetic polymorphisms associated with alcohol consumption using a custom genotyping array for 1,350 single nucleotide polymorphisms (SNPs. Our goal was to ascertain whether our approach, which relies on statistical and informatics techniques, and non-human animal models of alcohol drinking behavior, could inform interpretation of genetic association studies with human populations. Results In the HXB/BXH recombinant inbred (RI rats, correlation analysis of brain gene expression levels with alcohol consumption in a two-bottle choice paradigm, and filtering based on behavioral and gene expression quantitative trait locus (QTL analyses, generated a list of candidate genes. A literature-based, functional analysis of the interactions of the products of these candidate genes defined pathways linked to presynaptic GABA release, activation of dopamine neurons, and postsynaptic GABA receptor trafficking, in brain regions including the hypothalamus, ventral tegmentum and amygdala. The analysis also implicated energy metabolism and caloric intake control as potential influences on alcohol consumption by the recombinant inbred rats. In the human populations, polymorphisms in genes associated with GABA synthesis and GABA receptors, as well as genes related to dopaminergic transmission, were associated with alcohol consumption. Conclusion Our results emphasize the importance of the signaling pathways identified using the non-human animal models, rather than single gene products, in identifying factors responsible for complex traits such as alcohol consumption. The results suggest cross-species similarities in pathways that influence predisposition to consume

  13. The Human Genome Project: big science transforms biology and medicine

    OpenAIRE

    Hood, Leroy; Rowen, Lee

    2013-01-01

    The Human Genome Project has transformed biology through its integrated big science approach to deciphering a reference human genome sequence along with the complete sequences of key model organisms. The project exemplifies the power, necessity and success of large, integrated, cross-disciplinary efforts - so-called ‘big science’ - directed towards complex major objectives. In this article, we discuss the ways in which this ambitious endeavor led to the development of novel technologies and a...

  14. Genome-wide identification of coding and non-coding conserved sequence tags in human and mouse genomes

    Directory of Open Access Journals (Sweden)

    Maggi Giorgio P

    2008-06-01

    Full Text Available Abstract Background The accurate detection of genes and the identification of functional regions is still an open issue in the annotation of genomic sequences. This problem affects new genomes but also those of very well studied organisms such as human and mouse where, despite the great efforts, the inventory of genes and regulatory regions is far from complete. Comparative genomics is an effective approach to address this problem. Unfortunately it is limited by the computational requirements needed to perform genome-wide comparisons and by the problem of discriminating between conserved coding and non-coding sequences. This discrimination is often based (thus dependent on the availability of annotated proteins. Results In this paper we present the results of a comprehensive comparison of human and mouse genomes performed with a new high throughput grid-based system which allows the rapid detection of conserved sequences and accurate assessment of their coding potential. By detecting clusters of coding conserved sequences the system is also suitable to accurately identify potential gene loci. Following this analysis we created a collection of human-mouse conserved sequence tags and carefully compared our results to reliable annotations in order to benchmark the reliability of our classifications. Strikingly we were able to detect several potential gene loci supported by EST sequences but not corresponding to as yet annotated genes. Conclusion Here we present a new system which allows comprehensive comparison of genomes to detect conserved coding and non-coding sequences and the identification of potential gene loci. Our system does not require the availability of any annotated sequence thus is suitable for the analysis of new or poorly annotated genomes.

  15. Crossed wires: 3D genome misfolding in human disease.

    Science.gov (United States)

    Norton, Heidi K; Phillips-Cremins, Jennifer E

    2017-11-06

    Mammalian genomes are folded into unique topological structures that undergo precise spatiotemporal restructuring during healthy development. Here, we highlight recent advances in our understanding of how the genome folds inside the 3D nucleus and how these folding patterns are miswired during the onset and progression of mammalian disease states. We discuss potential mechanisms underlying the link among genome misfolding, genome dysregulation, and aberrant cellular phenotypes. We also discuss cases in which the endogenous 3D genome configurations in healthy cells might be particularly susceptible to mutation or translocation. Together, these data support an emerging model in which genome folding and misfolding is critically linked to the onset and progression of a broad range of human diseases. © 2017 Norton and Phillips-Cremins.

  16. The Human Genome Project: big science transforms biology and medicine.

    Science.gov (United States)

    Hood, Leroy; Rowen, Lee

    2013-01-01

    The Human Genome Project has transformed biology through its integrated big science approach to deciphering a reference human genome sequence along with the complete sequences of key model organisms. The project exemplifies the power, necessity and success of large, integrated, cross-disciplinary efforts - so-called 'big science' - directed towards complex major objectives. In this article, we discuss the ways in which this ambitious endeavor led to the development of novel technologies and analytical tools, and how it brought the expertise of engineers, computer scientists and mathematicians together with biologists. It established an open approach to data sharing and open-source software, thereby making the data resulting from the project accessible to all. The genome sequences of microbes, plants and animals have revolutionized many fields of science, including microbiology, virology, infectious disease and plant biology. Moreover, deeper knowledge of human sequence variation has begun to alter the practice of medicine. The Human Genome Project has inspired subsequent large-scale data acquisition initiatives such as the International HapMap Project, 1000 Genomes, and The Cancer Genome Atlas, as well as the recently announced Human Brain Project and the emerging Human Proteome Project.

  17. Ancient Human Genome Sequence of an Extinct Palaeo-Eskimo

    DEFF Research Database (Denmark)

    Rasmussen, Morten; Li, Yingrui; Lindgreen, Stinus

    2010-01-01

    We report here the genome sequence of an ancient human. Obtained from approximately 4,000-year-old permafrost-preserved hair, the genome represents a male individual from the first known culture to settle in Greenland. Sequenced to an average depth of 20x, we recover 79% of the diploid genome...... possible phenotypic characteristics of the individual that belonged to a culture whose location has yielded only trace human remains. We compare the high-confidence SNPs to those of contemporary populations to find the populations most closely related to the individual. This provides evidence...

  18. Genome-wide association meta-analysis of human longevity identifies a novel locus conferring survival beyond 90 years of age

    DEFF Research Database (Denmark)

    Deelen, Joris; Beekman, Marian; Uh, Hae-Won

    2014-01-01

    descent (≥ 85 years) and 16 121 younger controls (controls. In addition, we performed a subset analysis in cases aged ≥ 90 years. We observed genome-wide significant association with longevity, as reflected...... by survival to ages beyond 90 years, at a novel locus, rs2149954, on chromosome 5q33.3 (OR = 1.10, P = 1.74 × 10(-8)). We also confirmed association of rs4420638 on chromosome 19q13.32 (OR = 0.72, P = 3.40 × 10(-36)), representing the TOMM40/APOE/APOC1 locus. In a prospective meta-analysis (n = 34 103......, independent of blood pressure. We report on the first GWAS-identified longevity locus on chromosome 5q33.3 influencing survival in the general European population. The minor allele of this locus associates with low blood pressure in middle age, although the contribution of this allele to survival may be less...

  19. Whole genome sequence analysis of Mycobacterium suricattae

    KAUST Repository

    Dippenaar, Anzaan; Parsons, Sven David Charles; Sampson, Samantha Leigh; Van Der Merwe, Ruben Gerhard; Drewe, Julian Ashley; Abdallah, Abdallah; Siame, Kabengele Keith; Gey Van Pittius, Nicolaas Claudius; Van Helden, Paul David; Pain, Arnab; Warren, Robin Mark

    2015-01-01

    Tuberculosis occurs in various mammalian hosts and is caused by a range of different lineages of the Mycobacterium tuberculosis complex (MTBC). A recently described member, Mycobacterium suricattae, causes tuberculosis in meerkats (Suricata suricatta) in Southern Africa and preliminary genetic analysis showed this organism to be closely related to an MTBC pathogen of rock hyraxes (Procavia capensis), the dassie bacillus. Here we make use of whole genome sequencing to describe the evolution of the genome of M. suricattae, including known and novel regions of difference, SNPs and IS6110 insertion sites. We used genome-wide phylogenetic analysis to show that M. suricattae clusters with the chimpanzee bacillus, previously isolated from a chimpanzee (Pan troglodytes) in West Africa. We propose an evolutionary scenario for the Mycobacterium africanum lineage 6 complex, showing the evolutionary relationship of M. africanum and chimpanzee bacillus, and the closely related members M. suricattae, dassie bacillus and Mycobacterium mungi.

  20. Whole genome sequence analysis of Mycobacterium suricattae

    KAUST Repository

    Dippenaar, Anzaan

    2015-10-21

    Tuberculosis occurs in various mammalian hosts and is caused by a range of different lineages of the Mycobacterium tuberculosis complex (MTBC). A recently described member, Mycobacterium suricattae, causes tuberculosis in meerkats (Suricata suricatta) in Southern Africa and preliminary genetic analysis showed this organism to be closely related to an MTBC pathogen of rock hyraxes (Procavia capensis), the dassie bacillus. Here we make use of whole genome sequencing to describe the evolution of the genome of M. suricattae, including known and novel regions of difference, SNPs and IS6110 insertion sites. We used genome-wide phylogenetic analysis to show that M. suricattae clusters with the chimpanzee bacillus, previously isolated from a chimpanzee (Pan troglodytes) in West Africa. We propose an evolutionary scenario for the Mycobacterium africanum lineage 6 complex, showing the evolutionary relationship of M. africanum and chimpanzee bacillus, and the closely related members M. suricattae, dassie bacillus and Mycobacterium mungi.

  1. Genome editing: a robust technology for human stem cells.

    Science.gov (United States)

    Chandrasekaran, Arun Pandian; Song, Minjung; Ramakrishna, Suresh

    2017-09-01

    Human pluripotent stem cells comprise induced pluripotent and embryonic stem cells, which have tremendous potential for biological and therapeutic applications. The development of efficient technologies for the targeted genome alteration of stem cells in disease models is a prerequisite for utilizing stem cells to their full potential. Genome editing of stem cells is possible with the help of synthetic nucleases that facilitate site-specific modification of a gene of interest. Recent advances in genome editing techniques have improved the efficiency and speed of the development of stem cells for human disease models. Zinc finger nucleases, transcription activator-like effector nucleases, and clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated system are powerful tools for editing DNA at specific loci. Here, we discuss recent technological advances in genome editing with site-specific nucleases in human stem cells.

  2. The Chlamydia psittaci genome: a comparative analysis of intracellular pathogens.

    Science.gov (United States)

    Voigt, Anja; Schöfl, Gerhard; Saluz, Hans Peter

    2012-01-01

    Chlamydiaceae are a family of obligate intracellular pathogens causing a wide range of diseases in animals and humans, and facing unique evolutionary constraints not encountered by free-living prokaryotes. To investigate genomic aspects of infection, virulence and host preference we have sequenced Chlamydia psittaci, the pathogenic agent of ornithosis. A comparison of the genome of the avian Chlamydia psittaci isolate 6BC with the genomes of other chlamydial species, C. trachomatis, C. muridarum, C. pneumoniae, C. abortus, C. felis and C. caviae, revealed a high level of sequence conservation and synteny across taxa, with the major exception of the human pathogen C. trachomatis. Important differences manifest in the polymorphic membrane protein family specific for the Chlamydiae and in the highly variable chlamydial plasticity zone. We identified a number of psittaci-specific polymorphic membrane proteins of the G family that may be related to differences in host-range and/or virulence as compared to closely related Chlamydiaceae. We calculated non-synonymous to synonymous substitution rate ratios for pairs of orthologous genes to identify putative targets of adaptive evolution and predicted type III secreted effector proteins. This study is the first detailed analysis of the Chlamydia psittaci genome sequence. It provides insights in the genome architecture of C. psittaci and proposes a number of novel candidate genes mostly of yet unknown function that may be important for pathogen-host interactions.

  3. The Chlamydia psittaci genome: a comparative analysis of intracellular pathogens.

    Directory of Open Access Journals (Sweden)

    Anja Voigt

    Full Text Available Chlamydiaceae are a family of obligate intracellular pathogens causing a wide range of diseases in animals and humans, and facing unique evolutionary constraints not encountered by free-living prokaryotes. To investigate genomic aspects of infection, virulence and host preference we have sequenced Chlamydia psittaci, the pathogenic agent of ornithosis.A comparison of the genome of the avian Chlamydia psittaci isolate 6BC with the genomes of other chlamydial species, C. trachomatis, C. muridarum, C. pneumoniae, C. abortus, C. felis and C. caviae, revealed a high level of sequence conservation and synteny across taxa, with the major exception of the human pathogen C. trachomatis. Important differences manifest in the polymorphic membrane protein family specific for the Chlamydiae and in the highly variable chlamydial plasticity zone. We identified a number of psittaci-specific polymorphic membrane proteins of the G family that may be related to differences in host-range and/or virulence as compared to closely related Chlamydiaceae. We calculated non-synonymous to synonymous substitution rate ratios for pairs of orthologous genes to identify putative targets of adaptive evolution and predicted type III secreted effector proteins.This study is the first detailed analysis of the Chlamydia psittaci genome sequence. It provides insights in the genome architecture of C. psittaci and proposes a number of novel candidate genes mostly of yet unknown function that may be important for pathogen-host interactions.

  4. Metagenomic Analysis of the Human Gut Microbiome

    DEFF Research Database (Denmark)

    dos Santos, Marcelo Bertalan Quintanilha

    Understanding the link between the human gut microbiome and human health is one of the biggest scientific challenges in our decade. Because 90% of our cells are bacteria, and the microbial genome contains 200 times more genes than the human genome, the study of the human microbiome has...... the potential to impact many areas of our health. This PhD thesis is the first study to generate a large amount of experimental data on the DNA and RNA of the human gut microbiome. This was made possible by our development of a human gut microbiome array capable of profiling any human gut microbiome. Analysis...... of our results changes the way we link the gut microbiome with diseases. Our results indicate that inflammatory diseases will affect the ecological system of the human gut microbiome, reducing its diversity. Classification analysis of healthy and unhealthy individuals demonstrates that unhealthy...

  5. Defining functional DNA elements in the human genome

    Science.gov (United States)

    Kellis, Manolis; Wold, Barbara; Snyder, Michael P.; Bernstein, Bradley E.; Kundaje, Anshul; Marinov, Georgi K.; Ward, Lucas D.; Birney, Ewan; Crawford, Gregory E.; Dekker, Job; Dunham, Ian; Elnitski, Laura L.; Farnham, Peggy J.; Feingold, Elise A.; Gerstein, Mark; Giddings, Morgan C.; Gilbert, David M.; Gingeras, Thomas R.; Green, Eric D.; Guigo, Roderic; Hubbard, Tim; Kent, Jim; Lieb, Jason D.; Myers, Richard M.; Pazin, Michael J.; Ren, Bing; Stamatoyannopoulos, John A.; Weng, Zhiping; White, Kevin P.; Hardison, Ross C.

    2014-01-01

    With the completion of the human genome sequence, attention turned to identifying and annotating its functional DNA elements. As a complement to genetic and comparative genomics approaches, the Encyclopedia of DNA Elements Project was launched to contribute maps of RNA transcripts, transcriptional regulator binding sites, and chromatin states in many cell types. The resulting genome-wide data reveal sites of biochemical activity with high positional resolution and cell type specificity that facilitate studies of gene regulation and interpretation of noncoding variants associated with human disease. However, the biochemically active regions cover a much larger fraction of the genome than do evolutionarily conserved regions, raising the question of whether nonconserved but biochemically active regions are truly functional. Here, we review the strengths and limitations of biochemical, evolutionary, and genetic approaches for defining functional DNA segments, potential sources for the observed differences in estimated genomic coverage, and the biological implications of these discrepancies. We also analyze the relationship between signal intensity, genomic coverage, and evolutionary conservation. Our results reinforce the principle that each approach provides complementary information and that we need to use combinations of all three to elucidate genome function in human biology and disease. PMID:24753594

  6. The human noncoding genome defined by genetic diversity.

    Science.gov (United States)

    di Iulio, Julia; Bartha, Istvan; Wong, Emily H M; Yu, Hung-Chun; Lavrenko, Victor; Yang, Dongchan; Jung, Inkyung; Hicks, Michael A; Shah, Naisha; Kirkness, Ewen F; Fabani, Martin M; Biggs, William H; Ren, Bing; Venter, J Craig; Telenti, Amalio

    2018-03-01

    Understanding the significance of genetic variants in the noncoding genome is emerging as the next challenge in human genomics. We used the power of 11,257 whole-genome sequences and 16,384 heptamers (7-nt motifs) to build a map of sequence constraint for the human species. This build differed substantially from traditional maps of interspecies conservation and identified regulatory elements among the most constrained regions of the genome. Using new Hi-C experimental data, we describe a strong pattern of coordination over 2 Mb where the most constrained regulatory elements associate with the most essential genes. Constrained regions of the noncoding genome are up to 52-fold enriched for known pathogenic variants as compared to unconstrained regions (21-fold when compared to the genome average). This map of sequence constraint across thousands of individuals is an asset to help interpret noncoding elements in the human genome, prioritize variants and reconsider gene units at a larger scale.

  7. Megabase replication domains along the human genome: relation to chromatin structure and genome organisation.

    Science.gov (United States)

    Audit, Benjamin; Zaghloul, Lamia; Baker, Antoine; Arneodo, Alain; Chen, Chun-Long; d'Aubenton-Carafa, Yves; Thermes, Claude

    2013-01-01

    In higher eukaryotes, the absence of specific sequence motifs, marking the origins of replication has been a serious hindrance to the understanding of (i) the mechanisms that regulate the spatio-temporal replication program, and (ii) the links between origins activation, chromatin structure and transcription. In this chapter, we review the partitioning of the human genome into megabased-size replication domains delineated as N-shaped motifs in the strand compositional asymmetry profiles. They collectively span 28.3% of the genome and are bordered by more than 1,000 putative replication origins. We recapitulate the comparison of this partition of the human genome with high-resolution experimental data that confirms that replication domain borders are likely to be preferential replication initiation zones in the germline. In addition, we highlight the specific distribution of experimental and numerical chromatin marks along replication domains. Domain borders correspond to particular open chromatin regions, possibly encoded in the DNA sequence, and around which replication and transcription are highly coordinated. These regions also present a high evolutionary breakpoint density, suggesting that susceptibility to breakage might be linked to local open chromatin fiber state. Altogether, this chapter presents a compartmentalization of the human genome into replication domains that are landmarks of the human genome organization and are likely to play a key role in genome dynamics during evolution and in pathological situations.

  8. Meta-analysis of genome-wide scans for human adult stature identifies novel Loci and associations with measures of skeletal frame size.

    Directory of Open Access Journals (Sweden)

    Nicole Soranzo

    2009-04-01

    Full Text Available Recent genome-wide (GW scans have identified several independent loci affecting human stature, but their contribution through the different skeletal components of height is still poorly understood. We carried out a genome-wide scan in 12,611 participants, followed by replication in an additional 7,187 individuals, and identified 17 genomic regions with GW-significant association with height. Of these, two are entirely novel (rs11809207 in CATSPER4, combined P-value = 6.1x10(-8 and rs910316 in TMED10, P-value = 1.4x10(-7 and two had previously been described with weak statistical support (rs10472828 in NPR3, P-value = 3x10(-7 and rs849141 in JAZF1, P-value = 3.2x10(-11. One locus (rs1182188 at GNA12 identifies the first height eQTL. We also assessed the contribution of height loci to the upper- (trunk and lower-body (hip axis and femur skeletal components of height. We find evidence for several loci associated with trunk length (including rs6570507 in GPR126, P-value = 4x10(-5 and rs6817306 in LCORL, P-value = 4x10(-4, hip axis length (including rs6830062 at LCORL, P-value = 4.8x10(-4 and rs4911494 at UQCC, P-value = 1.9x10(-4, and femur length (including rs710841 at PRKG2, P-value = 2.4x10(-5 and rs10946808 at HIST1H1D, P-value = 6.4x10(-6. Finally, we used conditional analyses to explore a possible differential contribution of the height loci to these different skeletal size measurements. In addition to validating four novel loci controlling adult stature, our study represents the first effort to assess the contribution of genetic loci to three skeletal components of height. Further statistical tests in larger numbers of individuals will be required to verify if the height loci affect height preferentially through these subcomponents of height.

  9. Phylogeny and comparative genome analysis of a Basidiomycete fungi

    Energy Technology Data Exchange (ETDEWEB)

    Riley, Robert W.; Salamov, Asaf; Grigoriev, Igor; Hibbett, David

    2011-03-14

    Fungi of the phylum Basidiomycota, make up some 37percent of the described fungi, and are important from the perspectives of forestry, agriculture, medicine, and bioenergy. This diverse phylum includes the mushrooms, wood rots, plant pathogenic rusts and smuts, and some human pathogens. To better understand these important fungi, we have undertaken a comparative genomic analysis of the Basidiomycetes with available sequenced genomes. We report a phylogeny that sheds light on previously unclear evolutionary relationships among the Basidiomycetes. We also define a `core proteome? based on protein families conserved in all Basidiomycetes. We identify key expansions and contractions in protein families that may be responsible for the degradation of plant biomass such as cellulose, hemicellulose, and lignin. Finally, we speculate as to the genomic changes that drove such expansions and contractions.

  10. The sequence and analysis of a Chinese pig genome

    Directory of Open Access Journals (Sweden)

    Fang Xiaodong

    2012-11-01

    Full Text Available Abstract Background The pig is an economically important food source, amounting to approximately 40% of all meat consumed worldwide. Pigs also serve as an important model organism because of their similarity to humans at the anatomical, physiological and genetic level, making them very useful for studying a variety of human diseases. A pig strain of particular interest is the miniature pig, specifically the Wuzhishan pig (WZSP, as it has been extensively inbred. Its high level of homozygosity offers increased ease for selective breeding for specific traits and a more straightforward understanding of the genetic changes that underlie its biological characteristics. WZSP also serves as a promising means for applications in surgery, tissue engineering, and xenotransplantation. Here, we report the sequencing and analysis of an inbreeding WZSP genome. Results Our results reveal some unique genomic features, including a relatively high level of homozygosity in the diploid genome, an unusual distribution of heterozygosity, an over-representation of tRNA-derived transposable elements, a small amount of porcine endogenous retrovirus, and a lack of type C retroviruses. In addition, we carried out systematic research on gene evolution, together with a detailed investigation of the counterparts of human drug target genes. Conclusion Our results provide the opportunity to more clearly define the genomic character of pig, which could enhance our ability to create more useful pig models.

  11. Comparative genome analysis of Basidiomycete fungi

    Energy Technology Data Exchange (ETDEWEB)

    Riley, Robert; Salamov, Asaf; Henrissat, Bernard; Nagy, Laszlo; Brown, Daren; Held, Benjamin; Baker, Scott; Blanchette, Robert; Boussau, Bastien; Doty, Sharon L.; Fagnan, Kirsten; Floudas, Dimitris; Levasseur, Anthony; Manning, Gerard; Martin, Francis; Morin, Emmanuelle; Otillar, Robert; Pisabarro, Antonio; Walton, Jonathan; Wolfe, Ken; Hibbett, David; Grigoriev, Igor

    2013-08-07

    Fungi of the phylum Basidiomycota (basidiomycetes), make up some 37percent of the described fungi, and are important in forestry, agriculture, medicine, and bioenergy. This diverse phylum includes symbionts, pathogens, and saprotrophs including the majority of wood decaying and ectomycorrhizal species. To better understand the genetic diversity of this phylum we compared the genomes of 35 basidiomycetes including 6 newly sequenced genomes. These genomes span extremes of genome size, gene number, and repeat content. Analysis of core genes reveals that some 48percent of basidiomycete proteins are unique to the phylum with nearly half of those (22percent) found in only one organism. Correlations between lifestyle and certain gene families are evident. Phylogenetic patterns of plant biomass-degrading genes in Agaricomycotina suggest a continuum rather than a dichotomy between the white rot and brown rot modes of wood decay. Based on phylogenetically-informed PCA analysis of wood decay genes, we predict that that Botryobasidium botryosum and Jaapia argillacea have properties similar to white rot species, although neither has typical ligninolytic class II fungal peroxidases (PODs). This prediction is supported by growth assays in which both fungi exhibit wood decay with white rot-like characteristics. Based on this, we suggest that the white/brown rot dichotomy may be inadequate to describe the full range of wood decaying fungi. Analysis of the rate of discovery of proteins with no or few homologs suggests the value of continued sequencing of basidiomycete fungi.

  12. The complete nucleotide sequence, genome organization, and origin of human adenovirus type 11

    International Nuclear Information System (INIS)

    Stone, Daniel; Furthmann, Anne; Sandig, Volker; Lieber, Andre

    2003-01-01

    The complete DNA sequence and transcription map of human adenovirus type 11 are reported here. This is the first published sequence for a subgenera B human adenovirus and demonstrates a genome organization highly similar to those of other human adenoviruses. All of the genes from the early, intermediate, and late regions are present in the expected locations of the genome for a human adenovirus. The genome size is 34,794 bp in length and has a GC content of 48.9%. Sequence alignment with genomes of groups A (Ad12), C (Ad5), D (Ad17), E (Simian adenovirus 25), and F (Ad40) revealed homologies of 64, 54, 68, 75, and 52%, respectively. Detailed genomic analysis demonstrated that Ads 11 and 35 are highly conserved in all areas except the hexon hypervariable regions and fiber. Similarly, comparison of Ad11 with subgroup E SAV25 revealed poor homology between fibers but high homology in proteins encoded by all other areas of the genome. We propose an evolutionary model in which functional viruses can be reconstituted following fiber substitution from one serotype to another. According to this model either the Ad11 genome is a derivative of Ad35, from which the fiber was substituted with Ad7, or the Ad35 genome is the product of a fiber substitution from Ad21 into the Ad11 genome. This model also provides a possible explanation for the origin of group E Ads, which are evolutionarily derived from a group C fiber substitution into a group B genome

  13. Human genome and open source: balancing ethics and business.

    Science.gov (United States)

    Marturano, Antonio

    2011-01-01

    The Human Genome Project has been completed thanks to a massive use of computer techniques, as well as the adoption of the open-source business and research model by the scientists involved. This model won over the proprietary model and allowed a quick propagation and feedback of research results among peers. In this paper, the author will analyse some ethical and legal issues emerging by the use of such computer model in the Human Genome property rights. The author will argue that the Open Source is the best business model, as it is able to balance business and human rights perspectives.

  14. From hacking the human genome to editing organs.

    Science.gov (United States)

    Tobita, Takamasa; Guzman-Lepe, Jorge; Collin de l'Hortet, Alexandra

    2015-01-01

    In the recent decades, human genome engineering has been one of the major interesting research subjects, essentially because it raises new possibilities for personalized medicine and biotechnologies. With the development of engineered nucleases such as the Zinc Finger Nucleases (ZFNs), the Transcription activator-like effector nucleases (TALENs) and more recently the Clustered Regularly Interspaced short Palindromic Repeats (CRISPR), the field of human genome edition has evolved very rapidly. Every new genetic tool is broadening the scope of applications on human tissues, even before we can completely master each of these tools. In this review, we will present the recent advances regarding human genome edition tools, we will discuss the numerous implications they have in research and medicine, and we will mention the limits and concerns about such technologies.

  15. The diploid genome sequence of an individual human.

    Directory of Open Access Journals (Sweden)

    Samuel Levy

    2007-09-01

    Full Text Available Presented here is a genome sequence of an individual human. It was produced from approximately 32 million random DNA fragments, sequenced by Sanger dideoxy technology and assembled into 4,528 scaffolds, comprising 2,810 million bases (Mb of contiguous sequence with approximately 7.5-fold coverage for any given region. We developed a modified version of the Celera assembler to facilitate the identification and comparison of alternate alleles within this individual diploid genome. Comparison of this genome and the National Center for Biotechnology Information human reference assembly revealed more than 4.1 million DNA variants, encompassing 12.3 Mb. These variants (of which 1,288,319 were novel included 3,213,401 single nucleotide polymorphisms (SNPs, 53,823 block substitutions (2-206 bp, 292,102 heterozygous insertion/deletion events (indels(1-571 bp, 559,473 homozygous indels (1-82,711 bp, 90 inversions, as well as numerous segmental duplications and copy number variation regions. Non-SNP DNA variation accounts for 22% of all events identified in the donor, however they involve 74% of all variant bases. This suggests an important role for non-SNP genetic alterations in defining the diploid genome structure. Moreover, 44% of genes were heterozygous for one or more variants. Using a novel haplotype assembly strategy, we were able to span 1.5 Gb of genome sequence in segments >200 kb, providing further precision to the diploid nature of the genome. These data depict a definitive molecular portrait of a diploid human genome that provides a starting point for future genome comparisons and enables an era of individualized genomic information.

  16. The Dynamic Genome and Transcriptome of the Human Fungal Pathogen Blastomyces and Close Relative Emmonsia

    OpenAIRE

    Muñoz, José F.; Gauthier, Gregory M.; Desjardins, Christopher A.; Gallo, Juan E.; Holder, Jason; Sullivan, Thomas D.; Marty, Amber J.; Carmen, John C.; Chen, Zehua; Ding, Li; Gujja, Sharvari; Magrini, Vincent; Misas, Elizabeth; Mitreva, Makedonka; Priest, Margaret

    2015-01-01

    Three closely related thermally dimorphic pathogens are causal agents of major fungal diseases affecting humans in the Americas: blastomycosis, histoplasmosis and paracoccidioidomycosis. Here we report the genome sequence and analysis of four strains of the etiological agent of blastomycosis, Blastomyces, and two species of the related genus Emmonsia, typically pathogens of small mammals. Compared to related species, Blastomyces genomes are highly expanded, with long, often sharply demarcated...

  17. Learning about human population history from ancient and modern genomes.

    Science.gov (United States)

    Stoneking, Mark; Krause, Johannes

    2011-08-18

    Genome-wide data, both from SNP arrays and from complete genome sequencing, are becoming increasingly abundant and are now even available from extinct hominins. These data are providing new insights into population history; in particular, when combined with model-based analytical approaches, genome-wide data allow direct testing of hypotheses about population history. For example, genome-wide data from both contemporary populations and extinct hominins strongly support a single dispersal of modern humans from Africa, followed by two archaic admixture events: one with Neanderthals somewhere outside Africa and a second with Denisovans that (so far) has only been detected in New Guinea. These new developments promise to reveal new stories about human population history, without having to resort to storytelling.

  18. The Human Genome Project and the social contract: a law policy approach.

    Science.gov (United States)

    Byk, C

    1992-08-01

    For the first time in history, genetics will enable science to completely identify each human as genetically unique. Will this knowledge reinforce the trend for more individual liberties or will it create a 'brave new world'? A law policy approach to the problems raised by the human genome project shows how far our democratic institutions are from being the proper forum to discuss such issues. Because of the fears and anxiety raised in the population, and also because of its wide implications on the everyday life, the human genome analysis more than any other project needs to succeed in setting up such a social assessment.

  19. Primer on molecular genetics. DOE Human Genome Program

    Energy Technology Data Exchange (ETDEWEB)

    1992-04-01

    This report is taken from the April 1992 draft of the DOE Human Genome 1991--1992 Program Report, which is expected to be published in May 1992. The primer is intended to be an introduction to basic principles of molecular genetics pertaining to the genome project. The material contained herein is not final and may be incomplete. Techniques of genetic mapping and DNA sequencing are described.

  20. National human genome projects: an update and an agenda

    OpenAIRE

    An, Joon Yong

    2017-01-01

    Population genetic and human genetic studies are being accelerated with genome technology and data sharing. Accordingly, in the past 10 years, several countries have initiated genetic research using genome technology and identified the genetic architecture of the ethnic groups living in the corresponding country or suggested the genetic foundation of a social phenomenon. Genetic research has been conducted from epidemiological studies that previously described the health or disease conditions...

  1. In silico analysis of the fucosylation-associated genome of the human blood fluke Schistosoma mansoni: cloning and characterization of the fucosyltransferase multigene family.

    Science.gov (United States)

    Peterson, Nathan A; Anderson, Tavis K; Yoshino, Timothy P

    2013-01-01

    Fucosylated glycans of the parasitic flatworm Schistosoma mansoni play key roles in its development and immunobiology. In the present study we used a genome-wide homology-based bioinformatics approach to search for genes that contribute to fucosylated glycan expression in S. mansoni, specifically the α2-, α3-, and α6-fucosyltransferases (FucTs), which transfer L-fucose from a GDP-L-fucose donor to an oligosaccharide acceptor. We identified and in silico characterized several novel schistosome FucT homologs, including six α3-FucTs and six α6-FucTs, as well as two protein O-FucTs that catalyze the unrelated transfer of L-fucose to serine and threonine residues of epidermal growth factor- and thrombospondin-type repeats. No α2-FucTs were observed. Primary sequence analyses identified key conserved FucT motifs as well as characteristic transmembrane domains, consistent with their putative roles as fucosyltransferases. Most genes exhibit alternative splicing, with multiple transcript variants generated. A phylogenetic analysis demonstrated that schistosome α3- and α6-FucTs form monophyletic clades within their respective gene families, suggesting multiple gene duplications following the separation of the schistosome lineage from the main evolutionary tree. Quantitative decreases in steady-state transcript levels of some FucTs during early larval development suggest a possible mechanism for differential expression of fucosylated glycans in schistosomes. This study systematically identifies the complete repertoire of FucT homologs in S. mansoni and provides fundamental information regarding their genomic organization, genetic variation, developmental expression, and evolutionary history.

  2. Saccharomyces genome database informs human biology

    OpenAIRE

    Skrzypek, Marek S; Nash, Robert S; Wong, Edith D; MacPherson, Kevin A; Hellerstedt, Sage T; Engel, Stacia R; Karra, Kalpana; Weng, Shuai; Sheppard, Travis K; Binkley, Gail; Simison, Matt; Miyasato, Stuart R; Cherry, J Michael

    2017-01-01

    Abstract The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org) is an expertly curated database of literature-derived functional information for the model organism budding yeast, Saccharomyces cerevisiae. SGD constantly strives to synergize new types of experimental data and bioinformatics predictions with existing data, and to organize them into a comprehensive and up-to-date information resource. The primary mission of SGD is to facilitate research into the biology of yeast and...

  3. In the Beginning was the Genome: Genomics and the Bi-textuality of Human Existence.

    Science.gov (United States)

    Zwart, H A E Hub

    2018-04-01

    This paper addresses the cultural impact of genomics and the Human Genome Project (HGP) on human self-understanding. Notably, it addresses the claim made by Francis Collins (director of the HGP) that the genome is the language of God and the claim made by Max Delbrück (founding father of molecular life sciences research) that Aristotle must be credited with having predicted DNA as the soul that organises bio-matter. From a continental philosophical perspective I will argue that human existence results from a dialectical interaction between two types of texts: the language of molecular biology and the language of civilisation; the language of the genome and the language of our socio-cultural, symbolic ambiance. Whereas the former ultimately builds on the alphabets of genes and nucleotides, the latter is informed by primordial texts such as the Bible and the Quran. In applied bioethics deliberations on genomics, science is easily framed as liberating and progressive, religious world-views as conservative and restrictive (Zwart 1993). This paper focusses on the broader cultural ambiance of the debate to discern how the bi-textuality of human existence is currently undergoing a transition, as not only the physiological, but also the normative dimension is being reframed in biomolecular and terabyte terms.

  4. Distinct p53 genomic binding patterns in normal and cancer-derived human cells

    Energy Technology Data Exchange (ETDEWEB)

    Botcheva K.; McCorkle S. R.; McCombie W. R.; Dunn J. J.; Anderson C. W.

    2011-12-15

    We report here genome-wide analysis of the tumor suppressor p53 binding sites in normal human cells. 743 high-confidence ChIP-seq peaks representing putative genomic binding sites were identified in normal IMR90 fibroblasts using a reference chromatin sample. More than 40% were located within 2 kb of a transcription start site (TSS), a distribution similar to that documented for individually studied, functional p53 binding sites and, to date, not observed by previous p53 genome-wide studies. Nearly half of the high-confidence binding sites in the IMR90 cells reside in CpG islands, in marked contrast to sites reported in cancer-derived cells. The distinct genomic features of the IMR90 binding sites do not reflect a distinct preference for specific sequences, since the de novo developed p53 motif based on our study is similar to those reported by genome-wide studies of cancer cells. More likely, the different chromatin landscape in normal, compared with cancer-derived cells, influences p53 binding via modulating availability of the sites. We compared the IMR90 ChIPseq peaks to the recently published IMR90 methylome1 and demonstrated that they are enriched at hypomethylated DNA. Our study represents the first genome-wide, de novo mapping of p53 binding sites in normal human cells and reveals that p53 binding sites reside in distinct genomic landscapes in normal and cancer-derived human cells.

  5. Genomic signatures of diet-related shifts during human origins.

    Science.gov (United States)

    Babbitt, Courtney C; Warner, Lisa R; Fedrigo, Olivier; Wall, Christine E; Wray, Gregory A

    2011-04-07

    There are numerous anthropological analyses concerning the importance of diet during human evolution. Diet is thought to have had a profound influence on the human phenotype, and dietary differences have been hypothesized to contribute to the dramatic morphological changes seen in modern humans as compared with non-human primates. Here, we attempt to integrate the results of new genomic studies within this well-developed anthropological context. We then review the current evidence for adaptation related to diet, both at the level of sequence changes and gene expression. Finally, we propose some ways in which new technologies can help identify specific genomic adaptations that have resulted in metabolic and morphological differences between humans and non-human primates.

  6. Genome-Wide Identification and Analysis of Arabidopsis Sodium Proton Antiporter (NHX and Human Sodium Proton Exchanger (NHE Homologs in Sorghum bicolor

    Directory of Open Access Journals (Sweden)

    P. Hima Kumari

    2018-05-01

    Full Text Available Na+ transporters play an important role during salt stress and development. The present study is aimed at genome-wide identification, in silico analysis of sodium-proton antiporter (NHX and sodium-proton exchanger (NHE-type transporters in Sorghum bicolor and their expression patterns under varied abiotic stress conditions. In Sorghum, seven NHX and nine NHE homologs were identified. Amiloride (a known inhibitor of Na+/H+ exchanger activity binding motif was noticed in both types of the transporters. Chromosome 2 was found to be a hotspot region with five sodium transporters. Phylogenetic analysis inferred six ortholog and three paralog groups. To gain an insight into functional divergence of SbNHX/NHE transporters, real-time gene expression was performed under salt, drought, heat, and cold stresses in embryo, root, stem, and leaf tissues. Expression patterns revealed that both SbNHXs and SbNHEs are responsive either to single or multiple abiotic stresses. The predicted protein–protein interaction networks revealed that only SbNHX7 is involved in the calcineurin B-like proteins (CBL- CBL interacting protein kinases (CIPK pathway. The study provides insights into the functional divergence of SbNHX/NHE transporter genes with tissue specific expressions in Sorghum under different abiotic stress conditions.

  7. Evolution of the NANOG pseudogene family in the human and chimpanzee genomes

    Directory of Open Access Journals (Sweden)

    Maughan Peter J

    2006-02-01

    Full Text Available Abstract Background The NANOG gene is expressed in mammalian embryonic stem cells where it maintains cellular pluripotency. An unusually large family of pseudogenes arose from it with one unprocessed and ten processed pseudogenes in the human genome. This article compares the NANOG gene and its pseudogenes in the human and chimpanzee genomes and derives an evolutionary history of this pseudogene family. Results The NANOG gene and all pseudogenes except NANOGP8 are present at their expected orthologous chromosomal positions in the chimpanzee genome when compared to the human genome, indicating that their origins predate the human-chimpanzee divergence. Analysis of flanking DNA sequences demonstrates that NANOGP8 is absent from the chimpanzee genome. Conclusion Based on the most parsimonious ordering of inferred source-gene mutations, the deduced evolutionary origins for the NANOG pseudogene family in the human and chimpanzee genomes, in order of most ancient to most recent, are NANOGP6, NANOGP5, NANOGP3, NANOGP10, NANOGP2, NANOGP9, NANOGP7, NANOGP1, and NANOGP4. All of these pseudogenes were fixed in the genome of the human-chimpanzee common ancestor. NANOGP8 is the most recent pseudogene and it originated exclusively in the human lineage after the human-chimpanzee divergence. NANOGP1 is apparently an unprocessed pseudogene. Comparison of its sequence to the functional NANOG gene's reading frame suggests that this apparent pseudogene remained functional after duplication and, therefore, was subject to selection-driven conservation of its reading frame, and that it may retain some functionality or that its loss of function may be evolutionarily recent.

  8. Open reading frames associated with cancer in the dark matter of the human genome.

    Science.gov (United States)

    Delgado, Ana Paula; Brandao, Pamela; Chapado, Maria Julia; Hamid, Sheilin; Narayanan, Ramaswamy

    2014-01-01

    The uncharacterized proteins (open reading frames, ORFs) in the human genome offer an opportunity to discover novel targets for cancer. A systematic analysis of the dark matter of the human proteome for druggability and biomarker discovery is crucial to mining the genome. Numerous data mining tools are available to mine these ORFs to develop a comprehensive knowledge base for future target discovery and validation. Using the Genetic Association Database, the ORFs of the human dark matter proteome were screened for evidence of association with neoplasms. The Phenome-Genome Integrator tool was used to establish phenotypic association with disease traits including cancer. Batch analysis of the tools for protein expression analysis, gene ontology and motifs and domains was used to characterize the ORFs. Sixty-two ORFs were identified for neoplasm association. The expression Quantitative Trait Loci (eQTL) analysis identified thirteen ORFs related to cancer traits. Protein expression, motifs and domain analysis and genome-wide association studies verified the relevance of these OncoORFs in diverse tumors. The OncoORFs are also associated with a wide variety of human diseases and disorders. Our results link the OncoORFs to diverse diseases and disorders. This suggests a complex landscape of the uncharacterized proteome in human diseases. These results open the dark matter of the proteome to novel cancer target research. Copyright© 2014, International Institute of Anticancer Research (Dr. John G. Delinasios), All rights reserved.

  9. Genomic analysis of murine DNA-dependent protein kinase

    International Nuclear Information System (INIS)

    Fujimori, A.; Abe, M.

    2003-01-01

    Full text: The gene of catalytic subunit of DNA dependent protein kinase is responsible gene for SCID mice. The molecules play a critical role in non-homologous end joining including the V(D)J recombination. Contribution of the molecules to the difference of radiosensitivity and the susceptibility to cancer has been suggested. Here we show the entire nucleotide sequence of approximately 193 kbp and 84 kbp genomic regions encoding the entire DNA-PKcs gene in the mouse and chicken respectively. Retroposon was found in the intron 51 of mouse genomic DNA-PKcs gene but in human and chicken. Comparative analysis of these two species strongly suggested that only two genes, DNA-PKcs and MCM4, exist in the region of both species. Several conserved sequences and cis elements, however, were predicted. Recently, the orthologous region for the human DNA-PKcs locus was completed. The results of further comparative study will be discussed

  10. 78 FR 68856 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2013-11-15

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Nakamura, Ph.D., Scientific Review Officer, Scientific Review Branch, National Human Genome Research...-402-0838. [[Page 68857

  11. Benchmarking undedicated cloud computing providers for analysis of genomic datasets.

    Science.gov (United States)

    Yazar, Seyhan; Gooden, George E C; Mackey, David A; Hewitt, Alex W

    2014-01-01

    A major bottleneck in biological discovery is now emerging at the computational level. Cloud computing offers a dynamic means whereby small and medium-sized laboratories can rapidly adjust their computational capacity. We benchmarked two established cloud computing services, Amazon Web Services Elastic MapReduce (EMR) on Amazon EC2 instances and Google Compute Engine (GCE), using publicly available genomic datasets (E.coli CC102 strain and a Han Chinese male genome) and a standard bioinformatic pipeline on a Hadoop-based platform. Wall-clock time for complete assembly differed by 52.9% (95% CI: 27.5-78.2) for E.coli and 53.5% (95% CI: 34.4-72.6) for human genome, with GCE being more efficient than EMR. The cost of running this experiment on EMR and GCE differed significantly, with the costs on EMR being 257.3% (95% CI: 211.5-303.1) and 173.9% (95% CI: 134.6-213.1) more expensive for E.coli and human assemblies respectively. Thus, GCE was found to outperform EMR both in terms of cost and wall-clock time. Our findings confirm that cloud computing is an efficient and potentially cost-effective alternative for analysis of large genomic datasets. In addition to releasing our cost-effectiveness comparison, we present available ready-to-use scripts for establishing Hadoop instances with Ganglia monitoring on EC2 or GCE.

  12. Benchmarking undedicated cloud computing providers for analysis of genomic datasets.

    Directory of Open Access Journals (Sweden)

    Seyhan Yazar

    Full Text Available A major bottleneck in biological discovery is now emerging at the computational level. Cloud computing offers a dynamic means whereby small and medium-sized laboratories can rapidly adjust their computational capacity. We benchmarked two established cloud computing services, Amazon Web Services Elastic MapReduce (EMR on Amazon EC2 instances and Google Compute Engine (GCE, using publicly available genomic datasets (E.coli CC102 strain and a Han Chinese male genome and a standard bioinformatic pipeline on a Hadoop-based platform. Wall-clock time for complete assembly differed by 52.9% (95% CI: 27.5-78.2 for E.coli and 53.5% (95% CI: 34.4-72.6 for human genome, with GCE being more efficient than EMR. The cost of running this experiment on EMR and GCE differed significantly, with the costs on EMR being 257.3% (95% CI: 211.5-303.1 and 173.9% (95% CI: 134.6-213.1 more expensive for E.coli and human assemblies respectively. Thus, GCE was found to outperform EMR both in terms of cost and wall-clock time. Our findings confirm that cloud computing is an efficient and potentially cost-effective alternative for analysis of large genomic datasets. In addition to releasing our cost-effectiveness comparison, we present available ready-to-use scripts for establishing Hadoop instances with Ganglia monitoring on EC2 or GCE.

  13. Genome sequencing of chimpanzee malaria parasites reveals possible pathways of adaptation to human hosts

    KAUST Repository

    Otto, Thomas D.

    2014-09-09

    Plasmodium falciparum causes most human malaria deaths, having prehistorically evolved from parasites of African Great Apes. Here we explore the genomic basis of P. falciparum adaptation to human hosts by fully sequencing the genome of the closely related chimpanzee parasite species P. reichenowi, and obtaining partial sequence data from a more distantly related chimpanzee parasite (P. gaboni). The close relationship between P. reichenowi and P. falciparum is emphasized by almost complete conservation of genomic synteny, but against this strikingly conserved background we observe major differences at loci involved in erythrocyte invasion. The organization of most virulence-associated multigene families, including the hypervariable var genes, is broadly conserved, but P. falciparum has a smaller subset of rif and stevor genes whose products are expressed on the infected erythrocyte surface. Genome-wide analysis identifies other loci under recent positive selection, but a limited number of changes at the host–parasite interface may have mediated host switching.

  14. Microbial genome-wide association studies: lessons from human GWAS.

    Science.gov (United States)

    Power, Robert A; Parkhill, Julian; de Oliveira, Tulio

    2017-01-01

    The reduced costs of sequencing have led to whole-genome sequences for a large number of microorganisms, enabling the application of microbial genome-wide association studies (GWAS). Given the successes of human GWAS in understanding disease aetiology and identifying potential drug targets, microbial GWAS are likely to further advance our understanding of infectious diseases. These advances include insights into pressing global health problems, such as antibiotic resistance and disease transmission. In this Review, we outline the methodologies of GWAS, the current state of the field of microbial GWAS, and how lessons from human GWAS can direct the future of the field.

  15. The human Genome project and the future of oncology

    International Nuclear Information System (INIS)

    Collins, Francis S.

    1996-01-01

    The Human Genome Project is an ambitious 15-year effort to devise maps and sequence of the 3-billion base pair human genome, including all 100,000 genes. The project is running ahead of schedule and under budget. Already the effects on progress in disease gene discovery have been dramatic, especially for cancer. The most appropriate uses of susceptibility testing for breast, ovarian, and colon cancer are being investigated in research protocols, and the need to prevent genetic discrimination in employment and health insurance is becoming more urgent. In the longer term, these gene discoveries are likely to usher in a new era of therapeutic molecular medicine

  16. Detecting Genomic Signatures of Natural Selection with Principal Component Analysis: Application to the 1000 Genomes Data.

    Science.gov (United States)

    Duforet-Frebourg, Nicolas; Luu, Keurcien; Laval, Guillaume; Bazin, Eric; Blum, Michael G B

    2016-04-01

    To characterize natural selection, various analytical methods for detecting candidate genomic regions have been developed. We propose to perform genome-wide scans of natural selection using principal component analysis (PCA). We show that the common FST index of genetic differentiation between populations can be viewed as the proportion of variance explained by the principal components. Considering the correlations between genetic variants and each principal component provides a conceptual framework to detect genetic variants involved in local adaptation without any prior definition of populations. To validate the PCA-based approach, we consider the 1000 Genomes data (phase 1) considering 850 individuals coming from Africa, Asia, and Europe. The number of genetic variants is of the order of 36 millions obtained with a low-coverage sequencing depth (3×). The correlations between genetic variation and each principal component provide well-known targets for positive selection (EDAR, SLC24A5, SLC45A2, DARC), and also new candidate genes (APPBPP2, TP1A1, RTTN, KCNMA, MYO5C) and noncoding RNAs. In addition to identifying genes involved in biological adaptation, we identify two biological pathways involved in polygenic adaptation that are related to the innate immune system (beta defensins) and to lipid metabolism (fatty acid omega oxidation). An additional analysis of European data shows that a genome scan based on PCA retrieves classical examples of local adaptation even when there are no well-defined populations. PCA-based statistics, implemented in the PCAdapt R package and the PCAdapt fast open-source software, retrieve well-known signals of human adaptation, which is encouraging for future whole-genome sequencing project, especially when defining populations is difficult. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  17. Characterization of noncoding regulatory DNA in the human genome.

    Science.gov (United States)

    Elkon, Ran; Agami, Reuven

    2017-08-08

    Genetic variants associated with common diseases are usually located in noncoding parts of the human genome. Delineation of the full repertoire of functional noncoding elements, together with efficient methods for probing their biological roles, is therefore of crucial importance. Over the past decade, DNA accessibility and various epigenetic modifications have been associated with regulatory functions. Mapping these features across the genome has enabled researchers to begin to document the full complement of putative regulatory elements. High-throughput reporter assays to probe the functions of regulatory regions have also been developed but these methods separate putative regulatory elements from the chromosome so that any effects of chromatin context and long-range regulatory interactions are lost. Definitive assignment of function(s) to putative cis-regulatory elements requires perturbation of these elements. Genome-editing technologies are now transforming our ability to perturb regulatory elements across entire genomes. Interpretation of high-throughput genetic screens that incorporate genome editors might enable the construction of an unbiased map of functional noncoding elements in the human genome.

  18. The zebrafish reference genome sequence and its relationship to the human genome

    Science.gov (United States)

    Howe, Kerstin; Clark, Matthew D.; Torroja, Carlos F.; Torrance, James; Berthelot, Camille; Muffato, Matthieu; Collins, John E.; Humphray, Sean; McLaren, Karen; Matthews, Lucy; McLaren, Stuart; Sealy, Ian; Caccamo, Mario; Churcher, Carol; Scott, Carol; Barrett, Jeffrey C.; Koch, Romke; Rauch, Gerd-Jörg; White, Simon; Chow, William; Kilian, Britt; Quintais, Leonor T.; Guerra-Assunção, José A.; Zhou, Yi; Gu, Yong; Yen, Jennifer; Vogel, Jan-Hinnerk; Eyre, Tina; Redmond, Seth; Banerjee, Ruby; Chi, Jianxiang; Fu, Beiyuan; Langley, Elizabeth; Maguire, Sean F.; Laird, Gavin K.; Lloyd, David; Kenyon, Emma; Donaldson, Sarah; Sehra, Harminder; Almeida-King, Jeff; Loveland, Jane; Trevanion, Stephen; Jones, Matt; Quail, Mike; Willey, Dave; Hunt, Adrienne; Burton, John; Sims, Sarah; McLay, Kirsten; Plumb, Bob; Davis, Joy; Clee, Chris; Oliver, Karen; Clark, Richard; Riddle, Clare; Eliott, David; Threadgold, Glen; Harden, Glenn; Ware, Darren; Mortimer, Beverly; Kerry, Giselle; Heath, Paul; Phillimore, Benjamin; Tracey, Alan; Corby, Nicole; Dunn, Matthew; Johnson, Christopher; Wood, Jonathan; Clark, Susan; Pelan, Sarah; Griffiths, Guy; Smith, Michelle; Glithero, Rebecca; Howden, Philip; Barker, Nicholas; Stevens, Christopher; Harley, Joanna; Holt, Karen; Panagiotidis, Georgios; Lovell, Jamieson; Beasley, Helen; Henderson, Carl; Gordon, Daria; Auger, Katherine; Wright, Deborah; Collins, Joanna; Raisen, Claire; Dyer, Lauren; Leung, Kenric; Robertson, Lauren; Ambridge, Kirsty; Leongamornlert, Daniel; McGuire, Sarah; Gilderthorp, Ruth; Griffiths, Coline; Manthravadi, Deepa; Nichol, Sarah; Barker, Gary; Whitehead, Siobhan; Kay, Michael; Brown, Jacqueline; Murnane, Clare; Gray, Emma; Humphries, Matthew; Sycamore, Neil; Barker, Darren; Saunders, David; Wallis, Justene; Babbage, Anne; Hammond, Sian; Mashreghi-Mohammadi, Maryam; Barr, Lucy; Martin, Sancha; Wray, Paul; Ellington, Andrew; Matthews, Nicholas; Ellwood, Matthew; Woodmansey, Rebecca; Clark, Graham; Cooper, James; Tromans, Anthony; Grafham, Darren; Skuce, Carl; Pandian, Richard; Andrews, Robert; Harrison, Elliot; Kimberley, Andrew; Garnett, Jane; Fosker, Nigel; Hall, Rebekah; Garner, Patrick; Kelly, Daniel; Bird, Christine; Palmer, Sophie; Gehring, Ines; Berger, Andrea; Dooley, Christopher M.; Ersan-Ürün, Zübeyde; Eser, Cigdem; Geiger, Horst; Geisler, Maria; Karotki, Lena; Kirn, Anette; Konantz, Judith; Konantz, Martina; Oberländer, Martina; Rudolph-Geiger, Silke; Teucke, Mathias; Osoegawa, Kazutoyo; Zhu, Baoli; Rapp, Amanda; Widaa, Sara; Langford, Cordelia; Yang, Fengtang; Carter, Nigel P.; Harrow, Jennifer; Ning, Zemin; Herrero, Javier; Searle, Steve M. J.; Enright, Anton; Geisler, Robert; Plasterk, Ronald H. A.; Lee, Charles; Westerfield, Monte; de Jong, Pieter J.; Zon, Leonard I.; Postlethwait, John H.; Nüsslein-Volhard, Christiane; Hubbard, Tim J. P.; Crollius, Hugues Roest; Rogers, Jane; Stemple, Derek L.

    2013-01-01

    Zebrafish have become a popular organism for the study of vertebrate gene function1,2. The virtually transparent embryos of this species, and the ability to accelerate genetic studies by gene knockdown or overexpression, have led to the widespread use of zebrafish in the detailed investigation of vertebrate gene function and increasingly, the study of human genetic disease3–5. However, for effective modelling of human genetic disease it is important to understand the extent to which zebrafish genes and gene structures are related to orthologous human genes. To examine this, we generated a high-quality sequence assembly of the zebrafish genome, made up of an overlapping set of completely sequenced large-insert clones that were ordered and oriented using a high-resolution high-density meiotic map. Detailed automatic and manual annotation provides evidence of more than 26,000 protein-coding genes6, the largest gene set of any vertebrate so far sequenced. Comparison to the human reference genome shows that approximately 70% of human genes have at least one obvious zebrafish orthologue. In addition, the high quality of this genome assembly provides a clearer understanding of key genomic features such as a unique repeat content, a scarcity of pseudogenes, an enrichment of zebrafish-specific genes on chromosome 4 and chromosomal regions that influence sex determination. PMID:23594743

  19. The zebrafish reference genome sequence and its relationship to the human genome.

    Science.gov (United States)

    Howe, Kerstin; Clark, Matthew D; Torroja, Carlos F; Torrance, James; Berthelot, Camille; Muffato, Matthieu; Collins, John E; Humphray, Sean; McLaren, Karen; Matthews, Lucy; McLaren, Stuart; Sealy, Ian; Caccamo, Mario; Churcher, Carol; Scott, Carol; Barrett, Jeffrey C; Koch, Romke; Rauch, Gerd-Jörg; White, Simon; Chow, William; Kilian, Britt; Quintais, Leonor T; Guerra-Assunção, José A; Zhou, Yi; Gu, Yong; Yen, Jennifer; Vogel, Jan-Hinnerk; Eyre, Tina; Redmond, Seth; Banerjee, Ruby; Chi, Jianxiang; Fu, Beiyuan; Langley, Elizabeth; Maguire, Sean F; Laird, Gavin K; Lloyd, David; Kenyon, Emma; Donaldson, Sarah; Sehra, Harminder; Almeida-King, Jeff; Loveland, Jane; Trevanion, Stephen; Jones, Matt; Quail, Mike; Willey, Dave; Hunt, Adrienne; Burton, John; Sims, Sarah; McLay, Kirsten; Plumb, Bob; Davis, Joy; Clee, Chris; Oliver, Karen; Clark, Richard; Riddle, Clare; Elliot, David; Eliott, David; Threadgold, Glen; Harden, Glenn; Ware, Darren; Begum, Sharmin; Mortimore, Beverley; Mortimer, Beverly; Kerry, Giselle; Heath, Paul; Phillimore, Benjamin; Tracey, Alan; Corby, Nicole; Dunn, Matthew; Johnson, Christopher; Wood, Jonathan; Clark, Susan; Pelan, Sarah; Griffiths, Guy; Smith, Michelle; Glithero, Rebecca; Howden, Philip; Barker, Nicholas; Lloyd, Christine; Stevens, Christopher; Harley, Joanna; Holt, Karen; Panagiotidis, Georgios; Lovell, Jamieson; Beasley, Helen; Henderson, Carl; Gordon, Daria; Auger, Katherine; Wright, Deborah; Collins, Joanna; Raisen, Claire; Dyer, Lauren; Leung, Kenric; Robertson, Lauren; Ambridge, Kirsty; Leongamornlert, Daniel; McGuire, Sarah; Gilderthorp, Ruth; Griffiths, Coline; Manthravadi, Deepa; Nichol, Sarah; Barker, Gary; Whitehead, Siobhan; Kay, Michael; Brown, Jacqueline; Murnane, Clare; Gray, Emma; Humphries, Matthew; Sycamore, Neil; Barker, Darren; Saunders, David; Wallis, Justene; Babbage, Anne; Hammond, Sian; Mashreghi-Mohammadi, Maryam; Barr, Lucy; Martin, Sancha; Wray, Paul; Ellington, Andrew; Matthews, Nicholas; Ellwood, Matthew; Woodmansey, Rebecca; Clark, Graham; Cooper, James D; Cooper, James; Tromans, Anthony; Grafham, Darren; Skuce, Carl; Pandian, Richard; Andrews, Robert; Harrison, Elliot; Kimberley, Andrew; Garnett, Jane; Fosker, Nigel; Hall, Rebekah; Garner, Patrick; Kelly, Daniel; Bird, Christine; Palmer, Sophie; Gehring, Ines; Berger, Andrea; Dooley, Christopher M; Ersan-Ürün, Zübeyde; Eser, Cigdem; Geiger, Horst; Geisler, Maria; Karotki, Lena; Kirn, Anette; Konantz, Judith; Konantz, Martina; Oberländer, Martina; Rudolph-Geiger, Silke; Teucke, Mathias; Lanz, Christa; Raddatz, Günter; Osoegawa, Kazutoyo; Zhu, Baoli; Rapp, Amanda; Widaa, Sara; Langford, Cordelia; Yang, Fengtang; Schuster, Stephan C; Carter, Nigel P; Harrow, Jennifer; Ning, Zemin; Herrero, Javier; Searle, Steve M J; Enright, Anton; Geisler, Robert; Plasterk, Ronald H A; Lee, Charles; Westerfield, Monte; de Jong, Pieter J; Zon, Leonard I; Postlethwait, John H; Nüsslein-Volhard, Christiane; Hubbard, Tim J P; Roest Crollius, Hugues; Rogers, Jane; Stemple, Derek L

    2013-04-25

    Zebrafish have become a popular organism for the study of vertebrate gene function. The virtually transparent embryos of this species, and the ability to accelerate genetic studies by gene knockdown or overexpression, have led to the widespread use of zebrafish in the detailed investigation of vertebrate gene function and increasingly, the study of human genetic disease. However, for effective modelling of human genetic disease it is important to understand the extent to which zebrafish genes and gene structures are related to orthologous human genes. To examine this, we generated a high-quality sequence assembly of the zebrafish genome, made up of an overlapping set of completely sequenced large-insert clones that were ordered and oriented using a high-resolution high-density meiotic map. Detailed automatic and manual annotation provides evidence of more than 26,000 protein-coding genes, the largest gene set of any vertebrate so far sequenced. Comparison to the human reference genome shows that approximately 70% of human genes have at least one obvious zebrafish orthologue. In addition, the high quality of this genome assembly provides a clearer understanding of key genomic features such as a unique repeat content, a scarcity of pseudogenes, an enrichment of zebrafish-specific genes on chromosome 4 and chromosomal regions that influence sex determination.

  20. FGWAS: Functional genome wide association analysis.

    Science.gov (United States)

    Huang, Chao; Thompson, Paul; Wang, Yalin; Yu, Yang; Zhang, Jingwen; Kong, Dehan; Colen, Rivka R; Knickmeyer, Rebecca C; Zhu, Hongtu

    2017-10-01

    Functional phenotypes (e.g., subcortical surface representation), which commonly arise in imaging genetic studies, have been used to detect putative genes for complexly inherited neuropsychiatric and neurodegenerative disorders. However, existing statistical methods largely ignore the functional features (e.g., functional smoothness and correlation). The aim of this paper is to develop a functional genome-wide association analysis (FGWAS) framework to efficiently carry out whole-genome analyses of functional phenotypes. FGWAS consists of three components: a multivariate varying coefficient model, a global sure independence screening procedure, and a test procedure. Compared with the standard multivariate regression model, the multivariate varying coefficient model explicitly models the functional features of functional phenotypes through the integration of smooth coefficient functions and functional principal component analysis. Statistically, compared with existing methods for genome-wide association studies (GWAS), FGWAS can substantially boost the detection power for discovering important genetic variants influencing brain structure and function. Simulation studies show that FGWAS outperforms existing GWAS methods for searching sparse signals in an extremely large search space, while controlling for the family-wise error rate. We have successfully applied FGWAS to large-scale analysis of data from the Alzheimer's Disease Neuroimaging Initiative for 708 subjects, 30,000 vertices on the left and right hippocampal surfaces, and 501,584 SNPs. Copyright © 2017 Elsevier Inc. All rights reserved.

  1. Comparative analysis of Acinetobacters: three genomes for three lifestyles.

    Directory of Open Access Journals (Sweden)

    David Vallenet

    Full Text Available Acinetobacter baumannii is the source of numerous nosocomial infections in humans and therefore deserves close attention as multidrug or even pandrug resistant strains are increasingly being identified worldwide. Here we report the comparison of two newly sequenced genomes of A. baumannii. The human isolate A. baumannii AYE is multidrug resistant whereas strain SDF, which was isolated from body lice, is antibiotic susceptible. As reference for comparison in this analysis, the genome of the soil-living bacterium A. baylyi strain ADP1 was used. The most interesting dissimilarities we observed were that i whereas strain AYE and A. baylyi genomes harbored very few Insertion Sequence elements which could promote expression of downstream genes, strain SDF sequence contains several hundred of them that have played a crucial role in its genome reduction (gene disruptions and simple DNA loss; ii strain SDF has low catabolic capacities compared to strain AYE. Interestingly, the latter has even higher catabolic capacities than A. baylyi which has already been reported as a very nutritionally versatile organism. This metabolic performance could explain the persistence of A. baumannii nosocomial strains in environments where nutrients are scarce; iii several processes known to play a key role during host infection (biofilm formation, iron uptake, quorum sensing, virulence factors were either different or absent, the best example of which is iron uptake. Indeed, strain AYE and A. baylyi use siderophore-based systems to scavenge iron from the environment whereas strain SDF uses an alternate system similar to the Haem Acquisition System (HAS. Taken together, all these observations suggest that the genome contents of the 3 Acinetobacters compared are partly shaped by life in distinct ecological niches: human (and more largely hospital environment, louse, soil.

  2. Genome-wide Studies of Mycolic Acid Bacteria: Computational Identification and Analysis of a Minimal Genome

    KAUST Repository

    Kamanu, Frederick Kinyua

    2012-12-01

    The mycolic acid bacteria are a distinct suprageneric group of asporogenous Grampositive, high GC-content bacteria, distinguished by the presence of mycolic acids in their cell envelope. They exhibit great diversity in their cell and morphology; although primarily non-pathogens, this group contains three major pathogens Mycobacterium leprae, Mycobacterium tuberculosis complex, and Corynebacterium diphtheria. Although the mycolic acid bacteria are a clearly defined group of bacteria, the taxonomic relationships between its constituent genera and species are less well defined. Two approaches were tested for their suitability in describing the taxonomy of the group. First, a Multilocus Sequence Typing (MLST) experiment was assessed and found to be superior to monophyletic (16S small ribosomal subunit) in delineating a total of 52 mycolic acid bacterial species. Phylogenetic inference was performed using the neighbor-joining method. To further refine phylogenetic analysis and to take advantage of the widespread availability of bacterial genome data, a computational framework that simulates DNA-DNA hybridisation was developed and validated using multiscale bootstrap resampling. The tool classifies microbial genomes based on whole genome DNA, and was deployed as a web-application using PHP and Javascript. It is accessible online at http://cbrc.kaust.edu.sa/dna_hybridization/ A third study was a computational and statistical methods in the identification and analysis of a putative minimal mycolic acid bacterial genome so as to better understand (1) the genomic requirements to encode a mycolic acid bacterial cell and (2) the role and type of genes and genetic elements that lead to the massive increase in genome size in environmental mycolic acid bacteria. Using a reciprocal comparison approach, a total of 690 orthologous gene clusters forming a putative minimal genome were identified across 24 mycolic acid bacterial species. In order to identify new potential drug

  3. Comparing sequencing assays and human-machine analyses in actionable genomics for glioblastoma.

    Science.gov (United States)

    Wrzeszczynski, Kazimierz O; Frank, Mayu O; Koyama, Takahiko; Rhrissorrakrai, Kahn; Robine, Nicolas; Utro, Filippo; Emde, Anne-Katrin; Chen, Bo-Juen; Arora, Kanika; Shah, Minita; Vacic, Vladimir; Norel, Raquel; Bilal, Erhan; Bergmann, Ewa A; Moore Vogel, Julia L; Bruce, Jeffrey N; Lassman, Andrew B; Canoll, Peter; Grommes, Christian; Harvey, Steve; Parida, Laxmi; Michelini, Vanessa V; Zody, Michael C; Jobanputra, Vaidehi; Royyuru, Ajay K; Darnell, Robert B

    2017-08-01

    To analyze a glioblastoma tumor specimen with 3 different platforms and compare potentially actionable calls from each. Tumor DNA was analyzed by a commercial targeted panel. In addition, tumor-normal DNA was analyzed by whole-genome sequencing (WGS) and tumor RNA was analyzed by RNA sequencing (RNA-seq). The WGS and RNA-seq data were analyzed by a team of bioinformaticians and cancer oncologists, and separately by IBM Watson Genomic Analytics (WGA), an automated system for prioritizing somatic variants and identifying drugs. More variants were identified by WGS/RNA analysis than by targeted panels. WGA completed a comparable analysis in a fraction of the time required by the human analysts. The development of an effective human-machine interface in the analysis of deep cancer genomic datasets may provide potentially clinically actionable calls for individual patients in a more timely and efficient manner than currently possible. NCT02725684.

  4. Significance of functional disease-causal/susceptible variants identified by whole-genome analyses for the understanding of human diseases.

    Science.gov (United States)

    Hitomi, Yuki; Tokunaga, Katsushi

    2017-01-01

    Human genome variation may cause differences in traits and disease risks. Disease-causal/susceptible genes and variants for both common and rare diseases can be detected by comprehensive whole-genome analyses, such as whole-genome sequencing (WGS), using next-generation sequencing (NGS) technology and genome-wide association studies (GWAS). Here, in addition to the application of an NGS as a whole-genome analysis method, we summarize approaches for the identification of functional disease-causal/susceptible variants from abundant genetic variants in the human genome and methods for evaluating their functional effects in human diseases, using an NGS and in silico and in vitro functional analyses. We also discuss the clinical applications of the functional disease causal/susceptible variants to personalized medicine.

  5. Genomics and the Ark: an ecocentric perspective on human history.

    Science.gov (United States)

    Zwart, Hub; Penders, Bart

    2011-01-01

    Views of ourselves in relationship to the rest of the biosphere are changing. Theocentric and anthropocentric perspectives are giving way to more ecocentric views on the history, present, and future of humankind. Novel sciences, such as genomics, have deepened and broadened our understanding of the process of anthropogenesis, the coming into being of humans. Genomics suggests that early human history must be regarded as a complex narrative of evolving ecosystems, in which human evolution both influenced and was influenced by the evolution of companion species. During the agricultural revolution, human beings designed small-scale artificial ecosystems or evolutionary "Arks," in which networks of plants, animals, and microorganisms coevolved. Currently, our attitude towards this process seems subject to a paradoxical reversal. The boundaries of the Ark have dramatically broadened, and genomics is not only being used to increase our understanding of our ecological past, but may also help us to conserve, reconstruct, or even revivify species and ecosystems to whose degradation or (near) extinction we have contributed. This article explores the role of genomics in the elaboration of a more ecocentric view of ourselves with the help of two examples, namely the renaissance of Paleolithic diets and of Pleistocene parks. It argues that an understanding of the world in ecocentric terms requires new partnerships and mutually beneficial forms of collaboration and convergence between life sciences, social sciences, and the humanities.

  6. Forces shaping the fastest evolving regions in the human genome.

    Directory of Open Access Journals (Sweden)

    Katherine S Pollard

    2006-10-01

    Full Text Available Comparative genomics allow us to search the human genome for segments that were extensively changed in the last approximately 5 million years since divergence from our common ancestor with chimpanzee, but are highly conserved in other species and thus are likely to be functional. We found 202 genomic elements that are highly conserved in vertebrates but show evidence of significantly accelerated substitution rates in human. These are mostly in non-coding DNA, often near genes associated with transcription and DNA binding. Resequencing confirmed that the five most accelerated elements are dramatically changed in human but not in other primates, with seven times more substitutions in human than in chimp. The accelerated elements, and in particular the top five, show a strong bias for adenine and thymine to guanine and cytosine nucleotide changes and are disproportionately located in high recombination and high guanine and cytosine content environments near telomeres, suggesting either biased gene conversion or isochore selection. In addition, there is some evidence of directional selection in the regions containing the two most accelerated regions. A combination of evolutionary forces has contributed to accelerated evolution of the fastest evolving elements in the human genome.

  7. Genome-Wide Detection and Analysis of Multifunctional Genes

    Science.gov (United States)

    Pritykin, Yuri; Ghersi, Dario; Singh, Mona

    2015-01-01

    Many genes can play a role in multiple biological processes or molecular functions. Identifying multifunctional genes at the genome-wide level and studying their properties can shed light upon the complexity of molecular events that underpin cellular functioning, thereby leading to a better understanding of the functional landscape of the cell. However, to date, genome-wide analysis of multifunctional genes (and the proteins they encode) has been limited. Here we introduce a computational approach that uses known functional annotations to extract genes playing a role in at least two distinct biological processes. We leverage functional genomics data sets for three organisms—H. sapiens, D. melanogaster, and S. cerevisiae—and show that, as compared to other annotated genes, genes involved in multiple biological processes possess distinct physicochemical properties, are more broadly expressed, tend to be more central in protein interaction networks, tend to be more evolutionarily conserved, and are more likely to be essential. We also find that multifunctional genes are significantly more likely to be involved in human disorders. These same features also hold when multifunctionality is defined with respect to molecular functions instead of biological processes. Our analysis uncovers key features about multifunctional genes, and is a step towards a better genome-wide understanding of gene multifunctionality. PMID:26436655

  8. Human reliability analysis

    International Nuclear Information System (INIS)

    Dougherty, E.M.; Fragola, J.R.

    1988-01-01

    The authors present a treatment of human reliability analysis incorporating an introduction to probabilistic risk assessment for nuclear power generating stations. They treat the subject according to the framework established for general systems theory. Draws upon reliability analysis, psychology, human factors engineering, and statistics, integrating elements of these fields within a systems framework. Provides a history of human reliability analysis, and includes examples of the application of the systems approach

  9. YersiniaBase: a genomic resource and analysis platform for comparative analysis of Yersinia.

    Science.gov (United States)

    Tan, Shi Yang; Dutta, Avirup; Jakubovics, Nicholas S; Ang, Mia Yang; Siow, Cheuk Chuen; Mutha, Naresh Vr; Heydari, Hamed; Wee, Wei Yee; Wong, Guat Jah; Choo, Siew Woh

    2015-01-16

    Yersinia is a Gram-negative bacteria that includes serious pathogens such as the Yersinia pestis, which causes plague, Yersinia pseudotuberculosis, Yersinia enterocolitica. The remaining species are generally considered non-pathogenic to humans, although there is evidence that at least some of these species can cause occasional infections using distinct mechanisms from the more pathogenic species. With the advances in sequencing technologies, many genomes of Yersinia have been sequenced. However, there is currently no specialized platform to hold the rapidly-growing Yersinia genomic data and to provide analysis tools particularly for comparative analyses, which are required to provide improved insights into their biology, evolution and pathogenicity. To facilitate the ongoing and future research of Yersinia, especially those generally considered non-pathogenic species, a well-defined repository and analysis platform is needed to hold the Yersinia genomic data and analysis tools for the Yersinia research community. Hence, we have developed the YersiniaBase, a robust and user-friendly Yersinia resource and analysis platform for the analysis of Yersinia genomic data. YersiniaBase has a total of twelve species and 232 genome sequences, of which the majority are Yersinia pestis. In order to smooth the process of searching genomic data in a large database, we implemented an Asynchronous JavaScript and XML (AJAX)-based real-time searching system in YersiniaBase. Besides incorporating existing tools, which include JavaScript-based genome browser (JBrowse) and Basic Local Alignment Search Tool (BLAST), YersiniaBase also has in-house developed tools: (1) Pairwise Genome Comparison tool (PGC) for comparing two user-selected genomes; (2) Pathogenomics Profiling Tool (PathoProT) for comparative pathogenomics analysis of Yersinia genomes; (3) YersiniaTree for constructing phylogenetic tree of Yersinia. We ran analyses based on the tools and genomic data in YersiniaBase and the

  10. PGSB/MIPS Plant Genome Information Resources and Concepts for the Analysis of Complex Grass Genomes.

    Science.gov (United States)

    Spannagl, Manuel; Bader, Kai; Pfeifer, Matthias; Nussbaumer, Thomas; Mayer, Klaus F X

    2016-01-01

    PGSB (Plant Genome and Systems Biology; formerly MIPS-Munich Institute for Protein Sequences) has been involved in developing, implementing and maintaining plant genome databases for more than a decade. Genome databases and analysis resources have focused on individual genomes and aim to provide flexible and maintainable datasets for model plant genomes as a backbone against which experimental data, e.g., from high-throughput functional genomics, can be organized and analyzed. In addition, genomes from both model and crop plants form a scaffold for comparative genomics, assisted by specialized tools such as the CrowsNest viewer to explore conserved gene order (synteny) between related species on macro- and micro-levels.The genomes of many economically important Triticeae plants such as wheat, barley, and rye present a great challenge for sequence assembly and bioinformatic analysis due to their enormous complexity and large genome size. Novel concepts and strategies have been developed to deal with these difficulties and have been applied to the genomes of wheat, barley, rye, and other cereals. This includes the GenomeZipper concept, reference-guided exome assembly, and "chromosome genomics" based on flow cytometry sorted chromosomes.

  11. Non-genomic effects of vitamin D in human spermatozoa

    DEFF Research Database (Denmark)

    Blomberg Jensen, Martin; Dissing, Steen

    2012-01-01

    The spectrum for vitamin D (VD) mediated effects has expanded in recent years. Activated VD (1,25(OH)(2)D(3)) binds to the VD receptor (VDR) and mediates non-genomic effects through the alternative ligand binding-pocket (VDR-ap) or regulates gene transcription through the genomic binding......-pocket. VDR and VD-metabolizing enzymes are expressed in human testis, male reproductive tract and mature spermatozoa, and VD is considered important for male reproduction. Expression of the VD-inactivating enzyme CYP24A1 at the annulus of human spermatozoa distinguish normal and infertile men with high...... specificity, and CYP24A1 expression is positively correlated with all semen variables and suggested as a marker for both semen quality and VD responsiveness. Moreover, spermatozoa are transcriptionally silent and are therefore a unique model to study non-genomic effects. 1,25(OH)(2)D(3) induced a rapid...

  12. Genome-wide comparative analysis of four Indian Drosophila species.

    Science.gov (United States)

    Mohanty, Sujata; Khanna, Radhika

    2017-12-01

    Comparative analysis of multiple genomes of closely or distantly related Drosophila species undoubtedly creates excitement among evolutionary biologists in exploring the genomic changes with an ecology and evolutionary perspective. We present herewith the de novo assembled whole genome sequences of four Drosophila species, D. bipectinata, D. takahashii, D. biarmipes and D. nasuta of Indian origin using Next Generation Sequencing technology on an Illumina platform along with their detailed assembly statistics. The comparative genomics analysis, e.g. gene predictions and annotations, functional and orthogroup analysis of coding sequences and genome wide SNP distribution were performed. The whole genome of Zaprionus indianus of Indian origin published earlier by us and the genome sequences of previously sequenced 12 Drosophila species available in the NCBI database were included in the analysis. The present work is a part of our ongoing genomics project of Indian Drosophila species.

  13. Comparison of phasing strategies for whole human genomes.

    Science.gov (United States)

    Choi, Yongwook; Chan, Agnes P; Kirkness, Ewen; Telenti, Amalio; Schork, Nicholas J

    2018-04-01

    Humans are a diploid species that inherit one set of chromosomes paternally and one homologous set of chromosomes maternally. Unfortunately, most human sequencing initiatives ignore this fact in that they do not directly delineate the nucleotide content of the maternal and paternal copies of the 23 chromosomes individuals possess (i.e., they do not 'phase' the genome) often because of the costs and complexities of doing so. We compared 11 different widely-used approaches to phasing human genomes using the publicly available 'Genome-In-A-Bottle' (GIAB) phased version of the NA12878 genome as a gold standard. The phasing strategies we compared included laboratory-based assays that prepare DNA in unique ways to facilitate phasing as well as purely computational approaches that seek to reconstruct phase information from general sequencing reads and constructs or population-level haplotype frequency information obtained through a reference panel of haplotypes. To assess the performance of the 11 approaches, we used metrics that included, among others, switch error rates, haplotype block lengths, the proportion of fully phase-resolved genes, phasing accuracy and yield between pairs of SNVs. Our comparisons suggest that a hybrid or combined approach that leverages: 1. population-based phasing using the SHAPEIT software suite, 2. either genome-wide sequencing read data or parental genotypes, and 3. a large reference panel of variant and haplotype frequencies, provides a fast and efficient way to produce highly accurate phase-resolved individual human genomes. We found that for population-based approaches, phasing performance is enhanced with the addition of genome-wide read data; e.g., whole genome shotgun and/or RNA sequencing reads. Further, we found that the inclusion of parental genotype data within a population-based phasing strategy can provide as much as a ten-fold reduction in phasing errors. We also considered a majority voting scheme for the construction of a

  14. DOE Human Genome Program contractor-grantee workshop

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1996-01-01

    This volume contains the proceedings for the DOE Human Genome Program`s Contractor-Grantee Workshop V held in Sante Fe, New Mexico January 28, February 1, 1996. Presentations were divided into sessions entitled Sequencing; Mapping; Informatics; Ethical, Legal, and Social Issues; and Infrastructure. Reports of individual projects described herein are separately indexed and abstracted for the database.

  15. Human genome program report. Part 2, 1996 research abstracts

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1997-11-01

    This report contains Part 2 of a two-part report to reflect research and progress in the US Department of Energy Human Genome Program from 1994 through 1996, with specified updates made just before publication. Part 2 consists of 1996 research abstracts. Attention is focused on the following: sequencing; mapping; informatics; ethical, legal, and social issues; infrastructure; and small business innovation research.

  16. Reconsidering democracy. History of the Human Genome Project.

    NARCIS (Netherlands)

    Marli Huijer

    2003-01-01

    What options are open for people—citizens, politicians, and other nonscientists—to become actively involved in and anticipate new directions in the life sciences? In addressing this question, this article focuses on the start of the Human Genome Project (1985-1990). By contrasting various models of

  17. Human genome program report. Part 1, overview and progress

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1997-11-01

    This report contains Part 1 of a two-part report to reflect research and progress in the U.S. Department of Energy Human Genome Program from 1994 through 1996, with specified updates made just before publication. Part 1 consists of the program overview and report on progress.

  18. The Human Genome Project: Biology, Computers, and Privacy.

    Science.gov (United States)

    Cutter, Mary Ann G.; Drexler, Edward; Gottesman, Kay S.; Goulding, Philip G.; McCullough, Laurence B.; McInerney, Joseph D.; Micikas, Lynda B.; Mural, Richard J.; Murray, Jeffrey C.; Zola, John

    This module, for high school teachers, is the second of two modules about the Human Genome Project (HGP) produced by the Biological Sciences Curriculum Study (BSCS). The first section of this module provides background information for teachers about the structure and objectives of the HGP, aspects of the science and technology that underlie the…

  19. The human genome; you gain some, you lose some

    NARCIS (Netherlands)

    Kriek, Marjolein

    2007-01-01

    Copy number variations (CNVs) in the human genome are inherent in both evolutionary progression as well as the etiology of disease. The introduction of this thesis will review CNVs that appear to be neutral as well as CNVs that appear to be related to a phenotypic trait. This will be followed by a

  20. Reconsidering democracy - History of the human genome project

    NARCIS (Netherlands)

    Huijer, M

    What options are open for people-citizens, politicians, and other nonscientists-to become actively involved in and anticipate new directions in the life sciences? In addressing this question, this article focuses on the start of the Human Genome Project (1985-1990). By contrasting various models of

  1. Enhancing Biology Instruction with the Human Genome Project

    Science.gov (United States)

    Buxeda, Rosa J.; Moore-Russo, Deborah A.

    2003-01-01

    The Human Genome Project (HGP) is a recent scientific milestone that has received notable attention. This article shows how a biology course is using the HGP to enhance students' experiences by providing awareness of cutting edge research, with information on new emerging career options, and with opportunities to consider ethical questions raised…

  2. Templated sequence insertion polymorphisms in the human genome

    Science.gov (United States)

    Onozawa, Masahiro; Aplan, Peter

    2016-11-01

    Templated Sequence Insertion Polymorphism (TSIP) is a recently described form of polymorphism recognized in the human genome, in which a sequence that is templated from a distant genomic region is inserted into the genome, seemingly at random. TSIPs can be grouped into two classes based on nucleotide sequence features at the insertion junctions; Class 1 TSIPs show features of insertions that are mediated via the LINE-1 ORF2 protein, including 1) target-site duplication (TSD), 2) polyadenylation 10-30 nucleotides downstream of a “cryptic” polyadenylation signal, and 3) preference for insertion at a 5’-TTTT/A-3’ sequence. In contrast, class 2 TSIPs show features consistent with repair of a DNA double-strand break via insertion of a DNA “patch” that is derived from a distant genomic region. Survey of a large number of normal human volunteers demonstrates that most individuals have 25-30 TSIPs, and that these TSIPs track with specific geographic regions. Similar to other forms of human polymorphism, we suspect that these TSIPs may be important for the generation of human diversity and genetic diseases.

  3. Genome Sequence of Novel Human Parechovirus Type 17

    OpenAIRE

    B?ttcher, Sindy; Obermeier, Patrick E.; Diedrich, Sabine; Kabor?, Yolande; D?Alfonso, Rossella; Pfister, Herbert; Kaiser, Rolf; Di Cristanziano, Veronica

    2017-01-01

    ABSTRACT Human parechoviruses (HPeV) circulate worldwide, causing a broad variety of symptoms, preferentially in early childhood. We report here the nearly complete genome sequence of a novel HPeV type, consisting of 7,062 nucleotides and encoding 2,179?amino acids. M36/CI/2014 was taxonomically classified as HPeV-17 by the picornavirus study group.

  4. The human genome project and the future of medical practice ...

    African Journals Online (AJOL)

    Contrary to the scepticism that characterised the planning stages of the human genome project, the technology and sequence data resulting from the project are set to revolutionise medical practice for good. The expected benefits include: enhanced discovery of disease genes, which will lead to improved knowledge on the ...

  5. Prediction of Complex Human Traits Using the Genomic Best Linear Unbiased Predictor

    DEFF Research Database (Denmark)

    de los Campos, Gustavo; Vazquez, Ana I; Fernando, Rohan

    2013-01-01

    Despite important advances from Genome Wide Association Studies (GWAS), for most complex human traits and diseases, a sizable proportion of genetic variance remains unexplained and prediction accuracy (PA) is usually low. Evidence suggests that PA can be improved using Whole-Genome Regression (WGR......) models where phenotypes are regressed on hundreds of thousands of variants simultaneously. The Genomic Best Linear Unbiased Prediction G-BLUP, a ridge-regression type method) is a commonly used WGR method and has shown good predictive performance when applied to plant and animal breeding populations....... However, breeding and human populations differ greatly in a number of factors that can affect the predictive performance of G-BLUP. Using theory, simulations, and real data analysis, we study the erformance of G-BLUP when applied to data from related and unrelated human subjects. Under perfect linkage...

  6. 76 FR 66076 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2011-10-25

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Call). Contact Person: Camilla E. Day, PhD, Scientific Review Officer, CIDR, National Human Genome... Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS) Dated: October 19...

  7. 77 FR 60706 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2012-10-04

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special.... Nakamura, Ph.D., Scientific Review Officer, Scientific Review Branch, National Human Genome Research...

  8. 76 FR 19780 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2011-04-08

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... E. Day, PhD, Scientific Review Officer, CIDR, National Human Genome Research Institute, National... . (Catalogue of Federal Domestic Assistance Program No. 93.172, Human Genome Research, National Institutes of...

  9. 76 FR 3917 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2011-01-21

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... Branch, National Human Genome Research Institute, 5635 Fishers Lane, Suite 4076, MSC 9306, Rockville, MD...

  10. 75 FR 56115 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2010-09-15

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS...

  11. 77 FR 2735 - National Human Genome Research Institute; Notice of Meetings

    Science.gov (United States)

    2012-01-19

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... personal privacy. Name of Committee: National Advisory Council for Human Genome Research. Date: February 13... Extramural Research National Human Genome Research Institute, 5635 Fishers Lane, Suite 4076, MSC 9305...

  12. 76 FR 3643 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2011-01-20

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Initial... . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes of...

  13. 78 FR 24223 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2013-04-24

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Initial...: To review and evaluate grant applications. Place: National Human Genome Research Institute, 3rd floor...

  14. 77 FR 2304 - National Human Genome Research Institute; Notice of Meeting

    Science.gov (United States)

    2012-01-17

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome....S.C. 281(d)(4)), notice is hereby given that the National Human Genome Research Institute (NHGRI... meeting of the National Advisory Council for Human Genome Research. Background materials on the proposed...

  15. 78 FR 21382 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2013-04-10

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... applications. Place: National Human Genome Research Institute, Suite 4076, 5635 Fisher's Lane, Bethesda, MD..., National Human Genome Research Institute, National Institutes of Health, 5635 Fishers Lane, Suite 4075...

  16. 78 FR 20933 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2013-04-08

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... review and evaluate grant applications. Place: National Human Genome Research Institute, Room 3055, 5635...

  17. 76 FR 22112 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2011-04-20

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special....nih.gov . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research...

  18. 78 FR 31953 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2013-05-28

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... review and evaluate grant applications. Place: National Human Genome Research Institute, 3rd Floor...

  19. 75 FR 10488 - National Human Genome Research Institute; Notice of Closed Meetings

    Science.gov (United States)

    2010-03-08

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... clearly unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research...- 4280, [email protected]gov . Name of Committee: National Human Genome Research Institute Special...

  20. 76 FR 35224 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2011-06-16

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome...). Contact Person: Camilla E. Day, PhD, Scientific Review Officer, CIR, National Human Genome Research..., [email protected] . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research...

  1. 77 FR 64816 - National Human Genome Research Institute; Notice of Meeting

    Science.gov (United States)

    2012-10-23

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome..., National Human Genome Research Institute. The meeting will be open to the public as indicated below, with... invasion of personal privacy. Name of Committee: Board of Scientific Counselors, National Human Genome...

  2. 75 FR 8373 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2010-02-24

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute [email protected] . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research...

  3. 75 FR 2147 - National Human Genome Research Institute; Notice of Meetings

    Science.gov (United States)

    2010-01-14

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Council for Human Genome Research. The meetings will be open to the public as indicated below, with... Extramural Research, National Human Genome Research Institute, 5635 Fishers Lane, Suite 4076, MSC 9305...

  4. 77 FR 22332 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2012-04-13

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special.... Agenda: To review and evaluate grant applications. Place: National Human Genome Research Institute, 5635...

  5. 76 FR 65204 - National Human Genome Research Institute; Notice of Meeting

    Science.gov (United States)

    2011-10-20

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome..., National Human Genome Research Institute. The meeting will be open to the public as indicated below, with... invasion of personal privacy. Name of Committee: Board of Scientific Counselors, National Human Genome...

  6. 76 FR 22407 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2011-04-21

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special.... (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes of...

  7. 77 FR 8268 - National Human Genome Research Institute; Notice of Closed Meetings

    Science.gov (United States)

    2012-02-14

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... applications. Place: National Human Genome Research Institute, 5635 Fisher's Lane, Room 4076, Rockville, MD..., CIDR, National Human Genome Research Institute, National Institutes of Health, 5635 Fishers Lane, Suite...

  8. 75 FR 48977 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2010-08-12

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome.... Contact Person: Camilla E. Day, PhD, Scientific Review Officer, CIDR, National Human Genome Research..., [email protected] . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research...

  9. 77 FR 74676 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2012-12-17

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Human Genome Research Institute, National Institutes of Health, 5635 Fishers Lane, Suite 4075, Bethesda.... 93.172, Human Genome Research, National Institutes of Health, HHS) Dated: December 11, 2012. David...

  10. 75 FR 19984 - National Human Genome Research Institute; Notice of Closed Meetings

    Science.gov (United States)

    2010-04-16

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome..., National Human Genome Research Institute, National Institutes of Health, 5635 Fishers Lane, Suite 4075... Nakamura, PhD, Scientific Review Officer, Scientific Review Branch, National Human Genome Research...

  11. 75 FR 26762 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2010-05-12

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Initial... . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes of...

  12. 75 FR 44800 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2010-07-29

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... for Human Genome Research. The meeting will be closed to the public in accordance with the provisions... Committee: National Advisory Council for Human Genome Research. Date: August 18, 2010. Time: 1 p.m. to 3 p.m...

  13. 75 FR 35821 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2010-06-23

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Person: Camilla E. Day, PhD, Scientific Review Officer, CIDR, National Human Genome Research [email protected] . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research...

  14. 76 FR 3642 - National Human Genome Research Institute; Notice of Closed Meetings

    Science.gov (United States)

    2011-01-20

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... clearly unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research....nih.gov . Name of Committee: National Human Genome Research Institute Special Emphasis Panel eMERGE...

  15. 78 FR 47715 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2013-08-06

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Person: Camilla E. Day, Ph.D., Scientific Review Officer, CIDR, National Human Genome Research [email protected] . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research...

  16. 77 FR 31863 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2012-05-30

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special..., Human Genome Research, National Institutes of Health, HHS) Dated: May 22, 2012. Jennifer S. Spaeth...

  17. 75 FR 52537 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2010-08-26

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Initial....nih.gov . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research...

  18. 78 FR 61851 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2013-10-04

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... a.m. to 4:00 p.m. Agenda: To review and evaluate grant applications. Place: National Human Genome...

  19. 76 FR 79199 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2011-12-21

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome.... Contact Person: Camilla E. Day, Ph.D., Scientific Review Officer, CIDR, National Human Genome Research..., [email protected] . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research...

  20. 75 FR 80509 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2010-12-22

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Call). Contact Person: Camilla E. Day, PhD, Scientific Review Officer, CIDR, National Human Genome... Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS) Dated: December 16...

  1. 76 FR 28056 - National Human Genome Research Institute; Notice of Closed Meetings

    Science.gov (United States)

    2011-05-13

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... clearly unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research... D. Nakamura, PhD, Scientific Review Officer, Office of Scientific Review, National Human Genome...

  2. 75 FR 2148 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2010-01-14

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Initial....nih.gov . (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research...

  3. 76 FR 66731 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2011-10-27

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS) Dated: October 21, 2011...

  4. 76 FR 10909 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2011-02-28

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome..., National Human Genome Research Institute, National Institutes of Health, 5635 Fishers Lane, Suite 4076, MSC..., Human Genome Research, National Institutes of Health, HHS). Dated: February 18, 2011. Jennifer S. Spaeth...

  5. 75 FR 52538 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2010-08-26

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... Person: Ken D. Nakamura, PhD, Scientific Review Officer, Scientific Review Branch, National Human Genome...

  6. 76 FR 35223 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2011-06-16

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... Person: Rudy O. Pozzatti, PhD, Scientific Review Officer, Scientific Review Branch, National Human Genome...

  7. 76 FR 36930 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2011-06-23

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special..., Human Genome Research, National Institutes of Health, HHS) Dated: June 17, 2011. Jennifer S. Spaeth...

  8. 77 FR 35991 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2012-06-15

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Human Genome Research Institute, National Institutes of Health, 5635 Fishers Lane, Suite 4075, Bethesda.... 93.172, Human Genome Research, National Institutes of Health, HHS) Dated: June 8, 2012. Jennifer S...

  9. 77 FR 61770 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2012-10-11

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS) [[Page 61771...

  10. 76 FR 63932 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2011-10-14

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS) Dated: October 7...

  11. 75 FR 8977 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2010-02-26

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome..., National Human Genome Research Institute, National Institutes of Health, 5635 Fishers Lane, Suite 4076, MSC..., Human Genome Research, National Institutes of Health, HHS) Dated: February 18, 2010. Jennifer Spaeth...

  12. 78 FR 66752 - National Human Genome Research Institute; Amended Notice of Meeting

    Science.gov (United States)

    2013-11-06

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... National Human Genome Research Institute Special Emphasis Panel, October 15, 2013, 01:00 p.m. to October 15, 2013, 02:30 p.m., National Human Genome Research Institute, 5635 Fishers Lane, Suite 3055, Rockville...

  13. 75 FR 32957 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2010-06-10

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... funding cycle. (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research...

  14. 78 FR 14806 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2013-03-07

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... unwarranted invasion of personal privacy. Name of Committee: National Human Genome Research Institute Special... p.m. Agenda: To review and evaluate grant applications. Place: National Human Genome Research...

  15. 75 FR 53703 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2010-09-01

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome..., Scientific Review Branch, National Human Genome Research Institute, National Institutes of Health, 5635.... (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes of...

  16. 75 FR 51828 - National Human Genome Research Institute; Notice of Meetings

    Science.gov (United States)

    2010-08-23

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... personal privacy. Name of Committee: National Advisory Council for Human Genome Research. Date: February 7... Research, National Human Genome Research Institute, 5635 Fishers Lane, Suite 4076, MSC 9305, Bethesda, MD...

  17. 75 FR 60467 - National Human Genome Research Institute; Notice of Meeting

    Science.gov (United States)

    2010-09-30

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome..., National Human Genome Research Institute. The meeting will be open to the public as indicated below, with... invasion of personal privacy. Name of Committee: Board of Scientific Counselors, National Human Genome...

  18. 75 FR 67380 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2010-11-02

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome... Review Branch, National Human Genome Research Institute, National Institutes of Health, 5635 Fishers Lane.... (Catalogue of Federal Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes of...

  19. Isolation of human genomic DNA for genetic analysis from premature neonates: a comparison between newborn dried blood spots, whole blood and umbilical cord tissue

    Science.gov (United States)

    2013-01-01

    Background Genotyping requires biological sample collection that must be reliable, convenient and acceptable for patients and clinicians. Finding the most optimal procedure of sample collection for premature neonates who have a very limited blood volume is a particular challenge. The aim of the current study was to evaluate the use of umbilical cord (UC) tissue and newborn dried blood spot (DBS)-extracted genomic DNA (gDNA) as an alternative to venous blood-derived gDNA from premature neonates for molecular genetic analysis. All samples were obtained from premature newborn infants between 24-32 weeks of gestation. Paired blood and UC samples were collected from 31 study participants. gDNA was extracted from ethylenediaminetetraacetic acid (EDTA) anticoagulant-treated blood samples (~500 μl) and newborn DBSs (n = 723) using QIAamp DNA Micro kit (Qiagen Ltd., Crawley, UK); and from UC using Qiagen DNAeasy Blood and Tissue kit (Qiagen Ltd., Crawley, UK). gDNA was quantified and purity confirmed by measuring the A260:A280 ratio. PCR amplification and pyrosequencing was carried out to determine suitability of the gDNA for molecular genetic analysis. Minor allele frequency of two unrelated single nucleotide polymorphisms (SNPs) was calculated using the entire cohort. Results Both whole blood samples and UC tissue provided good quality and yield of gDNA, which was considerably less from newborn DBS. The gDNA purity was also reduced after 3 years of storage of the newborn DBS. PCR amplification of three unrelated genes resulted in clear products in all whole blood and UC samples and 86%-100% of newborn DBS. Genotyping using pyrosequencing showed 100% concordance in the paired UC and whole blood samples. Minor allele frequencies of the two SNPs indicated that no maternal gDNA contamination occurred in the genotyping of the UC samples. Conclusions gDNAs from all three sources are suitable for standard PCR and pyrosequencing assays. Given that UC provide good quality

  20. Human Cancer Models Initiative | Office of Cancer Genomics

    Science.gov (United States)

    The Human Cancer Models Initiative (HCMI) is an international consortium that is generating novel human tumor-derived culture models, which are annotated with genomic and clinical data. In an effort to advance cancer research and more fully understand how in vitro findings are related to clinical biology, HCMI-developed models and related data will be available as a community resource for cancer research.

  1. Workup of Human Blood Samples for Deep Sequencing of HIV-1 Genomes

    NARCIS (Netherlands)

    Cornelissen, Marion; Gall, Astrid; van der Kuyl, Antoinette; Wymant, Chris; Blanquart, François; Fraser, Christophe; Berkhout, Ben

    2018-01-01

    We describe a detailed protocol for the manual workup of blood (plasma/serum) samples from individuals infected with the human immunodeficiency virus type 1 (HIV-1) for deep sequence analysis of the viral genome. The study optimizing the assay was performed in the context of the BEEHIVE (Bridging

  2. Genome sequencing and analysis of BCG vaccine strains.

    Directory of Open Access Journals (Sweden)

    Wen Zhang

    Full Text Available BACKGROUND: Although the Bacillus Calmette-Guérin (BCG vaccine against tuberculosis (TB has been available for more than 75 years, one third of the world's population is still infected with Mycobacterium tuberculosis and approximately 2 million people die of TB every year. To reduce this immense TB burden, a clearer understanding of the functional genes underlying the action of BCG and the development of new vaccines are urgently needed. METHODS AND FINDINGS: Comparative genomic analysis of 19 M. tuberculosis complex strains showed that BCG strains underwent repeated human manipulation, had higher region of deletion rates than those of natural M. tuberculosis strains, and lost several essential components such as T-cell epitopes. A total of 188 BCG strain T-cell epitopes were lost to various degrees. The non-virulent BCG Tokyo strain, which has the largest number of T-cell epitopes (359, lost 124. Here we propose that BCG strain protection variability results from different epitopes. This study is the first to present BCG as a model organism for genetics research. BCG strains have a very well-documented history and now detailed genome information. Genome comparison revealed the selection process of BCG strains under human manipulation (1908-1966. CONCLUSIONS: Our results revealed the cause of BCG vaccine strain protection variability at the genome level and supported the hypothesis that the restoration of lost BCG Tokyo epitopes is a useful future vaccine development strategy. Furthermore, these detailed BCG vaccine genome investigation results will be useful in microbial genetics, microbial engineering and other research fields.

  3. The Complete Sequence of a Human Parainfluenzavirus 4 Genome

    Science.gov (United States)

    Yea, Carmen; Cheung, Rose; Collins, Carol; Adachi, Dena; Nishikawa, John; Tellier, Raymond

    2009-01-01

    Although the human parainfluenza virus 4 (HPIV4) has been known for a long time, its genome, alone among the human paramyxoviruses, has not been completely sequenced to date. In this study we obtained the first complete genomic sequence of HPIV4 from a clinical isolate named SKPIV4 obtained at the Hospital for Sick Children in Toronto (Ontario, Canada). The coding regions for the N, P/V, M, F and HN proteins show very high identities (95% to 97%) with previously available partial sequences for HPIV4B. The sequence for the L protein and the non-coding regions represent new information. A surprising feature of the genome is its length, more than 17 kb, making it the longest genome within the genus Rubulavirus, although the length is well within the known range of 15 kb to 19 kb for the subfamily Paramyxovirinae. The availability of a complete genomic sequence will facilitate investigations on a respiratory virus that is still not completely characterized. PMID:21994536

  4. The Complete Sequence of a Human Parainfluenzavirus 4 Genome

    Directory of Open Access Journals (Sweden)

    Carmen Yea

    2009-06-01

    Full Text Available Although the human parainfluenza virus 4 (HPIV4 has been known for a long time, its genome, alone among the human paramyxoviruses, has not been completely sequenced to date. In this study we obtained the first complete genomic sequence of HPIV4 from a clinical isolate named SKPIV4 obtained at the Hospital for Sick Children in Toronto (Ontario, Canada. The coding regions for the N, P/V, M, F and HN proteins show very high identities (95% to 97% with previously available partial sequences for HPIV4B. The sequence for the L protein and the non-coding regions represent new information. A surprising feature of the genome is its length, more than 17 kb, making it the longest genome within the genus Rubulavirus, although the length is well within the known range of 15 kb to 19 kb for the subfamily Paramyxovirinae. The availability of a complete genomic sequence will facilitate investigations on a respiratory virus that is still not completely characterized.

  5. CRISPR Genome Engineering for Human Pluripotent Stem Cell Research.

    Science.gov (United States)

    Chaterji, Somali; Ahn, Eun Hyun; Kim, Deok-Ho

    2017-01-01

    The emergence of targeted and efficient genome editing technologies, such as repurposed bacterial programmable nucleases (e.g., CRISPR-Cas systems), has abetted the development of cell engineering approaches. Lessons learned from the development of RNA-interference (RNA-i) therapies can spur the translation of genome editing, such as those enabling the translation of human pluripotent stem cell engineering. In this review, we discuss the opportunities and the challenges of repurposing bacterial nucleases for genome editing, while appreciating their roles, primarily at the epigenomic granularity. First, we discuss the evolution of high-precision, genome editing technologies, highlighting CRISPR-Cas9. They exist in the form of programmable nucleases, engineered with sequence-specific localizing domains, and with the ability to revolutionize human stem cell technologies through precision targeting with greater on-target activities. Next, we highlight the major challenges that need to be met prior to bench-to-bedside translation, often learning from the path-to-clinic of complementary technologies, such as RNA-i. Finally, we suggest potential bioinformatics developments and CRISPR delivery vehicles that can be deployed to circumvent some of the challenges confronting genome editing technologies en route to the clinic.

  6. The genome of a Mongolian individual reveals the genetic imprints of Mongolians on modern human populations.

    Science.gov (United States)

    Bai, Haihua; Guo, Xiaosen; Zhang, Dong; Narisu, Narisu; Bu, Junjie; Jirimutu, Jirimutu; Liang, Fan; Zhao, Xiang; Xing, Yanping; Wang, Dingzhu; Li, Tongda; Zhang, Yanru; Guan, Baozhu; Yang, Xukui; Yang, Zili; Shuangshan, Shuangshan; Su, Zhe; Wu, Huiguang; Li, Wenjing; Chen, Ming; Zhu, Shilin; Bayinnamula, Bayinnamula; Chang, Yuqi; Gao, Ying; Lan, Tianming; Suyalatu, Suyalatu; Huang, Hui; Su, Yan; Chen, Yujie; Li, Wenqi; Yang, Xu; Feng, Qiang; Wang, Jian; Yang, Huanming; Wang, Jun; Wu, Qizhu; Yin, Ye; Zhou, Huanmin

    2014-11-05

    Mongolians have played a significant role in modern human evolution, especially after the rise of Genghis Khan (1162[?]-1227). Although the social cultural impacts of Genghis Khan and the Mongolian population have been well documented, explorations of their genome structure and genetic imprints on other human populations have been lacking. We here present the genome of a Mongolian male individual. The genome was de novo assembled using a total of 130.8-fold genomic data produced from massively parallel whole-genome sequencing. We identified high-confidence variation sets, including 3.7 million single nucleotide polymorphisms (SNPs) and 756,234 short insertions and deletions. Functional SNP analysis predicted that the individual has a pathogenic risk for carnitine deficiency. We located the patrilineal inheritance of the Mongolian genome to the lineage D3a through Y haplogroup analysis and inferred that the individual has a common patrilineal ancestor with Tibeto-Burman populations and is likely to be the progeny of the earliest settlers in East Asia. We finally investigated the genetic imprints of Mongolians on other human populations using different approaches. We found varying degrees of gene flows between Mongolians and populations living in Europe, South/Central Asia, and the Indian subcontinent. The analyses demonstrate that the genetic impacts of Mongolians likely resulted from the expansion of the Mongolian Empire in the 13th century. The genome will be of great help in further explorations of modern human evolution and genetic causes of diseases/traits specific to Mongolians. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  7. Human genome education model project. Ethical, legal, and social implications of the human genome project: Education of interdisciplinary professionals

    Energy Technology Data Exchange (ETDEWEB)

    Weiss, J.O. [Alliance of Genetic Support Groups, Chevy Chase, MD (United States); Lapham, E.V. [Georgetown Univ., Washington, DC (United States). Child Development Center

    1996-12-31

    This meeting was held June 10, 1996 at Georgetown University. The purpose of this meeting was to provide a multidisciplinary forum for exchange of state-of-the-art information on the human genome education model. Topics of discussion include the following: psychosocial issues; ethical issues for professionals; legislative issues and update; and education issues.

  8. Sequencing and Analysis of Neanderthal Genomic DNA

    Energy Technology Data Exchange (ETDEWEB)

    Noonan, James P.; Coop, Graham; Kudaravalli, Sridhar; Smith,Doug; Krause, Johannes; Alessi, Joe; Chen, Feng; Platt, Darren; Paabo,Svante; Pritchard, Jonathan K.; Rubin, Edward M.

    2006-06-13

    Recovery and analysis of multiple Neanderthal autosomalsequences using a metagenomic approach reveals that modern humans andNeanderthals split ~;400,000 years ago, without significant evidence ofsubsequent admixture.

  9. Accelerated Evolution of Conserved Noncoding Sequences in theHuman Genome

    Energy Technology Data Exchange (ETDEWEB)

    Prambhakar, Shyam; Noonan, James P.; Paabo, Svante; Rubin, EdwardM.

    2006-07-06

    Genomic comparisons between human and distant, non-primatemammals are commonly used to identify cis-regulatory elements based onconstrained sequence evolution. However, these methods fail to detect"cryptic" functional elements, which are too weakly conserved amongmammals to distinguish from nonfunctional DNA. To address this problem,we explored the potential of deep intra-primate sequence comparisons. Wesequenced the orthologs of 558 kb of human genomic sequence, coveringmultiple loci involved in cholesterol homeostasis, in 6 nonhumanprimates. Our analysis identified 6 noncoding DNA elements displayingsignificant conservation among primates, but undetectable in more distantcomparisons. In vitro and in vivo tests revealed that at least three ofthese 6 elements have regulatory function. Notably, the mouse orthologsof these three functional human sequences had regulatory activity despitetheir lack of significant sequence conservation, indicating that they arecryptic ancestral cis-regulatory elements. These regulatory elementscould still be detected in a smaller set of three primate speciesincluding human, rhesus and marmoset. Since the human and rhesus genomesequences are already available, and the marmoset genome is activelybeing sequenced, the primate-specific conservation analysis describedhere can be applied in the near future on a whole-genome scale, tocomplement the annotation provided by more distant speciescomparisons.

  10. The humankind genome: from genetic diversity to the origin of human diseases.

    Science.gov (United States)

    Belizário, Jose E

    2013-12-01

    Genome-wide association studies have failed to establish common variant risk for the majority of common human diseases. The underlying reasons for this failure are explained by recent studies of resequencing and comparison of over 1200 human genomes and 10 000 exomes, together with the delineation of DNA methylation patterns (epigenome) and full characterization of coding and noncoding RNAs (transcriptome) being transcribed. These studies have provided the most comprehensive catalogues of functional elements and genetic variants that are now available for global integrative analysis and experimental validation in prospective cohort studies. With these datasets, researchers will have unparalleled opportunities for the alignment, mining, and testing of hypotheses for the roles of specific genetic variants, including copy number variations, single nucleotide polymorphisms, and indels as the cause of specific phenotypes and diseases. Through the use of next-generation sequencing technologies for genotyping and standardized ontological annotation to systematically analyze the effects of genomic variation on humans and model organism phenotypes, we will be able to find candidate genes and new clues for disease's etiology and treatment. This article describes essential concepts in genetics and genomic technologies as well as the emerging computational framework to comprehensively search websites and platforms available for the analysis and interpretation of genomic data.

  11. Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae : Implications for the microbial "pan-genome"

    NARCIS (Netherlands)

    Tettelin, H; Masignani, [No Value; Cieslewicz, MJ; Donati, C; Medini, D; Ward, NL; Angiuoli, SV; Crabtree, J; Jones, AL; Durkin, AS; DeBoy, RT; Davidsen, TM; Mora, M; Scarselli, M; Ros, IMY; Peterson, JD; Hauser, CR; Sundaram, JP; Nelson, WC; Madupu, R; Brinkac, LM; Dodson, RJ; Rosovitz, MJ; Sullivan, SA; Daugherty, SC; Haft, DH; Selengut, J; Gwinn, ML; Zhou, LW; Zafar, N; Khouri, H; Radune, D; Dimitrov, G; Watkins, K; O'Connor, KJB; Smith, S; Utterback, TR; White, O; Rubens, CE; Grandi, G; Madoff, LC; Kasper, DL; Telford, JL; Wessels, MR; Rappuoli, R; Fraser, CM

    2005-01-01

    The development of efficient and inexpensive genome sequencing methods has revolutionized the study of human bacterial pathogens and improved vaccine design. Unfortunately, the sequence of a single genome does not reflect how genetic variability drives pathogenesis within a bacterial species and

  12. Functional Coverage of the Human Genome by Existing Structures, Structural Genomics Targets, and Homology Models.

    Directory of Open Access Journals (Sweden)

    2005-08-01

    Full Text Available The bias in protein structure and function space resulting from experimental limitations and targeting of particular functional classes of proteins by structural biologists has long been recognized, but never continuously quantified. Using the Enzyme Commission and the Gene Ontology classifications as a reference frame, and integrating structure data from the Protein Data Bank (PDB, target sequences from the structural genomics projects, structure homology derived from the SUPERFAMILY database, and genome annotations from Ensembl and NCBI, we provide a quantified view, both at the domain and whole-protein levels, of the current and projected coverage of protein structure and function space relative to the human genome. Protein structures currently provide at least one domain that covers 37% of the functional classes identified in the genome; whole structure coverage exists for 25% of the genome. If all the structural genomics targets were solved (twice the current number of structures in the PDB, it is estimated that structures of one domain would cover 69% of the functional classes identified and complete structure coverage would be 44%. Homology models from existing experimental structures extend the 37% coverage to 56% of the genome as single domains and 25% to 31% for complete structures. Coverage from homology models is not evenly distributed by protein family, reflecting differing degrees of sequence and structure divergence within families. While these data provide coverage, conversely, they also systematically highlight functional classes of proteins for which structures should be determined. Current key functional families without structure representation are highlighted here; updated information on the "most wanted list" that should be solved is available on a weekly basis from http://function.rcsb.org:8080/pdb/function_distribution/index.html.

  13. Draft genome of neurotropic nematode parasite Angiostrongylus cantonensis, causative agent of human eosinophilic meningitis.

    Science.gov (United States)

    Yong, Hoi-Sen; Eamsobhana, Praphathip; Lim, Phaik-Eem; Razali, Rozaimi; Aziz, Farhanah Abdul; Rosli, Nurul Shielawati Mohamed; Poole-Johnson, Johan; Anwar, Arif

    2015-08-01

    Angiostrongylus cantonensis is a bursate nematode parasite that causes eosinophilic meningitis (or meningoencephalitis) in humans in many parts of the world. The genomic data from A. cantonensis will form a useful resource for comparative genomic and chemogenomic studies to aid the development of diagnostics and therapeutics. We have sequenced, assembled and annotated the genome of A. cantonensis. The genome size is estimated to be ∼260 Mb, with 17,280 genomic scaffolds, 91X coverage, 81.45% for complete and 93.95% for partial score based on CEGMA analysis of genome completeness. The number of predicted genes of ≥300 bp was 17,482. A total of 7737 predicted protein-coding genes of ≥50 amino acids were identified in the assembled genome. Among the proteins of known function, kinases are the most abundant followed by transferases. The draft genome contains 34 excretory-secretory proteins (ES), a minimum of 44 Nematode Astacin (NAS) metalloproteases, 12 Homeobox (HOX) genes, and 30 neurotransmitters. The assembled genome size (260 Mb) is larger than those of Pristionchus pacificus, Caenorhabditis elegans, Necator americanus, Caenorhabditis briggsae, Trichinella spiralis, Brugia malayi and Loa loa, but smaller than Haemonchus contortus and Ascaris suum. The repeat content (25%) is similar to H. contortus. The GC content (41.17%) is lower compared to P. pacificus (42.7%) and H. contortus (43.1%) but higher compared to C. briggsae (37.69%), A. suum (37.9%) and N. americanus (40.2%) while the scaffold N50 is 42,191. This draft genome will facilitate the understanding of many unresolved issues on the parasite and the disorder it causes. Copyright © 2015 Elsevier B.V. All rights reserved.

  14. Genomic characterization of large heterochromatic gaps in the human genome assembly.

    Directory of Open Access Journals (Sweden)

    Nicolas Altemose

    2014-05-01

    Full Text Available The largest gaps in the human genome assembly correspond to multi-megabase heterochromatic regions composed primarily of two related families of tandem repeats, Human Satellites 2 and 3 (HSat2,3. The abundance of repetitive DNA in these regions challenges standard mapping and assembly algorithms, and as a result, the sequence composition and potential biological functions of these regions remain largely unexplored. Furthermore, existing genomic tools designed to predict consensus-based descriptions of repeat families cannot be readily applied to complex satellite repeats such as HSat2,3, which lack a consistent repeat unit reference sequence. Here we present an alignment-free method to characterize complex satellites using whole-genome shotgun read datasets. Utilizing this approach, we classify HSat2,3 sequences into fourteen subfamilies and predict their chromosomal distributions, resulting in a comprehensive satellite reference database to further enable genomic studies of heterochromatic regions. We also identify 1.3 Mb of non-repetitive sequence interspersed with HSat2,3 across 17 unmapped assembly scaffolds, including eight annotated gene predictions. Finally, we apply our satellite reference database to high-throughput sequence data from 396 males to estimate array size variation of the predominant HSat3 array on the Y chromosome, confirming that satellite array sizes can vary between individuals over an order of magnitude (7 to 98 Mb and further demonstrating that array sizes are distributed differently within distinct Y haplogroups. In summary, we present a novel framework for generating initial reference databases for unassembled genomic regions enriched with complex satellite DNA, and we further demonstrate the utility of these reference databases for studying patterns of sequence variation within human populations.

  15. Genomic rearrangements by LINE-1 insertion-mediated deletion in the human and chimpanzee lineages.

    Science.gov (United States)

    Han, Kyudong; Sen, Shurjo K; Wang, Jianxin; Callinan, Pauline A; Lee, Jungnam; Cordaux, Richard; Liang, Ping; Batzer, Mark A

    2005-01-01

    Long INterspersed Elements (LINE-1s or L1s) are abundant non-LTR retrotransposons in mammalian genomes that are capable of insertional mutagenesis. They have been associated with target site deletions upon insertion in cell culture studies of retrotransposition. Here, we report 50 deletion events in the human and chimpanzee genomes directly linked to the insertion of L1 elements, resulting in the loss of approximately 18 kb of sequence from the human genome and approximately 15 kb from the chimpanzee genome. Our data suggest that during the primate radiation, L1 insertions may have deleted up to 7.5 Mb of target genomic sequences. While the results of our in vivo analysis differ from those of previous cell culture assays of L1 insertion-mediated deletions in terms of the size and rate of sequence deletion, evolutionary factors can reconcile the differences. We report a pattern of genomic deletion sizes similar to those created during the retrotransposition of Alu elements. Our study provides support for the existence of different mechanisms for small and large L1-mediated deletions, and we present a model for the correlation of L1 element size and the corresponding deletion size. In addition, we show that internal rearrangements can modify L1 structure during retrotransposition events associated with large deletions.

  16. Recombination analysis based on the complete genome of bocavirus

    Directory of Open Access Journals (Sweden)

    Chen Shengxia

    2011-04-01

    Full Text Available Abstract Bocavirus include bovine parvovirus, minute virus of canine, porcine bocavirus, gorilla bocavirus, and Human bocaviruses 1-4 (HBoVs. Although recent reports showed that recombination happened in bocavirus, no systematical study investigated the recombination of bocavirus. The present study performed the phylogenetic and recombination analysis of bocavirus over the complete genomes available in GenBank. Results confirmed that recombination existed among bocavirus, including the likely inter-genotype recombination between HBoV1 and HBoV4, and intra-genotype recombination among HBoV2 variants. Moreover, it is the first report revealing the recombination that occurred between minute viruses of canine.

  17. Characterization of canine osteosarcoma by array comparative genomic hybridization and RT-qPCR: signatures of genomic imbalance in canine osteosarcoma parallel the human counterpart.

    Science.gov (United States)

    Angstadt, Andrea Y; Motsinger-Reif, Alison; Thomas, Rachael; Kisseberth, William C; Guillermo Couto, C; Duval, Dawn L; Nielsen, Dahlia M; Modiano, Jaime F; Breen, Matthew

    2011-11-01

    Osteosarcoma (OS) is the most commonly diagnosed malignant bone tumor in humans and dogs, characterized in both species by extremely complex karyotypes exhibiting high frequencies of genomic imbalance. Evaluation of genomic signatures in human OS using array comparative genomic hybridization (aCGH) has assisted in uncovering genetic mechanisms that result in disease phenotype. Previous low-resolution (10-20 Mb) aCGH analysis of canine OS identified a wide range of recurrent DNA copy number aberrations, indicating extensive genomic instability. In this study, we profiled 123 canine OS tumors by 1 Mb-resolution aCGH to generate a dataset for direct comparison with current data for human OS, concluding that several high frequency aberrations in canine and human OS are orthologous. To ensure complete coverage of gene annotation, we identified the human refseq genes that map to these orthologous aberrant dog regions and found several candidate genes warranting evaluation for OS involvement. Specifically, subsequenct FISH and qRT-PCR analysis of RUNX2, TUSC3, and PTEN indicated that expression levels correlated with genomic copy number status, showcasing RUNX2 as an OS associated gene and TUSC3 as a possible tumor suppressor candidate. Together these data demonstrate the ability of genomic comparative oncology to identify genetic abberations which may be important for OS progression. Large scale screening of genomic imbalance in canine OS further validates the use of the dog as a suitable model for human cancers, supporting the idea that dysregulation discovered in canine cancers will provide an avenue for complementary study in human counterparts. Copyright © 2011 Wiley-Liss, Inc.

  18. Genome-wide Analysis of Gene Regulation

    DEFF Research Database (Denmark)

    Chen, Yun

    to protein: through epigenetic modifications, transcription regulators or post-transcriptional controls. The following papers concern several layers of gene regulation with questions answered by different HTS approaches. Genome-wide screening of epigenetic changes by ChIP-seq allowed us to study both spatial...... and temporal alterations of histone modifications (Papers I and II). Coupling the data with machine learning approaches, we established a prediction framework to assess the most informative histone marks as well as their most influential nucleosome positions in predicting the promoter usages. (Papers I...... they regulated or if the sites had global elevated usage rates by multiple TFs. Using RNA-seq, 5’end-seq in combination with depletion of 5’exonuclease as well as nonsensemediated decay (NMD) factors, we systematically analyzed NMD substrates as well as their degradation intermediates in human cells (Paper V...

  19. Learning about the Human Genome. Part 2: Resources for Science Educators. ERIC Digest.

    Science.gov (United States)

    Haury, David L.

    This ERIC Digest identifies how the human genome project fits into the "National Science Education Standards" and lists Human Genome Project Web sites found on the World Wide Web. It is a resource companion to "Learning about the Human Genome. Part 1: Challenge to Science Educators" (Haury 2001). The Web resources and…

  20. Human genomic disease variants: a neutral evolutionary explanation.

    Science.gov (United States)

    Dudley, Joel T; Kim, Yuseob; Liu, Li; Markov, Glenn J; Gerold, Kristyn; Chen, Rong; Butte, Atul J; Kumar, Sudhir

    2012-08-01

    Many perspectives on the role of evolution in human health include nonempirical assumptions concerning the adaptive evolutionary origins of human diseases. Evolutionary analyses of the increasing wealth of clinical and population genomic data have begun to challenge these presumptions. In order to systematically evaluate such claims, the time has come to build a common framework for an empirical and intellectual unification of evolution and modern medicine. We review the emerging evidence and provide a supporting conceptual framework that establishes the classical neutral theory of molecular evolution (NTME) as the basis for evaluating disease- associated genomic variations in health and medicine. For over a decade, the NTME has already explained the origins and distribution of variants implicated in diseases and has illuminated the power of evolutionary thinking in genomic medicine. We suggest that a majority of disease variants in modern populations will have neutral evolutionary origins (previously neutral), with a relatively smaller fraction exhibiting adaptive evolutionary origins (previously adaptive). This pattern is expected to hold true for common as well as rare disease variants. Ultimately, a neutral evolutionary perspective will provide medicine with an informative and actionable framework that enables objective clinical assessment beyond convenient tendencies to invoke past adaptive events in human history as a root cause of human disease.

  1. Restriction enzyme analysis of the human cytomegalovirus genome in specimens collected from immunodeficient patients in Belém, State of Pará, Brazil

    Directory of Open Access Journals (Sweden)

    Dorotéa Lobato da Silva

    2011-10-01

    Full Text Available INTRODUCTION: Human cytomegalovirus is an opportunistic betaherpesvirus that causes persistent and serious infections in immunodeficient patients. Recurrent infections occur due to the presence of the virus in a latent state in some cell types. It is possible to examine the virus using molecular methods to aid in the immunological diagnosis and to generate a molecular viral profile in immunodeficient patients. The objective of this study was to characterize cytomegalovirus genotypes and to generate the epidemiological and molecular viral profile in immunodeficient patients. METHODS: A total of 105 samples were collected from immunodeficient patients from the City of Belém, including newborns, hemodialysis patients, transplant recipients and HIV+ patients. An IgG and IgM antibody study was completed using ELISA, and enzymatic analysis by restriction fragment length polymorphism (RFLP was performed to characterize viral genotypes. RESULTS: It was observed that 100% of the patients had IgG antibodies, 87% of which were IgG+/IgM-, consistent with a prior infection profile, 13% were IgG+/IgM+, suggestive of recent infection. The newborn group had the highest frequency (27% of the IgG+/IgM+ profile. By RFLP analysis, only one genotype was observed, gB2, which corresponded to the standard AD169 strain. CONCLUSIONS: The presence of IgM antibodies in new borns indicates that HCMV continues to be an important cause of congenital infection. The low observed genotypic diversity could be attributed to the small sample size because newborns were excluded from the RFLP analysis. This study will be continued including samples from newborns to extend the knowledge of the general and molecular epidemiology of HCMV in immunodeficient patients.

  2. Multi-scale structural community organisation of the human genome.

    Science.gov (United States)

    Boulos, Rasha E; Tremblay, Nicolas; Arneodo, Alain; Borgnat, Pierre; Audit, Benjamin

    2017-04-11

    Structural interaction frequency matrices between all genome loci are now experimentally achievable thanks to high-throughput chromosome conformation capture technologies. This ensues a new methodological challenge for computational biology which consists in objectively extracting from these data the structural motifs characteristic of genome organisation. We deployed the fast multi-scale community mining algorithm based on spectral graph wavelets to characterise the networks of intra-chromosomal interactions in human cell lines. We observed that there exist structural domains of all sizes up to chromosome length and demonstrated that the set of structural communities forms a hierarchy of chromosome segments. Hence, at all scales, chromosome folding predominantly involves interactions between neighbouring sites rather than the formation of links between distant loci. Multi-scale structural decomposition of human chromosomes provides an original framework to question structural organisation and its relationship to functional regulation across the scales. By construction the proposed methodology is independent of the precise assembly of the reference genome and is thus directly applicable to genomes whose assembly is not fully determined.

  3. The noncoding human genome and the future of personalised medicine.

    Science.gov (United States)

    Cowie, Philip; Hay, Elizabeth A; MacKenzie, Alasdair

    2015-01-30

    Non-coding cis-regulatory sequences act as the 'eyes' of the genome and their role is to perceive, organise and relay cellular communication information to RNA polymerase II at gene promoters. The evolution of these sequences, that include enhancers, silencers, insulators and promoters, has progressed in multicellular organisms to the extent that cis-regulatory sequences make up as much as 10% of the human genome. Parallel evidence suggests that 75% of polymorphisms associated with heritable disease occur within predicted cis-regulatory sequences that effectively alter the 'perception' of cis-regulatory sequences or render them blind to cell communication cues. Cis-regulatory sequences also act as major functional targets of epigenetic modification thus representing an important conduit through which changes in DNA-methylation affects disease susceptibility. The objectives of the current review are (1) to describe what has been learned about identifying and characterising cis-regulatory sequences since the sequencing of the human genome; (2) to discuss their role in interpreting cell signalling pathways pathways; and (3) outline how this role may be altered by polymorphisms and epigenetic changes. We argue that the importance of the cis-regulatory genome for the interpretation of cellular communication pathways cannot be overstated and understanding its role in health and disease will be critical for the future development of personalised medicine.

  4. Group sparse canonical correlation analysis for genomic data integration.

    Science.gov (United States)

    Lin, Dongdong; Zhang, Jigang; Li, Jingyao; Calhoun, Vince D; Deng, Hong-Wen; Wang, Yu-Ping

    2013-08-12

    The emergence of high-throughput genomic datasets from different sources and platforms (e.g., gene expression, single nucleotide polymorphisms (SNP), and copy number variation (CNV)) has greatly enhanced our understandings of the interplay of these genomic factors as well as their influences on the complex diseases. It is challenging to explore the relationship between these different types of genomic data sets. In this paper, we focus on a multivariate statistical method, canonical correlation analysis (CCA) method for this problem. Conventional CCA method does not work effectively if the number of data samples is significantly less than that of biomarkers, which is a typical case for genomic data (e.g., SNPs). Sparse CCA (sCCA) methods were introduced to overcome such difficulty, mostly using penalizations with l-1 norm (CCA-l1) or the combination of l-1and l-2 norm (CCA-elastic net). However, they overlook the structural or group effect within genomic data in the analysis, which often exist and are important (e.g., SNPs spanning a gene interact and work together as a group). We propose a new group sparse CCA method (CCA-sparse group) along with an effective numerical algorithm to study the mutual relationship between two different types of genomic data (i.e., SNP and gene expression). We then extend the model to a more general formulation that can include the existing sCCA models. We apply the model to feature/variable selection from two data sets and compare our group sparse CCA method with existing sCCA methods on both simulation and two real datasets (human gliomas data and NCI60 data). We use a graphical representation of the samples with a pair of canonical variates to demonstrate the discriminating characteristic of the selected features. Pathway analysis is further performed for biological interpretation of those features. The CCA-sparse group method incorporates group effects of features into the correlation analysis while performs individual feature

  5. Opening plenary speaker: Human genomics, precision medicine, and advancing human health.

    Science.gov (United States)

    Green, Eric D

    2016-08-01

    Starting with the launch of the Human Genome Project in 1990, the past quarter-century has brought spectacular achievements in genomics that dramatically empower the study of human biology and disease. The human genomics enterprise is now in the midst of an important transition, as the growing foundation of genomic knowledge is being used by researchers and clinicians to tackle increasingly complex problems in biomedicine. Of particular prominence is the use of revolutionary new DNA sequencing technologies for generating prodigious amounts of DNA sequence data to elucidate the complexities of genome structure, function, and evolution, as well as to unravel the genomic bases of rare and common diseases. Together, these developments are ushering in the era of genomic medicine. Augmenting the advances in human genomics have been innovations in technologies for measuring environmental and lifestyle information, electronic health records, and data science; together, these provide opportunities of unprecedented scale and scope for investigating the underpinnings of health and disease. To capitalize on these opportunities, U.S. President Barack Obama recently announced a major new research endeavor - the U.S. Precision Medicine Initiative. This bold effort will be framed around several key aims, which include accelerating the use of genomically informed approaches to cancer care, making important policy and regulatory changes, and establishing a large research cohort of >1 million volunteers to facilitate precision medicine research. The latter will include making the partnership with all participants a centerpiece feature in the cohort's design and development. The Precision Medicine Initiative represents a broad-based research program that will allow new approaches for individualized medical care to be rigorously tested, so as to establish a new evidence base for advancing clinical practice and, eventually, human health.

  6. Evolution and Diversity in Human Herpes Simplex Virus Genomes

    Science.gov (United States)

    Gatherer, Derek; Ochoa, Alejandro; Greenbaum, Benjamin; Dolan, Aidan; Bowden, Rory J.; Enquist, Lynn W.; Legendre, Matthieu; Davison, Andrew J.

    2014-01-01

    Herpes simplex virus 1 (HSV-1) causes a chronic, lifelong infection in >60% of adults. Multiple recent vaccine trials have failed, with viral diversity likely contributing to these failures. To understand HSV-1 diversity better, we comprehensively compared 20 newly sequenced viral genomes from China, Japan, Kenya, and South Korea with six previously sequenced genomes from the United States, Europe, and Japan. In this diverse collection of passaged strains, we found that one-fifth of the newly sequenced members share a gene deletion and one-third exhibit homopolymeric frameshift mutations (HFMs). Individual strains exhibit genotypic and potential phenotypic variation via HFMs, deletions, short sequence repeats, and single-nucleotide polymorphisms, although the protein sequence identity between strains exceeds 90% on average. In the first genome-scale analysis of positive selection in HSV-1, we found signs of selection in specific proteins and residues, including the fusion protein glycoprotein H. We also confirmed previous results suggesting that recombination has occurred with high frequency throughout the HSV-1 genome. Despite this, the HSV-1 strains analyzed clustered by geographic origin during whole-genome distance analysis. These data shed light on likely routes of HSV-1 adaptation to changing environments and will aid in the selection of vaccine antigens that are invariant worldwide. PMID:24227835

  7. The Echinococcus canadensis (G7) genome: a key knowledge of parasitic platyhelminth human diseases.

    Science.gov (United States)

    Maldonado, Lucas L; Assis, Juliana; Araújo, Flávio M Gomes; Salim, Anna C M; Macchiaroli, Natalia; Cucher, Marcela; Camicia, Federico; Fox, Adolfo; Rosenzvit, Mara; Oliveira, Guilherme; Kamenetzky, Laura

    2017-02-27

    The parasite Echinococcus canadensis (G7) (phylum Platyhelminthes, class Cestoda) is one of the causative agents of echinococcosis. Echinococcosis is a worldwide chronic zoonosis affecting humans as well as domestic and wild mammals, which has been reported as a prioritized neglected disease by the World Health Organisation. No genomic data, comparative genomic analyses or efficient therapeutic and diagnostic tools are available for this severe disease. The information presented in this study will help to understand the peculiar biological characters and to design species-specific control tools. We sequenced, assembled and annotated the 115-Mb genome of E. canadensis (G7). Comparative genomic analyses using whole genome data of three Echinococcus species not only confirmed the status of E. canadensis (G7) as a separate species but also demonstrated a high nucleotide sequences divergence in relation to E. granulosus (G1). The E. canadensis (G7) genome contains 11,449 genes with a core set of 881 orthologs shared among five cestode species. Comparative genomics revealed that there are more single nucleotide polymorphisms (SNPs) between E. canadensis (G7) and E. granulosus (G1) than between E. canadensis (G7) and E. multilocularis. This result was unexpected since E. canadensis (G7) and E. granulosus (G1) were considered to belong to the species complex E. granulosus sensu lato. We described SNPs in known drug targets and metabolism genes in the E. canadensis (G7) genome. Regarding gene regulation, we analysed three particular features: CpG island distribution along the three Echinococcus genomes, DNA methylation system and small RNA pathway. The results suggest the occurrence of yet unknown gene regulation mechanisms in Echinococcus. This is the first work that addresses Echinococcus comparative genomics. The resources presented here will promote the study of mechanisms of parasite development as well as new tools for drug discovery. The availability of a high

  8. Characterization of Human Cytomegalovirus Genome Diversity in Immunocompromised Hosts by Whole-Genome Sequencing Directly From Clinical Specimens.

    Science.gov (United States)

    Hage, Elias; Wilkie, Gavin S; Linnenweber-Held, Silvia; Dhingra, Akshay; Suárez, Nicolás M; Schmidt, Julius J; Kay-Fedorov, Penelope C; Mischak-Weissinger, Eva; Heim, Albert; Schwarz, Anke; Schulz, Thomas F; Davison, Andrew J; Ganzenmueller, Tina

    2017-06-01

    Advances in next-generation sequencing (NGS) technologies allow comprehensive studies of genetic diversity over the entire genome of human cytomegalovirus (HCMV), a significant pathogen for immunocompromised individuals. Next-generation sequencing was performed on target enriched sequence libraries prepared directly from a variety of clinical specimens (blood, urine, breast milk, respiratory samples, biopsies, and vitreous humor) obtained longitudinally or from different anatomical compartments from 20 HCMV-infected patients (renal transplant recipients, stem cell transplant recipients, and congenitally infected children). De novo-assembled HCMV genome sequences were obtained for 57 of 68 sequenced samples. Analysis of longitudinal or compartmental HCMV diversity revealed various patterns: no major differences were detected among longitudinal, intraindividual blood samples from 9 of 15 patients and in most of the patients with compartmental samples, whereas a switch of the major HCMV population was observed in 6 individuals with sequential blood samples and upon compartmental analysis of 1 patient with HCMV retinitis. Variant analysis revealed additional aspects of minor virus population dynamics and antiviral-resistance mutations. In immunosuppressed patients, HCMV can remain relatively stable or undergo drastic genomic changes that are suggestive of the emergence of minor resident strains or de novo infection. © The Author 2017. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail journals.permissions@oup.com.

  9. The "most wanted" taxa from the human microbiome for whole genome sequencing.

    Directory of Open Access Journals (Sweden)

    Anthony A Fodor

    Full Text Available The goal of the Human Microbiome Project (HMP is to generate a comprehensive catalog of human-associated microorganisms including reference genomes representing the most common species. Toward this goal, the HMP has characterized the microbial communities at 18 body habitats in a cohort of over 200 healthy volunteers using 16S rRNA gene (16S sequencing and has generated nearly 1,000 reference genomes from human-associated microorganisms. To determine how well current reference genome collections capture the diversity observed among the healthy microbiome and to guide isolation and future sequencing of microbiome members, we compared the HMP's 16S data sets to several reference 16S collections to create a 'most wanted' list of taxa for sequencing. Our analysis revealed that the diversity of commonly occurring taxa within the HMP cohort microbiome is relatively modest, few novel taxa are represented by these OTUs and many common taxa among HMP volunteers recur across different populations of healthy humans. Taken together, these results suggest that it should be possible to perform whole-genome sequencing on a large fraction of the human microbiome, including the 'most wanted', and that these sequences should serve to support microbiome studies across multiple cohorts. Also, in stark contrast to other taxa, the 'most wanted' organisms are poorly represented among culture collections suggesting that novel culture- and single-cell-based methods will be required to isolate these organisms for sequencing.

  10. Single genome retrieval of context-dependent variability in mutation rates for human germline.

    Science.gov (United States)

    Sahakyan, Aleksandr B; Balasubramanian, Shankar

    2017-01-13

    Accurate knowledge of the core components of substitution rates is of vital importance to understand genome evolution and dynamics. By performing a single-genome and direct analysis of 39,894 retrotransposon remnants, we reveal sequence context-dependent germline nucleotide substitution rates for the human genome. The rates are characterised through rate constants in a time-domain, and are made available through a dedicated program (Trek) and a stand-alone database. Due to the nature of the method design and the imposed stringency criteria, we expect our rate constants to be good estimates for the rates of spontaneous mutations. Benefiting from such data, we study the short-range nucleotide (up to 7-mer) organisation and the germline basal substitution propensity (BSP) profile of the human genome; characterise novel, CpG-independent, substitution prone and resistant motifs; confirm a decreased tendency of moieties with low BSP to undergo somatic mutations in a number of cancer types; and, produce a Trek-based estimate of the overall mutation rate in human. The extended set of rate constants we report may enrich our resources and help advance our understanding of genome dynamics and evolution, with possible implications for the role of spontaneous mutations in the emergence of pathological genotypes and neutral evolution of proteomes.

  11. The mobile genetic element Alu in the human genome

    Energy Technology Data Exchange (ETDEWEB)

    Novick, G.E. [Florida International Univ., Miami, FL (United States); Batzer, M.A.; Deininger, P.L. [Louisiana State Univ. Medical Center, New Orleans, LA (United States)] [and others

    1996-01-01

    Genetic material has been traditionally envisioned as relatively static with the exception of occasional, often deleterious mutations. The sequence DNA-to-RNA-to-protein represented for many years the central dogma relating gene structure and function. Recently, the field of molecular genetics has provided revolutionary information on the dynamic role of repetitive elements in the function of the genetic material and the evolution of humans and other organisms. Alu sequences represent the largest family of short interspersed repetitive elements (SINEs) in humans, being present in an excess of 500,000 copies per haploid genome. Alu elements, as well as the other repetitive elements, were once considered to be useless. Today, the biology of Alu transposable elements is being widely examined in order to determine the molecular basis of a growing number of identified diseases and to provide new directions in genome mapping and biomedical research. 66 refs., 5 figs.

  12. Targets of balancing selection in the human genome

    DEFF Research Database (Denmark)

    Andrés, Aida M; Hubisz, Melissa J; Indap, Amit

    2009-01-01

    Balancing selection is potentially an important biological force for maintaining advantageous genetic diversity in populations, including variation that is responsible for long-term adaptation to the environment. By serving as a means to maintain genetic variation, it may be particularly relevant...... to maintaining phenotypic variation in natural populations. Nevertheless, its prevalence and specific targets in the human genome remain largely unknown. We have analyzed the patterns of diversity and divergence of 13,400 genes in two human populations using an unbiased single-nucleotide polymorphism data set......, a genome-wide approach, and a method that incorporates demography in neutrality tests. We identified an unbiased catalog of genes with signatures of long-term balancing selection, which includes immunity genes as well as genes encoding keratins and membrane channels; the catalog also shows enrichment...

  13. Genomics: The Science and Technology Behind the Human Genome Project (by Charles R. Cantor and Cassandra L. Smith)

    Science.gov (United States)

    Serra, Reviewed By Martin J.

    2000-01-01

    Genomics is one of the most rapidly expanding areas of science. This book is an outgrowth of a series of lectures given by one of the former heads (CRC) of the Human Genome Initiative. The book is designed to reach a wide audience, from biologists with little chemical or physical science background through engineers, computer scientists, and physicists with little current exposure to the chemical or biological principles of genetics. The text starts with a basic review of the chemical and biological properties of DNA. However, without either a biochemistry background or a supplemental biochemistry text, this chapter and much of the rest of the text would be difficult to digest. The second chapter is designed to put DNA into the context of the larger chromosomal unit. Specialized chromosomal structures and sequences (centromeres, telomeres) are introduced, leading to a section on chromosome organization and purification. The next 4 chapters cover the physical (hybridization, electrophoresis), chemical (polymerase chain reaction), and biological (genetic) techniques that provide the backbone of genomic analysis. These chapters cover in significant detail the fundamental principles underlying each technique and provide a firm background for the remainder of the text. Chapters 7­9 consider the need and methods for the development of physical maps. Chapter 7 primarily discusses chromosomal localization techniques, including in situ hybridization, FISH, and chromosome paintings. The next two chapters focus on the development of libraries and clones. In particular, Chapter 9 considers the limitations of current mapping and clone production. The current state and future of DNA sequencing is covered in the next three chapters. The first considers the current methods of DNA sequencing - especially gel-based methods of analysis, although other possible approaches (mass spectrometry) are introduced. Much of the chapter addresses the limitations of current methods, including

  14. Short template switch events explain mutation clusters in the human genome.

    Science.gov (United States)

    Löytynoja, Ari; Goldman, Nick

    2017-06-01

    Resequencing efforts are uncovering the extent of genetic variation in humans and provide data to study the evolutionary processes shaping our genome. One recurring puzzle in both intra- and inter-species studies is the high frequency of complex mutations comprising multiple nearby base substitutions or insertion-deletions. We devised a generalized mutation model of template switching during replication that extends existing models of genome rearrangement and used this to study the role of template switch events in the origin of short mutation clusters. Applied to the human genome, our model detects thousands of template switch events during the evolution of human and chimp from their common ancestor and hundreds of events between two independently sequenced human genomes. Although many of these are consistent with a template switch mechanism previously proposed for bacteria, our model also identifies new types of mutations that create short inversions, some flanked by paired inverted repeats. The local template switch process can create numerous complex mutation patterns, including hairpin loop structures, and explains multinucleotide mutations and compensatory substitutions without invoking positive selection, speculative mechanisms, or implausible coincidence. Clustered sequence differences are challenging for current mapping and variant calling methods, and we show that many erroneous variant annotations exist in human reference data. Local template switch events may have been neglected as an explanation for complex mutations because of biases in commonly used analyses. Incorporation of our model into reference-based analysis pipelines and comparisons of de novo assembled genomes will lead to improved understanding of genome variation and evolution. © 2017 Löytynoja and Goldman; Published by Cold Spring Harbor Laboratory Press.

  15. The human genome project and the Catholic Church (1)

    Science.gov (United States)

    Moraczewski, Albert S

    1991-12-01

    The Cathlic Church has not made any formal statements about the Human Genome Project as such. But the present Pope, John Paul II, has commented, albeit very briefly, on various aspects of genetic manipulation. Genetic interventions which are therapeutic (e.g. gene therapy), namely, directed to the correction or amelioration of a disorder are acceptable, in principle, provided they promote the personal well being of the individual being so treated. Genetic interventions which are not therapeutic for the specific individual involved but are experimental and directed primarily to improving humans as biological entities are of dubious moral probity, but are not necessarily to be totally rejected out of hand. To be morally acceptable such genetic intervention should meet certain conditions which include due respect for the given psychological nature of each individual human being. In addition, no harm should be inflicted on the process of human generation, and its fundamental design should not be altered. Any genetic manipulation which results in, or tends to, the creation of groups with different qualities such that there would result a fresh marginalization of these people must be avoided. It has been also suggested by a few that because the Son of God took on a human nature in Jesus Christ, one may not so alter the human genome that a new distinct species would be created....

  16. Genome-wide expression analysis of human in vivo irritated epidermis: differential profiles induced by sodium lauryl sulfate and nonanoic acid.

    Science.gov (United States)

    Clemmensen, Anders; Andersen, Klaus E; Clemmensen, Ole; Tan, Qihua; Petersen, Thomas K; Kruse, Torben A; Thomassen, Mads

    2010-09-01

    The pathogenesis of irritant contact dermatitis (ICD) is poorly understood, and genes participating in the epidermal response to chemical irritants are only partly known. It is commonly accepted that different irritants have different mechanisms of action in the development of ICD. To define the differential molecular events induced in the epidermis by different irritants, we collected sequential biopsies ((1/2), 4, and 24 hours after a single exposure and at day 11 after repeated exposure) from human volunteers exposed to either sodium lauryl sulfate (SLS) or nonanoic acid (NON). Gene expression analysis using high-density oligonucleotide microarrays (representing 47,000 transcripts) revealed essentially different pathway responses (1/2)hours after exposure: NON transiently induced the IL-6 pathway as well as a number of mitogen-activated signaling cascades including extracellular signal-regulated kinase and growth factor receptor signaling, whereas SLS transiently downregulated cellular energy metabolism pathways. Differential expression of the cyclooxygenase-2 and matrix metalloproteinase 3 transcripts was confirmed immunohistochemically. After cumulative exposure, 883 genes were differentially expressed, whereas we identified 23 suggested common biomarkers for ICD. In conclusion, we bring new insights into two hitherto less well-elucidated phases of skin irritancy: the very initial as well as the late phase after single and cumulative mild exposures, respectively.

  17. Genome-wide analysis of gene expression during adipogenesis in human adipose-derived stromal cells reveals novel patterns of gene expression during adipocyte differentiation

    Directory of Open Access Journals (Sweden)

    Melvin Anyasi Ambele

    2016-05-01

    Full Text Available We have undertaken an in-depth transcriptome analysis of adipogenesis in human adipose-derived stromal cells (ASCs induced to differentiate into adipocytes in vitro. Gene expression was assessed on days 1, 7, 14 and 21 post-induction and genes differentially expressed numbered 128, 218, 253 and 240 respectively. Up-regulated genes were associated with blood vessel development, leukocyte migration, as well as tumor growth, invasion and metastasis. They also shared common pathways with certain obesity-related pathophysiological conditions. Down-regulated genes were enriched for immune response processes. KLF15, LMO3, FOXO1 and ZBTB16 transcription factors were up-regulated throughout the differentiation process. CEBPA, PPARG, ZNF117, MLXIPL, MMP3 and RORB were up-regulated only on days 14 and 21, which coincide with the maturation of adipocytes and could possibly serve as candidates for controlling fat accumulation and the size of mature adipocytes. In summary, we have identified genes that were up-regulated only on days 1 and 7 or days 14 and 21 that could serve as potential early and late-stage differentiation markers.

  18. Widespread of horizontal gene transfer in the human genome

    OpenAIRE

    Huang, Wenze; Tsai, Lillian; Li, Yulong; Hua, Nan; Sun, Chen; Wei, Chaochun

    2017-01-01

    Background A fundamental concept in biology is that heritable material is passed from parents to offspring, a process called vertical gene transfer. An alternative mechanism of gene acquisition is through horizontal gene transfer (HGT), which involves movement of genetic materials between different species. Horizontal gene transfer has been found prevalent in prokaryotes but very rare in eukaryote. In this paper, we investigate horizontal gene transfer in the human genome. Results From the pa...

  19. Linkage Disequilibrium between STRPs and SNPs across the Human Genome

    OpenAIRE

    Payseur, Bret A.; Place, Michael; Weber, James L.

    2008-01-01

    Patterns of linkage disequilibrium (LD) reveal the action of evolutionary processes and provide crucial information for association mapping of disease genes. Although recent studies have described the landscape of LD among single nucleotide polymorphisms (SNPs) from across the human genome, associations involving other classes of molecular variation remain poorly understood. In addition to recombination and population history, mutation rate and process are expected to shape LD. To test this i...

  20. Genomic analysis of Xenopus organizer function

    Directory of Open Access Journals (Sweden)

    Suhai Sándor

    2006-06-01

    Full Text Available Abstract Background Studies of the Xenopus organizer have laid the foundation for our understanding of the conserved signaling pathways that pattern vertebrate embryos during gastrulation. The two primary activities of the organizer, BMP and Wnt inhibition, can regulate a spectrum of genes that pattern essentially all aspects of the embryo during gastrulation. As our knowledge of organizer signaling grows, it is imperative that we begin knitting together our gene-level knowledge into genome-level signaling models. The goal of this paper was to identify complete lists of genes regulated by different aspects of organizer signaling, thereby providing a deeper understanding of the genomic mechanisms that underlie these complex and fundamental signaling events. Results To this end, we ectopically overexpress Noggin and Dkk-1, inhibitors of the BMP and Wnt pathways, respectively, within ventral tissues. After isolating embryonic ventral halves at early and late gastrulation, we analyze the transcriptional response to these molecules within the generated ectopic organizers using oligonucleotide microarrays. An efficient statistical analysis scheme, combined with a new Gene Ontology biological process annotation of the Xenopus genome, allows reliable and faithful clustering of molecules based upon their roles during gastrulation. From this data, we identify new organizer-related expression patterns for 19 genes. Moreover, our data sub-divides organizer genes into separate head and trunk organizing groups, which each show distinct responses to Noggin and Dkk-1 activity during gastrulation. Conclusion Our data provides a genomic view of the cohorts of genes that respond to Noggin and Dkk-1 activity, allowing us to separate the role of each in organizer function. These patterns demonstrate a model where BMP inhibition plays a largely inductive role during early developmental stages, thereby initiating the suites of genes needed to pattern dorsal tissues

  1. Report of the second Human Genome Diversity workshop

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1992-12-31

    The Second Human Genome Diversity Workshop was successfully held at Penn State University from October 29--31, 1992. The Workshop was essentially organized around 7 groups, each comprising approximately 10 participants, representing the sampling issues in different regions of the world. These groups worked independently, using a common format provided by the organizers; this was adjusted as needed by the individual groups. The Workshop began with a presentation of the mandate to the participants, and of the procedures to be followed during the workshop. Dr. Feldman presented a summary of the results from the First Workshop. He and the other organizers also presented brief comments giving their perspective on the objectives of the Second Workshop. Dr. Julia Bodmer discussed the study of European genetic diversity, especially in the context of the HLA experience there, and of plans to extend such studies in the coming years. She also discussed surveys of world HLA laboratories in regard to resources related to Human Genome Diversity. Dr. Mark Weiss discussed the relevance of nonhuman primate studies for understanding how demographic processes, such as mate exchange between local groups, affected the local dispersion of genetic variation. Primate population geneticists have some relevant experience in interpreting variation at this local level, in particular, with various DNA fingerprinting methods. This experience may be relevant to the Human Genome Diversity Project, in terms of practical and statistical issues.

  2. Linkage disequilibrium between STRPs and SNPs across the human genome.

    Science.gov (United States)

    Payseur, Bret A; Place, Michael; Weber, James L

    2008-05-01

    Patterns of linkage disequilibrium (LD) reveal the action of evolutionary processes and provide crucial information for association mapping of disease genes. Although recent studies have described the landscape of LD among single nucleotide polymorphisms (SNPs) from across the human genome, associations involving other classes of molecular variation remain poorly understood. In addition to recombination and population history, mutation rate and process are expected to shape LD. To test this idea, we measured associations between short-tandem-repeat polymorphisms (STRPs), which can mutate rapidly and recurrently, and SNPs in 721 regions across the human genome. We directly compared STRP-SNP LD with SNP-SNP LD from the same genomic regions in the human HapMap populations. The intensity of STRP-SNP LD, measured by the average of D', was reduced, consistent with the action of recurrent mutation. Nevertheless, a higher fraction of STRP-SNP pairs than SNP-SNP pairs showed significant LD, on both short (up to 50 kb) and long (cM) scales. These results reveal the substantial effects of mutational processes on LD at STRPs and provide important measures of the potential of STRPs for association mapping of disease genes.

  3. The genomic signature of human rhinoviruses A, B and C.

    Directory of Open Access Journals (Sweden)

    Spyridon Megremis

    Full Text Available Human rhinoviruses are single stranded positive sense RNA viruses that are presented in more than 50% of acute upper respiratory tract infections. Despite extensive studies on the genetic diversity of the virus, little is known about the forces driving it. In order to explain this diversity, many research groups have focused on protein sequence requirements for viable, functional and transmissible virus but have missed out an important aspect of viral evolution such as the genomic ontology of the virus. This study presents for the first time the genomic signature of 111 fully sequenced HRV strains from all three groups HRV-A, HRV-B and HRV-C. We observed an HRV genome tendency to eliminate CpG and UpA dinucleotides, coupling with over-representation of UpG and CpA. We propose a specific mechanism which describes how rapid changes in the HRV genomic sequence can take place under the strict control of conservation of the polypeptide backbone. Moreover, the distribution of the observed under- and over-represented dinucleotides along the HRV genome is presented. Distance matrice tables based on CpG and UpA odds ratios were constructed and viewed as heatmaps and distance trees. None of the suppressions can be attributed to codon usage or in RNA secondary structure requirements. Since viral recognition is dependent on RNA motifs rich in CpG and UpA, it is possible that the overall described genome evolution mechanism acts in order to protect the virus from host recognition.

  4. State of human genome at low-doses ecological influences

    International Nuclear Information System (INIS)

    Mel'nov, S.B.; Rytik, P.G.; Kruchinskij, N.G.; Kovalev, V.A.; Palamar, L.A.; Senyuk, O.F.

    2005-01-01

    The results of analysis of the state of genome (amounts of single strand breaks in DNA) of the persons exposed to influence of complex 'Chernobyl factor' in remote terms after a failure on ChNPP are resulted. Findings allowed to expose the increase of level of single strand breaks in DNA at the chronically irradiated persons mainly carry adaptive character and probably can be related to instability of genome. Thus at organism level growth of mutational pressure and strengthening of instability of cellular genome is related to the change of spectrum of biological characteristics, in particular individual reaction of somatic cells of victims on additional mutagens influences. The indicated changes can testify to existence of potential risk of remote genetic consequences of long-term irradiation influence in low doses

  5. Analysis of high-identity segmental duplications in the grapevine genome

    Directory of Open Access Journals (Sweden)

    Carelli Francesco N

    2011-08-01

    Full Text Available Abstract Background Segmental duplications (SDs are blocks of genomic sequence of 1-200 kb that map to different loci in a genome and share a sequence identity > 90%. SDs show at the sequence level the same characteristics as other regions of the human genome: they contain both high-copy repeats and gene sequences. SDs play an important role in genome plasticity by creating new genes and modeling genome structure. Although data is plentiful for mammals, not much was known about the representation of SDs in plant genomes. In this regard, we performed a genome-wide analysis of high-identity SDs on the sequenced grapevine (Vitis vinifera genome (PN40024. Results We demonstrate that recent SDs (> 94% identity and >= 10 kb in size are a relevant component of the grapevine genome (85 Mb, 17% of the genome sequence. We detected mitochondrial and plastid DNA and genes (10% of gene annotation in segmentally duplicated regions of the nuclear genome. In particular, the nine highest copy number genes have a copy in either or both organelle genomes. Further we showed that several duplicated genes take part in the biosynthesis of compounds involved in plant response to environmental stress. Conclusions These data show the great influence of SDs and organelle DNA transfers in modeling the Vitis vinifera nuclear DNA structure as well as the impact of SDs in contributing to the adaptive capacity of grapevine and the nutritional content of grape products through genome variation. This study represents a step forward in the full characterization of duplicated genes important for grapevine cultural needs and human health.

  6. The complete mitochondrial genome of Gossypium hirsutum and evolutionary analysis of higher plant mitochondrial genomes.

    Science.gov (United States)

    Liu, Guozheng; Cao, Dandan; Li, Shuangshuang; Su, Aiguo; Geng, Jianing; Grover, Corrinne E; Hu, Songnian; Hua, Jinping

    2013-01-01

    Mitochondria are the main manufacturers of cellular ATP in eukaryotes. The plant mitochondrial genome contains large number of foreign DNA and repeated sequences undergone frequently intramolecular recombination. Upland Cotton (Gossypium hirsutum L.) is one of the main natural fiber crops and also an important oil-producing plant in the world. Sequencing of the cotton mitochondrial (mt) genome could be helpful for the evolution research of plant mt genomes. We utilized 454 technology for sequencing and combined with Fosmid library of the Gossypium hirsutum mt genome screening and positive clones sequencing and conducted a series of evolutionary analysis on Cycas taitungensis and 24 angiosperms mt genomes. After data assembling and contigs joining, the complete mitochondrial genome sequence of G. hirsutum was obtained. The completed G.hirsutum mt genome is 621,884 bp in length, and contained 68 genes, including 35 protein genes, four rRNA genes and 29 tRNA genes. Five gene clusters are found conserved in all plant mt genomes; one and four clusters are specifically conserved in monocots and dicots, respectively. Homologous sequences are distributed along the plant mt genomes and species closely related share the most homologous sequences. For species that have both mt and chloroplast genome sequences available, we checked the location of cp-like migration and found several fragments closely linked with mitochondrial genes. The G. hirsutum mt genome possesses most of the common characters of higher plant mt genomes. The existence of syntenic gene clusters, as well as the conservation of some intergenic sequences and genic content among the plant mt genomes suggest that evolution of mt genomes is consistent with plant taxonomy but independent among different species.

  7. Phylogenomic Analysis and Dynamic Evolution of Chloroplast Genomes in Salicaceae

    Directory of Open Access Journals (Sweden)

    Yuan Huang

    2017-06-01

    Full Text Available Chloroplast genomes of plants are highly conserved in both gene order and gene content. Analysis of the whole chloroplast genome is known to provide much more informative DNA sites and thus generates high resolution for plant phylogenies. Here, we report the complete chloroplast genomes of three Salix species in family Salicaceae. Phylogeny of Salicaceae inferred from complete chloroplast genomes is generally consistent with previous studies but resolved with higher statistical support. Incongruences of phylogeny, however, are observed in genus Populus, which most likely results from homoplasy. By comparing three Salix chloroplast genomes with the published chloroplast genomes of other Salicaceae species, we demonstrate that the synteny and length of chloroplast genomes in Salicaceae are highly conserved but experienced dynamic evolution among species. We identify seven positively selected chloroplast genes in Salicaceae, which might be related to the adaptive evolution of Salicaceae species. Comparative chloroplast genome analysis within the family also indicates that some chloroplast genes are lost or became pseudogenes, infer that the chloroplast genes horizontally transferred to the nucleus genome. Based on the complete nucleus genome sequences from two Salicaceae species, we remarkably identify that the entire chloroplast genome is indeed transferred and integrated to the nucleus genome in the individual of the reference genome of P. trichocarpa at least once. This observation, along with presence of the large nuclear plastid DNA (NUPTs and NUPTs-containing multiple chloroplast genes in their original order in the chloroplast genome, favors the DNA-mediated hypothesis of organelle to nucleus DNA transfer. Overall, the phylogenomic analysis using chloroplast complete genomes clearly elucidates the phylogeny of Salicaceae. The identification of positively selected chloroplast genes and dynamic chloroplast-to-nucleus gene transfers in

  8. De novo assembly of a haplotype-resolved human genome

    DEFF Research Database (Denmark)

    Cao, Hongzhi; Wu, Honglong; Luo, Ruibang

    2015-01-01

    The human genome is diploid, and knowledge of the variants on each chromosome is important for the interpretation of genomic information. Here we report the assembly of a haplotype-resolved diploid genome without using a reference genome. Our pipeline relies on fosmid pooling together with whole-...

  9. The Bifidobacterium dentium Bd1 genome sequence reflects its genetic adaptation to the human oral cavity.

    Directory of Open Access Journals (Sweden)

    Marco Ventura

    2009-12-01

    Full Text Available Bifidobacteria, one of the relatively dominant components of the human intestinal microbiota, are considered one of the key groups of beneficial intestinal bacteria (probiotic bacteria. However, in addition to health-promoting taxa, the genus Bifidobacterium also includes Bifidobacterium dentium, an opportunistic cariogenic pathogen. The genetic basis for the ability of B. dentium to survive in the oral cavity and contribute to caries development is not understood. The genome of B. dentium Bd1, a strain isolated from dental caries, was sequenced to completion to uncover a single circular 2,636,368 base pair chromosome with 2,143 predicted open reading frames. Annotation of the genome sequence revealed multiple ways in which B. dentium has adapted to the oral environment through specialized nutrient acquisition, defences against antimicrobials, and gene products that increase fitness and competitiveness within the oral niche. B. dentium Bd1 was shown to metabolize a wide variety of carbohydrates, consistent with genome-based predictions, while colonization and persistence factors implicated in tissue adhesion, acid tolerance, and the metabolism of human saliva-derived compounds were also identified. Global transcriptome analysis demonstrated that many of the genes encoding these predicted traits are highly expressed under relevant physiological conditions. This is the first report to identify, through various genomic approaches, specific genetic adaptations of a Bifidobacterium taxon, Bifidobacterium dentium Bd1, to a lifestyle as a cariogenic microorganism in the oral cavity. In silico analysis and comparative genomic hybridization experiments clearly reveal a high level of genome conservation among various B. dentium strains. The data indicate that the genome of this opportunistic cariogen has evolved through a very limited number of horizontal gene acquisition events, highlighting the narrow boundaries that separate commensals from

  10. Millstone: software for multiplex microbial genome analysis and engineering.

    Science.gov (United States)

    Goodman, Daniel B; Kuznetsov, Gleb; Lajoie, Marc J; Ahern, Brian W; Napolitano, Michael G; Chen, Kevin Y; Chen, Changping; Church, George M

    2017-05-25

    Inexpensive DNA sequencing and advances in genome editing have made computational analysis a major rate-limiting step in adaptive laboratory evolution and microbial genome engineering. We describe Millstone, a web-based platform that automates genotype comparison and visualization for projects with up to hundreds of genomic samples. To enable iterative genome engineering, Millstone allows users to design oligonucleotide libraries and create successive versions of reference genomes. Millstone is open source and easily deployable to a cloud platform, local cluster, or desktop, making it a scalable solution for any lab.

  11. [Manipulation of the human genome: ethics and law].

    Science.gov (United States)

    Goulart, Maria Carolina Vaz; Iano, Flávia Godoy; Silva, Paulo Maurício; Sales-Peres, Silvia Helena de Carvalho; Sales-Peres, Arsênio

    2010-06-01

    The molecular biology has provided the basic tool for geneticists deepening in the molecular mechanisms that influence different diseases. It should be noted the scientific and moral responsibility of the researchers, because the scientists should imagine the moral consequences of the commercial application of genetic tests, since this fact involves not only the individual and their families, but the entire population. Besides being also necessary to make a reflection on how this information from the human genome will be used, for good or bad. The objective of this review was to bring the light of knowledge, data on characteristics of the ethical application of molecular biology, linking it with the rights of human beings. After studying literature, it might be observed that the Human Genome Project has generated several possibilities, such as the identification of genes associated with diseases with synergistic properties, but sometimes modifying behavior to genetically intervene in humans, bringing benefits or social harm. The big challenge is to decide what humanity wants on this giant leap.

  12. Los Alamos Science: The Human Genome Project. Number 20, 1992

    Science.gov (United States)

    Cooper, N. G.; Shea, N. eds.

    1992-01-01

    This document provides a broad overview of the Human Genome Project, with particular emphasis on work being done at Los Alamos. It tries to emphasize the scientific aspects of the project, compared to the more speculative information presented in the popular press. There is a brief introduction to modern genetics, including a review of classic work. There is a broad overview of the Genome Project, describing what the project is, what are some of its major five-year goals, what are major technological challenges ahead of the project, and what can the field of biology, as well as society expect to see as benefits from this project. Specific results on the efforts directed at mapping chromosomes 16 and 5 are discussed. A brief introduction to DNA libraries is presented, bearing in mind that Los Alamos has housed such libraries for many years prior to the Genome Project. Information on efforts to do applied computational work related to the project are discussed, as well as experimental efforts to do rapid DNA sequencing by means of single-molecule detection using applied spectroscopic methods. The article introduces the Los Alamos staff which are working on the Genome Project, and concludes with brief discussions on ethical, legal, and social implications of this work; a brief glimpse of genetics as it may be practiced in the next century; and a glossary of relevant terms.

  13. Experimental annotation of the human genome using microarray technology.

    Science.gov (United States)

    Shoemaker, D D; Schadt, E E; Armour, C D; He, Y D; Garrett-Engele, P; McDonagh, P D; Loerch, P M; Leonardson, A; Lum, P Y; Cavet, G; Wu, L F; Altschuler, S J; Edwards, S; King, J; Tsang, J S; Schimmack, G; Schelter, J M; Koch, J; Ziman, M; Marton, M J; Li, B; Cundiff, P; Ward, T; Castle, J; Krolewski, M; Meyer, M R; Mao, M; Burchard, J; Kidd, M J; Dai, H; Phillips, J W; Linsley, P S; Stoughton, R; Scherer, S; Boguski, M S

    2001-02-15

    The most important product of the sequencing of a genome is a complete, accurate catalogue of genes and their products, primarily messenger RNA transcripts and their cognate proteins. Such a catalogue cannot be constructed by computational annotation alone; it requires experimental validation on a genome scale. Using 'exon' and 'tiling' arrays fabricated by ink-jet oligonucleotide synthesis, we devised an experimental approach to validate and refine computational gene predictions and define full-length transcripts on the basis of co-regulated expression of their exons. These methods can provide more accurate gene numbers and allow the detection of mRNA splice variants and identification of the tissue- and disease-specific conditions under which genes are expressed. We apply our technique to chromosome 22q under 69 experimental condition pairs, and to the entire human genome under two experimental conditions. We discuss implications for more comprehensive, consistent and reliable genome annotation, more efficient, full-length complementary DNA cloning strategies and application to complex diseases.

  14. Los Alamos Science: The Human Genome Project. Number 20, 1992

    Energy Technology Data Exchange (ETDEWEB)

    Cooper, N G; Shea, N [eds.

    1992-01-01

    This article provides a broad overview of the Human Genome Project, with particular emphasis on work being done at Los Alamos. It tries to emphasize the scientific aspects of the project, compared to the more speculative information presented in the popular press. There is a brief introduction to modern genetics, including a review of classic work. There is a broad overview of the Genome Project, describing what the project is, what are some of its major five-year goals, what are major technological challenges ahead of the project, and what can the field of biology, as well as society expect to see as benefits from this project. Specific results on the efforts directed at mapping chromosomes 16 and 5 are discussed. A brief introduction to DNA libraries is presented, bearing in mind that Los Alamos has housed such libraries for many years prior to the Genome Project. Information on efforts to do applied computational work related to the project are discussed, as well as experimental efforts to do rapid DNA sequencing by means of single-molecule detection using applied spectroscopic methods. The article introduces the Los Alamos staff which are working on the Genome Project, and concludes with brief discussions on ethical, legal, and social implications of this work; a brief glimpse of genetics as it may be practiced in the next century; and a glossary of relevant terms.

  15. Reporting of Human Genome Epidemiology (HuGE association studies: An empirical assessment

    Directory of Open Access Journals (Sweden)

    Gwinn Marta

    2008-05-01

    Full Text Available Abstract Background Several thousand human genome epidemiology association studies are published every year investigating the relationship between common genetic variants and diverse phenotypes. Transparent reporting of study methods and results allows readers to better assess the validity of study findings. Here, we document reporting practices of human genome epidemiology studies. Methods Articles were randomly selected from a continuously updated database of human genome epidemiology association studies to be representative of genetic epidemiology literature. The main analysis evaluated 315 articles published in 2001–2003. For a comparative update, we evaluated 28 more recent articles published in 2006, focusing on issues that were poorly reported in 2001–2003. Results During both time periods, most studies comprised relatively small study populations and examined one or more genetic variants within a single gene. Articles were inconsistent in reporting the data needed to assess selection bias and the methods used to minimize misclassification (of the genotype, outcome, and environmental exposure or to identify population stratification. Statistical power, the use of unrelated study participants, and the use of replicate samples were reported more often in articles published during 2006 when compared with the earlier sample. Conclusion We conclude that many items needed to assess error and bias in human genome epidemiology association studies are not consistently reported. Although some improvements were seen over time, reporting guidelines and online supplemental material may help enhance the transparency of this literature.

  16. Human genome sequencing with direct x-ray holographic imaging

    International Nuclear Information System (INIS)

    Rhodes, C.K.

    1993-01-01

    Direct holographic imaging of biological materials is widely applicable to the study of the structure, properties and action of genetic material. This particular application involves the sequencing of the human genome where prospective genomic imaging technology is composed of three subtechnologies, name an x-ray holographic camera, suitable chemistry and enzymology for the preparation of tagged DNA samples, and the illuminator in the form of an x-ray laser. We report appropriate x-ray camera, embodied by the instrument developed by MCR, is available and that suitable chemical and enzymatic procedures exist for the preparation of the necessary tagged DNA strands. Concerning the future development of the x-ray illuminator. We find that a practical small scale x-ray light source is indeed feasible. This outcome requires the use of unconventional physical processes in order to achieve the necessary power-compression in the amplifying medium. The understanding of these new physical mechanisms is developing rapidly. Importantly, although the x-ray source does not currently exist, the understanding of these new physical mechanisms is developing rapidly and the research has established the basic scaling laws that will determine the properties of the x-ray illuminator. When this x-ray source becomes available, an extremely rapid and cost effective instrument for 3-D imaging of biological materials can be applied to a wide range of biological structural assays, including the base-pair sequencing of the human genome and many questions regarding its higher levels of organization

  17. Genome sequence analysis of the model grass Brachypodium distachyon: insights into grass genome evolution

    Energy Technology Data Exchange (ETDEWEB)

    Schulman, Al

    2009-08-09

    Three subfamilies of grasses, the Erhardtoideae (rice), the Panicoideae (maize, sorghum, sugar cane and millet), and the Pooideae (wheat, barley and cool season forage grasses) provide the basis of human nutrition and are poised to become major sources of renewable energy. Here we describe the complete genome sequence of the wild grass Brachypodium distachyon (Brachypodium), the first member of the Pooideae subfamily to be completely sequenced. Comparison of the Brachypodium, rice and sorghum genomes reveals a precise sequence- based history of genome evolution across a broad diversity of the grass family and identifies nested insertions of whole chromosomes into centromeric regions as a predominant mechanism driving chromosome evolution in the grasses. The relatively compact genome of Brachypodium is maintained by a balance of retroelement replication and loss. The complete genome sequence of Brachypodium, coupled to its exceptional promise as a model system for grass research, will support the development of new energy and food crops

  18. Integrated proteomic and genomic analysis of colorectal cancer

    Science.gov (United States)

    Investigators who analyzed 95 human colorectal tumor samples have determined how gene alterations identified in previous analyses of the same samples are expressed at the protein level. The integration of proteomic and genomic data, or proteogenomics, pro

  19. SIGMA: A System for Integrative Genomic Microarray Analysis of Cancer Genomes

    Directory of Open Access Journals (Sweden)

    Davies Jonathan J

    2006-12-01

    Full Text Available Abstract Background The prevalence of high resolution profiling of genomes has created a need for the integrative analysis of information generated from multiple methodologies and platforms. Although the majority of data in the public domain are gene expression profiles, and expression analysis software are available, the increase of array CGH studies has enabled integration of high throughput genomic and gene expression datasets. However, tools for direct mining and analysis of array CGH data are limited. Hence, there is a great need for analytical and display software tailored to cross platform integrative analysis of cancer genomes. Results We have created a user-friendly java application to facilitate sophisticated visualization and analysis such as cross-tumor and cross-platform comparisons. To demonstrate the utility of this software, we assembled array CGH data representing Affymetrix SNP chip, Stanford cDNA arrays and whole genome tiling path array platforms for cross comparison. This cancer genome database contains 267 profiles from commonly used cancer cell lines representing 14 different tissue types. Conclusion In this study we have developed an application for the visualization and analysis of data from high resolution array CGH platforms that can be adapted for analysis of multiple types of high throughput genomic datasets. Furthermore, we invite researchers using array CGH technology to deposit both their raw and processed data, as this will be a continually expanding database of cancer genomes. This publicly available resource, the System for Integrative Genomic Microarray Analysis (SIGMA of cancer genomes, can be accessed at http://sigma.bccrc.ca.

  20. Poor man’s 1000 genome project: Recent human population expansion confounds the detection of disease alleles in 7,098 complete mitochondrial genomes

    Directory of Open Access Journals (Sweden)

    Hie Lim eKim

    2013-02-01

    Full Text Available Rapid growth of the human population has caused the accumulation of rare genetic variants that may play a role in the origin of genetic diseases. However, it is challenging to identify those rare variants responsible for specific diseases without genetic data from an extraordinarily large population sample. Here we focused on the accumulated data from the human mitochondrial (mt genome sequences because this data provided 7,098 whole genomes for analysis. In this dataset we identified 6,110 single nucleotide variants (SNVs and their frequency and determined that the best-fit demographic model for the 7,098 genomes included severe population bottlenecks and exponential expansions of the non-African population. Using this model, we simulated the evolution of mt genomes in order to ascertain the behavior of deleterious mutations. We found that such deleterious mutations barely survived during population expansion. We derived the threshold frequency of a deleterious mutation in separate African, Asian, and European populations and used it to identify pathogenic mutations in our dataset. Although threshold frequency was very low, the proportion of variants showing a lower frequency than that threshold was 82%, 83%, and 91% of the total variants for the African, Asian, and European populations, respectively. Within these variants, only 18 known pathogenic mutations were detected in the 7,098 genomes. This result showed the difficulty of detecting a pathogenic mutation within an abundance of rare variants in the human population, even with a large number of genomes available for study.

  1. Barcode server: a visualization-based genome analysis system.

    Directory of Open Access Journals (Sweden)

    Fenglou Mao

    Full Text Available We have previously developed a computational method for representing a genome as a barcode image, which makes various genomic features visually apparent. We have demonstrated that this visual capability has made some challenging genome analysis problems relatively easy to solve. We have applied this capability to a number of challenging problems, including (a identification of horizontally transferred genes, (b identification of genomic islands with special properties and (c binning of metagenomic sequences, and achieved highly encouraging results. These application results inspired us to develop this barcode-based genome analysis server for public service, which supports the following capabilities: (a calculation of the k-mer based barcode image for a provided DNA sequence; (b detection of sequence fragments in a given genome with distinct barcodes from those of the majority of the genome, (c clustering of provided DNA sequences into groups having similar barcodes; and (d homology-based search using Blast against a genome database for any selected genomic regions deemed to have interesting barcodes. The barcode server provides a job management capability, allowing processing of a large number of analysis jobs for barcode-based comparative genome analyses. The barcode server is accessible at http://csbl1.bmb.uga.edu/Barcode.

  2. Human Ageing Genomic Resources: new and updated databases

    Science.gov (United States)

    Tacutu, Robi; Thornton, Daniel; Johnson, Emily; Budovsky, Arie; Barardo, Diogo; Craig, Thomas; Diana, Eugene; Lehmann, Gilad; Toren, Dmitri; Wang, Jingwei; Fraifeld, Vadim E

    2018-01-01

    Abstract In spite of a growing body of research and data, human ageing remains a poorly understood process. Over 10 years ago we developed the Human Ageing Genomic Resources (HAGR), a collection of databases and tools for studying the biology and genetics of ageing. Here, we present HAGR’s main functionalities, highlighting new additions and improvements. HAGR consists of six core databases: (i) the GenAge database of ageing-related genes, in turn composed of a dataset of >300 human ageing-related genes and a dataset with >2000 genes associated with ageing or longevity in model organisms; (ii) the AnAge database of animal ageing and longevity, featuring >4000 species; (iii) the GenDR database with >200 genes associated with the life-extending effects of dietary restriction; (iv) the LongevityMap database of human genetic association studies of longevity with >500 entries; (v) the DrugAge database with >400 ageing or longevity-associated drugs or compounds; (vi) the CellAge database with >200 genes associated with cell senescence. All our databases are manually curated by experts and regularly updated to ensure a high quality data. Cross-links across our databases and to external resources help researchers locate and integrate relevant information. HAGR is freely available online (http://genomics.senescence.info/). PMID:29121237

  3. Genetic Characterization and Comparative Genome Analysis of Brucella melitensis Isolates from India

    Directory of Open Access Journals (Sweden)

    Sarwar Azam

    2016-01-01

    Full Text Available Brucellosis is the most frequent zoonotic disease worldwide, with over 500,000 new human infections every year. Brucella melitensis, the most virulent species in humans, primarily affects goats and the zoonotic transmission occurs by ingestion of unpasteurized milk products or through direct contact with fetal tissues. Brucellosis is endemic in India but no information is available on population structure and genetic diversity of Brucella spp. in India. We performed multilocus sequence typing of four B. melitensis strains isolated from naturally infected goats from India. For more detailed genetic characterization, we carried out whole genome sequencing and comparative genome analysis of one of the B. melitensis isolates, Bm IND1. Genome analysis identified 141 unique SNPs, 78 VNTRs, 51 Indels, and 2 putative prophage integrations in the Bm IND1 genome. Our data may help to develop improved epidemiological typing tools and efficient preventive strategies to control brucellosis.

  4. Prolonged Integration Site Selection of a Lentiviral Vector in the Genome of Human Keratinocytes.

    Science.gov (United States)

    Qian, Wei; Wang, Yong; Li, Rui-Fu; Zhou, Xin; Liu, Jing; Peng, Dai-Zhi

    2017-03-03

    BACKGROUND Lentiviral vectors have been successfully used for human skin cell gene transfer studies. Defining the selection of integration sites for retroviral vectors in the host genome is crucial in risk assessment analysis of gene therapy. However, genome-wide analyses of lentiviral integration sites in human keratinocytes, especially after prolonged growth, are poorly understood. MATERIAL AND METHODS In this study, 874 unique lentiviral vector integration sites in human HaCaT keratinocytes after long-term culture were identified and analyzed with the online tool GTSG-QuickMap and SPSS software. RESULTS The data indicated that lentiviral vectors showed integration site preferences for genes and gene-rich regions. CONCLUSIONS This study will likely assist in determining the relative risks of the lentiviral vector system and in the design of a safe lentiviral vector system in the gene therapy of skin diseases.

  5. A Genomics Approach to Tumor Gemome Analysis

    National Research Council Canada - National Science Library

    Collins, Colin

    2002-01-01

    Genomes of solid tumors are often highly rearranged and these rearrangements promote cancer progression through disruption of genes mediating immortality, survival, metastasis, and resistance to therapy...

  6. High-density rhesus macaque oligonucleotide microarray design using early-stage rhesus genome sequence information and human genome annotations

    Directory of Open Access Journals (Sweden)

    Magness Charles L

    2007-01-01

    Full Text Available Abstract Background Until recently, few genomic reagents specific for non-human primate research have been available. To address this need, we have constructed a macaque-specific high-density oligonucleotide microarray by using highly fragmented low-pass sequence contigs from the rhesus genome project together with the detailed sequence and exon structure of the human genome. Using this method, we designed oligonucleotide probes to over 17,000 distinct rhesus/human gene orthologs and increased by four-fold the number of available genes relative to our first-generation expressed sequence tag (EST-derived array. Results We constructed a database containing 248,000 exon sequences from 23,000 human RefSeq genes and compared each human exon with its best matching sequence in the January 2005 version of the rhesus genome project list of 486,000 DNA contigs. Best matching rhesus exon sequences for each of the 23,000 human genes were then concatenated in the proper order and orientation to produce a rhesus "virtual transcriptome." Microarray probes were designed, one per gene, to the region closest to the 3' untranslated region (UTR of each rhesus virtual transcript. Each probe was compared to a composite rhesus/human transcript database to test for cross-hybridization potential yielding a final probe set representing 18,296 rhesus/human gene orthologs, including transcript variants, and over 17,000 distinct genes. We hybridized mRNA from rhesus brain and spleen to both the EST- and genome-derived microarrays. Besides four-fold greater gene coverage, the genome-derived array also showed greater mean signal intensities for genes present on both arrays. Genome-derived probes showed 99.4% identity when compared to 4,767 rhesus GenBank sequence tag site (STS sequences indicating that early stage low-pass versions of complex genomes are of sufficient quality to yield valuable functional genomic information when combined with finished genome information from

  7. Within-Host Variations of Human Papillomavirus Reveal APOBEC-Signature Mutagenesis in the Viral Genome.

    Science.gov (United States)

    Hirose, Yusuke; Onuki, Mamiko; Tenjimbayashi, Yuri; Mori, Seiichiro; Ishii, Yoshiyuki; Takeuchi, Takamasa; Tasaka, Nobutaka; Satoh, Toyomi; Morisada, Tohru; Iwata, Takashi; Miyamoto, Shingo; Matsumoto, Koji; Sekizawa, Akihiko; Kukimoto, Iwao

    2018-03-28

    Persistent infection with oncogenic human papillomaviruses (HPVs) causes cervical cancer, accompanied with the accumulation of somatic mutations into the host genome. There are concomitant genetic changes in the HPV genome during viral infection; however, their relevance to cervical carcinogenesis is poorly understood. Here we explored within-host genetic diversity of HPV by performing deep sequencing analyses of viral whole-genome sequences in clinical specimens. The whole genomes of HPV types 16, 52 and 58 were amplified by type-specific PCR from total cellular DNA of cervical exfoliated cells collected from patients with cervical intraepithelial neoplasia (CIN) and invasive cervical cancer (ICC), and were deep-sequenced. After constructing a reference vial genome sequence for each specimen, nucleotide positions showing changes with > 0.5% frequencies compared to the reference sequence were determined for individual samples. In total, 1,052 positions of nucleotide variations were detected in HPV genomes from 151 samples (CIN1, n = 56; CIN2/3, n = 68; ICC, n = 27), with varying numbers per sample. Overall, C-to-T and C-to-A substitutions were the dominant changes observed across all histological grades. While C-to-T transitions were predominantly detected in CIN1, their prevalence was decreased in CIN2/3 and fell below that of C-to-A transversions in ICC. Analysis of the tri-nucleotides context encompassing substituted bases revealed that Tp C pN, a preferred target sequence for cellular APOBEC cytosine deaminases, was a primary site for C-to-T substitutions in the HPV genome. These results strongly imply that the APOBEC proteins are drivers of HPV genome mutation, particularly in CIN1 lesions. IMPORTANCE HPVs exhibit surprisingly high levels of genetic diversity, including a large repertoire of minor genomic variants in each viral genotype. Here, by conducting deep sequencing analyses, we show for the first time a comprehensive snapshot of the "within

  8. Comparative Genomic Analysis of Mannheimia haemolytica from Bovine Sources.

    Science.gov (United States)

    Klima, Cassidy L; Cook, Shaun R; Zaheer, Rahat; Laing, Chad; Gannon, Vick P; Xu, Yong; Rasmussen, Jay; Potter, Andrew; Hendrick, Steve; Alexander, Trevor W; McAllister, Tim A

    2016-01-01

    Bovine respiratory disease is a common health problem in beef production. The primary bacterial agent involved, Mannheimia haemolytica, is a target for antimicrobial therapy and at risk for associated antimicrobial resistance development. The role of M. haemolytica in pathogenesis is linked to serotype with serotypes 1 (S1) and 6 (S6) isolated from pneumonic lesions and serotype 2 (S2) found in the upper respiratory tract of healthy animals. Here, we sequenced the genomes of 11 strains of M. haemolytica, representing all three serotypes and performed comparative genomics analysis to identify genetic features that may contribute to pathogenesis. Possible virulence associated genes were identified within 14 distinct prophage, including a periplasmic chaperone, a lipoprotein, peptidoglycan glycosyltransferase and a stress response protein. Prophage content ranged from 2-8 per genome, but was higher in S1 and S6 strains. A type I-C CRISPR-Cas system was identified in each strain with spacer diversity and organization conserved among serotypes. The majority of spacers occur in S1 and S6 strains and originate from phage suggesting that serotypes 1 and 6 may be more resistant to phage predation. However, two spacers complementary to the host chromosome targeting a UDP-N-acetylglucosamine 2-epimerase and a glycosyl transferases group 1 gene are present in S1 and S6 strains only indicating these serotypes may employ CRISPR-Cas to regulate gene expression to avoid host immune responses or enhance adhesion during infection. Integrative conjugative elements are present in nine of the eleven genomes. Three of these harbor extensive multi-drug resistance cassettes encoding resistance against the majority of drugs used to combat infection in beef cattle, including macrolides and tetracyclines used in human medicine. The findings here identify key features that are likely contributing to serotype related pathogenesis and specific targets for vaccine design intended to reduce the

  9. Comparative Genomic Analysis of Mannheimia haemolytica from Bovine Sources.

    Directory of Open Access Journals (Sweden)

    Cassidy L Klima

    Full Text Available Bovine respiratory disease is a common health problem in beef production. The primary bacterial agent involved, Mannheimia haemolytica, is a target for antimicrobial therapy and at risk for associated antimicrobial resistance development. The role of M. haemolytica in pathogenesis is linked to serotype with serotypes 1 (S1 and 6 (S6 isolated from pneumonic lesions and serotype 2 (S2 found in the upper respiratory tract of healthy animals. Here, we sequenced the genomes of 11 strains of M. haemolytica, representing all three serotypes and performed comparative genomics analysis to identify genetic features that may contribute to pathogenesis. Possible virulence associated genes were identified within 14 distinct prophage, including a periplasmic chaperone, a lipoprotein, peptidoglycan glycosyltransferase and a stress response protein. Prophage content ranged from 2-8 per genome, but was higher in S1 and S6 strains. A type I-C CRISPR-Cas system was identified in each strain with spacer diversity and organization conserved among serotypes. The majority of spacers occur in S1 and S6 strains and originate from phage suggesting that serotypes 1 and 6 may be more resistant to phage predation. However, two spacers complementary to the host chromosome targeting a UDP-N-acetylglucosamine 2-epimerase and a glycosyl transferases group 1 gene are present in S1 and S6 strains only indicating these serotypes may employ CRISPR-Cas to regulate gene expression to avoid host immune responses or enhance adhesion during infection. Integrative conjugative elements are present in nine of the eleven genomes. Three of these harbor extensive multi-drug resistance cassettes encoding resistance against the majority of drugs used to combat infection in beef cattle, including macrolides and tetracyclines used in human medicine. The findings here identify key features that are likely contributing to serotype related pathogenesis and specific targets for vaccine design

  10. Predicting human height by Victorian and genomic methods.

    Science.gov (United States)

    Aulchenko, Yurii S; Struchalin, Maksim V; Belonogova, Nadezhda M; Axenovich, Tatiana I; Weedon, Michael N; Hofman, Albert; Uitterlinden, Andre G; Kayser, Manfred; Oostra, Ben A; van Duijn, Cornelia M; Janssens, A Cecile J W; Borodin, Pavel M

    2009-08-01

    In the Victorian era, Sir Francis Galton showed that 'when dealing with the transmission of stature from parents to children, the average height of the two parents, ... is all we need care to know about them' (1886). One hundred and twenty-two years after Galton's work was published, 54 loci showing strong statistical evidence for association to human height were described, providing us with potential genomic means of human height prediction. In a population-based study of 5748 people, we find that a 54-loci genomic profile explained 4-6% of the sex- and age-adjusted height variance, and had limited ability to discriminate tall/short people, as characterized by the area under the receiver-operating characteristic curve (AUC). In a family-based study of 550 people, with both parents having height measurements, we find that the Galtonian mid-parental prediction method explained 40% of the sex- and age-adjusted height variance, and showed high discriminative accuracy. We have also explored how much variance a genomic profile should explain to reach certain AUC values. For highly heritable traits such as height, we conclude that in applications in which parental phenotypic information is available (eg, medicine), the Victorian Galton's method will long stay unsurpassed, in terms of both discriminative accuracy and costs. For less heritable traits, and in situations in which parental information is not available (eg, forensics), genomic methods may provide an alternative, given that the variants determining an essential proportion of the trait's variation can be identified.

  11. Functional genomic analysis of C. elegans molting.

    Directory of Open Access Journals (Sweden)

    Alison R Frand

    2005-10-01

    Full Text Available Although the molting cycle is a hallmark of insects and nematodes, neither the endocrine control of molting via size, stage, and nutritional inputs nor the enzymatic mechanism for synthesis and release of the exoskeleton is well understood. Here, we identify endocrine and enzymatic regulators of molting in C. elegans through a genome-wide RNA-interference screen. Products of the 159 genes discovered include annotated transcription factors, secreted peptides, transmembrane proteins, and extracellular matrix enzymes essential for molting. Fusions between several genes and green fluorescent protein show a pulse of expression before each molt in epithelial cells that synthesize the exoskeleton, indicating that the corresponding proteins are made in the correct time and place to regulate molting. We show further that inactivation of particular genes abrogates expression of the green fluorescent protein reporter genes, revealing regulatory networks that might couple the expression of genes essential for molting to endocrine cues. Many molting genes are conserved in parasitic nematodes responsible for human disease, and thus represent attractive targets for pesticide and pharmaceutical development.

  12. GEnomes Management Application (GEM.app): a new software tool for large-scale collaborative genome analysis.

    Science.gov (United States)

    Gonzalez, Michael A; Lebrigio, Rafael F Acosta; Van Booven, Derek; Ulloa, Rick H; Powell, Eric; Speziani, Fiorella; Tekin, Mustafa; Schüle, Rebecca; Züchner, Stephan

    2013-06-01

    Novel genes are now identified at a rapid pace for many Mendelian disorders, and increasingly, for genetically complex phenotypes. However, new challenges have also become evident: (1) effectively managing larger exome and/or genome datasets, especially for smaller labs; (2) direct hands-on analysis and contextual interpretation of variant data in large genomic datasets; and (3) many small and medium-sized clinical and research-based investigative teams around the world are generating data that, if combined and shared, will significantly increase the opportunities for the entire community to identify new genes. To address these challenges, we have developed GEnomes Management Application (GEM.app), a software tool to annotate, manage, visualize, and analyze large genomic datasets (https://genomics.med.miami.edu/). GEM.app currently contains ∼1,600 whole exomes from 50 different phenotypes studied by 40 principal investigators from 15 different countries. The focus of GEM.app is on user-friendly analysis for nonbioinformaticians to make next-generation sequencing data directly accessible. Yet, GEM.app provides powerful and flexible filter options, including single family filtering, across family/phenotype queries, nested filtering, and evaluation of segregation in families. In addition, the system is fast, obtaining results within 4 sec across ∼1,200 exomes. We believe that this system will further enhance identification of genetic causes of human disease. © 2013 Wiley Periodicals, Inc.

  13. 77 FR 50140 - National Human Genome Research Institute; Notice of Closed Meeting

    Science.gov (United States)

    2012-08-20

    ... DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Human Genome..., Human Genome Research, National Institutes of Health, HHS) Dated: August 13, 2012. Anna Snouffer, Deputy..., Bethesda, MD 20892. Contact Person: Camilla E. Day, Ph.D., Scientific Review Officer, CIDR, National Human...